Sample records for rna structures present

  1. Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo

    PubMed Central

    Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.

    2017-01-01

    Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286

  2. CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction

    PubMed Central

    Puton, Tomasz; Kozlowski, Lukasz P.; Rother, Kristian M.; Bujnicki, Janusz M.

    2013-01-01

    We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks. PMID:23435231

  3. Automated 3D structure composition for large RNAs

    PubMed Central

    Popenda, Mariusz; Szachniuk, Marta; Antczak, Maciej; Purzycka, Katarzyna J.; Lukasiak, Piotr; Bartol, Natalia; Blazewicz, Jacek; Adamiak, Ryszard W.

    2012-01-01

    Understanding the numerous functions that RNAs play in living cells depends critically on knowledge of their three-dimensional structure. Due to the difficulties in experimentally assessing structures of large RNAs, there is currently great demand for new high-resolution structure prediction methods. We present the novel method for the fully automated prediction of RNA 3D structures from a user-defined secondary structure. The concept is founded on the machine translation system. The translation engine operates on the RNA FRABASE database tailored to the dictionary relating the RNA secondary structure and tertiary structure elements. The translation algorithm is very fast. Initial 3D structure is composed in a range of seconds on a single processor. The method assures the prediction of large RNA 3D structures of high quality. Our approach needs neither structural templates nor RNA sequence alignment, required for comparative methods. This enables the building of unresolved yet native and artificial RNA structures. The method is implemented in a publicly available, user-friendly server RNAComposer. It works in an interactive mode and a batch mode. The batch mode is designed for large-scale modelling and accepts atomic distance restraints. Presently, the server is set to build RNA structures of up to 500 residues. PMID:22539264

  4. Automated and fast building of three-dimensional RNA structures.

    PubMed

    Zhao, Yunjie; Huang, Yangyu; Gong, Zhou; Wang, Yanjie; Man, Jianfen; Xiao, Yi

    2012-01-01

    Building tertiary structures of non-coding RNA is required to understand their functions and design new molecules. Current algorithms of RNA tertiary structure prediction give satisfactory accuracy only for small size and simple topology and many of them need manual manipulation. Here, we present an automated and fast program, 3dRNA, for RNA tertiary structure prediction with reasonable accuracy for RNAs of larger size and complex topology.

  5. Structure, recognition and adaptive binding in RNA aptamer complexes.

    PubMed

    Patel, D J; Suri, A K; Jiang, F; Jiang, L; Fan, P; Kumar, R A; Nonin, S

    1997-10-10

    Novel features of RNA structure, recognition and discrimination have been recently elucidated through the solution structural characterization of RNA aptamers that bind cofactors, aminoglycoside antibiotics, amino acids and peptides with high affinity and specificity. This review presents the solution structures of RNA aptamer complexes with adenosine monophosphate, flavin mononucleotide, arginine/citrulline and tobramycin together with an example of hydrogen exchange measurements of the base-pair kinetics for the AMP-RNA aptamer complex. A comparative analysis of the structures of these RNA aptamer complexes yields the principles, patterns and diversity associated with RNA architecture, molecular recognition and adaptive binding associated with complex formation.

  6. Swellix: a computational tool to explore RNA conformational space.

    PubMed

    Sloat, Nathan; Liu, Jui-Wen; Schroeder, Susan J

    2017-11-21

    The sequence of nucleotides in an RNA determines the possible base pairs for an RNA fold and thus also determines the overall shape and function of an RNA. The Swellix program presented here combines a helix abstraction with a combinatorial approach to the RNA folding problem in order to compute all possible non-pseudoknotted RNA structures for RNA sequences. The Swellix program builds on the Crumple program and can include experimental constraints on global RNA structures such as the minimum number and lengths of helices from crystallography, cryoelectron microscopy, or in vivo crosslinking and chemical probing methods. The conceptual advance in Swellix is to count helices and generate all possible combinations of helices rather than counting and combining base pairs. Swellix bundles similar helices and includes improvements in memory use and efficient parallelization. Biological applications of Swellix are demonstrated by computing the reduction in conformational space and entropy due to naturally modified nucleotides in tRNA sequences and by motif searches in Human Endogenous Retroviral (HERV) RNA sequences. The Swellix motif search reveals occurrences of protein and drug binding motifs in the HERV RNA ensemble that do not occur in minimum free energy or centroid predicted structures. Swellix presents significant improvements over Crumple in terms of efficiency and memory use. The efficient parallelization of Swellix enables the computation of sequences as long as 418 nucleotides with sufficient experimental constraints. Thus, Swellix provides a practical alternative to free energy minimization tools when multiple structures, kinetically determined structures, or complex RNA-RNA and RNA-protein interactions are present in an RNA folding problem.

  7. RNA secondary structure prediction using soft computing.

    PubMed

    Ray, Shubhra Sankar; Pal, Sankar K

    2013-01-01

    Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.

  8. Biochemical and Structural Studies of RNA Modification and Repair

    ERIC Educational Resources Information Center

    Chan, Chio Mui

    2009-01-01

    RNA modification, RNA interference, and RNA repair are important events in the cell. This thesis presents three projects related to these three fields. By using both biochemical and structural methods, we characterized enzymatic activities of pseudouridine synthase TruD, solved the structure of "A. aeolicus" GidA, and reconstituted a novel…

  9. High-throughput determination of RNA structure by proximity ligation.

    PubMed

    Ramani, Vijay; Qiu, Ruolan; Shendure, Jay

    2015-09-01

    We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.

  10. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

    PubMed

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

    2008-10-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.

  11. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

    PubMed Central

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

    2013-01-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281

  12. A guide to large-scale RNA sample preparation.

    PubMed

    Baronti, Lorenzo; Karlsson, Hampus; Marušič, Maja; Petzold, Katja

    2018-05-01

    RNA is becoming more important as an increasing number of functions, both regulatory and enzymatic, are being discovered on a daily basis. As the RNA boom has just begun, most techniques are still in development and changes occur frequently. To understand RNA functions, revealing the structure of RNA is of utmost importance, which requires sample preparation. We review the latest methods to produce and purify a variation of RNA molecules for different purposes with the main focus on structural biology and biophysics. We present a guide aimed at identifying the most suitable method for your RNA and your biological question and highlighting the advantages of different methods. Graphical abstract In this review we present different methods for large-scale production and purification of RNAs for structural and biophysical studies.

  13. Quantifying the relationship between sequence and three-dimensional structure conservation in RNA

    PubMed Central

    2010-01-01

    Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657

  14. Advancing viral RNA structure prediction: measuring the thermodynamics of pyrimidine-rich internal loops

    PubMed Central

    Phan, Andy; Mailey, Katherine; Saeki, Jessica; Gu, Xiaobo

    2017-01-01

    Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites. PMID:28213527

  15. Structural imprints in vivo decode RNA regulatory mechanisms.

    PubMed

    Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y

    2015-03-26

    Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.

  16. Dawn of the in vivo RNA structurome and interactome.

    PubMed

    Kwok, Chun Kit

    2016-10-15

    RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases. © 2016 The Author(s); published by Portland Press Limited on behalf of the Biochemical Society.

  17. Advancing viral RNA structure prediction: measuring the thermodynamics of pyrimidine-rich internal loops.

    PubMed

    Phan, Andy; Mailey, Katherine; Saeki, Jessica; Gu, Xiaobo; Schroeder, Susan J

    2017-05-01

    Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites. © 2017 Phan et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

    PubMed

    Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

    2018-05-09

    While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.

  19. Structural imprints in vivo decode RNA regulatory mechanisms

    PubMed Central

    Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

    2015-01-01

    Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993

  20. An efficient algorithm for planar drawing of RNA structures with pseudoknots of any type.

    PubMed

    Byun, Yanga; Han, Kyungsook

    2016-06-01

    An RNA pseudoknot is a tertiary structural element in which bases of a loop pair with complementary bases are outside the loop. A drawing of RNA secondary structures is a tree, but a drawing of RNA pseudoknots is a graph that has an inner cycle within a pseudoknot and possibly outer cycles formed between the pseudoknot and other structural elements. Visualizing a large-scale RNA structure with pseudoknots as a planar drawing is challenging because a planar drawing of an RNA structure requires both pseudoknots and an entire structure enclosing the pseudoknots to be embedded into a plane without overlapping or crossing. This paper presents an efficient heuristic algorithm for visualizing a pseudoknotted RNA structure as a planar drawing. The algorithm consists of several parts for finding crossing stems and page mapping the stems, for the layout of stem-loops and pseudoknots, and for overlap detection between structural elements and resolving it. Unlike previous algorithms, our algorithm generates a planar drawing for a large RNA structure with pseudoknots of any type and provides a bracket view of the structure. It generates a compact and aesthetic structure graph for a large pseudoknotted RNA structure in O([Formula: see text]) time, where n is the number of stems of the RNA structure.

  1. Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

    PubMed

    Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

    2010-01-01

    The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.

  2. RBind: computational network method to predict RNA binding sites.

    PubMed

    Wang, Kaili; Jian, Yiren; Wang, Huiwen; Zeng, Chen; Zhao, Yunjie

    2018-04-26

    Non-coding RNA molecules play essential roles by interacting with other molecules to perform various biological functions. However, it is difficult to determine RNA structures due to their flexibility. At present, the number of experimentally solved RNA-ligand and RNA-protein structures is still insufficient. Therefore, binding sites prediction of non-coding RNA is required to understand their functions. Current RNA binding site prediction algorithms produce many false positive nucleotides that are distance away from the binding sites. Here, we present a network approach, RBind, to predict the RNA binding sites. We benchmarked RBind in RNA-ligand and RNA-protein datasets. The average accuracy of 0.82 in RNA-ligand and 0.63 in RNA-protein testing showed that this network strategy has a reliable accuracy for binding sites prediction. The codes and datasets are available at https://zhaolab.com.cn/RBind. yjzhaowh@mail.ccnu.edu.cn. Supplementary data are available at Bioinformatics online.

  3. Accurate multiple sequence-structure alignment of RNA sequences using combinatorial optimization.

    PubMed

    Bauer, Markus; Klau, Gunnar W; Reinert, Knut

    2007-07-27

    The discovery of functional non-coding RNA sequences has led to an increasing interest in algorithms related to RNA analysis. Traditional sequence alignment algorithms, however, fail at computing reliable alignments of low-homology RNA sequences. The spatial conformation of RNA sequences largely determines their function, and therefore RNA alignment algorithms have to take structural information into account. We present a graph-based representation for sequence-structure alignments, which we model as an integer linear program (ILP). We sketch how we compute an optimal or near-optimal solution to the ILP using methods from combinatorial optimization, and present results on a recently published benchmark set for RNA alignments. The implementation of our algorithm yields better alignments in terms of two published scores than the other programs that we tested: This is especially the case with an increasing number of input sequences. Our program LARA is freely available for academic purposes from http://www.planet-lisa.net.

  4. Structure of T7 RNA polymerase complexed to the transcriptional inhibitor T7 lysozyme.

    PubMed Central

    Jeruzalmi, D; Steitz, T A

    1998-01-01

    The T7 RNA polymerase-T7 lysozyme complex regulates phage gene expression during infection of Escherichia coli. The 2.8 A crystal structure of the complex reveals that lysozyme binds at a site remote from the polymerase active site, suggesting an indirect mechanism of inhibition. Comparison of the T7 RNA polymerase structure with that of the homologous pol I family of DNA polymerases reveals identities in the catalytic site but also differences specific to RNA polymerase function. The structure of T7 RNA polymerase presented here differs significantly from a previously published structure. Sequence similarities between phage RNA polymerases and those from mitochondria and chloroplasts, when interpreted in the context of our revised model of T7 RNA polymerase, suggest a conserved fold. PMID:9670025

  5. On topological RNA interaction structures.

    PubMed

    Qin, Jing; Reidys, Christian M

    2013-07-01

    Recently a folding algorithm of topological RNA pseudoknot structures was presented in Reidys et al. (2011). This algorithm folds single-stranded γ-structures, that is, RNA structures composed by distinct motifs of bounded topological genus. In this article, we set the theoretical foundations for the folding of the two backbone analogues of γ structures: the RNA γ-interaction structures. These are RNA-RNA interaction structures that are constructed by a finite number of building blocks over two backbones having genus at most γ. Combinatorial properties of γ-interaction structures are of practical interest since they have direct implications for the folding of topological interaction structures. We compute the generating function of γ-interaction structures and show that it is algebraic, which implies that the numbers of interaction structures can be computed recursively. We obtain simple asymptotic formulas for 0- and 1-interaction structures. The simplest class of interaction structures are the 0-interaction structures, which represent the two backbone analogues of secondary structures.

  6. R-chie: a web server and R package for visualizing RNA secondary structures

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Zhu, Jing Yun A.; Meyer, Irmtraud M.

    2012-01-01

    Visually examining RNA structures can greatly aid in understanding their potential functional roles and in evaluating the performance of structure prediction algorithms. As many functional roles of RNA structures can already be studied given the secondary structure of the RNA, various methods have been devised for visualizing RNA secondary structures. Most of these methods depict a given RNA secondary structure as a planar graph consisting of base-paired stems interconnected by roundish loops. In this article, we present an alternative method of depicting RNA secondary structure as arc diagrams. This is well suited for structures that are difficult or impossible to represent as planar stem-loop diagrams. Arc diagrams can intuitively display pseudo-knotted structures, as well as transient and alternative structural features. In addition, they facilitate the comparison of known and predicted RNA secondary structures. An added benefit is that structure information can be displayed in conjunction with a corresponding multiple sequence alignments, thereby highlighting structure and primary sequence conservation and variation. We have implemented the visualization algorithm as a web server R-chie as well as a corresponding R package called R4RNA, which allows users to run the software locally and across a range of common operating systems. PMID:22434875

  7. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

    PubMed

    Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

    2016-04-20

    RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Solution structure and thermodynamics of 2',5' RNA intercalation.

    PubMed

    Horowitz, Eric D; Lilavivat, Seth; Holladay, Benjamin W; Germann, Markus W; Hud, Nicholas V

    2009-04-29

    As a means to explore the influence of the nucleic acid backbone on the intercalative binding of ligands to DNA and RNA, we have determined the solution structure of a proflavine-bound 2',5'-linked octamer duplex with the sequence GCCGCGGC. This structure represents the first NMR structure of an intercalated RNA duplex, of either backbone structural isomer. By comparison with X-ray crystal structures, we have identified similarities and differences between intercalated 3',5' and 2',5'-linked RNA duplexes. First, the two forms of RNA have different sugar pucker geometries at the intercalated nucleotide steps, yet have the same interphosphate distances. Second, as in intercalated 3',5' RNA, the phosphate backbone angle zeta at the 2',5' RNA intercalation site prefers to be in the trans conformation, whereas unintercalated 2',5' and 3',5' RNA prefer the -gauche conformation. These observations provide new insights regarding the transitions required for intercalation of a phosphodiester-ribose backbone and suggest a possible contribution of the backbone to the origin of the nearest-neighbor exclusion principle. Thermodynamic studies presented for intercalation of both structural RNA isomers also reveal a surprising sensitivity of intercalator binding enthalpy and entropy to the details of RNA backbone structure.

  9. RAG-3D: A search tool for RNA 3D substructures

    DOE PAGES

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; ...

    2015-08-24

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  10. RAG-3D: a search tool for RNA 3D substructures

    PubMed Central

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar

    2015-01-01

    To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547

  11. RAG-3D: A search tool for RNA 3D substructures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef

    In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less

  12. RNA design rules from a massive open laboratory

    PubMed Central

    Lee, Jeehyung; Kladwang, Wipapat; Lee, Minjae; Cantu, Daniel; Azizyan, Martin; Kim, Hanjoo; Limpaecher, Alex; Gaikwad, Snehal; Yoon, Sungroh; Treuille, Adrien; Das, Rhiju

    2014-01-01

    Self-assembling RNA molecules present compelling substrates for the rational interrogation and control of living systems. However, imperfect in silico models—even at the secondary structure level—hinder the design of new RNAs that function properly when synthesized. Here, we present a unique and potentially general approach to such empirical problems: the Massive Open Laboratory. The EteRNA project connects 37,000 enthusiasts to RNA design puzzles through an online interface. Uniquely, EteRNA participants not only manipulate simulated molecules but also control a remote experimental pipeline for high-throughput RNA synthesis and structure mapping. We show herein that the EteRNA community leveraged dozens of cycles of continuous wet laboratory feedback to learn strategies for solving in vitro RNA design problems on which automated methods fail. The top strategies—including several previously unrecognized negative design rules—were distilled by machine learning into an algorithm, EteRNABot. Over a rigorous 1-y testing phase, both the EteRNA community and EteRNABot significantly outperformed prior algorithms in a dozen RNA secondary structure design tests, including the creation of dendrimer-like structures and scaffolds for small molecule sensors. These results show that an online community can carry out large-scale experiments, hypothesis generation, and algorithm design to create practical advances in empirical science. PMID:24469816

  13. Structure determination of an 11-subunit exosome in complex with RNA by molecular replacement

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Makino, Debora Lika, E-mail: dmakino@biochem.mpg.de; Conti, Elena

    The crystallographic steps towards the structure determination of a complete eukaryotic exosome complex bound to RNA are presented. Phasing of this 11-protein subunit complex was carried out via molecular replacement. The RNA exosome is an evolutionarily conserved multi-protein complex involved in the 3′ degradation of a variety of RNA transcripts. In the nucleus, the exosome participates in the maturation of structured RNAs, in the surveillance of pre-mRNAs and in the decay of a variety of noncoding transcripts. In the cytoplasm, the exosome degrades mRNAs in constitutive and regulated turnover pathways. Several structures of subcomplexes of eukaryotic exosomes or related prokaryoticmore » exosome-like complexes are known, but how the complete assembly is organized to fulfil processive RNA degradation has been unclear. An atomic snapshot of a Saccharomyces cerevisiae 420 kDa exosome complex bound to an RNA substrate in the pre-cleavage state of a hydrolytic reaction has been determined. Here, the crystallographic steps towards the structural elucidation, which was carried out by molecular replacement, are presented.« less

  14. Efficient RNA structure comparison algorithms.

    PubMed

    Arslan, Abdullah N; Anandan, Jithendar; Fry, Eric; Monschke, Keith; Ganneboina, Nitin; Bowerman, Jason

    2017-12-01

    Recently proposed relative addressing-based ([Formula: see text]) RNA secondary structure representation has important features by which an RNA structure database can be stored into a suffix array. A fast substructure search algorithm has been proposed based on binary search on this suffix array. Using this substructure search algorithm, we present a fast algorithm that finds the largest common substructure of given multiple RNA structures in [Formula: see text] format. The multiple RNA structure comparison problem is NP-hard in its general formulation. We introduced a new problem for comparing multiple RNA structures. This problem has more strict similarity definition and objective, and we propose an algorithm that solves this problem efficiently. We also develop another comparison algorithm that iteratively calls this algorithm to locate nonoverlapping large common substructures in compared RNAs. With the new resulting tools, we improved the RNASSAC website (linked from http://faculty.tamuc.edu/aarslan ). This website now also includes two drawing tools: one specialized for preparing RNA substructures that can be used as input by the search tool, and another one for automatically drawing the entire RNA structure from a given structure sequence.

  15. Triplet repeat RNA structure and its role as pathogenic agent and therapeutic target

    PubMed Central

    Krzyzosiak, Wlodzimierz J.; Sobczak, Krzysztof; Wojciechowska, Marzena; Fiszer, Agnieszka; Mykowska, Agnieszka; Kozlowski, Piotr

    2012-01-01

    This review presents detailed information about the structure of triplet repeat RNA and addresses the simple sequence repeats of normal and expanded lengths in the context of the physiological and pathogenic roles played in human cells. First, we discuss the occurrence and frequency of various trinucleotide repeats in transcripts and classify them according to the propensity to form RNA structures of different architectures and stabilities. We show that repeats capable of forming hairpin structures are overrepresented in exons, which implies that they may have important functions. We further describe long triplet repeat RNA as a pathogenic agent by presenting human neurological diseases caused by triplet repeat expansions in which mutant RNA gains a toxic function. Prominent examples of these diseases include myotonic dystrophy type 1 and fragile X-associated tremor ataxia syndrome, which are triggered by mutant CUG and CGG repeats, respectively. In addition, we discuss RNA-mediated pathogenesis in polyglutamine disorders such as Huntington's disease and spinocerebellar ataxia type 3, in which expanded CAG repeats may act as an auxiliary toxic agent. Finally, triplet repeat RNA is presented as a therapeutic target. We describe various concepts and approaches aimed at the selective inhibition of mutant transcript activity in experimental therapies developed for repeat-associated diseases. PMID:21908410

  16. RNA Bricks—a database of RNA 3D motifs and their interactions

    PubMed Central

    Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.

    2014-01-01

    The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091

  17. Aggregation and folding phase transitions of RNA molecules

    NASA Astrophysics Data System (ADS)

    Bundschuh, Ralf

    2007-03-01

    RNA is a biomolecule that is involved in nearly all aspects of cellular functions. In order to perform many of these functions, RNA molecules have to fold into specific secondary structures. This folding is driven by the tendency of the bases to form Watson-Crick base pairs. Beyond the biological importance of RNA, the relatively simple rules for structure formation of RNA make it a very interesting system from the statistical physics point of view. We will present examples of phase transitions in RNA secondary structure formation that are amenable to analytical descriptions. A special focus will be on aggregation between several RNA molecules which is important for some regulatory circuits based on RNA structure, triplet repeat diseases like Huntington's, and as a model for prion diseases. We show that depending on the relative strength of the intramolecular and the intermolecular base pairing, RNA molecules undergo a transition into an aggregated phase and quantitatively characterize this transition.

  18. ModeRNA: a tool for comparative modeling of RNA 3D structure

    PubMed Central

    Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

    2011-01-01

    RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639

  19. Information-Theoretic Uncertainty of SCFG-Modeled Folding Space of The Non-coding RNA

    PubMed Central

    Manzourolajdad, Amirhossein; Wang, Yingfeng; Shaw, Timothy I.; Malmberg, Russell L.

    2012-01-01

    RNA secondary structure ensembles define probability distributions for alternative equilibrium secondary structures of an RNA sequence. Shannon’s Entropy is a measure for the amount of diversity present in any ensemble. In this work, Shannon’s entropy of the SCFG ensemble on an RNA sequence is derived and implemented in polynomial time for both structurally ambiguous and unambiguous grammars. Micro RNA sequences generally have low folding entropy, as previously discovered. Surprisingly, signs of significantly high folding entropy were observed in certain ncRNA families. More effective models coupled with targeted randomization tests can lead to a better insight into folding features of these families. PMID:23160142

  20. GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies.

    PubMed

    Boudard, Mélanie; Bernauer, Julie; Barth, Dominique; Cohen, Johanne; Denise, Alain

    2015-01-01

    Cellular processes involve large numbers of RNA molecules. The functions of these RNA molecules and their binding to molecular machines are highly dependent on their 3D structures. One of the key challenges in RNA structure prediction and modeling is predicting the spatial arrangement of the various structural elements of RNA. As RNA folding is generally hierarchical, methods involving coarse-grained models hold great promise for this purpose. We present here a novel coarse-grained method for sampling, based on game theory and knowledge-based potentials. This strategy, GARN (Game Algorithm for RNa sampling), is often much faster than previously described techniques and generates large sets of solutions closely resembling the native structure. GARN is thus a suitable starting point for the molecular modeling of large RNAs, particularly those with experimental constraints. GARN is available from: http://garn.lri.fr/.

  1. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

    PubMed Central

    2010-01-01

    Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. Conclusions RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field. PMID:20459631

  2. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures.

    PubMed

    Popenda, Mariusz; Szachniuk, Marta; Blazewicz, Marek; Wasik, Szymon; Burke, Edmund K; Blazewicz, Jacek; Adamiak, Ryszard W

    2010-05-06

    Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field.

  3. Discrete RNA libraries from pseudo-torsional space

    PubMed Central

    Humphris-Narayanan, Elisabeth

    2012-01-01

    The discovery that RNA molecules can fold into complex structures and carry out diverse cellular roles has led to interest in developing tools for modeling RNA tertiary structure. While significant progress has been made in establishing that the RNA backbone is rotameric, few libraries of discrete conformations specifically for use in RNA modeling have been validated. Here, we present six libraries of discrete RNA conformations based on a simplified pseudo-torsional notation of the RNA backbone, comparable to phi and psi in the protein backbone. We evaluate the ability of each library to represent single nucleotide backbone conformations and we show how individual library fragments can be assembled into dinucleotides that are consistent with established RNA backbone descriptors spanning from sugar to sugar. We then use each library to build all-atom models of 20 test folds and we show how the composition of a fragment library can limit model quality. Despite the limitations inherent in using discretized libraries, we find that several hundred discrete fragments can rebuild RNA folds up to 174 nucleotides in length with atomic-level accuracy (<1.5Å RMSD). We anticipate the libraries presented here could easily be incorporated into RNA structural modeling, analysis, or refinement tools. PMID:22425640

  4. antaRNA: ant colony-based RNA sequence design.

    PubMed

    Kleinkauf, Robert; Mann, Martin; Backofen, Rolf

    2015-10-01

    RNA sequence design is studied at least as long as the classical folding problem. Although for the latter the functional fold of an RNA molecule is to be found ,: inverse folding tries to identify RNA sequences that fold into a function-specific target structure. In combination with RNA-based biotechnology and synthetic biology ,: reliable RNA sequence design becomes a crucial step to generate novel biochemical components. In this article ,: the computational tool antaRNA is presented. It is capable of compiling RNA sequences for a given structure that comply in addition with an adjustable full range objective GC-content distribution ,: specific sequence constraints and additional fuzzy structure constraints. antaRNA applies ant colony optimization meta-heuristics and its superior performance is shown on a biological datasets. http://www.bioinf.uni-freiburg.de/Software/antaRNA CONTACT: backofen@informatik.uni-freiburg.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  5. Bringing RNA into View: RNA and Its Roles in Biology.

    ERIC Educational Resources Information Center

    Atkins, John F.; Ellington, Andrew; Friedman, B. Ellen; Gesteland, Raymond F.; Noller, Harry F.; Pasquale, Stephen M.; Storey, Richard D.; Uhlenbeck, Olke C.; Weiner, Alan M.

    This guide presents a module for college students on ribonucleic acid (RNA) and its role in biology. The module aims to integrate the latest research and its findings into college-level biology and provide an opportunity for students to understand biological processes. Four activities are presented: (1) "RNA Structure: Tapes to Shapes"; (2) "RNA…

  6. A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine.

    PubMed

    Jain, Dharm Skandh; Gupte, Sanket Rajan; Aduri, Raviprasad

    2018-06-22

    RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.

  7. De novo discovery of structural motifs in RNA 3D structures through clustering.

    PubMed

    Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie

    2018-05-18

    As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.

  8. Temperature-responsive in vitro RNA structurome of Yersinia pseudotuberculosis.

    PubMed

    Righetti, Francesco; Nuss, Aaron M; Twittenhoff, Christian; Beele, Sascha; Urban, Kristina; Will, Sebastian; Bernhart, Stephan H; Stadler, Peter F; Dersch, Petra; Narberhaus, Franz

    2016-06-28

    RNA structures are fundamentally important for RNA function. Dynamic, condition-dependent structural changes are able to modulate gene expression as shown for riboswitches and RNA thermometers. By parallel analysis of RNA structures, we mapped the RNA structurome of Yersinia pseudotuberculosis at three different temperatures. This human pathogen is exquisitely responsive to host body temperature (37 °C), which induces a major metabolic transition. Our analysis profiles the structure of more than 1,750 RNAs at 25 °C, 37 °C, and 42 °C. Average mRNAs tend to be unstructured around the ribosome binding site. We searched for 5'-UTRs that are folded at low temperature and identified novel thermoresponsive RNA structures from diverse gene categories. The regulatory potential of 16 candidates was validated. In summary, we present a dynamic bacterial RNA structurome and find that the expression of virulence-relevant functions in Y. pseudotuberculosis and reprogramming of its metabolism in response to temperature is associated with a restructuring of numerous mRNAs.

  9. nextPARS: parallel probing of RNA structures in Illumina

    PubMed Central

    Saus, Ester; Willis, Jesse R.; Pryszcz, Leszek P.; Hafez, Ahmed; Llorens, Carlos; Himmelbauer, Heinz

    2018-01-01

    RNA molecules play important roles in virtually every cellular process. These functions are often mediated through the adoption of specific structures that enable RNAs to interact with other molecules. Thus, determining the secondary structures of RNAs is central to understanding their function and evolution. In recent years several sequencing-based approaches have been developed that allow probing structural features of thousands of RNA molecules present in a sample. Here, we describe nextPARS, a novel Illumina-based implementation of in vitro parallel probing of RNA structures. Our approach achieves comparable accuracy to previous implementations, while enabling higher throughput and sample multiplexing. PMID:29358234

  10. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures

    PubMed Central

    Miao, Zhichao; Adamiak, Ryszard W.; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cheng, Clarence; Chojnowski, Grzegorz; Chou, Fang-Chieh; Cordero, Pablo; Cruz, José Almeida; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Dunin-Horkawicz, Stanislaw; Kladwang, Wipapat; Krokhotin, Andrey; Lach, Grzegorz; Magnus, Marcin; Major, François; Mann, Thomas H.; Masquida, Benoît; Matelska, Dorota; Meyer, Mélanie; Peselis, Alla; Popenda, Mariusz; Purzycka, Katarzyna J.; Serganov, Alexander; Stasiewicz, Juliusz; Szachniuk, Marta; Tandon, Arpit; Tian, Siqi; Wang, Jian; Xiao, Yi; Xu, Xiaojun; Zhang, Jinwei; Zhao, Peinan; Zok, Tomasz; Westhof, Eric

    2015-01-01

    This paper is a report of a second round of RNA-Puzzles, a collective and blind experiment in three-dimensional (3D) RNA structure prediction. Three puzzles, Puzzles 5, 6, and 10, represented sequences of three large RNA structures with limited or no homology with previously solved RNA molecules. A lariat-capping ribozyme, as well as riboswitches complexed to adenosylcobalamin and tRNA, were predicted by seven groups using RNAComposer, ModeRNA/SimRNA, Vfold, Rosetta, DMD, MC-Fold, 3dRNA, and AMBER refinement. Some groups derived models using data from state-of-the-art chemical-mapping methods (SHAPE, DMS, CMCT, and mutate-and-map). The comparisons between the predictions and the three subsequently released crystallographic structures, solved at diffraction resolutions of 2.5–3.2 Å, were carried out automatically using various sets of quality indicators. The comparisons clearly demonstrate the state of present-day de novo prediction abilities as well as the limitations of these state-of-the-art methods. All of the best prediction models have similar topologies to the native structures, which suggests that computational methods for RNA structure prediction can already provide useful structural information for biological problems. However, the prediction accuracy for non-Watson–Crick interactions, key to proper folding of RNAs, is low and some predicted models had high Clash Scores. These two difficulties point to some of the continuing bottlenecks in RNA structure prediction. All submitted models are available for download at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:25883046

  11. Turning limited experimental information into 3D models of RNA.

    PubMed

    Flores, Samuel Coulbourn; Altman, Russ B

    2010-09-01

    Our understanding of RNA functions in the cell is evolving rapidly. As for proteins, the detailed three-dimensional (3D) structure of RNA is often key to understanding its function. Although crystallography and nuclear magnetic resonance (NMR) can determine the atomic coordinates of some RNA structures, many 3D structures present technical challenges that make these methods difficult to apply. The great flexibility of RNA, its charged backbone, dearth of specific surface features, and propensity for kinetic traps all conspire with its long folding time, to challenge in silico methods for physics-based folding. On the other hand, base-pairing interactions (either in runs to form helices or isolated tertiary contacts) and motifs are often available from relatively low-cost experiments or informatics analyses. We present RNABuilder, a novel code that uses internal coordinate mechanics to satisfy user-specified base pairing and steric forces under chemical constraints. The code recapitulates the topology and characteristic L-shape of tRNA and obtains an accurate noncrystallographic structure of the Tetrahymena ribozyme P4/P6 domain. The algorithm scales nearly linearly with molecule size, opening the door to the modeling of significantly larger structures.

  12. Nanomanipulation of Single RNA Molecules by Optical Tweezers

    PubMed Central

    Stephenson, William; Wan, Gorby; Tenenbaum, Scott A.; Li, Pan T. X.

    2014-01-01

    A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed. PMID:25177917

  13. Global Organization of a Positive-strand RNA Virus Genome

    PubMed Central

    Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew

    2013-01-01

    The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202

  14. Thermodynamic heuristics with case-based reasoning: combined insights for RNA pseudoknot secondary structure.

    PubMed

    Al-Khatib, Ra'ed M; Rashid, Nur'Aini Abdul; Abdullah, Rosni

    2011-08-01

    The secondary structure of RNA pseudoknots has been extensively inferred and scrutinized by computational approaches. Experimental methods for determining RNA structure are time consuming and tedious; therefore, predictive computational approaches are required. Predicting the most accurate and energy-stable pseudoknot RNA secondary structure has been proven to be an NP-hard problem. In this paper, a new RNA folding approach, termed MSeeker, is presented; it includes KnotSeeker (a heuristic method) and Mfold (a thermodynamic algorithm). The global optimization of this thermodynamic heuristic approach was further enhanced by using a case-based reasoning technique as a local optimization method. MSeeker is a proposed algorithm for predicting RNA pseudoknot structure from individual sequences, especially long ones. This research demonstrates that MSeeker improves the sensitivity and specificity of existing RNA pseudoknot structure predictions. The performance and structural results from this proposed method were evaluated against seven other state-of-the-art pseudoknot prediction methods. The MSeeker method had better sensitivity than the DotKnot, FlexStem, HotKnots, pknotsRG, ILM, NUPACK and pknotsRE methods, with 79% of the predicted pseudoknot base-pairs being correct.

  15. A new model for approximating RNA folding trajectories and population kinetics

    NASA Astrophysics Data System (ADS)

    Kirkpatrick, Bonnie; Hajiaghayi, Monir; Condon, Anne

    2013-01-01

    RNA participates both in functional aspects of the cell and in gene regulation. The interactions of these molecules are mediated by their secondary structure which can be viewed as a planar circle graph with arcs for all the chemical bonds between pairs of bases in the RNA sequence. The problem of predicting RNA secondary structure, specifically the chemically most probable structure, has many useful and efficient algorithms. This leaves RNA folding, the problem of predicting the dynamic behavior of RNA structure over time, as the main open problem. RNA folding is important for functional understanding because some RNA molecules change secondary structure in response to interactions with the environment. The full RNA folding model on at most O(3n) secondary structures is the gold standard. We present a new subset approximation model for the full model, give methods to analyze its accuracy and discuss the relative merits of our model as compared with a pre-existing subset approximation. The main advantage of our model is that it generates Monte Carlo folding pathways with the same probabilities with which they are generated under the full model. The pre-existing subset approximation does not have this property.

  16. Simultaneous NMR characterisation of multiple minima in the free energy landscape of an RNA UUCG tetraloop.

    PubMed

    Borkar, Aditi N; Vallurupalli, Pramodh; Camilloni, Carlo; Kay, Lewis E; Vendruscolo, Michele

    2017-01-25

    RNA molecules in solution tend to undergo structural fluctuations of relatively large amplitude and to populate a range of different conformations some of which with low populations. It is still very challenging, however, to characterise the structures of these low populated states and to understand their functional roles. In the present study, we address this problem by using NMR residual dipolar couplings (RDCs) as structural restraints in replica-averaged metadynamics (RAM) simulations. By applying this approach to a 14-mer RNA hairpin containing the prototypical UUCG tetraloop motif, we show that it is possible to construct the free energy landscape of this RNA molecule. This free energy landscapes reveals the surprisingly rich dynamics of the UUCG tetraloop and identifies the multiple substates that exist in equilibrium owing to thermal fluctuations. The approach that we present is general and can be applied to the study of the free energy landscapes of other RNA or RNA-protein systems.

  17. Insights into Structural and Mechanistic Features of Viral IRES Elements

    PubMed Central

    Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.

    2018-01-01

    Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113

  18. K-Partite RNA Secondary Structures

    NASA Astrophysics Data System (ADS)

    Jiang, Minghui; Tejada, Pedro J.; Lasisi, Ramoni O.; Cheng, Shanhong; Fechser, D. Scott

    RNA secondary structure prediction is a fundamental problem in structural bioinformatics. The prediction problem is difficult because RNA secondary structures may contain pseudoknots formed by crossing base pairs. We introduce k-partite secondary structures as a simple classification of RNA secondary structures with pseudoknots. An RNA secondary structure is k-partite if it is the union of k pseudoknot-free sub-structures. Most known RNA secondary structures are either bipartite or tripartite. We show that there exists a constant number k such that any secondary structure can be modified into a k-partite secondary structure with approximately the same free energy. This offers a partial explanation of the prevalence of k-partite secondary structures with small k. We give a complete characterization of the computational complexities of recognizing k-partite secondary structures for all k ≥ 2, and show that this recognition problem is essentially the same as the k-colorability problem on circle graphs. We present two simple heuristics, iterated peeling and first-fit packing, for finding k-partite RNA secondary structures. For maximizing the number of base pair stackings, our iterated peeling heuristic achieves a constant approximation ratio of at most k for 2 ≤ k ≤ 5, and at most frac6{1-(1-6/k)^k} le frac6{1-e^{-6}} < 6.01491 for k ≥ 6. Experiment on sequences from PseudoBase shows that our first-fit packing heuristic outperforms the leading method HotKnots in predicting RNA secondary structures with pseudoknots. Source code, data set, and experimental results are available at http://www.cs.usu.edu/ mjiang/rna/kpartite/.

  19. A parallel implementation of the Wuchty algorithm with additional experimental filters to more thoroughly explore RNA conformational space.

    PubMed

    Stone, Jonathan W; Bleckley, Samuel; Lavelle, Sean; Schroeder, Susan J

    2015-01-01

    We present new modifications to the Wuchty algorithm in order to better define and explore possible conformations for an RNA sequence. The new features, including parallelization, energy-independent lonely pair constraints, context-dependent chemical probing constraints, helix filters, and optional multibranch loops, provide useful tools for exploring the landscape of RNA folding. Chemical probing alone may not necessarily define a single unique structure. The helix filters and optional multibranch loops are global constraints on RNA structure that are an especially useful tool for generating models of encapsidated viral RNA for which cryoelectron microscopy or crystallography data may be available. The computations generate a combinatorially complete set of structures near a free energy minimum and thus provide data on the density and diversity of structures near the bottom of a folding funnel for an RNA sequence. The conformational landscapes for some RNA sequences may resemble a low, wide basin rather than a steep funnel that converges to a single structure.

  20. New algorithms to represent complex pseudoknotted RNA structures in dot-bracket notation.

    PubMed

    Antczak, Maciej; Popenda, Mariusz; Zok, Tomasz; Zurkowski, Michal; Adamiak, Ryszard W; Szachniuk, Marta

    2018-04-15

    Understanding the formation, architecture and roles of pseudoknots in RNA structures are one of the most difficult challenges in RNA computational biology and structural bioinformatics. Methods predicting pseudoknots typically perform this with poor accuracy, often despite experimental data incorporation. Existing bioinformatic approaches differ in terms of pseudoknots' recognition and revealing their nature. A few ways of pseudoknot classification exist, most common ones refer to a genus or order. Following the latter one, we propose new algorithms that identify pseudoknots in RNA structure provided in BPSEQ format, determine their order and encode in dot-bracket-letter notation. The proposed encoding aims to illustrate the hierarchy of RNA folding. New algorithms are based on dynamic programming and hybrid (combining exhaustive search and random walk) approaches. They evolved from elementary algorithm implemented within the workflow of RNA FRABASE 1.0, our database of RNA structure fragments. They use different scoring functions to rank dissimilar dot-bracket representations of RNA structure. Computational experiments show an advantage of new methods over the others, especially for large RNA structures. Presented algorithms have been implemented as new functionality of RNApdbee webserver and are ready to use at http://rnapdbee.cs.put.poznan.pl. mszachniuk@cs.put.poznan.pl. Supplementary data are available at Bioinformatics online.

  1. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure.

    PubMed

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-03-31

    With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules.

  2. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure

    PubMed Central

    Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

    2008-01-01

    Background With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. Results RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. Conclusion A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules. PMID:18373878

  3. Structural landscape of base pairs containing post-transcriptional modifications in RNA

    PubMed Central

    Seelam, Preethi P.; Sharma, Purshotam

    2017-01-01

    Base pairs involving post-transcriptionally modified nucleobases are believed to play important roles in a wide variety of functional RNAs. Here we present our attempts toward understanding the structural and functional role of naturally occurring modified base pairs using a combination of X-ray crystal structure database analysis, sequence analysis, and advanced quantum chemical methods. Our bioinformatics analysis reveals that despite their presence in all major secondary structural elements, modified base pairs are most prevalent in tRNA crystal structures and most commonly involve guanine or uridine modifications. Further, analysis of tRNA sequences reveals additional examples of modified base pairs at structurally conserved tRNA regions and highlights the conservation patterns of these base pairs in three domains of life. Comparison of structures and binding energies of modified base pairs with their unmodified counterparts, using quantum chemical methods, allowed us to classify the base modifications in terms of the nature of their electronic structure effects on base-pairing. Analysis of specific structural contexts of modified base pairs in RNA crystal structures revealed several interesting scenarios, including those at the tRNA:rRNA interface, antibiotic-binding sites on the ribosome, and the three-way junctions within tRNA. These scenarios, when analyzed in the context of available experimental data, allowed us to correlate the occurrence and strength of modified base pairs with their specific functional roles. Overall, our study highlights the structural importance of modified base pairs in RNA and points toward the need for greater appreciation of the role of modified bases and their interactions, in the context of many biological processes involving RNA. PMID:28341704

  4. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures

    PubMed Central

    Sloma, Michael F.; Mathews, David H.

    2016-01-01

    RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. PMID:27852924

  5. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo.

    PubMed

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M; Weissman, Jonathan S; Rouskin, Silvi

    2017-01-01

    Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.

  6. Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.

    PubMed

    Jiang, F; Kumar, R A; Jones, R A; Patel, D J

    1996-07-11

    The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.

  7. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.

    PubMed

    Sloma, Michael F; Mathews, David H

    2016-12-01

    RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. © 2016 Sloma and Mathews; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  8. Assembly of Q{beta} viral RNA polymerase with host translational elongation factors EF-Tu and -Ts.

    PubMed

    Takeshita, Daijiro; Tomita, Kozo

    2010-09-07

    Replication and transcription of viral RNA genomes rely on host-donated proteins. Qbeta virus infects Escherichia coli and replicates and transcribes its own genomic RNA by Qbeta replicase. Qbeta replicase requires the virus-encoded RNA-dependent RNA polymerase (beta-subunit), and the host-donated translational elongation factors EF-Tu and -Ts, as active core subunits for its RNA polymerization activity. Here, we present the crystal structure of the core Qbeta replicase, comprising the beta-subunit, EF-Tu and -Ts. The beta-subunit has a right-handed structure, and the EF-Tu:Ts binary complex maintains the structure of the catalytic core crevasse of the beta-subunit through hydrophobic interactions, between the finger and thumb domains of the beta-subunit and domain-2 of EF-Tu and the coiled-coil motif of EF-Ts, respectively. These hydrophobic interactions are required for the expression and assembly of the Qbeta replicase complex. Thus, EF-Tu and -Ts have chaperone-like functions in the maintenance of the structure of the active Qbeta replicase. Modeling of the template RNA and the growing RNA in the catalytic site of the Qbeta replicase structure also suggests that structural changes of the RNAs and EF-Tu:Ts should accompany processive RNA polymerization and that EF-Tu:Ts in the Qbeta replicase could function to modulate the RNA folding and structure.

  9. Principles for Predicting RNA Secondary Structure Design Difficulty.

    PubMed

    Anderson-Lee, Jeff; Fisker, Eli; Kosaraju, Vineet; Wu, Michelle; Kong, Justin; Lee, Jeehyung; Lee, Minjae; Zada, Mathew; Treuille, Adrien; Das, Rhiju

    2016-02-27

    Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  10. RNase MRP and the RNA processing cascade in the eukaryotic ancestor.

    PubMed

    Woodhams, Michael D; Stadler, Peter F; Penny, David; Collins, Lesley J

    2007-02-08

    Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.

  11. Four RNA families with functional transient structures

    PubMed Central

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5′ flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5′ UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here. PMID:25751035

  12. Four RNA families with functional transient structures.

    PubMed

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5' flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5' UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here.

  13. Guide-bound structures of an RNA-targeting A-cleaving CRISPR–Cas13a enzyme

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.

    CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR–Cas systems include the Cas13a enzyme, an RNA-activated RNase capable of crRNA processing and single-stranded RNA degradation upon target-transcript binding. Here we present the 2.0-Å resolution crystal structure of a crRNA-bound Lachnospiraceae bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target-RNA-specifying sequence in the protein interior explains the conformational gatingmore » of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked before target-RNA recognition, with implications for both bacterial immunity and diagnostic applications.« less

  14. Guide-bound structures of an RNA-targeting A-cleaving CRISPR–Cas13a enzyme

    DOE PAGES

    Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.; ...

    2017-09-11

    CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR–Cas systems include the Cas13a enzyme, an RNA-activated RNase capable of crRNA processing and single-stranded RNA degradation upon target-transcript binding. Here we present the 2.0-Å resolution crystal structure of a crRNA-bound Lachnospiraceae bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target-RNA-specifying sequence in the protein interior explains the conformational gatingmore » of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked before target-RNA recognition, with implications for both bacterial immunity and diagnostic applications.« less

  15. The structural analysis of the mitochondrial SSUrRNA implies a close phylogenetic relationship between mitochondria from plants and from the heterotrophic alga Prototheca wickerhamii.

    PubMed

    Wolff, G; Kück, U

    1990-04-01

    The gene for the mitochondrial small subunit rRNA (SSUrRNA) from the heterotrophic alga Prototheca wickerhamii has been isolated from a gene library of extranuclear DNA. Sequence and structural analyses allow the determination of a secondary structure model for this rRNA. In addition, several sequence motifs are present which are typically found in SSUrRNAs of various mitochondrial origins. Unexpectedly, the Prototheca RNA sequence has more features in common with mitochondrial SSUrRNAs from plants than with that from the green alga Chlamydomonas reinhardtii. The phylogenetic relationship between mitochondria from plants and algae is discussed.

  16. Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families.

    PubMed

    Reinharz, Vladimir; Soulé, Antoine; Westhof, Eric; Waldispühl, Jérôme; Denise, Alain

    2018-05-04

    The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.

  17. ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data

    PubMed Central

    Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa

    2017-01-01

    Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546

  18. An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures

    PubMed Central

    Zandi, Kasra; Butler, Gregory; Kharma, Nawwaf

    2016-01-01

    Computational design of RNA sequences that fold into targeted secondary structures has many applications in biomedicine, nanotechnology and synthetic biology. An RNA molecule is made of different types of secondary structure elements and an important RNA element named pseudoknot plays a key role in stabilizing the functional form of the molecule. However, due to the computational complexities associated with characterizing pseudoknotted RNA structures, most of the existing RNA sequence designer algorithms generally ignore this important structural element and therefore limit their applications. In this paper we present a new algorithm to design RNA sequences for pseudoknotted secondary structures. We use NUPACK as the folding algorithm to compute the equilibrium characteristics of the pseudoknotted RNAs, and describe a new adaptive defect weighted sampling algorithm named Enzymer to design low ensemble defect RNA sequences for targeted secondary structures including pseudoknots. We used a biological data set of 201 pseudoknotted structures from the Pseudobase library to benchmark the performance of our algorithm. We compared the quality characteristics of the RNA sequences we designed by Enzymer with the results obtained from the state of the art MODENA and antaRNA. Our results show our method succeeds more frequently than MODENA and antaRNA do, and generates sequences that have lower ensemble defect, lower probability defect and higher thermostability. Finally by using Enzymer and by constraining the design to a naturally occurring and highly conserved Hammerhead motif, we designed 8 sequences for a pseudoknotted cis-acting Hammerhead ribozyme. Enzymer is available for download at https://bitbucket.org/casraz/enzymer. PMID:27499762

  19. SMARTIV: combined sequence and structure de-novo motif discovery for in-vivo RNA binding data.

    PubMed

    Polishchuk, Maya; Paz, Inbal; Yakhini, Zohar; Mandel-Gutfreund, Yael

    2018-05-25

    Gene expression regulation is highly dependent on binding of RNA-binding proteins (RBPs) to their RNA targets. Growing evidence supports the notion that both RNA primary sequence and its local secondary structure play a role in specific Protein-RNA recognition and binding. Despite the great advance in high-throughput experimental methods for identifying sequence targets of RBPs, predicting the specific sequence and structure binding preferences of RBPs remains a major challenge. We present a novel webserver, SMARTIV, designed for discovering and visualizing combined RNA sequence and structure motifs from high-throughput RNA-binding data, generated from in-vivo experiments. The uniqueness of SMARTIV is that it predicts motifs from enriched k-mers that combine information from ranked RNA sequences and their predicted secondary structure, obtained using various folding methods. Consequently, SMARTIV generates Position Weight Matrices (PWMs) in a combined sequence and structure alphabet with assigned P-values. SMARTIV concisely represents the sequence and structure motif content as a single graphical logo, which is informative and easy for visual perception. SMARTIV was examined extensively on a variety of high-throughput binding experiments for RBPs from different families, generated from different technologies, showing consistent and accurate results. Finally, SMARTIV is a user-friendly webserver, highly efficient in run-time and freely accessible via http://smartiv.technion.ac.il/.

  20. A parallel strategy for predicting the secondary structure of polycistronic microRNAs.

    PubMed

    Han, Dianwei; Tang, Guiliang; Zhang, Jun

    2013-01-01

    The biogenesis of a functional microRNA is largely dependent on the secondary structure of the microRNA precursor (pre-miRNA). Recently, it has been shown that microRNAs are present in the genome as the form of polycistronic transcriptional units in plants and animals. It will be important to design efficient computational methods to predict such structures for microRNA discovery and its applications in gene silencing. In this paper, we propose a parallel algorithm based on the master-slave architecture to predict the secondary structure from an input sequence. We conducted some experiments to verify the effectiveness of our parallel algorithm. The experimental results show that our algorithm is able to produce the optimal secondary structure of polycistronic microRNAs.

  1. SETTER: web server for RNA structure comparison

    PubMed Central

    Čech, Petr; Svozil, Daniel; Hoksza, David

    2012-01-01

    The recent discoveries of regulatory non-coding RNAs changed our view of RNA as a simple information transfer molecule. Understanding the architecture and function of active RNA molecules requires methods for comparing and analyzing their 3D structures. While structural alignment of short RNAs is achievable in a reasonable amount of time, large structures represent much bigger challenge. Here, we present the SETTER web server for the RNA structure pairwise comparison utilizing the SETTER (SEcondary sTructure-based TERtiary Structure Similarity Algorithm) algorithm. The SETTER method divides an RNA structure into the set of non-overlapping structural elements called generalized secondary structure units (GSSUs). The SETTER algorithm scales as O(n2) with the size of a GSSUs and as O(n) with the number of GSSUs in the structure. This scaling gives SETTER its high speed as the average size of the GSSU remains constant irrespective of the size of the structure. However, the favorable speed of the algorithm does not compromise its accuracy. The SETTER web server together with the stand-alone implementation of the SETTER algorithm are freely accessible at http://siret.cz/setter. PMID:22693209

  2. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    PubMed Central

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  3. RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure

    PubMed Central

    Theis, Corinna; Zirbel, Craig L.; zu Siederdissen, Christian Höner; Anthon, Christian; Hofacker, Ivo L.; Nielsen, Henrik; Gorodkin, Jan

    2015-01-01

    Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D module prediction tools and apply them on a 13-way vertebrate sequence-based alignment. We find that RNA 3D modules predicted by metaRNAmodules and JAR3D are significantly enriched in the screened windows compared to their shuffled counterparts. The initially estimated FDR of 47.0% is lowered to below 25% when certain 3D module predictions are present in the window of the 2D prediction. We discuss the implications and prospects for further development of computational strategies for detection of RNA 2D structure in genomic sequence. PMID:26509713

  4. On the combinatorics of sparsification.

    PubMed

    Huang, Fenix Wd; Reidys, Christian M

    2012-10-22

    We study the sparsification of dynamic programming based on folding algorithms of RNA structures. Sparsification is a method that improves significantly the computation of minimum free energy (mfe) RNA structures. We provide a quantitative analysis of the sparsification of a particular decomposition rule, Λ∗. This rule splits an interval of RNA secondary and pseudoknot structures of fixed topological genus. Key for quantifying sparsifications is the size of the so called candidate sets. Here we assume mfe-structures to be specifically distributed (see Assumption 1) within arbitrary and irreducible RNA secondary and pseudoknot structures of fixed topological genus. We then present a combinatorial framework which allows by means of probabilities of irreducible sub-structures to obtain the expectation of the Λ∗-candidate set w.r.t. a uniformly random input sequence. We compute these expectations for arc-based energy models via energy-filtered generating functions (GF) in case of RNA secondary structures as well as RNA pseudoknot structures. Furthermore, for RNA secondary structures we also analyze a simplified loop-based energy model. Our combinatorial analysis is then compared to the expected number of Λ∗-candidates obtained from the folding mfe-structures. In case of the mfe-folding of RNA secondary structures with a simplified loop-based energy model our results imply that sparsification provides a significant, constant improvement of 91% (theory) to be compared to an 96% (experimental, simplified arc-based model) reduction. However, we do not observe a linear factor improvement. Finally, in case of the "full" loop-energy model we can report a reduction of 98% (experiment). Sparsification was initially attributed a linear factor improvement. This conclusion was based on the so called polymer-zeta property, which stems from interpreting polymer chains as self-avoiding walks. Subsequent findings however reveal that the O(n) improvement is not correct. The combinatorial analysis presented here shows that, assuming a specific distribution (see Assumption 1), of mfe-structures within irreducible and arbitrary structures, the expected number of Λ∗-candidates is Θ(n2). However, the constant reduction is quite significant, being in the range of 96%. We furthermore show an analogous result for the sparsification of the Λ∗-decomposition rule for RNA pseudoknotted structures of genus one. Finally we observe that the effect of sparsification is sensitive to the employed energy model.

  5. Chemical Approaches for Structure and Function of RNA in Postgenomic Era

    PubMed Central

    Ro-Choi, Tae Suk; Choi, Yong Chun

    2012-01-01

    In the study of cellular RNA chemistry, a major thrust of research focused upon sequence determinations for decades. Structures of snRNAs (4.5S RNA I (Alu), U1, U2, U3, U4, U5, and U6) were determined at Baylor College of Medicine, Houston, Tex, in an earlier time of pregenomic era. They show novel modifications including base methylation, sugar methylation, 5′-cap structures (types 0–III) and sequence heterogeneity. This work offered an exciting problem of posttranscriptional modification and underwent numerous significant advances through technological revolutions during pregenomic, genomic, and postgenomic eras. Presently, snRNA research is making progresses involved in enzymology of snRNA modifications, molecular evolution, mechanism of spliceosome assembly, chemical mechanism of intron removal, high-order structure of snRNA in spliceosome, and pathology of splicing. These works are destined to reach final pathway of work “Function and Structure of Spliceosome” in addition to exciting new exploitation of other noncoding RNAs in all aspects of regulatory functions. PMID:22347623

  6. Importance of Diffuse Metal Ion Binding to RNA

    PubMed Central

    Tan, Zhi-Jie; Chen, Shi-Jie

    2016-01-01

    RNAs are highly charged polyanionic molecules. RNA structure and function are strongly correlated with the ionic condition of the solution. The primary focus of this article is on the role of diffusive ions in RNA folding. Due to the long-range nature of electrostatic interactions, the diffuse ions can contribute significantly to RNA structural stability and folding kinetics. We present an overview of the experimental findings as well as the theoretical developments on the diffuse ion effects in RNA folding. This review places heavy emphasis on the effect of magnesium ions. Magnesium ions play a highly efficient role in stabilizing RNA tertiary structures and promoting tertiary structural folding. The highly efficient role goes beyond the mean-field effect such as the ionic strength. In addition to the effects of specific ion binding and ion dehydration, ion-ion correlation for the diffuse ions can contribute to the efficient role of the multivalent ions such as the magnesium ions in RNA folding. PMID:22010269

  7. Importance of diffuse metal ion binding to RNA.

    PubMed

    Tan, Zhi-Jie; Chen, Shi-Jie

    2011-01-01

    RNAs are highly charged polyanionic molecules. RNA structure and function are strongly correlated with the ionic condition of the solution. The primary focus of this article is on the role of diffusive ions in RNA folding. Due to the long-range nature of electrostatic interactions, the diffuse ions can contribute significantly to RNA structural stability and folding kinetics. We present an overview of the experimental findings as well as the theoretical developments on the diffuse ion effects in RNA folding. This review places heavy emphasis on the effect of magnesium ions. Magnesium ions play a highly efficient role in stabilizing RNA tertiary structures and promoting tertiary structural folding. The highly efficient role goes beyond the mean-field effect such as the ionic strength. In addition to the effects of specific ion binding and ion dehydration, ion-ion correlation for the diffuse ions can contribute to the efficient role of the multivalent ions such as the magnesium ions in RNA folding.

  8. Structural analyses of EBER1 and EBER2 ribonucleoprotein particles present in Epstein-Barr virus-infected cells.

    PubMed Central

    Glickman, J N; Howe, J G; Steitz, J A

    1988-01-01

    The ribonucleoprotein (RNP) particles containing the Epstein-Barr virus-associated small RNAs EBER1 and EBER2 were analyzed to determine their RNA secondary structures and sites of RNA-protein interaction. The secondary structures were probed with nucleases and by chemical modification with single-strand-specific reagents, and the sites of modification or cleavage were mapped by primer extension. These data were used to develop secondary structures for the two RNAs, and likely sites of close RNA-protein contact were identified by comparing modification patterns for naked RNA and RNA in RNP particles. In addition, sites of interaction between each Epstein-Barr virus-encoded RNA (EBER) and the La antigen were identified by analyzing RNA fragments resistant to digestion by RNase A or T1 after immunoprecipitation by an anti-La serum sample from a lupus patient. Our results confirm earlier findings that the La protein binds to the 3' terminus of each molecule. Possible functions for the EBER RNPs are discussed. Images PMID:2828685

  9. The Spot 42 RNA: A regulatory small RNA with roles in the central metabolism.

    PubMed

    Bækkedal, Cecilie; Haugen, Peik

    2015-01-01

    The Spot 42 RNA is a 109 nucleotide long (in Escherichia coli) noncoding small regulatory RNA (sRNA) encoded by the spf (spot fourty-two) gene. spf is found in gamma-proteobacteria and the majority of experimental work on Spot 42 RNA has been performed using E. coli, and recently Aliivibrio salmonicida. In the cell Spot 42 RNA plays essential roles as a regulator in carbohydrate metabolism and uptake, and its expression is activated by glucose, and inhibited by the cAMP-CRP complex. Here we summarize the current knowledge on Spot 42, and present the natural distribution of spf, show family-specific secondary structural features of Spot 42, and link highly conserved structural regions to mRNA target binding.

  10. Structural and functional analysis of 5S rRNA in Saccharomyces cerevisiae

    PubMed Central

    Kiparisov, S.; Sergiev, P. V.; Dontsova, O. A.; Petrov, A.; Meskauskas, A.; Dinman, J. D.

    2005-01-01

    5S rRNA extends from the central protuberance of the large ribosomal subunit, through the A-site finger, and down to the GTPase-associated center. Here, we present a structure-function analysis of seven 5S rRNA alleles which are sufficient for viability in the yeast Saccharomyces cerevisiae when expressed in the absence of wild-type 5S rRNAs, and extend this analysis using a large bank of mutant alleles that show semidominant phenotypes in the presence of wild-type 5S rRNA. This analysis supports the hypothesis that 5S rRNA serves to link together several different functional centers of the ribosome. Data are also presented which suggest that in eukaryotic genomes selection has favored the maintenance of multiple alleles of 5S rRNA, and that these may provide cells with a mechanism to post-transcriptionally regulate gene expression. PMID:16047201

  11. Fast prediction of RNA-RNA interaction using heuristic algorithm.

    PubMed

    Montaseri, Soheila

    2015-01-01

    Interaction between two RNA molecules plays a crucial role in many medical and biological processes such as gene expression regulation. In this process, an RNA molecule prohibits the translation of another RNA molecule by establishing stable interactions with it. Some algorithms have been formed to predict the structure of the RNA-RNA interaction. High computational time is a common challenge in most of the presented algorithms. In this context, a heuristic method is introduced to accurately predict the interaction between two RNAs based on minimum free energy (MFE). This algorithm uses a few dot matrices for finding the secondary structure of each RNA and binding sites between two RNAs. Furthermore, a parallel version of this method is presented. We describe the algorithm's concurrency and parallelism for a multicore chip. The proposed algorithm has been performed on some datasets including CopA-CopT, R1inv-R2inv, Tar-Tar*, DIS-DIS, and IncRNA54-RepZ in Escherichia coli bacteria. The method has high validity and efficiency, and it is run in low computational time in comparison to other approaches.

  12. RNA-dependent RNA polymerase of hepatitis C virus binds to its coding region RNA stem-loop structure, 5BSL3.2, and its negative strand.

    PubMed

    Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko

    2010-05-01

    The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.

  13. Crystal Structures of the E. coli Transcription Initiation Complexes with a Complete Bubble

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zuo, Yuhong; Steitz, Thomas A.

    2015-05-01

    During transcription initiation, RNA polymerase binds to promoter DNA to form an initiation complex containing a DNA bubble and enters into abortive cycles of RNA synthesis before escaping the promoter to transit into the elongation phase for processive RNA synthesis. Here we present the crystal structures of E. coli transcription initiation complexes containing a complete transcription bubble and de novo synthesized RNA oligonucleotides at about 6-Å resolution. The structures show how RNA polymerase recognizes DNA promoters that contain spacers of different lengths and reveal a bridging interaction between the 5'-triphosphate of the nascent RNA and the σ factor that maymore » function to stabilize the short RNA-DNA hybrids during the early stage of transcription initiation. The conformation of the RNA oligonucleotides and the paths of the DNA strands in the complete initiation complexes provide insights into the mechanism that controls both the abortive and productive RNA synthesis.« less

  14. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly.

    PubMed

    Jain, Swati; Schlick, Tamar

    2017-11-24

    Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. 5' termini of poliovirus RNA: difference between virion and nonencapsidated 35S RNA.

    PubMed Central

    Fernandez-Muñoz, R; Lavi, U

    1977-01-01

    Poliovirus cytoplasmic, nonencapsidated 35S RNA yields approximately one pUp per molecule upon T2 RNase digestion, indicating that this RNA has the same 5' end as the polyribosome-associated viral RNA fraction. Double-stranded, replicative form RNA after the same treatment yielded approximately four pNp structures per molecule, 65% of which was pUp. In contrast, the 35S RNA from mature virions contained no detectable pNp, indicating that the 5' end of the virion RNA is different from that of the nonencapsidated RNA. None of the above molecules contained pppNp, ppNp, or GpppNp structures present in host mRNA. The virion RNA molecules, as we have shown previously for thenonencapsidated 35S viral RNA (Fernandez-Muñoz and Darnell, 1976), is not labeled with [methyl-3H]methionine. PMID:189096

  16. A surprisingly large RNase P RNA in Candida glabrata

    PubMed Central

    KACHOURI, RYM; STRIBINSKIS, VILIUS; ZHU, YANGLONG; RAMOS, KENNETH S.; WESTHOF, ERIC; LI, YONG

    2005-01-01

    We have found an extremely large ribonuclease P (RNase P) RNA (RPR1) in the human pathogen Candida glabrata and verified that this molecule is expressed and present in the active enzyme complex of this hemiascomycete yeast. A structural alignment of the C. glabrata sequence with 36 other hemiascomycete RNase P RNAs (abbreviated as P RNAs) allows us to characterize the types of insertions. In addition, 15 P RNA sequences were newly characterized by searching in the recently sequenced genomes Candida albicans, C. glabrata, Debaryomyces hansenii, Eremothecium gossypii, Kluyveromyces lactis, Kluyveromyces waltii, Naumovia castellii, Saccharomyces kudriavzevii, Saccharomyces mikatae, and Yarrowia lipolytica; and by PCR amplification for other Candida species (Candida guilliermondii, Candida krusei, Candida parapsilosis, Candida stellatoidea, and Candida tropicalis). The phylogenetic comparative analysis identifies a hemiascomycete secondary structure consensus that presents a conserved core in all species with variable insertions or deletions. The most significant variability is found in C. glabrata P RNA in which three insertions exceeding in total 700 nt are present in the Specificity domain. This P RNA is more than twice the length of any other homologous P RNAs known in the three domains of life and is eight times the size of the smallest. RNase P RNA, therefore, represents one of the most diversified noncoding RNAs in terms of size variation and structural diversity. PMID:15987816

  17. RSRE: RNA structural robustness evaluator

    PubMed Central

    Shu, Wenjie; Zheng, Zhiqiang; Wang, Shengqi

    2007-01-01

    Biological robustness, defined as the ability to maintain stable functioning in the face of various perturbations, is an important and fundamental topic in current biology, and has become a focus of numerous studies in recent years. Although structural robustness has been explored in several types of RNA molecules, the origins of robustness are still controversial. Computational analysis results are needed to make up for the lack of evidence of robustness in natural biological systems. The RNA structural robustness evaluator (RSRE) web server presented here provides a freely available online tool to quantitatively evaluate the structural robustness of RNA based on the widely accepted definition of neutrality. Several classical structure comparison methods are employed; five randomization methods are implemented to generate control sequences; sub-optimal predicted structures can be optionally utilized to mitigate the uncertainty of secondary structure prediction. With a user-friendly interface, the web application is easy to use. Intuitive illustrations are provided along with the original computational results to facilitate analysis. The RSRE will be helpful in the wide exploration of RNA structural robustness and will catalyze our understanding of RNA evolution. The RSRE web server is freely available at http://biosrv1.bmi.ac.cn/RSRE/ or http://biotech.bmi.ac.cn/RSRE/. PMID:17567615

  18. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

    PubMed

    Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

    2014-01-30

    RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.

  19. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo

    PubMed Central

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M.; Weissman, Jonathan S.; Rouskin, Silvi

    2017-01-01

    Coupling structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structural studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduces biases and necessitates population-average assessments of RNA structure. Here we present dimethyl sulfate mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase (TGIRT). DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in non-canonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs to their mature isoforms. These applications illustrate DMS-MaPseq’s capacity to dramatically expand in vivo analysis of RNA structure. PMID:27819661

  20. Structure of an archaeal non-discriminating glutamyl-tRNA synthetase: a missing link in the evolution of Gln-tRNAGln formation.

    PubMed

    Nureki, Osamu; O'Donoghue, Patrick; Watanabe, Nobuhisa; Ohmori, Atsuhiko; Oshikane, Hiroyuki; Araiso, Yuhei; Sheppard, Kelly; Söll, Dieter; Ishitani, Ryuichiro

    2010-11-01

    The molecular basis of the genetic code relies on the specific ligation of amino acids to their cognate tRNA molecules. However, two pathways exist for the formation of Gln-tRNA(Gln). The evolutionarily older indirect route utilizes a non-discriminating glutamyl-tRNA synthetase (ND-GluRS) that can form both Glu-tRNA(Glu) and Glu-tRNA(Gln). The Glu-tRNA(Gln) is then converted to Gln-tRNA(Gln) by an amidotransferase. Since the well-characterized bacterial ND-GluRS enzymes recognize tRNA(Glu) and tRNA(Gln) with an unrelated α-helical cage domain in contrast to the β-barrel anticodon-binding domain in archaeal and eukaryotic GluRSs, the mode of tRNA(Glu)/tRNA(Gln) discrimination in archaea and eukaryotes was unknown. Here, we present the crystal structure of the Methanothermobacter thermautotrophicus ND-GluRS, which is the evolutionary predecessor of both the glutaminyl-tRNA synthetase (GlnRS) and the eukaryotic discriminating GluRS. Comparison with the previously solved structure of the Escherichia coli GlnRS-tRNA(Gln) complex reveals the structural determinants responsible for specific tRNA(Gln) recognition by GlnRS compared to promiscuous recognition of both tRNAs by the ND-GluRS. The structure also shows the amino acid recognition pocket of GluRS is more variable than that found in GlnRS. Phylogenetic analysis is used to reconstruct the key events in the evolution from indirect to direct genetic encoding of glutamine.

  1. Structural insights into the role of diphthamide on elongation factor 2 in messenger RNA reading frame maintenance.

    PubMed

    Pellegrino, Simone; Demeshkina, Natalia; Mancera-Martinez, Eder; Melnikov, Sergey; Simonetti, Angelita; Myasnikov, Alexander; Yusupov, Marat; Yusupova, Gulnara; Hashem, Yaser

    2018-06-07

    One of the most critical steps of protein biosynthesis is the coupled movement of messenger RNA (mRNA), that encodes genetic information, with transfer RNAs (tRNAs) on the ribosome. In eukaryotes this process is catalyzed by a conserved G-protein, the elongation factor 2 (eEF2), which carries a unique post-translational modification, called diphthamide, found in all eukaryotic species. Here we present near-atomic resolution cryo-EM structures of yeast 80S ribosome complexes containing mRNA, tRNA and eEF2 trapped in different GTP-hydrolysis states which provide further structural insights on the role of diphthamide in the mechanism of translation fidelity in eukaryotes. Copyright © 2018. Published by Elsevier Ltd.

  2. Analysis of secondary structural elements in human microRNA hairpin precursors.

    PubMed

    Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D

    2016-03-01

    MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.

  3. Single TRAM domain RNA-binding proteins in Archaea: functional insight from Ctr3 from the Antarctic methanogen Methanococcoides burtonii.

    PubMed

    Taha; Siddiqui, K S; Campanaro, S; Najnin, T; Deshpande, N; Williams, T J; Aldrich-Wright, J; Wilkins, M; Curmi, P M G; Cavicchioli, R

    2016-09-01

    TRAM domain proteins present in Archaea and Bacteria have a β-barrel shape with anti-parallel β-sheets that form a nucleic acid binding surface; a structure also present in cold shock proteins (Csps). Aside from protein structures, experimental data defining the function of TRAM domains is lacking. Here, we explore the possible functional properties of a single TRAM domain protein, Ctr3 (cold-responsive TRAM domain protein 3) from the Antarctic archaeon Methanococcoides burtonii that has increased abundance during low temperature growth. Ribonucleic acid (RNA) bound by Ctr3 in vitro was determined using RNA-seq. Ctr3-bound M. burtonii RNA with a preference for transfer (t)RNA and 5S ribosomal RNA, and a potential binding motif was identified. In tRNA, the motif represented the C loop; a region that is conserved in tRNA from all domains of life and appears to be solvent exposed, potentially providing access for Ctr3 to bind. Ctr3 and Csps are structurally similar and are both inferred to function in low temperature translation. The broad representation of single TRAM domain proteins within Archaea compared with their apparent absence in Bacteria, and scarcity of Csps in Archaea but prevalence in Bacteria, suggests they represent distinct evolutionary lineages of functionally equivalent RNA-binding proteins. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  4. RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

    PubMed Central

    Ellington, Roni; Wachira, James

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses discrete mathematical techniques and identifies specified base pairs as parameters. The goal of the REU was to introduce upper-level undergraduate students to the principles and challenges of interdisciplinary research in molecular biology and discrete mathematics. At the beginning of the project, students from the biology and mathematics departments of a mid-sized university received instruction on the role of secondary structure in the function of eukaryotic RNAs and RNA viruses, RNA related to combinatorics, and the National Center for Biotechnology Information resources. The student research projects focused on RNA secondary structure prediction on a regulatory region of the yellow fever virus RNA genome and on an untranslated region of an mRNA of a gene associated with the neurological disorder epilepsy. At the end of the project, the REU students gave poster and oral presentations, and they submitted written final project reports to the program director. The outcome of the REU was that the students gained transferable knowledge and skills in bioinformatics and an awareness of the applications of discrete mathematics to biological research problems. PMID:20810968

  5. RNA secondary structure prediction by using discrete mathematics: an interdisciplinary research experience for undergraduate students.

    PubMed

    Ellington, Roni; Wachira, James; Nkwanta, Asamoah

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses discrete mathematical techniques and identifies specified base pairs as parameters. The goal of the REU was to introduce upper-level undergraduate students to the principles and challenges of interdisciplinary research in molecular biology and discrete mathematics. At the beginning of the project, students from the biology and mathematics departments of a mid-sized university received instruction on the role of secondary structure in the function of eukaryotic RNAs and RNA viruses, RNA related to combinatorics, and the National Center for Biotechnology Information resources. The student research projects focused on RNA secondary structure prediction on a regulatory region of the yellow fever virus RNA genome and on an untranslated region of an mRNA of a gene associated with the neurological disorder epilepsy. At the end of the project, the REU students gave poster and oral presentations, and they submitted written final project reports to the program director. The outcome of the REU was that the students gained transferable knowledge and skills in bioinformatics and an awareness of the applications of discrete mathematics to biological research problems.

  6. Influence of Na+ and Mg2+ ions on RNA structures studied with molecular dynamics simulations.

    PubMed

    Fischer, Nina M; Polêto, Marcelo D; Steuer, Jakob; van der Spoel, David

    2018-06-01

    The structure of ribonucleic acid (RNA) polymers is strongly dependent on the presence of, in particular Mg2+ cations to stabilize structural features. Only in high-resolution X-ray crystallography structures can ions be identified reliably. Here, we perform molecular dynamics simulations of 24 RNA structures with varying ion concentrations. Twelve of the structures were helical and the others complex folded. The aim of the study is to predict ion positions but also to evaluate the impact of different types of ions (Na+ or Mg2+) and the ionic strength on structural stability and variations of RNA. As a general conclusion Mg2+ is found to conserve the experimental structure better than Na+ and, where experimental ion positions are available, they can be reproduced with reasonable accuracy. If a large surplus of ions is present the added electrostatic screening makes prediction of binding-sites less reproducible. Distinct differences in ion-binding between helical and complex folded structures are found. The strength of binding (ΔG‡ for breaking RNA atom-ion interactions) is found to differ between roughly 10 and 26 kJ/mol for the different RNA atoms. Differences in stability between helical and complex folded structures and of the influence of metal ions on either are discussed.

  7. LCS-TA to identify similar fragments in RNA 3D structures.

    PubMed

    Wiedemann, Jakub; Zok, Tomasz; Milostan, Maciej; Szachniuk, Marta

    2017-10-23

    In modern structural bioinformatics, comparison of molecular structures aimed to identify and assess similarities and differences between them is one of the most commonly performed procedures. It gives the basis for evaluation of in silico predicted models. It constitutes the preliminary step in searching for structural motifs. In particular, it supports tracing the molecular evolution. Faced with an ever-increasing amount of available structural data, researchers need a range of methods enabling comparative analysis of the structures from either global or local perspective. Herein, we present a new, superposition-independent method which processes pairs of RNA 3D structures to identify their local similarities. The similarity is considered in the context of structure bending and bonds' rotation which are described by torsion angles. In the analyzed RNA structures, the method finds the longest continuous segments that show similar torsion within a user-defined threshold. The length of the segment is provided as local similarity measure. The method has been implemented as LCS-TA algorithm (Longest Continuous Segments in Torsion Angle space) and is incorporated into our MCQ4Structures application, freely available for download from http://www.cs.put.poznan.pl/tzok/mcq/ . The presented approach ties torsion-angle-based method of structure analysis with the idea of local similarity identification by handling continuous 3D structure segments. The first method, implemented in MCQ4Structures, has been successfully utilized in RNA-Puzzles initiative. The second one, originally applied in Euclidean space, is a component of LGA (Local-Global Alignment) algorithm commonly used in assessing protein models submitted to CASP. This unique combination of concepts implemented in LCS-TA provides a new perspective on structure quality assessment in local and quantitative aspect. A series of computational experiments show the first results of applying our method to comparison of RNA 3D models. LCS-TA can be used for identifying strengths and weaknesses in the prediction of RNA tertiary structures.

  8. The origin and evolution of tRNA inferred from phylogenetic analysis of structure.

    PubMed

    Sun, Feng-Jie; Caetano-Anollés, Gustavo

    2008-01-01

    The evolutionary history of the two structural and functional domains of tRNA is controversial but harbors the secrets of early translation and the genetic code. To explore the origin and evolution of tRNA, we reconstructed phylogenetic trees directly from molecular structure. Forty-two structural characters describing the geometry of 571 tRNAs and three statistical parameters describing thermodynamic and mechanical features of molecules quantitatively were used to derive phylogenetic trees of molecules and molecular substructures. Trees of molecules failed to group tRNA according to amino acid specificity and did not reveal the tripartite nature of life, probably due to loss of phylogenetic signal or because tRNA diversification predated organismal diversification. Trees of substructures derived from both structural and statistical characters support the origin of tRNA in the acceptor arm and the hypothesis that the top half domain composed of acceptor and pseudouridine (TPsiC) arms is more ancient than the bottom half domain composed of dihydrouridine (DHU) and anticodon arms. This constitutes the cornerstone of the genomic tag hypothesis that postulates tRNAs were ancient telomeres in the RNA world. The trees of substructures suggest a model for the evolution of the major functional and structural components of tRNA. In this model, short RNA hairpins with stems homologous to the acceptor arm of present day tRNAs were extended with regions homologous to TPsiC and anticodon arms. The DHU arm was then incorporated into the resulting three-stemmed structure to form a proto-cloverleaf structure. The variable region was the last structural addition to the molecular repertoire of evolving tRNA substructures.

  9. Structure of Arabidopsis HYPONASTIC LEAVES1 and Its Molecular Implications for miRNA Processing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, S.; Chen, H; Yang, J

    2010-01-01

    The Arabidopsis HYPONASTIC LEAVES1 (HYL1) is a double-stranded RNA-binding protein that forms a complex with DICER-LIKE1 (DCL1) and SERRATE to facilitate processing of primary miRNAs into microRNAs (miRNAs). However, the structural mechanisms of miRNA maturation by this complex are poorly understood. Here, we present the crystal structures of double-stranded RNA binding domains (dsRBD1 and dsRBD2) of HYL1 and HYL1 dsRBD1 (HR1)/dsRNA complex as well as human TRBP2 dsRBD2 (TR2)/dsRNA complex for comparison analysis. Structural and functional study demonstrates that both HR1 and TR2 are canonical dsRBDs for dsRNA binding, whereas HR2 of HYL1 is a non-canonical dsRBD harboring a putativemore » dimerization interface. Domain swapping within the context of HYL1 demonstrates that TR2 can supplant the function of HR1 in vitro and in vivo. Further biochemical analyses suggest that HYL1 probably binds to the miRNA/miRNA* region of precursors as a dimer mediated by HR2.« less

  10. Guide-bound structures of an RNA-targeting A-cleaving CRISPR-Cas13a enzyme

    PubMed Central

    Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.; Holton, James M.; Charles, Emeric; O’Connell, Mitchell R.; Doudna, Jennifer A.

    2018-01-01

    CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR-Cas systems include the Cas13a enzyme, an RNA-activated ribonuclease (RNase) capable of crRNA processing and single-stranded RNA degradation upon target transcript binding. Here we present the 2.0 Å resolution crystal structure of a crRNA-bound L. bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define for the first time the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target RNA-specifying sequence in the protein interior explains the conformational gating of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked prior to target RNA recognition, with implications for both bacterial immunity and diagnostic applications. PMID:28892041

  11. Secondary RNA structure and its role in RNA interference to silence the respiratory syncytial virus fusion protein gene.

    PubMed

    Vig, Komal; Lewis, Nuruddeen; Moore, Eddie G; Pillai, Shreekumar; Dennis, Vida A; Singh, Shree R

    2009-11-01

    RNA interference (RNAi) is a post-transcriptional, gene silencing mechanism which uses small interfering RNA molecules (siRNA) for gene silencing. Respiratory Syncytial Virus (RSV) is an important respiratory pathogen of medical significance that causes high mortality in infants. The fusion (F) protein of RSV is a good target for therapeutic purposes as it is primarily responsible for penetration of the virus into host cells and subsequent syncytium formation during infection. In the present study, four siRNAs were designed and used individually as well as a mixture, to silence the RSV F gene. The relationship between siRNA design, target RNA structure, and their thermodynamics was also investigated. Silencing of F gene was observed using indirect immunofluorescence, western blot, reverse transcription PCR, and progeny viral titers. Our results show F gene silencing by all the four siRNAs individually and collectively. RT-PCR analysis revealed a decrease in mRNA level which corresponded to decreased F protein expression. siRNAs also inhibited RSV progeny as shown by viral titer estimation on infected HEp-2 cells. The present study demonstrates the silencing of the F gene using siRNA. Thermodynamic characteristics of the target RSV mRNA and siRNA seem to play an important role in siRNA gene silencing efficiency.

  12. The Spot 42 RNA: A regulatory small RNA with roles in the central metabolism

    PubMed Central

    Bækkedal, Cecilie; Haugen, Peik

    2015-01-01

    The Spot 42 RNA is a 109 nucleotide long (in Escherichia coli) noncoding small regulatory RNA (sRNA) encoded by the spf (spot fourty-two) gene. spf is found in gamma-proteobacteria and the majority of experimental work on Spot 42 RNA has been performed using E. coli, and recently Aliivibrio salmonicida. In the cell Spot 42 RNA plays essential roles as a regulator in carbohydrate metabolism and uptake, and its expression is activated by glucose, and inhibited by the cAMP-CRP complex. Here we summarize the current knowledge on Spot 42, and present the natural distribution of spf, show family-specific secondary structural features of Spot 42, and link highly conserved structural regions to mRNA target binding. PMID:26327359

  13. MultiSETTER: web server for multiple RNA structure comparison.

    PubMed

    Čech, Petr; Hoksza, David; Svozil, Daniel

    2015-08-12

    Understanding the architecture and function of RNA molecules requires methods for comparing and analyzing their tertiary and quaternary structures. While structural superposition of short RNAs is achievable in a reasonable time, large structures represent much bigger challenge. Therefore, we have developed a fast and accurate algorithm for RNA pairwise structure superposition called SETTER and implemented it in the SETTER web server. However, though biological relationships can be inferred by a pairwise structure alignment, key features preserved by evolution can be identified only from a multiple structure alignment. Thus, we extended the SETTER algorithm to the alignment of multiple RNA structures and developed the MultiSETTER algorithm. In this paper, we present the updated version of the SETTER web server that implements a user friendly interface to the MultiSETTER algorithm. The server accepts RNA structures either as the list of PDB IDs or as user-defined PDB files. After the superposition is computed, structures are visualized in 3D and several reports and statistics are generated. To the best of our knowledge, the MultiSETTER web server is the first publicly available tool for a multiple RNA structure alignment. The MultiSETTER server offers the visual inspection of an alignment in 3D space which may reveal structural and functional relationships not captured by other multiple alignment methods based either on a sequence or on secondary structure motifs.

  14. Structural organizations of yeast RNase P and RNase MRP holoenzymes as revealed by UV-crosslinking studies of RNA-protein interactions.

    PubMed

    Khanova, Elena; Esakova, Olga; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

    2012-04-01

    Eukaryotic ribonuclease (RNase) P and RNase MRP are closely related ribonucleoprotein complexes involved in the metabolism of various RNA molecules including tRNA, rRNA, and some mRNAs. While evolutionarily related to bacterial RNase P, eukaryotic enzymes of the RNase P/MRP family are much more complex. Saccharomyces cerevisiae RNase P consists of a catalytic RNA component and nine essential proteins; yeast RNase MRP has an RNA component resembling that in RNase P and 10 essential proteins, most of which are shared with RNase P. The structural organizations of eukaryotic RNases P/MRP are not clear. Here we present the results of RNA-protein UV crosslinking studies performed on RNase P and RNase MRP holoenzymes isolated from yeast. The results indicate locations of specific protein-binding sites in the RNA components of RNase P and RNase MRP and shed light on the structural organizations of these large ribonucleoprotein complexes.

  15. Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.

    PubMed

    Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B

    2018-06-07

    RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  16. Sparse RNA folding revisited: space-efficient minimum free energy structure prediction.

    PubMed

    Will, Sebastian; Jabbari, Hosna

    2016-01-01

    RNA secondary structure prediction by energy minimization is the central computational tool for the analysis of structural non-coding RNAs and their interactions. Sparsification has been successfully applied to improve the time efficiency of various structure prediction algorithms while guaranteeing the same result; however, for many such folding problems, space efficiency is of even greater concern, particularly for long RNA sequences. So far, space-efficient sparsified RNA folding with fold reconstruction was solved only for simple base-pair-based pseudo-energy models. Here, we revisit the problem of space-efficient free energy minimization. Whereas the space-efficient minimization of the free energy has been sketched before, the reconstruction of the optimum structure has not even been discussed. We show that this reconstruction is not possible in trivial extension of the method for simple energy models. Then, we present the time- and space-efficient sparsified free energy minimization algorithm SparseMFEFold that guarantees MFE structure prediction. In particular, this novel algorithm provides efficient fold reconstruction based on dynamically garbage-collected trace arrows. The complexity of our algorithm depends on two parameters, the number of candidates Z and the number of trace arrows T; both are bounded by [Formula: see text], but are typically much smaller. The time complexity of RNA folding is reduced from [Formula: see text] to [Formula: see text]; the space complexity, from [Formula: see text] to [Formula: see text]. Our empirical results show more than 80 % space savings over RNAfold [Vienna RNA package] on the long RNAs from the RNA STRAND database (≥2500 bases). The presented technique is intentionally generalizable to complex prediction algorithms; due to their high space demands, algorithms like pseudoknot prediction and RNA-RNA-interaction prediction are expected to profit even stronger than "standard" MFE folding. SparseMFEFold is free software, available at http://www.bioinf.uni-leipzig.de/~will/Software/SparseMFEFold.

  17. Fast online and index-based algorithms for approximate search of RNA sequence-structure patterns

    PubMed Central

    2013-01-01

    Background It is well known that the search for homologous RNAs is more effective if both sequence and structure information is incorporated into the search. However, current tools for searching with RNA sequence-structure patterns cannot fully handle mutations occurring on both these levels or are simply not fast enough for searching large sequence databases because of the high computational costs of the underlying sequence-structure alignment problem. Results We present new fast index-based and online algorithms for approximate matching of RNA sequence-structure patterns supporting a full set of edit operations on single bases and base pairs. Our methods efficiently compute semi-global alignments of structural RNA patterns and substrings of the target sequence whose costs satisfy a user-defined sequence-structure edit distance threshold. For this purpose, we introduce a new computing scheme to optimally reuse the entries of the required dynamic programming matrices for all substrings and combine it with a technique for avoiding the alignment computation of non-matching substrings. Our new index-based methods exploit suffix arrays preprocessed from the target database and achieve running times that are sublinear in the size of the searched sequences. To support the description of RNA molecules that fold into complex secondary structures with multiple ordered sequence-structure patterns, we use fast algorithms for the local or global chaining of approximate sequence-structure pattern matches. The chaining step removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our improved online algorithm is faster than the best previous method by up to factor 45. Our best new index-based algorithm achieves a speedup of factor 560. Conclusions The presented methods achieve considerable speedups compared to the best previous method. This, together with the expected sublinear running time of the presented index-based algorithms, allows for the first time approximate matching of RNA sequence-structure patterns in large sequence databases. Beyond the algorithmic contributions, we provide with RaligNAtor a robust and well documented open-source software package implementing the algorithms presented in this manuscript. The RaligNAtor software is available at http://www.zbh.uni-hamburg.de/ralignator. PMID:23865810

  18. Structator: fast index-based search for RNA sequence-structure patterns

    PubMed Central

    2011-01-01

    Background The secondary structure of RNA molecules is intimately related to their function and often more conserved than the sequence. Hence, the important task of searching databases for RNAs requires to match sequence-structure patterns. Unfortunately, current tools for this task have, in the best case, a running time that is only linear in the size of sequence databases. Furthermore, established index data structures for fast sequence matching, like suffix trees or arrays, cannot benefit from the complementarity constraints introduced by the secondary structure of RNAs. Results We present a novel method and readily applicable software for time efficient matching of RNA sequence-structure patterns in sequence databases. Our approach is based on affix arrays, a recently introduced index data structure, preprocessed from the target database. Affix arrays support bidirectional pattern search, which is required for efficiently handling the structural constraints of the pattern. Structural patterns like stem-loops can be matched inside out, such that the loop region is matched first and then the pairing bases on the boundaries are matched consecutively. This allows to exploit base pairing information for search space reduction and leads to an expected running time that is sublinear in the size of the sequence database. The incorporation of a new chaining approach in the search of RNA sequence-structure patterns enables the description of molecules folding into complex secondary structures with multiple ordered patterns. The chaining approach removes spurious matches from the set of intermediate results, in particular of patterns with little specificity. In benchmark experiments on the Rfam database, our method runs up to two orders of magnitude faster than previous methods. Conclusions The presented method's sublinear expected running time makes it well suited for RNA sequence-structure pattern matching in large sequence databases. RNA molecules containing several stem-loop substructures can be described by multiple sequence-structure patterns and their matches are efficiently handled by a novel chaining method. Beyond our algorithmic contributions, we provide with Structator a complete and robust open-source software solution for index-based search of RNA sequence-structure patterns. The Structator software is available at http://www.zbh.uni-hamburg.de/Structator. PMID:21619640

  19. Relicts and models of the RNA world

    NASA Astrophysics Data System (ADS)

    Lehto, Kirsi; Karetnikov, Alexey

    2005-01-01

    It is widely believed that the current DNA-RNA-protein-based life forms have evolved from preceding RNA-protein-based life forms, and these again, from mere RNA replicons. By rationale, it can be assumed that the early RNA replicons were fully heterotrophic in terms of obtaining all their building blocks from their environment. In the absence of protein catalysts, their essential life functions had to be mediated by simple functional structures and mechanisms, such as RNA secondary structures, RNA-RNA interactions and RNA-mediated catalysis, and possibly by catalytic minerals or clays. The central role of RNA catalysts in early life forms is supported by the fact that several catalytic RNAs still perform central biological functions in current life forms, and at least some of these may be derived as molecular relicts from the early RNA-based life. The RNA-catalysed metabolic reactions and molecular fossils are more conserved in the eukaryotic life forms than in the prokaryotes, suggesting that the linear eukaryote genomes may more closely resemble the structure and function of the early RNA replicons, than what do the circular prokaryote genomes. Present-day RNA viruses and viroids utilize ultimately simple life strategies, which may be similar to those used by the early RNA replicons. Thus, molecular and functional properties of viruses and viroids may be considered as examples or models of the structures and replication mechanisms, which might have been used for the replication of the early biopolymers.

  20. TRANSAT-- method for detecting the conserved helices of functional RNA structures, including transient, pseudo-knotted and alternative structures.

    PubMed

    Wiebe, Nicholas J P; Meyer, Irmtraud M

    2010-06-24

    The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular environment.

  1. RNA inverse folding using Monte Carlo tree search.

    PubMed

    Yang, Xiufeng; Yoshizoe, Kazuki; Taneda, Akito; Tsuda, Koji

    2017-11-06

    Artificially synthesized RNA molecules provide important ways for creating a variety of novel functional molecules. State-of-the-art RNA inverse folding algorithms can design simple and short RNA sequences of specific GC content, that fold into the target RNA structure. However, their performance is not satisfactory in complicated cases. We present a new inverse folding algorithm called MCTS-RNA, which uses Monte Carlo tree search (MCTS), a technique that has shown exceptional performance in Computer Go recently, to represent and discover the essential part of the sequence space. To obtain high accuracy, initial sequences generated by MCTS are further improved by a series of local updates. Our algorithm has an ability to control the GC content precisely and can deal with pseudoknot structures. Using common benchmark datasets for evaluation, MCTS-RNA showed a lot of promise as a standard method of RNA inverse folding. MCTS-RNA is available at https://github.com/tsudalab/MCTS-RNA .

  2. SHAPE Selection (SHAPES) enrich for RNA structure signal in SHAPE sequencing-based probing data

    PubMed Central

    Poulsen, Line Dahl; Kielpinski, Lukasz Jan; Salama, Sofie R.; Krogh, Anders; Vinther, Jeppe

    2015-01-01

    Selective 2′ Hydroxyl Acylation analyzed by Primer Extension (SHAPE) is an accurate method for probing of RNA secondary structure. In existing SHAPE methods, the SHAPE probing signal is normalized to a no-reagent control to correct for the background caused by premature termination of the reverse transcriptase. Here, we introduce a SHAPE Selection (SHAPES) reagent, N-propanone isatoic anhydride (NPIA), which retains the ability of SHAPE reagents to accurately probe RNA structure, but also allows covalent coupling between the SHAPES reagent and a biotin molecule. We demonstrate that SHAPES-based selection of cDNA–RNA hybrids on streptavidin beads effectively removes the large majority of background signal present in SHAPE probing data and that sequencing-based SHAPES data contain the same amount of RNA structure data as regular sequencing-based SHAPE data obtained through normalization to a no-reagent control. Moreover, the selection efficiently enriches for probed RNAs, suggesting that the SHAPES strategy will be useful for applications with high-background and low-probing signal such as in vivo RNA structure probing. PMID:25805860

  3. A dual-specific Glu-tRNA(Gln) and Asp-tRNA(Asn) amidotransferase is involved in decoding glutamine and asparagine codons in Acidithiobacillus ferrooxidans.

    PubMed

    Salazar, J C; Zúñiga, R; Raczniak, G; Becker, H; Söll, D; Orellana, O

    2001-07-06

    The gatC, gatA and gatB genes encoding the three subunits of glutamyl-tRNA(Gln) amidotransferase from Acidithiobacillus ferrooxidans, an acidophilic bacterium used in bioleaching of minerals, have been cloned and expressed in Escherichia coli. As in Bacillus subtilis the three gat genes are organized in an operon-like structure in A. ferrooxidans. The heterologously overexpressed enzyme converts Glu-tRNA(Gln) to Gln-tRNA(Gln) and Asp-tRNA(Asn) to Asn-tRNA(Asn). Biochemical analysis revealed that neither glutaminyl-tRNA synthetase nor asparaginyl-tRNA synthetase is present in A. ferrooxidans, but that glutamyl-tRNA synthetase and aspartyl-tRNA synthetase enzymes are present in the organism. These data suggest that the transamidation pathway is responsible for the formation of Gln-tRNA and Asn-tRNA in A. ferrooxidans.

  4. Ensemble-based prediction of RNA secondary structures.

    PubMed

    Aghaeepour, Nima; Hoos, Holger H

    2013-04-24

    Accurate structure prediction methods play an important role for the understanding of RNA function. Energy-based, pseudoknot-free secondary structure prediction is one of the most widely used and versatile approaches, and improved methods for this task have received much attention over the past five years. Despite the impressive progress that as been achieved in this area, existing evaluations of the prediction accuracy achieved by various algorithms do not provide a comprehensive, statistically sound assessment. Furthermore, while there is increasing evidence that no prediction algorithm consistently outperforms all others, no work has been done to exploit the complementary strengths of multiple approaches. In this work, we present two contributions to the area of RNA secondary structure prediction. Firstly, we use state-of-the-art, resampling-based statistical methods together with a previously published and increasingly widely used dataset of high-quality RNA structures to conduct a comprehensive evaluation of existing RNA secondary structure prediction procedures. The results from this evaluation clarify the performance relationship between ten well-known existing energy-based pseudoknot-free RNA secondary structure prediction methods and clearly demonstrate the progress that has been achieved in recent years. Secondly, we introduce AveRNA, a generic and powerful method for combining a set of existing secondary structure prediction procedures into an ensemble-based method that achieves significantly higher prediction accuracies than obtained from any of its component procedures. Our new, ensemble-based method, AveRNA, improves the state of the art for energy-based, pseudoknot-free RNA secondary structure prediction by exploiting the complementary strengths of multiple existing prediction procedures, as demonstrated using a state-of-the-art statistical resampling approach. In addition, AveRNA allows an intuitive and effective control of the trade-off between false negative and false positive base pair predictions. Finally, AveRNA can make use of arbitrary sets of secondary structure prediction procedures and can therefore be used to leverage improvements in prediction accuracy offered by algorithms and energy models developed in the future. Our data, MATLAB software and a web-based version of AveRNA are publicly available at http://www.cs.ubc.ca/labs/beta/Software/AveRNA.

  5. Cryo-EM structure of a late pre-40S ribosomal subunit from Saccharomyces cerevisiae

    PubMed Central

    Schmidt, Christian; Berninghausen, Otto; Becker, Thomas

    2017-01-01

    Mechanistic understanding of eukaryotic ribosome formation requires a detailed structural knowledge of the numerous assembly intermediates, generated along a complex pathway. Here, we present the structure of a late pre-40S particle at 3.6 Å resolution, revealing in molecular detail how assembly factors regulate the timely folding of pre-18S rRNA. The structure shows that, rather than sterically blocking 40S translational active sites, the associated assembly factors Tsr1, Enp1, Rio2 and Pno1 collectively preclude their final maturation, thereby preventing untimely tRNA and mRNA binding and error prone translation. Moreover, the structure explains how Pno1 coordinates the 3’end cleavage of the 18S rRNA by Nob1 and how the late factor’s removal in the cytoplasm ensures the structural integrity of the maturing 40S subunit. PMID:29155690

  6. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10).

    PubMed

    Park, HaJeung; González, Àlex L; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R; Fang, Pengfei; Guo, Min; Disney, Matthew D

    2015-06-23

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide (5')UCU(3')/(3')UCU(5') internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA.

  7. Crystallographic and Computational Analyses of AUUCU Repeating RNA That Causes Spinocerebellar Ataxia Type 10 (SCA10)

    PubMed Central

    Park, HaJeung; González, Àlex L.; Yildirim, Ilyas; Tran, Tuan; Lohman, Jeremy R.; Fang, Pengfei; Guo, Min; Disney, Matthew D.

    2016-01-01

    Spinocerebellar ataxia type 10 (SCA10) is caused by a pentanucleotide repeat expansion of r(AUUCU) within intron 9 of the ATXN10 pre-mRNA. The RNA causes disease by a gain-of-function mechanism in which it inactivates proteins involved in RNA biogenesis. Spectroscopic studies showed that r(AUUCU) repeats form a hairpin structure; however, there were no high-resolution structural models prior to this work. Herein, we report the first crystal structure of model r(AUUCU) repeats refined to 2.8 Å and analysis of the structure via molecular dynamics simulations. The r(AUUCU) tracts adopt an overall A-form geometry in which 3 × 3 nucleotide 5′UCU3′/3′UCU5′ internal loops are closed by AU pairs. Helical parameters of the refined structure as well as the corresponding electron density map on the crystallographic model reflect dynamic features of the internal loop. The computational analyses captured dynamic motion of the loop closing pairs, which can form single-stranded conformations with relatively low energies. Overall, the results presented here suggest the possibility for r(AUUCU) repeats to form metastable A-from structures, which can rearrange into single-stranded conformations and attract proteins such as heterogeneous nuclear ribonucleoprotein K (hnRNP K). The information presented here may aid in the rational design of therapeutics targeting this RNA. PMID:26039897

  8. Identification of novel RNA secondary structures within the hepatitis C virus genome reveals a cooperative involvement in genome packaging

    PubMed Central

    Stewart, H.; Bingham, R.J.; White, S. J.; Dykeman, E. C.; Zothner, C.; Tuplin, A. K.; Stockley, P. G.; Twarock, R.; Harris, M.

    2016-01-01

    The specific packaging of the hepatitis C virus (HCV) genome is hypothesised to be driven by Core-RNA interactions. To identify the regions of the viral genome involved in this process, we used SELEX (systematic evolution of ligands by exponential enrichment) to identify RNA aptamers which bind specifically to Core in vitro. Comparison of these aptamers to multiple HCV genomes revealed the presence of a conserved terminal loop motif within short RNA stem-loop structures. We postulated that interactions of these motifs, as well as sub-motifs which were present in HCV genomes at statistically significant levels, with the Core protein may drive virion assembly. We mutated 8 of these predicted motifs within the HCV infectious molecular clone JFH-1, thereby producing a range of mutant viruses predicted to possess altered RNA secondary structures. RNA replication and viral titre were unaltered in viruses possessing only one mutated structure. However, infectivity titres were decreased in viruses possessing a higher number of mutated regions. This work thus identified multiple novel RNA motifs which appear to contribute to genome packaging. We suggest that these structures act as cooperative packaging signals to drive specific RNA encapsidation during HCV assembly. PMID:26972799

  9. Regulatory RNAs in Bacillus subtilis: a Gram-Positive Perspective on Bacterial RNA-Mediated Regulation of Gene Expression.

    PubMed

    Mars, Ruben A T; Nicolas, Pierre; Denham, Emma L; van Dijl, Jan Maarten

    2016-12-01

    Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5' untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5' ends of mRNA molecules. These can include 5' secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  10. Regulatory RNAs in Bacillus subtilis: a Gram-Positive Perspective on Bacterial RNA-Mediated Regulation of Gene Expression

    PubMed Central

    Mars, Ruben A. T.; Nicolas, Pierre; Denham, Emma L.

    2016-01-01

    SUMMARY Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5′ untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5′ ends of mRNA molecules. These can include 5′ secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. PMID:27784798

  11. The 1.3 A resolution structure of the RNA tridecamer r(GCGUUUGAAACGC): metal ion binding correlates with base unstacking and groove contraction.

    PubMed

    Timsit, Youri; Bombard, Sophie

    2007-12-01

    Metal ions play a key role in RNA folding and activity. Elucidating the rules that govern the binding of metal ions is therefore an essential step for better understanding the RNA functions. High-resolution data are a prerequisite for a detailed structural analysis of ion binding on RNA and, in particular, the observation of monovalent cations. Here, the high-resolution crystal structures of the tridecamer duplex r(GCGUUUGAAACGC) crystallized under different conditions provides new structural insights on ion binding on GAAA/UUU sequences that exhibit both unusual structural and functional properties in RNA. The present study extends the repertory of RNA ion binding sites in showing that the two first bases of UUU triplets constitute a specific site for sodium ions. A striking asymmetric pattern of metal ion binding in the two equivalent halves of the palindromic sequence demonstrates that sequence and its environment act together to bind metal ions. A highly ionophilic half that binds six metal ions allows, for the first time, the observation of a disodium cluster in RNA. The comparison of the equivalent halves of the duplex provides experimental evidences that ion binding correlates with structural alterations and groove contraction.

  12. Evidence for a Complex Class of Nonadenylated mRNA in Drosophila

    PubMed Central

    Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.

    1980-01-01

    The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246

  13. Structure of RDE-4 dsRBDs and mutational studies provide insights into dsRNA recognition in the Caenorhabditis elegans RNAi pathway.

    PubMed

    Chiliveri, Sai Chaitanya; Deshmukh, Mandar V

    2014-02-15

    The association of RDE-4 (RNAi defective 4), a protein containing two dsRBDs (dsRNA-binding domains), with long dsRNA and Dcr-1 (Dicer1 homologue) initiates the siRNA pathway in Caenorhabditis elegans. Unlike its homologues in higher eukaryotes, RDE-4 dsRBDs possess weak (micromolar) affinity for short dsRNA. With increasing length of dsRNA, RDE-4 exhibits enhanced affinity due to co-operativity. The linker and dsRBD2 are indispensable for RDE-4's simultaneous interaction with dsRNA and Dcr-1. In the present study, we have determined the solution structures of RDE-4 constructs that contain both dsRBDs and the linker region. In addition to the canonical dsRBD fold, both dsRBDs of RDE-4 show modified structural features such as truncation in the β1-β2 loop that rationalize RDE-4's relatively weak dsRNA affinity. Structure and binding studies demonstrate that dsRBD2 plays a decisive role in the RDE-4-dsRNA interaction; however, in contrast with previous findings, we found ephemeral interaction of RDE-4 dsRBD1 with dsRNA. More importantly, mutations in two tandem lysine residues (Lys217 and Lys218) in dsRBD2 impair RDE-4's dsRNA-binding ability and could obliterate RNAi initiation in C. elegans. Additionally, we postulate a structural basis for the minimal requirement of linker and dsRBD2 for RDE-4's association with dsRNA and Dcr-1.

  14. The separation between the 5'-3' ends in long RNA molecules is short and nearly constant.

    PubMed

    Leija-Martínez, Nehemías; Casas-Flores, Sergio; Cadena-Nava, Rubén D; Roca, Joan A; Mendez-Cabañas, José A; Gomez, Eduardo; Ruiz-Garcia, Jaime

    2014-12-16

    RNA molecules play different roles in coding, decoding and gene expression regulation. Such roles are often associated to the RNA secondary or tertiary structures. The folding dynamics lead to multiple secondary structures of long RNA molecules, since an RNA molecule might fold into multiple distinct native states. Despite an ensemble of different structures, it has been theoretically proposed that the separation between the 5' and 3' ends of long single-stranded RNA molecules (ssRNA) remains constant, independent of their base content and length. Here, we present the first experimental measurements of the end-to-end separation in long ssRNA molecules. To determine this separation, we use single molecule Fluorescence Resonance Energy Transfer of fluorescently end-labeled ssRNA molecules ranging from 500 to 5500 nucleotides in length, obtained from two viruses and a fungus. We found that the end-to-end separation is indeed short, within 5-9 nm. It is remarkable that the separation of the ends of all RNA molecules studied remains small and similar, despite the origin, length and differences in their secondary structure. This implies that the ssRNA molecules are 'effectively circularized' something that might be a general feature of RNAs, and could result in fine-tuning for translation and gene expression regulation. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Computational Approaches to Nucleic Acid Origami.

    PubMed

    Jabbari, Hosna; Aminpour, Maral; Montemagno, Carlo

    2015-10-12

    Recent advances in experimental DNA origami have dramatically expanded the horizon of DNA nanotechnology. Complex 3D suprastructures have been designed and developed using DNA origami with applications in biomaterial science, nanomedicine, nanorobotics, and molecular computation. Ribonucleic acid (RNA) origami has recently been realized as a new approach. Similar to DNA, RNA molecules can be designed to form complex 3D structures through complementary base pairings. RNA origami structures are, however, more compact and more thermodynamically stable due to RNA's non-canonical base pairing and tertiary interactions. With all these advantages, the development of RNA origami lags behind DNA origami by a large gap. Furthermore, although computational methods have proven to be effective in designing DNA and RNA origami structures and in their evaluation, advances in computational nucleic acid origami is even more limited. In this paper, we review major milestones in experimental and computational DNA and RNA origami and present current challenges in these fields. We believe collaboration between experimental nanotechnologists and computer scientists are critical for advancing these new research paradigms.

  16. Structural model of an mRNA in complex with the bacterial chaperone Hfq

    DOE PAGES

    Peng, Yi; Curtis, Joseph E.; Fang, Xianyang; ...

    2014-11-17

    The Sm-like protein Hfq (host factor Q-beta phage) facilitates regulation by bacterial small noncoding RNAs (sRNAs) in response to stress and other environmental signals. In this paper, we present a low-resolution model of Escherichia coli Hfq bound to the rpoS mRNA, a bacterial stress response gene that is targeted by three different sRNAs. Selective 2'-hydroxyl acylation and primer extension, small-angle X-ray scattering, and Monte Carlo molecular dynamics simulations show that the distal face and lateral rim of Hfq interact with three sites in the rpoS leader, folding the RNA into a compact tertiary structure. These interactions are needed for sRNAmore » regulation of rpoS translation and position the sRNA target adjacent to an sRNA binding region on the proximal face of Hfq. Finally, our results show how Hfq specifically distorts the structure of the rpoS mRNA to enable sRNA base pairing and translational control.« less

  17. Prediction of RNA secondary structures: from theory to models and real molecules

    NASA Astrophysics Data System (ADS)

    Schuster, Peter

    2006-05-01

    RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from two letter alphabets form perfect structures when the nucleotides can form a base pair, as is the case with {GC} or {AU}, but the relation between the sequences and structures differs strongly from the four letter alphabet. A comprehensive theory of RNA structure is presented, which is based on the concepts of sequence space and shape space, being a space of structures. It sets the stage for modelling processes in ensembles of RNA molecules like evolutionary optimization or kinetic folding as dynamical phenomena guided by mappings between the two spaces. The number of minimum free energy (mfe) structures is always smaller than the number of sequences, even for two letter alphabets. Folding of RNA molecules into mfe energy structures constitutes a non-invertible mapping from sequence space onto shape space. The preimage of a structure in sequence space is defined as its neutral network. Similarly the set of suboptimal structures is the preimage of a sequence in shape space. This set represents the conformation space of a given sequence. The evolutionary optimization of structures in populations is a process taking place in sequence space, whereas kinetic folding occurs in molecular ensembles that optimize free energy in conformation space. Efficient folding algorithms based on dynamic programming are available for the prediction of secondary structures for given sequences. The inverse problem, the computation of sequences for predefined structures, is an important tool for the design of RNA molecules with tailored properties. Simultaneous folding or cofolding of two or more RNA molecules can be modelled readily at the secondary structure level and allows prediction of the most stable (mfe) conformations of complexes together with suboptimal states. Cofolding algorithms are important tools for efficient and highly specific primer design in the polymerase chain reaction (PCR) and help to explain the mechanisms of small interference RNA (si-RNA) molecules in gene regulation. The evolutionary optimization of RNA structures is illustrated by the search for a target structure and mimics aptamer selection in evolutionary biotechnology. It occurs typically in steps consisting of short adaptive phases interrupted by long epochs of little or no obvious progress in optimization. During these quasi-stationary epochs the populations are essentially confined to neutral networks where they search for sequences that allow a continuation of the adaptive process. Modelling RNA evolution as a simultaneous process in sequence and shape space provides answers to questions of the optimal population size and mutation rates. Kinetic folding is a stochastic process in conformation space. Exact solutions are derived by direct simulation in the form of trajectory sampling or by solving the master equation. The exact solutions can be approximated straightforwardly by Arrhenius kinetics on barrier trees, which represent simplified versions of conformational energy landscapes. The existence of at least one sequence forming any arbitrarily chosen pair of structures is granted by the intersection theorem. Folding kinetics is the key to understanding and designing multistable RNA molecules or RNA switches. These RNAs form two or more long lived conformations, and conformational changes occur either spontaneously or are induced through binding of small molecules or other biopolymers. RNA switches are found in nature where they act as elements in genetic and metabolic regulation. The reliability of RNA secondary structure prediction is limited by the accuracy with which the empirical parameters can be determined and by principal deficiencies, for example by the lack of energy contributions resulting from tertiary interactions. In addition, native structures may be determined by folding kinetics rather than by thermodynamics. We address the first problem by considering base pair probabilities or base pairing entropies, which are derived from the partition function of conformations. A high base pair probability corresponding to a low pairing entropy is taken as an indicator of a high reliability of prediction. Pseudoknots are discussed as an example of a tertiary interaction that is highly important for RNA function. Moreover, pseudoknot formation is readily incorporated into structure prediction algorithms. Some examples of experimental data on RNA secondary structures that are readily explained using the landscape concept are presented. They deal with (i) properties of RNA molecules with random sequences, (ii) RNA molecules from restricted alphabets, (iii) existence of neutral networks, (iv) shape space covering, (v) riboswitches and (vi) evolution of non-coding RNAs as an example of evolution restricted to neutral networks.

  18. Structure of RNA 3′-phosphate cyclase bound to substrate RNA

    PubMed Central

    Desai, Kevin K.; Bingman, Craig A.; Cheng, Chin L.; Phillips, George N.

    2014-01-01

    RNA 3′-phosphate cyclase (RtcA) catalyzes the ATP-dependent cyclization of a 3′-phosphate to form a 2′,3′-cyclic phosphate at RNA termini. Cyclization proceeds through RtcA–AMP and RNA(3′)pp(5′)A covalent intermediates, which are analogous to intermediates formed during catalysis by the tRNA ligase RtcB. Here we present a crystal structure of Pyrococcus horikoshii RtcA in complex with a 3′-phosphate terminated RNA and adenosine in the AMP-binding pocket. Our data reveal that RtcA recognizes substrate RNA by ensuring that the terminal 3′-phosphate makes a large contribution to RNA binding. Furthermore, the RNA 3′-phosphate is poised for in-line attack on the P–N bond that links the phosphorous atom of AMP to Nε of His307. Thus, we provide the first insights into RNA 3′-phosphate termini recognition and the mechanism of 3′-phosphate activation by an Rtc enzyme. PMID:25161314

  19. RNApdbee--a webserver to derive secondary structures from pdb files of knotted and unknotted RNAs.

    PubMed

    Antczak, Maciej; Zok, Tomasz; Popenda, Mariusz; Lukasiak, Piotr; Adamiak, Ryszard W; Blazewicz, Jacek; Szachniuk, Marta

    2014-07-01

    In RNA structural biology and bioinformatics an access to correct RNA secondary structure and its proper representation is of crucial importance. This is true especially in the field of secondary and 3D RNA structure prediction. Here, we introduce RNApdbee-a new tool that allows to extract RNA secondary structure from the pdb file, and presents it in both textual and graphical form. RNApdbee supports processing of knotted and unknotted structures of large RNAs, also within protein complexes. The method works not only for first but also for high order pseudoknots, and gives an information about canonical and non-canonical base pairs. A combination of these features is unique among existing applications for RNA structure analysis. Additionally, a function of converting between the text notations, i.e. BPSEQ, CT and extended dot-bracket, is provided. In order to facilitate a more comprehensive study, the webserver integrates the functionality of RNAView, MC-Annotate and 3DNA/DSSR, being the most common tools used for automated identification and classification of RNA base pairs. RNApdbee is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Interspecific variation in mitochondrial serine transfer RNA (UCN) in Euptychiina butterflies (Lepidoptera: Satyrinae): structure and alignment.

    PubMed

    Marín, Mario Alejandro; López, Andrés; Uribe, Sandra Inés

    2012-06-01

    The nucleotide variation and structural patterns of mitochondrial RNA molecule have been proposed as useful tools in molecular systematics; however, their usefulness is always subject to a proper assessment of homology in the sequence alignment. The present study describes the secondary structure of mitochondrial tRNA for the amino acid serine (UCN) on 13 Euptychiina species and the evaluation of its potential use for evolutionary studies in this group of butterflies. The secondary structure of tRNAs showed variation among the included species except between Hermeuptychia sp1 and sp2. Variation was concentrated in the ribotimidina-pseudouridine-cystosine (TψC), dihydrouridine (DHU) and variable loops and in the DHU and TψC arms. These results suggest this region as a potential marker useful for taxonomic differentiation of species in this group and also confirm the importance of including information from the secondary structure of tRNA to optimize the alignments.

  1. PARTS: Probabilistic Alignment for RNA joinT Secondary structure prediction

    PubMed Central

    Harmanci, Arif Ozgun; Sharma, Gaurav; Mathews, David H.

    2008-01-01

    A novel method is presented for joint prediction of alignment and common secondary structures of two RNA sequences. The joint consideration of common secondary structures and alignment is accomplished by structural alignment over a search space defined by the newly introduced motif called matched helical regions. The matched helical region formulation generalizes previously employed constraints for structural alignment and thereby better accommodates the structural variability within RNA families. A probabilistic model based on pseudo free energies obtained from precomputed base pairing and alignment probabilities is utilized for scoring structural alignments. Maximum a posteriori (MAP) common secondary structures, sequence alignment and joint posterior probabilities of base pairing are obtained from the model via a dynamic programming algorithm called PARTS. The advantage of the more general structural alignment of PARTS is seen in secondary structure predictions for the RNase P family. For this family, the PARTS MAP predictions of secondary structures and alignment perform significantly better than prior methods that utilize a more restrictive structural alignment model. For the tRNA and 5S rRNA families, the richer structural alignment model of PARTS does not offer a benefit and the method therefore performs comparably with existing alternatives. For all RNA families studied, the posterior probability estimates obtained from PARTS offer an improvement over posterior probability estimates from a single sequence prediction. When considering the base pairings predicted over a threshold value of confidence, the combination of sensitivity and positive predictive value is superior for PARTS than for the single sequence prediction. PARTS source code is available for download under the GNU public license at http://rna.urmc.rochester.edu. PMID:18304945

  2. Secretion of RNA-Containing Extracellular Vesicles by the Porcine Whipworm, Trichuris suis.

    PubMed

    Hansen, Eline Palm; Kringel, Helene; Williams, Andrew R; Nejsum, Peter

    2015-06-01

    Trichuris suis is a common parasitic helminth of pigs. As with many other parasites, T. suis ensures its own survival by evading host immune responses, but little is known about how this is achieved. MicroRNAs (miRNA) have been shown to be involved in various immunological processes by post-transcriptional regulation of specific genes, and the potential of using these molecules as biomarkers of disease is currently being examined. It has recently been shown that parasites may secrete extracellular structures such as exosomes and microvesicles, containing proteins and miRNA. The fusion of these structures with host cells has been demonstrated, and a role of exosome-derived miRNA in host gene regulation has been suggested. In the present study, we show that exosome- and microvesicular-like structures are secreted by T. suis L1 larvae and also demonstrate the presence of miRNA-sized RNA inside these structures. A potential role of these molecules in host-parasite interactions is suggested. In addition, an electron-dense layer covering the surface of the larvae was observed, which may play a function in host immune evasion.

  3. Insights into molecular plasticity in protein complexes from Trm9-Trm112 tRNA modifying enzyme crystal structure

    PubMed Central

    Létoquart, Juliette; van Tran, Nhan; Caroline, Vonny; Aleksandrov, Alexey; Lazar, Noureddine; van Tilbeurgh, Herman; Liger, Dominique; Graille, Marc

    2015-01-01

    Most of the factors involved in translation (tRNA, rRNA and proteins) are subject to post-transcriptional and post-translational modifications, which participate in the fine-tuning and tight control of ribosome and protein synthesis processes. In eukaryotes, Trm112 acts as an obligate activating platform for at least four methyltransferases (MTase) involved in the modification of 18S rRNA (Bud23), tRNA (Trm9 and Trm11) and translation termination factor eRF1 (Mtq2). Trm112 is then at a nexus between ribosome synthesis and function. Here, we present a structure-function analysis of the Trm9-Trm112 complex, which is involved in the 5-methoxycarbonylmethyluridine (mcm5U) modification of the tRNA anticodon wobble position and hence promotes translational fidelity. We also compare the known crystal structures of various Trm112-MTase complexes, highlighting the structural plasticity allowing Trm112 to interact through a very similar mode with its MTase partners, although those share less than 20% sequence identity. PMID:26438534

  4. Fragile X mental retardation protein recognizes a G quadruplex structure within the survival motor neuron domain containing 1 mRNA 5'-UTR.

    PubMed

    McAninch, Damian S; Heinaman, Ashley M; Lang, Cara N; Moss, Kathryn R; Bassell, Gary J; Rita Mihailescu, Mihaela; Evans, Timothy L

    2017-07-25

    G quadruplex structures have been predicted by bioinformatics to form in the 5'- and 3'-untranslated regions (UTRs) of several thousand mature mRNAs and are believed to play a role in translation regulation. Elucidation of these roles has primarily been focused on the 3'-UTR, with limited focus on characterizing the G quadruplex structures and functions in the 5'-UTR. Investigation of the affinity and specificity of RNA binding proteins for 5'-UTR G quadruplexes and the resulting regulatory effects have also been limited. Among the mRNAs predicted to form a G quadruplex structure within the 5'-UTR is the survival motor neuron domain containing 1 (SMNDC1) mRNA, encoding a protein that is critical to the spliceosome. Additionally, this mRNA has been identified as a potential target of the fragile X mental retardation protein (FMRP), whose loss of expression leads to fragile X syndrome. FMRP is an RNA binding protein involved in translation regulation that has been shown to bind mRNA targets that form G quadruplex structures. In this study we have used biophysical methods to investigate G quadruplex formation in the 5'-UTR of SMNDC1 mRNA and analyzed its interactions with FMRP. Our results show that SMNDC1 mRNA 5'-UTR forms an intramolecular, parallel G quadruplex structure comprised of three G quartet planes, which is bound specifically by FMRP both in vitro and in mouse brain lysates. These findings suggest a model by which FMRP might regulate the translation of a subset of its mRNA targets by recognizing the G quadruplex structure present in their 5'-UTR, and affecting their accessibility by the protein synthesis machinery.

  5. Structural and biochemical studies on ATP binding and hydrolysis by the Escherichia coli RNA chaperone Hfq.

    PubMed

    Hämmerle, Hermann; Beich-Frandsen, Mads; Večerek, Branislav; Rajkowitsch, Lukas; Carugo, Oliviero; Djinović-Carugo, Kristina; Bläsi, Udo

    2012-01-01

    In Escherichia coli the RNA chaperone Hfq is involved in riboregulation by assisting base-pairing between small regulatory RNAs (sRNAs) and mRNA targets. Several structural and biochemical studies revealed RNA binding sites on either surface of the donut shaped Hfq-hexamer. Whereas sRNAs are believed to contact preferentially the YKH motifs present on the proximal site, poly(A)(15) and ADP were shown to bind to tripartite binding motifs (ARE) circularly positioned on the distal site. Hfq has been reported to bind and to hydrolyze ATP. Here, we present the crystal structure of a C-terminally truncated variant of E. coli Hfq (Hfq(65)) in complex with ATP, showing that it binds to the distal R-sites. In addition, we revisited the reported ATPase activity of full length Hfq purified to homogeneity. At variance with previous reports, no ATPase activity was observed for Hfq. In addition, FRET assays neither indicated an impact of ATP on annealing of two model oligoribonucleotides nor did the presence of ATP induce strand displacement. Moreover, ATP did not lead to destabilization of binary and ternary Hfq-RNA complexes, unless a vast stoichiometric excess of ATP was used. Taken together, these studies strongly suggest that ATP is dispensable for and does not interfere with Hfq-mediated RNA transactions.

  6. Analysis of the tertiary structure of the ribonuclease P ribozyme-substrate complex by site-specific photoaffinity crosslinking.

    PubMed Central

    Harris, M E; Kazantsev, A V; Chen, J L; Pace, N R

    1997-01-01

    Bacterial ribonuclease P (RNase P), an endonuclease involved in tRNA maturation, is a ribonucleoprotein containing a catalytic RNA. The secondary structure of this ribozyme is well-established, and a low-resolution model of the three-dimensional structure of the ribozyme-substrate complex has been proposed based on site-specific crosslinking and phylogenetic comparative data [Harris ME et al., 1994 EMBO J 13:3953-3963]. However, several substructures of that model were poorly constrained by the available data. In the present analysis, additional constraints between elements within the Escherichia coli RNase P RNA-pre-tRNA complex were determined by intra- and intermolecular crosslinking experiments. Circularly permuted RNase P RNAs were used to position an azidophenacyl photoactive crosslinking agent specifically at strategic sites within the ribozyme-substrate complex. Crosslink sites were mapped by primer extension and confirmed by analysis of the mobility of the crosslinked RNA lariats on denaturing acrylamide gels relative to circular and linear RNA standards. Crosslinked species generally retained significant catalytic activity, indicating that the results reflect the native ribozyme structure. The crosslinking results support the general configuration of the structure model and predicate new positions and orientations for helices that were previously poorly constrained by the data set. The expanded library of crosslinking constraints was used, together with secondary and tertiary structure identified by phylogenetic sequence comparisons, to refine significantly the model of RNase P RNA with bound substrate pre-tRNA. The crosslinking results and data from chemical-modification and mutational studies are discussed in the context of the current structural perspective on this ribozyme. PMID:9174092

  7. Structure and function of Zika virus NS5 protein: perspectives for drug design.

    PubMed

    Wang, Boxiao; Thurmond, Stephanie; Hai, Rong; Song, Jikui

    2018-05-01

    Zika virus (ZIKV) belongs to the positive-sense single-stranded RNA-containing Flaviviridae family. Its recent outbreak and association with human diseases (e.g. neurological disorders) have raised global health concerns, and an urgency to develop a therapeutic strategy against ZIKV infection. However, there is no currently approved antiviral against ZIKV. Here we present a comprehensive overview on recent progress in structure-function investigation of ZIKV NS5 protein, the largest non-structural protein of ZIKV, which is responsible for replication of the viral genome, RNA capping and suppression of host interferon responses. Structural comparison of the N-terminal methyltransferase domain and C-terminal RNA-dependent RNA polymerase domain of ZIKV NS5 with their counterparts from related viruses provides mechanistic insights into ZIKV NS5-mediated RNA replication, and identifies residues critical for its enzymatic activities. Finally, a collection of recently identified small molecule inhibitors against ZIKV NS5 or its closely related flavivirus homologues are also discussed.

  8. DSSR-enhanced visualization of nucleic acid structures in Jmol

    PubMed Central

    Hanson, Robert M.

    2017-01-01

    Abstract Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). PMID:28472503

  9. A Sponge-like Structure Involved in the Association and Transport of Maternal Products during Drosophila Oogenesis

    PubMed Central

    Wilsch-Bräuninger, Michaela; Schwarz, Heinz; Nüsslein-Volhard, Christiane

    1997-01-01

    Localization of maternally provided RNAs during oogenesis is required for formation of the antero–posterior axis of the Drosophila embryo. Here we describe a subcellular structure in nurse cells and oocytes which may function as an intracellular compartment for assembly and transport of maternal products involved in RNA localization. This structure, which we have termed “sponge body,” consists of ER-like cisternae, embedded in an amorphous electron-dense mass. It lacks a surrounding membrane and is frequently associated with mitochondria. The sponge bodies are not identical to the Golgi complexes. We suggest that the sponge bodies are homologous to the mitochondrial cloud in Xenopus oocytes, a granulo-fibrillar structure that contains RNAs involved in patterning of the embryo. Exuperantia protein, the earliest factor known to be required for the localization of bicoid mRNA to the anterior pole of the Drosophila oocyte, is highly enriched in the sponge bodies but not an essential structural component of these. RNA staining indicates that sponge bodies contain RNA. However, neither the intensity of this staining nor the accumulation of Exuperantia in the sponge bodies is dependent on the amount of bicoid mRNA present in the ovaries. Sponge bodies surround nuage, a possible polar granule precursor. Microtubules and microfilaments are not present in sponge bodies, although transport of the sponge bodies through the cells is implied by their presence in cytoplasmic bridges. We propose that the sponge bodies are structures that, by assembly and transport of included molecules or associated structures, are involved in localization of mRNAs in Drosophila oocytes. PMID:9348297

  10. Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies

    PubMed Central

    Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.

    2018-01-01

    Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504

  11. Crystal structure of the human heterogeneous ribonucleoprotein A18 RNA-recognition motif

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Coburn, Katherine; Melville, Zephan; Aligholizadeh, Ehson

    The heterogeneous ribonucleoprotein A18 (hnRNP A18) is upregulated in hypoxic regions of various solid tumors and promotes tumor growthviathe coordination of mRNA transcripts associated with pro-survival genes. Thus, hnRNP A18 represents an important therapeutic target in tumor cells. Presented here is the first X-ray crystal structure to be reported for the RNA-recognition motif of hnRNP A18. By comparing this structure with those of homologous RNA-binding proteins (i.e.hnRNP A1), three residues on one face of an antiparallel β-sheet (Arg48, Phe50 and Phe52) and one residue in an unstructured loop (Arg41) were identified as likely to be involved in protein–nucleic acid interactions.more » This structure helps to serve as a foundation for biophysical studies of this RNA-binding protein and structure-based drug-design efforts for targeting hnRNP A18 in cancer, such as malignant melanoma, where hnRNP A18 levels are elevated and contribute to disease progression.« less

  12. Structural organizations of yeast RNase P and RNase MRP holoenzymes as revealed by UV-crosslinking studies of RNA–protein interactions

    PubMed Central

    Khanova, Elena; Esakova, Olga; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S.

    2012-01-01

    Eukaryotic ribonuclease (RNase) P and RNase MRP are closely related ribonucleoprotein complexes involved in the metabolism of various RNA molecules including tRNA, rRNA, and some mRNAs. While evolutionarily related to bacterial RNase P, eukaryotic enzymes of the RNase P/MRP family are much more complex. Saccharomyces cerevisiae RNase P consists of a catalytic RNA component and nine essential proteins; yeast RNase MRP has an RNA component resembling that in RNase P and 10 essential proteins, most of which are shared with RNase P. The structural organizations of eukaryotic RNases P/MRP are not clear. Here we present the results of RNA–protein UV crosslinking studies performed on RNase P and RNase MRP holoenzymes isolated from yeast. The results indicate locations of specific protein-binding sites in the RNA components of RNase P and RNase MRP and shed light on the structural organizations of these large ribonucleoprotein complexes. PMID:22332141

  13. Structural basis for 16S ribosomal RNA cleavage by the cytotoxic domain of colicin E3.

    PubMed

    Ng, C Leong; Lang, Kathrin; Meenan, Nicola Ag; Sharma, Amit; Kelley, Ann C; Kleanthous, Colin; Ramakrishnan, V

    2010-10-01

    The toxin colicin E3 targets the 30S subunit of bacterial ribosomes and cleaves a phosphodiester bond in the decoding center. We present the crystal structure of the 70S ribosome in complex with the cytotoxic domain of colicin E3 (E3-rRNase). The structure reveals how the rRNase domain of colicin binds to the A site of the decoding center in the 70S ribosome and cleaves the 16S ribosomal RNA (rRNA) between A1493 and G1494. The cleavage mechanism involves the concerted action of conserved residues Glu62 and His58 of the cytotoxic domain of colicin E3. These residues activate the 16S rRNA for 2' OH-induced hydrolysis. Conformational changes observed for E3-rRNase, 16S rRNA and helix 69 of 23S rRNA suggest that a dynamic binding platform is required for colicin E3 binding and function.

  14. Structure of an Rrp6-RNA exosome complex bound to poly(A) RNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wasmuth, Elizabeth V.; Januszyk, Kurt; Lima, Christopher D.

    The eukaryotic RNA exosome processes and degrades RNA by directing substrates to the distributive or processive 3' to 5' exoribonuclease activities of Rrp6 or Rrp44, respectively. The non-catalytic nine-subunit exosome core (Exo9) features a prominent central channel. Although RNA can pass through the channel to engage Rrp44, it is not clear how RNA is directed to Rrp6 or whether Rrp6 uses the central channel. Here we report a 3.3 Å crystal structure of a ten-subunit RNA exosome complex from Saccharomyces cerevisiae composed of the Exo9 core and Rrp6 bound to single-stranded poly(A) RNA. The Rrp6 catalytic domain rests on topmore » of the Exo9 S1/KH ring above the central channel, the RNA 3' end is anchored in the Rrp6 active site, and the remaining RNA traverses the S1/KH ring in an opposite orientation to that observed in a structure of a Rrp44-containing exosome complex. Solution studies with human and yeast RNA exosome complexes suggest that the RNA path to Rrp6 is conserved and dependent on the integrity of the S1/KH ring. Although path selection to Rrp6 or Rrp44 is stochastic in vitro, the fate of a particular RNA may be determined in vivo by the manner in which cofactors present RNA to the RNA exosome.« less

  15. A Three-Dimensional RNA Motif in Potato spindle tuber viroid Mediates Trafficking from Palisade Mesophyll to Spongy Mesophyll in Nicotiana benthamiana[W

    PubMed Central

    Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao

    2011-01-01

    Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006

  16. A three-dimensional RNA motif in Potato spindle tuber viroid mediates trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana.

    PubMed

    Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao

    2011-01-01

    Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.

  17. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components.

    PubMed

    Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

    2015-09-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. Structure–function studies of STAR family Quaking proteins bound to their in vivo RNA target sites

    PubMed Central

    Teplova, Marianna; Hafner, Markus; Teplov, Dmitri; Essig, Katharina; Tuschl, Thomas; Patel, Dinshaw J.

    2013-01-01

    Mammalian Quaking (QKI) and its Caenorhabditis elegans homolog, GLD-1 (defective in germ line development), are evolutionarily conserved RNA-binding proteins, which post-transcriptionally regulate target genes essential for developmental processes and myelination. We present X-ray structures of the STAR (signal transduction and activation of RNA) domain, composed of Qua1, K homology (KH), and Qua2 motifs of QKI and GLD-1 bound to high-affinity in vivo RNA targets containing YUAAY RNA recognition elements (RREs). The KH and Qua2 motifs of the STAR domain synergize to specifically interact with bases and sugar-phosphate backbones of the bound RRE. Qua1-mediated homodimerization generates a scaffold that enables concurrent recognition of two RREs, thereby plausibly targeting tandem RREs present in many QKI-targeted transcripts. Structure-guided mutations reduced QKI RNA-binding affinity in vitro and in vivo, and expression of QKI mutants in human embryonic kidney cells (HEK293) significantly decreased the abundance of QKI target mRNAs. Overall, our studies define principles underlying RNA target selection by STAR homodimers and provide insights into the post-transcriptional regulatory function of mammalian QKI proteins. PMID:23630077

  19. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components

    PubMed Central

    Fagerlund, Robert D.; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S.

    2015-01-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA–RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. PMID:26135751

  20. Customized workflow development and data modularization concepts for RNA-Sequencing and metatranscriptome experiments.

    PubMed

    Lott, Steffen C; Wolfien, Markus; Riege, Konstantin; Bagnacani, Andrea; Wolkenhauer, Olaf; Hoffmann, Steve; Hess, Wolfgang R

    2017-11-10

    RNA-Sequencing (RNA-Seq) has become a widely used approach to study quantitative and qualitative aspects of transcriptome data. The variety of RNA-Seq protocols, experimental study designs and the characteristic properties of the organisms under investigation greatly affect downstream and comparative analyses. In this review, we aim to explain the impact of structured pre-selection, classification and integration of best-performing tools within modularized data analysis workflows and ready-to-use computing infrastructures towards experimental data analyses. We highlight examples for workflows and use cases that are presented for pro-, eukaryotic and mixed dual RNA-Seq (meta-transcriptomics) experiments. In addition, we are summarizing the expertise of the laboratories participating in the project consortium "Structured Analysis and Integration of RNA-Seq experiments" (de.STAIR) and its integration with the Galaxy-workbench of the RNA Bioinformatics Center (RBC). Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  1. A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.

    Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less

  2. A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

    DOE PAGES

    Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.; ...

    2014-12-15

    Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less

  3. RCrane: semi-automated RNA model building.

    PubMed

    Keating, Kevin S; Pyle, Anna Marie

    2012-08-01

    RNA crystals typically diffract to much lower resolutions than protein crystals. This low-resolution diffraction results in unclear density maps, which cause considerable difficulties during the model-building process. These difficulties are exacerbated by the lack of computational tools for RNA modeling. Here, RCrane, a tool for the partially automated building of RNA into electron-density maps of low or intermediate resolution, is presented. This tool works within Coot, a common program for macromolecular model building. RCrane helps crystallographers to place phosphates and bases into electron density and then automatically predicts and builds the detailed all-atom structure of the traced nucleotides. RCrane then allows the crystallographer to review the newly built structure and select alternative backbone conformations where desired. This tool can also be used to automatically correct the backbone structure of previously built nucleotides. These automated corrections can fix incorrect sugar puckers, steric clashes and other structural problems.

  4. Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach.

    PubMed

    Zakov, Shay; Tsur, Dekel; Ziv-Ukelson, Michal

    2011-08-18

    RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms.

  5. Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach

    PubMed Central

    2011-01-01

    Background RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. Results We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. Conclusions The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms. PMID:21851589

  6. Structural basis for recognition of human 7SK long noncoding RNA by the La-related protein Larp7.

    PubMed

    Eichhorn, Catherine D; Yang, Yuan; Repeta, Lucas; Feigon, Juli

    2018-06-26

    The La and the La-related protein (LARP) superfamily is a diverse class of RNA binding proteins involved in RNA processing, folding, and function. Larp7 binds to the abundant long noncoding 7SK RNA and is required for 7SK ribonucleoprotein (RNP) assembly and function. The 7SK RNP sequesters a pool of the positive transcription elongation factor b (P-TEFb) in an inactive state; on release, P-TEFb phosphorylates RNA Polymerase II to stimulate transcription elongation. Despite its essential role in transcription, limited structural information is available for the 7SK RNP, particularly for protein-RNA interactions. Larp7 contains an N-terminal La module that binds UUU-3'OH and a C-terminal atypical RNA recognition motif (xRRM) required for specific binding to 7SK and P-TEFb assembly. Deletion of the xRRM is linked to gastric cancer in humans. We report the 2.2-Å X-ray crystal structure of the human La-related protein group 7 (hLarp7) xRRM bound to the 7SK stem-loop 4, revealing a unique binding interface. Contributions of observed interactions to binding affinity were investigated by mutagenesis and isothermal titration calorimetry. NMR 13 C spin relaxation data and comparison of free xRRM, RNA, and xRRM-RNA structures show that the xRRM is preordered to bind a flexible loop 4. Combining structures of the hLarp7 La module and the xRRM-7SK complex presented here, we propose a structural model for Larp7 binding to the 7SK 3' end and mechanism for 7SK RNP assembly. This work provides insight into how this domain contributes to 7SK recognition and assembly of the core 7SK RNP.

  7. Coarse-grained modeling of RNA 3D structure.

    PubMed

    Dawson, Wayne K; Maciejczyk, Maciej; Jankowska, Elzbieta J; Bujnicki, Janusz M

    2016-07-01

    Functional RNA molecules depend on three-dimensional (3D) structures to carry out their tasks within the cell. Understanding how these molecules interact to carry out their biological roles requires a detailed knowledge of RNA 3D structure and dynamics as well as thermodynamics, which strongly governs the folding of RNA and RNA-RNA interactions as well as a host of other interactions within the cellular environment. Experimental determination of these properties is difficult, and various computational methods have been developed to model the folding of RNA 3D structures and their interactions with other molecules. However, computational methods also have their limitations, especially when the biological effects demand computation of the dynamics beyond a few hundred nanoseconds. For the researcher confronted with such challenges, a more amenable approach is to resort to coarse-grained modeling to reduce the number of data points and computational demand to a more tractable size, while sacrificing as little critical information as possible. This review presents an introduction to the topic of coarse-grained modeling of RNA 3D structures and dynamics, covering both high- and low-resolution strategies. We discuss how physics-based approaches compare with knowledge based methods that rely on databases of information. In the course of this review, we discuss important aspects in the reasoning process behind building different models and the goals and pitfalls that can result. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  8. Fast, clash-free RNA conformational morphing using molecular junctions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Heliou, Amelie; Budday, Dominik; Fonseca, Rasmus

    Non-coding ribonucleic acids (ncRNA) are functional RNA molecules that are not translated into protein. They are extremely dynamic, adopting diverse conformational substates, which enables them to modulate their interaction with a large number of other molecules. The flexibility of ncRNA provides a challenge for probing their complex 3D conformational landscape, both experimentally and computationally. As a result, despite their conformational diversity, ncRNAs mostly preserve their secondary structure throughout the dynamic ensemble. Here we present a kinematics-based procedure to morph an RNA molecule between conformational substates, while avoiding inter-atomic clashes. We represent an RNA as a kinematic linkage, with fixed groupsmore » of atoms as rigid bodies and rotatable bonds as degrees of freedom. Our procedure maintains RNA secondary structure by treating hydrogen bonds between base pairs as constraints. The constraints define a lower-dimensional, secondary-structure constraint manifold in conformation space, where motions are largely governed by molecular junctions of unpaired nucleotides. On a large benchmark set, we show that our morphing procedure compares favorably to peer algorithms, and can approach goal conformations to within a low all-atom RMSD by directing fewer than 1% of its atoms. Furthermore, our results suggest that molecular junctions can modulate 3D structural rearrangements, while secondary structure elements guide large parts of the molecule along the transition to the correct final conformation.« less

  9. Secondary structure prediction for complete rDNA sequences (18S, 5.8S, and 28S rDNA) of Demodex folliculorum, and comparison of divergent domains structures across Acari.

    PubMed

    Zhao, Ya-E; Wang, Zheng-Hang; Xu, Yang; Wu, Li-Ping; Hu, Li

    2013-10-01

    According to base pairing, the rRNA folds into corresponding secondary structures, which contain additional phylogenetic information. On the basis of sequencing for complete rDNA sequences (18S, ITS1, 5.8S, ITS2 and 28S rDNA) of Demodex, we predicted the secondary structure of the complete rDNA sequence (18S, 5.8S, and 28S rDNA) of Demodex folliculorum, which was in concordance with that of the main arthropod lineages in past studies. And together with the sequence data from GenBank, we also predicted the secondary structures of divergent domains in SSU rRNA of 51 species and in LSU rRNA of 43 species from four superfamilies in Acari (Cheyletoidea, Tetranychoidea, Analgoidea and Ixodoidea). The multiple alignment among the four superfamilies in Acari showed that, insertions from Tetranychoidea SSU rRNA formed two newly proposed helixes, and helix c3-2b of LSU rRNA was absent in Demodex (Cheyletoidea) taxa. Generally speaking, LSU rRNA presented more remarkable differences than SSU rRNA did, mainly in D2, D3, D5, D7a, D7b, D8 and D10. Copyright © 2013 Elsevier Inc. All rights reserved.

  10. Fast, clash-free RNA conformational morphing using molecular junctions

    DOE PAGES

    Heliou, Amelie; Budday, Dominik; Fonseca, Rasmus; ...

    2017-03-13

    Non-coding ribonucleic acids (ncRNA) are functional RNA molecules that are not translated into protein. They are extremely dynamic, adopting diverse conformational substates, which enables them to modulate their interaction with a large number of other molecules. The flexibility of ncRNA provides a challenge for probing their complex 3D conformational landscape, both experimentally and computationally. As a result, despite their conformational diversity, ncRNAs mostly preserve their secondary structure throughout the dynamic ensemble. Here we present a kinematics-based procedure to morph an RNA molecule between conformational substates, while avoiding inter-atomic clashes. We represent an RNA as a kinematic linkage, with fixed groupsmore » of atoms as rigid bodies and rotatable bonds as degrees of freedom. Our procedure maintains RNA secondary structure by treating hydrogen bonds between base pairs as constraints. The constraints define a lower-dimensional, secondary-structure constraint manifold in conformation space, where motions are largely governed by molecular junctions of unpaired nucleotides. On a large benchmark set, we show that our morphing procedure compares favorably to peer algorithms, and can approach goal conformations to within a low all-atom RMSD by directing fewer than 1% of its atoms. Furthermore, our results suggest that molecular junctions can modulate 3D structural rearrangements, while secondary structure elements guide large parts of the molecule along the transition to the correct final conformation.« less

  11. Computational methods for prediction of RNA interactions with metal ions and small organic ligands.

    PubMed

    Philips, Anna; Łach, Grzegorz; Bujnicki, Janusz M

    2015-01-01

    In the recent years, it has become clear that a wide range of regulatory functions in bacteria are performed by riboswitches--regions of mRNA that change their structure upon external stimuli. Riboswitches are therefore attractive targets for drug design, molecular engineering, and fundamental research on regulatory circuitry of living cells. Several mechanisms are known for riboswitches controlling gene expression, but most of them perform their roles by ligand binding. As with other macromolecules, knowledge of the 3D structure of riboswitches is crucial for the understanding of their function. The development of experimental methods allowed for investigation of RNA structure and its complexes with ligands (which are either riboswitches' substrates or inhibitors) and metal cations (which stabilize the structure and are also known to be riboswitches' inhibitors). The experimental probing of different states of riboswitches is however time consuming, costly, and difficult to resolve without theoretical support. The natural consequence is the use of computational methods at least for initial research, such as the prediction of putative binding sites of ligands or metal ions. Here, we present a review on such methods, with a special focus on knowledge-based methods developed in our laboratory: LigandRNA--a scoring function for the prediction of RNA-small molecule interactions and MetalionRNA--a predictor of metal ions-binding sites in RNA structures. Both programs are available free of charge as a Web servers, LigandRNA at http://ligandrna.genesilico.pl and MetalionRNA at http://metalionrna.genesilico.pl/. © 2015 Elsevier Inc. All rights reserved.

  12. A Grammatical Approach to RNA-RNA Interaction Prediction

    NASA Astrophysics Data System (ADS)

    Kato, Yuki; Akutsu, Tatsuya; Seki, Hiroyuki

    2007-11-01

    Much attention has been paid to two interacting RNA molecules involved in post-transcriptional control of gene expression. Although there have been a few studies on RNA-RNA interaction prediction based on dynamic programming algorithm, no grammar-based approach has been proposed. The purpose of this paper is to provide a new modeling for RNA-RNA interaction based on multiple context-free grammar (MCFG). We present a polynomial time parsing algorithm for finding the most likely derivation tree for the stochastic version of MCFG, which is applicable to RNA joint secondary structure prediction including kissing hairpin loops. Also, elementary tests on RNA-RNA interaction prediction have shown that the proposed method is comparable to Alkan et al.'s method.

  13. Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory.

    PubMed

    Lin, Luan; McKerrow, Wilson H; Richards, Bryce; Phonsom, Chukiat; Lawrence, Charles E

    2018-03-05

    The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble.

  14. The Aminoacyl-tRNA Synthetase Complex.

    PubMed

    Mirande, Marc

    2017-01-01

    Aminoacyl-tRNA synthetases (AARSs) are essential enzymes that specifically aminoacylate one tRNA molecule by the cognate amino acid. They are a family of twenty enzymes, one for each amino acid. By coupling an amino acid to a specific RNA triplet, the anticodon, they are responsible for interpretation of the genetic code. In addition to this translational, canonical role, several aminoacyl-tRNA synthetases also fulfill nontranslational, moonlighting functions. In mammals, nine synthetases, those specific for amino acids Arg, Asp, Gln, Glu, Ile, Leu, Lys, Met and Pro, associate into a multi-aminoacyl-tRNA synthetase complex, an association which is believed to play a key role in the cellular organization of translation, but also in the regulation of the translational and nontranslational functions of these enzymes. Because the balance between their alternative functions rests on the assembly and disassembly of this supramolecular entity, it is essential to get precise insight into the structural organization of this complex. The high-resolution 3D-structure of the native particle, with a molecular weight of about 1.5 MDa, is not yet known. Low-resolution structures of the multi-aminoacyl-tRNA synthetase complex, as determined by cryo-EM or SAXS, have been reported. High-resolution data have been reported for individual enzymes of the complex, or for small subcomplexes. This review aims to present a critical view of our present knowledge of the aminoacyl-tRNA synthetase complex in 3D. These preliminary data shed some light on the mechanisms responsible for the balance between the translational and nontranslational functions of some of its components.

  15. DSSR-enhanced visualization of nucleic acid structures in Jmol.

    PubMed

    Hanson, Robert M; Lu, Xiang-Jun

    2017-07-03

    Sophisticated and interactive visualizations are essential for making sense of the intricate 3D structures of macromolecules. For proteins, secondary structural components are routinely featured in molecular graphics visualizations. However, the field of RNA structural bioinformatics is still lagging behind; for example, current molecular graphics tools lack built-in support even for base pairs, double helices, or hairpin loops. DSSR (Dissecting the Spatial Structure of RNA) is an integrated and automated command-line tool for the analysis and annotation of RNA tertiary structures. It calculates a comprehensive and unique set of features for characterizing RNA, as well as DNA structures. Jmol is a widely used, open-source Java viewer for 3D structures, with a powerful scripting language. JSmol, its reincarnation based on native JavaScript, has a predominant position in the post Java-applet era for web-based visualization of molecular structures. The DSSR-Jmol integration presented here makes salient features of DSSR readily accessible, either via the Java-based Jmol application itself, or its HTML5-based equivalent, JSmol. The DSSR web service accepts 3D coordinate files (in mmCIF or PDB format) initiated from a Jmol or JSmol session and returns DSSR-derived structural features in JSON format. This seamless combination of DSSR and Jmol/JSmol brings the molecular graphics of 3D RNA structures to a similar level as that for proteins, and enables a much deeper analysis of structural characteristics. It fills a gap in RNA structural bioinformatics, and is freely accessible (via the Jmol application or the JSmol-based website http://jmol.x3dna.org). © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. The Ribosomal RNA is a Useful Marker to Visualize Rhizobia Interacting with Legume Plants

    ERIC Educational Resources Information Center

    Rinaudi, Luciana; Isola, Maria C.; Giordano, Walter

    2004-01-01

    Symbiosis between rhizobia and leguminous plants leads to the formation of nitrogen-fixing root nodules. In the present article, we recommend the use of the ribosomal RNA (rRNA) isolated from legume nodules in an experimental class with the purpose of introducing students to the structure of eukaryotic and prokaryotic ribosomes and of…

  17. Structure and variation of the mitochondrial genome of fishes.

    PubMed

    Satoh, Takashi P; Miya, Masaki; Mabuchi, Kohji; Nishida, Mutsumi

    2016-09-07

    The mitochondrial (mt) genome has been used as an effective tool for phylogenetic and population genetic analyses in vertebrates. However, the structure and variability of the vertebrate mt genome are not well understood. A potential strategy for improving our understanding is to conduct a comprehensive comparative study of large mt genome data. The aim of this study was to characterize the structure and variability of the fish mt genome through comparative analysis of large datasets. An analysis of the secondary structure of proteins for 250 fish species (248 ray-finned and 2 cartilaginous fishes) illustrated that cytochrome c oxidase subunits (COI, COII, and COIII) and a cytochrome bc1 complex subunit (Cyt b) had substantial amino acid conservation. Among the four proteins, COI was the most conserved, as more than half of all amino acid sites were invariable among the 250 species. Our models identified 43 and 58 stems within 12S rRNA and 16S rRNA, respectively, with larger numbers than proposed previously for vertebrates. The models also identified 149 and 319 invariable sites in 12S rRNA and 16S rRNA, respectively, in all fishes. In particular, the present result verified that a region corresponding to the peptidyl transferase center in prokaryotic 23S rRNA, which is homologous to mt 16S rRNA, is also conserved in fish mt 16S rRNA. Concerning the gene order, we found 35 variations (in 32 families) that deviated from the common gene order in vertebrates. These gene rearrangements were mostly observed in the area spanning the ND5 gene to the control region as well as two tRNA gene cluster regions (IQM and WANCY regions). Although many of such gene rearrangements were unique to a specific taxon, some were shared polyphyletically between distantly related species. Through a large-scale comparative analysis of 250 fish species mt genomes, we elucidated various structural aspects of the fish mt genome and the encoded genes. The present results will be important for understanding functions of the mt genome and developing programs for nucleotide sequence analysis. This study demonstrated the significance of extensive comparisons for understanding the structure of the mt genome.

  18. New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.

    PubMed

    Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang

    2018-01-01

    The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.

  19. Helix-length compensation studies reveal the adaptability of the VS ribozyme architecture.

    PubMed

    Lacroix-Labonté, Julie; Girard, Nicolas; Lemieux, Sébastien; Legault, Pascale

    2012-03-01

    Compensatory mutations in RNA are generally regarded as those that maintain base pairing, and their identification forms the basis of phylogenetic predictions of RNA secondary structure. However, other types of compensatory mutations can provide higher-order structural and evolutionary information. Here, we present a helix-length compensation study for investigating structure-function relationships in RNA. The approach is demonstrated for stem-loop I and stem-loop V of the Neurospora VS ribozyme, which form a kissing-loop interaction important for substrate recognition. To rapidly characterize the substrate specificity (k(cat)/K(M)) of several substrate/ribozyme pairs, a procedure was established for simultaneous kinetic characterization of multiple substrates. Several active substrate/ribozyme pairs were identified, indicating the presence of limited substrate promiscuity for stem Ib variants and helix-length compensation between stems Ib and V. 3D models of the I/V interaction were generated that are compatible with the kinetic data. These models further illustrate the adaptability of the VS ribozyme architecture for substrate cleavage and provide global structural information on the I/V kissing-loop interaction. By exploring higher-order compensatory mutations in RNA our approach brings a deeper understanding of the adaptability of RNA structure, while opening new avenues for RNA research.

  20. The nucleolus: a raft adrift in the nuclear sea or the keystone in nuclear structure?

    PubMed Central

    O’Sullivan, Justin M.; Pai, Dave A.; Cridge, Andrew G.; Engelke, David R.; Ganley, Austen R. D.

    2016-01-01

    The nucleolus is a prominent nuclear structure that is the site of ribosomal RNA (rRNA) transcription, and hence ribosome biogenesis. Cellular demand for ribosomes, and hence rRNA, is tightly linked to cell growth and the rRNA makes up the majority of all the RNA within a cell. To fulfil the cellular demand for rRNA, the ribosomal RNA genes (rDNA) genes are amplified to high copy number and transcribed at very high rates. As such, understanding the rDNA has profound consequences for our comprehension of genome and transcriptional organization in cells. In this review we address the question of whether the nucleolus is a raft adrift the sea of nuclear DNA, or actively contributes to genome organization. We present evidence supporting the idea that the nucleolus, and the rDNA contained therein, play more roles in the biology of the cell than simply ribosome biogenesis. We propose that the nucleolus and the rDNA are central factors in the spatial organization of the genome, and that rapid alterations in nucleolar structure in response to changing conditions manifest themselves in altered genomic structures that have functional consequences. Finally, we discuss some predictions that result from the nucleolus having a central role in nuclear organization. PMID:25436580

  1. Structure of the Nucleoprotein Binding Domain of Mokola Virus Phosphoprotein▿

    PubMed Central

    Assenberg, René; Delmas, Olivier; Ren, Jingshan; Vidalain, Pierre-Olivier; Verma, Anil; Larrous, Florence; Graham, Stephen C.; Tangy, Frédéric; Grimes, Jonathan M.; Bourhy, Hervé

    2010-01-01

    Mokola virus (MOKV) is a nonsegmented, negative-sense RNA virus that belongs to the Lyssavirus genus and Rhabdoviridae family. MOKV phosphoprotein P is an essential component of the replication and transcription complex and acts as a cofactor for the viral RNA-dependent RNA polymerase. P recruits the viral polymerase to the nucleoprotein-bound viral RNA (N-RNA) via an interaction between its C-terminal domain and the N-RNA complex. Here we present a structure for this domain of MOKV P, obtained by expression of full-length P in Escherichia coli, which was subsequently truncated during crystallization. The structure has a high degree of homology with P of rabies virus, another member of Lyssavirus genus, and to a lesser degree with P of vesicular stomatitis virus (VSV), a member of the related Vesiculovirus genus. In addition, analysis of the crystal packing of this domain reveals a potential binding site for the nucleoprotein N. Using both site-directed mutagenesis and yeast two-hybrid experiments to measure P-N interaction, we have determined the relative roles of key amino acids involved in this interaction to map the region of P that binds N. This analysis also reveals a structural relationship between the N-RNA binding domain of the P proteins of the Rhabdoviridae and the Paramyxoviridae. PMID:19906936

  2. The nucleolus: a raft adrift in the nuclear sea or the keystone in nuclear structure?

    PubMed

    O'Sullivan, Justin M; Pai, Dave A; Cridge, Andrew G; Engelke, David R; Ganley, Austen R D

    2013-06-01

    The nucleolus is a prominent nuclear structure that is the site of ribosomal RNA (rRNA) transcription, and hence ribosome biogenesis. Cellular demand for ribosomes, and hence rRNA, is tightly linked to cell growth and the rRNA makes up the majority of all the RNA within a cell. To fulfill the cellular demand for rRNA, the ribosomal RNA (rDNA) genes are amplified to high copy number and transcribed at very high rates. As such, understanding the rDNA has profound consequences for our comprehension of genome and transcriptional organization in cells. In this review, we address the question of whether the nucleolus is a raft adrift the sea of nuclear DNA, or actively contributes to genome organization. We present evidence supporting the idea that the nucleolus, and the rDNA contained therein, play more roles in the biology of the cell than simply ribosome biogenesis. We propose that the nucleolus and the rDNA are central factors in the spatial organization of the genome, and that rapid alterations in nucleolar structure in response to changing conditions manifest themselves in altered genomic structures that have functional consequences. Finally, we discuss some predictions that result from the nucleolus having a central role in nuclear organization.

  3. Structural and molecular basis of mismatch correction and ribavirin excision from coronavirus RNA.

    PubMed

    Ferron, François; Subissi, Lorenzo; Silveira De Morais, Ana Theresa; Le, Nhung Thi Tuyet; Sevajol, Marion; Gluais, Laure; Decroly, Etienne; Vonrhein, Clemens; Bricogne, Gérard; Canard, Bruno; Imbert, Isabelle

    2018-01-09

    Coronaviruses (CoVs) stand out among RNA viruses because of their unusually large genomes (∼30 kb) associated with low mutation rates. CoVs code for nsp14, a bifunctional enzyme carrying RNA cap guanine N7-methyltransferase (MTase) and 3'-5' exoribonuclease (ExoN) activities. ExoN excises nucleotide mismatches at the RNA 3'-end in vitro, and its inactivation in vivo jeopardizes viral genetic stability. Here, we demonstrate for severe acute respiratory syndrome (SARS)-CoV an RNA synthesis and proofreading pathway through association of nsp14 with the low-fidelity nsp12 viral RNA polymerase. Through this pathway, the antiviral compound ribavirin 5'-monophosphate is significantly incorporated but also readily excised from RNA, which may explain its limited efficacy in vivo. The crystal structure at 3.38 Å resolution of SARS-CoV nsp14 in complex with its cofactor nsp10 adds to the uniqueness of CoVs among RNA viruses: The MTase domain presents a new fold that differs sharply from the canonical Rossmann fold.

  4. Insights into molecular plasticity in protein complexes from Trm9-Trm112 tRNA modifying enzyme crystal structure.

    PubMed

    Létoquart, Juliette; van Tran, Nhan; Caroline, Vonny; Aleksandrov, Alexey; Lazar, Noureddine; van Tilbeurgh, Herman; Liger, Dominique; Graille, Marc

    2015-12-15

    Most of the factors involved in translation (tRNA, rRNA and proteins) are subject to post-transcriptional and post-translational modifications, which participate in the fine-tuning and tight control of ribosome and protein synthesis processes. In eukaryotes, Trm112 acts as an obligate activating platform for at least four methyltransferases (MTase) involved in the modification of 18S rRNA (Bud23), tRNA (Trm9 and Trm11) and translation termination factor eRF1 (Mtq2). Trm112 is then at a nexus between ribosome synthesis and function. Here, we present a structure-function analysis of the Trm9-Trm112 complex, which is involved in the 5-methoxycarbonylmethyluridine (mcm(5)U) modification of the tRNA anticodon wobble position and hence promotes translational fidelity. We also compare the known crystal structures of various Trm112-MTase complexes, highlighting the structural plasticity allowing Trm112 to interact through a very similar mode with its MTase partners, although those share less than 20% sequence identity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Structural and Functional Characterization of an Archaeal Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-associated Complex for Antiviral Defense (CASCADE)*

    PubMed Central

    Lintner, Nathanael G.; Kerou, Melina; Brumfield, Susan K.; Graham, Shirley; Liu, Huanting; Naismith, James H.; Sdano, Matthew; Peng, Nan; She, Qunxin; Copié, Valérie; Young, Mark J.; White, Malcolm F.; Lawrence, C. Martin

    2011-01-01

    In response to viral infection, many prokaryotes incorporate fragments of virus-derived DNA into loci called clustered regularly interspaced short palindromic repeats (CRISPRs). The loci are then transcribed, and the processed CRISPR transcripts are used to target invading viral DNA and RNA. The Escherichia coli “CRISPR-associated complex for antiviral defense” (CASCADE) is central in targeting invading DNA. Here we report the structural and functional characterization of an archaeal CASCADE (aCASCADE) from Sulfolobus solfataricus. Tagged Csa2 (Cas7) expressed in S. solfataricus co-purifies with Cas5a-, Cas6-, Csa5-, and Cas6-processed CRISPR-RNA (crRNA). Csa2, the dominant protein in aCASCADE, forms a stable complex with Cas5a. Transmission electron microscopy reveals a helical complex of variable length, perhaps due to substoichiometric amounts of other CASCADE components. A recombinant Csa2-Cas5a complex is sufficient to bind crRNA and complementary ssDNA. The structure of Csa2 reveals a crescent-shaped structure unexpectedly composed of a modified RNA-recognition motif and two additional domains present as insertions in the RNA-recognition motif. Conserved residues indicate potential crRNA- and target DNA-binding sites, and the H160A variant shows significantly reduced affinity for crRNA. We propose a general subunit architecture for CASCADE in other bacteria and Archaea. PMID:21507944

  6. The organization and contribution of helicases to RNA splicing.

    PubMed

    De, Inessa; Schmitzová, Jana; Pena, Vladimir

    2016-01-01

    Splicing is an essential step of gene expression. It occurs in two consecutive chemical reactions catalyzed by a large protein-RNA complex named the spliceosome. Assembled on the pre-mRNA substrate from five small nuclear proteins, the spliceosome acts as a protein-controlled ribozyme to catalyze the two reactions and finally dissociates into its components, which are re-used for a new round of splicing. Upon following this cyclic pathway, the spliceosome undergoes numerous intermediate stages that differ in composition as well as in their internal RNA-RNA and RNA-protein contacts. The driving forces and control mechanisms of these remodeling processes are provided by specific molecular motors called RNA helicases. While eight spliceosomal helicases are present in all organisms, higher eukaryotes contain five additional ones potentially required to drive a more intricate splicing pathway and link it to an RNA metabolism of increasing complexity. Spliceosomal helicases exhibit a notable structural diversity in their accessory domains and overall architecture, in accordance with the diversity of their task-specific functions. This review summarizes structure-function knowledge about all spliceosomal helicases, including the latter five, which traditionally are treated separately from the conserved ones. The implications of the structural characteristics of helicases for their functions, as well as for their structural communication within the multi-subunits environment of the spliceosome, are pointed out. © 2016 Wiley Periodicals, Inc.

  7. Structural and functional characterization of an archaeal clustered regularly interspaced short palindromic repeat (CRISPR)-associated complex for antiviral defense (CASCADE).

    PubMed

    Lintner, Nathanael G; Kerou, Melina; Brumfield, Susan K; Graham, Shirley; Liu, Huanting; Naismith, James H; Sdano, Matthew; Peng, Nan; She, Qunxin; Copié, Valérie; Young, Mark J; White, Malcolm F; Lawrence, C Martin

    2011-06-17

    In response to viral infection, many prokaryotes incorporate fragments of virus-derived DNA into loci called clustered regularly interspaced short palindromic repeats (CRISPRs). The loci are then transcribed, and the processed CRISPR transcripts are used to target invading viral DNA and RNA. The Escherichia coli "CRISPR-associated complex for antiviral defense" (CASCADE) is central in targeting invading DNA. Here we report the structural and functional characterization of an archaeal CASCADE (aCASCADE) from Sulfolobus solfataricus. Tagged Csa2 (Cas7) expressed in S. solfataricus co-purifies with Cas5a-, Cas6-, Csa5-, and Cas6-processed CRISPR-RNA (crRNA). Csa2, the dominant protein in aCASCADE, forms a stable complex with Cas5a. Transmission electron microscopy reveals a helical complex of variable length, perhaps due to substoichiometric amounts of other CASCADE components. A recombinant Csa2-Cas5a complex is sufficient to bind crRNA and complementary ssDNA. The structure of Csa2 reveals a crescent-shaped structure unexpectedly composed of a modified RNA-recognition motif and two additional domains present as insertions in the RNA-recognition motif. Conserved residues indicate potential crRNA- and target DNA-binding sites, and the H160A variant shows significantly reduced affinity for crRNA. We propose a general subunit architecture for CASCADE in other bacteria and Archaea.

  8. Synergy between NMR measurements and MD simulations of protein/RNA complexes: application to the RRMs, the most common RNA recognition motifs

    PubMed Central

    Krepl, Miroslav; Cléry, Antoine; Blatter, Markus; Allain, Frederic H.T.; Sponer, Jiri

    2016-01-01

    RNA recognition motif (RRM) proteins represent an abundant class of proteins playing key roles in RNA biology. We present a joint atomistic molecular dynamics (MD) and experimental study of two RRM-containing proteins bound with their single-stranded target RNAs, namely the Fox-1 and SRSF1 complexes. The simulations are used in conjunction with NMR spectroscopy to interpret and expand the available structural data. We accumulate more than 50 μs of simulations and show that the MD method is robust enough to reliably describe the structural dynamics of the RRM–RNA complexes. The simulations predict unanticipated specific participation of Arg142 at the protein–RNA interface of the SRFS1 complex, which is subsequently confirmed by NMR and ITC measurements. Several segments of the protein–RNA interface may involve competition between dynamical local substates rather than firmly formed interactions, which is indirectly consistent with the primary NMR data. We demonstrate that the simulations can be used to interpret the NMR atomistic models and can provide qualified predictions. Finally, we propose a protocol for ‘MD-adapted structure ensemble’ as a way to integrate the simulation predictions and expand upon the deposited NMR structures. Unbiased μs-scale atomistic MD could become a technique routinely complementing the NMR measurements of protein–RNA complexes. PMID:27193998

  9. Macromolecular recognition: Structural aspects of the origin of the genetic system

    NASA Technical Reports Server (NTRS)

    Rein, Robert; Sokalski, W. Andrzej; Barak, Dov; Luo, Ning; Zielinski, Theresa Julia; Shibata, Masayuki

    1991-01-01

    Theoretical simulation of prebiotic chemical processes is an invaluable tool for probing the phenomenon of the evolution of life. Using computational and modeling techniques and guided by analogies from present day systems, we seek to understand the emergence of the genetic apparatus, enzymatic catalysis and protein synthesis under prebiotic conditions. Modeling of the ancestral aminoacyl-tRNA-synthetases (aRS) may provide important clues to the emergence of the genetic code and the protein synthetic machinery. The minimal structural requirements for the catalysis of tRNA aminoacylation are being explored. A formation of an aminoacyl adenylate was studied in the framework of ab initio molecular orbital theory. The role of individual residues in the vicinity of the TyrRS active site was examined, and the effect of all possible amino acids substitutions near the active site was examined. A formation of aminoacyl tRNA was studied by the molecular modeling system SYBYL with the high resolution crystallographic structures of the present day tRNA, aRS's complexes. The ultimate goal is to propose a simple RNA segment that is small enough to be build in the primordial chemical environment but maintains the specificity and catalytic activity of the contemporary RNA enzyme. To understand the mechanism of ribozyme catalyzed reactions, ab initio and semi-empirical (ZINDO) programs were used to investigate the reaction path of transphosphorylation. A special emphasis was placed on the possible catalytic and structural roles played by the coordinated magnesium cation. Both the inline and adjacent mechanisms of transphosphorylation were studied. The structural characteristics of the target helices, particularly a possible role for the G-T pair, is also studied by a molecular dynamics (MD) simulation technique.

  10. Lessons from an evolving rRNA: 16S and 23S rRNA structures from a comparative perspective

    NASA Technical Reports Server (NTRS)

    Gutell, R. R.; Larsen, N.; Woese, C. R.

    1994-01-01

    The 16S and 23S rRNA higher-order structures inferred from comparative analysis are now quite refined. The models presented here differ from their immediate predecessors only in minor detail. Thus, it is safe to assert that all of the standard secondary-structure elements in (prokaryotic) rRNAs have been identified, with approximately 90% of the individual base pairs in each molecule having independent comparative support, and that at least some of the tertiary interactions have been revealed. It is interesting to compare the rRNAs in this respect with tRNA, whose higher-order structure is known in detail from its crystal structure (36) (Table 2). It can be seen that rRNAs have as great a fraction of their sequence in established secondary-structure elements as does tRNA. However, the fact that the former show a much lower fraction of identified tertiary interactions and a greater fraction of unpaired nucleotides than the latter implies that many of the rRNA tertiary interactions remain to be located. (Alternatively, the ribosome might involve protein-rRNA rather than intramolecular rRNA interactions to stabilize three-dimensional structure.) Experimental studies on rRNA are consistent to a first approximation with the structures proposed here, confirming the basic assumption of comparative analysis, i.e., that bases whose compositions strictly covary are physically interacting. In the exhaustive study of Moazed et al. (45) on protection of the bases in the small-subunit rRNA against chemical modification, the vast majority of bases inferred to pair by covariation are found to be protected from chemical modification, both in isolated small-subunit rRNA and in the 30S subunit. The majority of the tertiary interactions are reflected in the chemical protection data as well (45). On the other hand, many of the bases not shown as paired in Fig. 1 are accessible to chemical attack (45). However, in this case a sizeable fraction of them are also protected against chemical modification (in the isolated rRNA), which suggests that considerable higher-order structure remains to be found (although all of it may not involve base-base interactions and so may not be detectable by comparative analysis). The agreement between the higher-order structure of the small-subunit rRNA and protection against chemical modification is not perfect, however; some bases shown to covary canonically are accessible to chemical modification (45).(ABSTRACT TRUNCATED AT 400 WORDS).

  11. Structural and Biochemical Studies on ATP Binding and Hydrolysis by the Escherichia coli RNA Chaperone Hfq

    PubMed Central

    Večerek, Branislav; Rajkowitsch, Lukas; Carugo, Oliviero; Djinović-Carugo, Kristina; Bläsi, Udo

    2012-01-01

    In Escherichia coli the RNA chaperone Hfq is involved in riboregulation by assisting base-pairing between small regulatory RNAs (sRNAs) and mRNA targets. Several structural and biochemical studies revealed RNA binding sites on either surface of the donut shaped Hfq-hexamer. Whereas sRNAs are believed to contact preferentially the YKH motifs present on the proximal site, poly(A)15 and ADP were shown to bind to tripartite binding motifs (ARE) circularly positioned on the distal site. Hfq has been reported to bind and to hydrolyze ATP. Here, we present the crystal structure of a C-terminally truncated variant of E. coli Hfq (Hfq65) in complex with ATP, showing that it binds to the distal R-sites. In addition, we revisited the reported ATPase activity of full length Hfq purified to homogeneity. At variance with previous reports, no ATPase activity was observed for Hfq. In addition, FRET assays neither indicated an impact of ATP on annealing of two model oligoribonucleotides nor did the presence of ATP induce strand displacement. Moreover, ATP did not lead to destabilization of binary and ternary Hfq-RNA complexes, unless a vast stoichiometric excess of ATP was used. Taken together, these studies strongly suggest that ATP is dispensable for and does not interfere with Hfq-mediated RNA transactions. PMID:23226421

  12. ESI-MS Investigation of an Equilibrium between a Bimolecular Quadruplex DNA and a Duplex DNA/RNA Hybrid

    NASA Astrophysics Data System (ADS)

    Birrento, Monica L.; Bryan, Tracy M.; Samosorn, Siritron; Beck, Jennifer L.

    2015-07-01

    Electrospray ionization mass spectrometry (ESI-MS) conditions were optimized for simultaneous observation of a bimolecular qDNA and a Watson-Crick base-paired duplex DNA/RNA hybrid. The DNA sequence used was telomeric DNA, and the RNA contained the template for telomerase-mediated telomeric DNA synthesis. Addition of RNA to the quadruplex DNA (qDNA) resulted in formation of the duplex DNA/RNA hybrid. Melting profiles obtained using circular dichroism spectroscopy confirmed that the DNA/RNA hybrid exhibited greater thermal stability than the bimolecular qDNA in solution. Binding of a 13-substituted berberine ( 1) derivative to the bimolecular qDNA stabilized its structure as evidenced by an increase in its stability in the mass spectrometer, and an increase in its circular dichroism (CD) melting temperature of 10°C. The DNA/RNA hybrid did not bind the ligand extensively and its thermal stability was unchanged in the presence of ( 1). The qDNA-ligand complex resisted unfolding in the presence of excess RNA, limiting the formation of the DNA/RNA hybrid. Previously, it has been proposed that DNA secondary structures, such as qDNA, may be involved in the telomerase mechanism. DNA/RNA hybrid structures occur at the active site of telomerase. The results presented in the current work show that if telomeric DNA was folded into a qDNA structure, it is possible for a DNA/RNA hybrid to form as is required during template alignment. The discrimination of ligand ( 1) for binding to the bimolecular qDNA over the DNA/RNA hybrid positions it as a useful compound for probing the role(s), if any, of antiparallel qDNA in the telomerase mechanism.

  13. microRNAs Databases: Developmental Methodologies, Structural and Functional Annotations.

    PubMed

    Singh, Nagendra Kumar

    2017-09-01

    microRNA (miRNA) is an endogenous and evolutionary conserved non-coding RNA, involved in post-transcriptional process as gene repressor and mRNA cleavage through RNA-induced silencing complex (RISC) formation. In RISC, miRNA binds in complementary base pair with targeted mRNA along with Argonaut proteins complex, causes gene repression or endonucleolytic cleavage of mRNAs and results in many diseases and syndromes. After the discovery of miRNA lin-4 and let-7, subsequently large numbers of miRNAs were discovered by low-throughput and high-throughput experimental techniques along with computational process in various biological and metabolic processes. The miRNAs are important non-coding RNA for understanding the complex biological phenomena of organism because it controls the gene regulation. This paper reviews miRNA databases with structural and functional annotations developed by various researchers. These databases contain structural and functional information of animal, plant and virus miRNAs including miRNAs-associated diseases, stress resistance in plant, miRNAs take part in various biological processes, effect of miRNAs interaction on drugs and environment, effect of variance on miRNAs, miRNAs gene expression analysis, sequence of miRNAs, structure of miRNAs. This review focuses on the developmental methodology of miRNA databases such as computational tools and methods used for extraction of miRNAs annotation from different resources or through experiment. This study also discusses the efficiency of user interface design of every database along with current entry and annotations of miRNA (pathways, gene ontology, disease ontology, etc.). Here, an integrated schematic diagram of construction process for databases is also drawn along with tabular and graphical comparison of various types of entries in different databases. Aim of this paper is to present the importance of miRNAs-related resources at a single place.

  14. Scanning the Human Genome for Novel Therapeutic Targets for Breast Cancer

    DTIC Science & Technology

    2006-04-01

    action of this class of non-coding regulatory RNAs13,14. MicroRNAs are transcribed by RNA polymerase II as long primary polyadenylated transcripts...Artificial miRNAs can be expressed from both RNA polymerase II and III promoters resulting in silencing to varying degrees. At present there...the highest levels of mature microRNA in RISC and generally effective silencing. These structures can be transcribed by either RNA polymerase II or

  15. Semiautomated model building for RNA crystallography using a directed rotameric approach.

    PubMed

    Keating, Kevin S; Pyle, Anna Marie

    2010-05-04

    Structured RNA molecules play essential roles in a variety of cellular processes; however, crystallographic studies of such RNA molecules present a large number of challenges. One notable complication arises from the low resolutions typical of RNA crystallography, which results in electron density maps that are imprecise and difficult to interpret. This problem is exacerbated by the lack of computational tools for RNA modeling, as many of the techniques commonly used in protein crystallography have no equivalents for RNA structure. This leads to difficulty and errors in the model building process, particularly in modeling of the RNA backbone, which is highly error prone due to the large number of variable torsion angles per nucleotide. To address this, we have developed a method for accurately building the RNA backbone into maps of intermediate or low resolution. This method is semiautomated, as it requires a crystallographer to first locate phosphates and bases in the electron density map. After this initial trace of the molecule, however, an accurate backbone structure can be built without further user intervention. To accomplish this, backbone conformers are first predicted using RNA pseudotorsions and the base-phosphate perpendicular distance. Detailed backbone coordinates are then calculated to conform both to the predicted conformer and to the previously located phosphates and bases. This technique is shown to produce accurate backbone structure even when starting from imprecise phosphate and base coordinates. A program implementing this methodology is currently available, and a plugin for the Coot model building program is under development.

  16. Computing the Partition Function for Kinetically Trapped RNA Secondary Structures

    PubMed Central

    Lorenz, William A.; Clote, Peter

    2011-01-01

    An RNA secondary structure is locally optimal if there is no lower energy structure that can be obtained by the addition or removal of a single base pair, where energy is defined according to the widely accepted Turner nearest neighbor model. Locally optimal structures form kinetic traps, since any evolution away from a locally optimal structure must involve energetically unfavorable folding steps. Here, we present a novel, efficient algorithm to compute the partition function over all locally optimal secondary structures of a given RNA sequence. Our software, RNAlocopt runs in time and space. Additionally, RNAlocopt samples a user-specified number of structures from the Boltzmann subensemble of all locally optimal structures. We apply RNAlocopt to show that (1) the number of locally optimal structures is far fewer than the total number of structures – indeed, the number of locally optimal structures approximately equal to the square root of the number of all structures, (2) the structural diversity of this subensemble may be either similar to or quite different from the structural diversity of the entire Boltzmann ensemble, a situation that depends on the type of input RNA, (3) the (modified) maximum expected accuracy structure, computed by taking into account base pairing frequencies of locally optimal structures, is a more accurate prediction of the native structure than other current thermodynamics-based methods. The software RNAlocopt constitutes a technical breakthrough in our study of the folding landscape for RNA secondary structures. For the first time, locally optimal structures (kinetic traps in the Turner energy model) can be rapidly generated for long RNA sequences, previously impossible with methods that involved exhaustive enumeration. Use of locally optimal structure leads to state-of-the-art secondary structure prediction, as benchmarked against methods involving the computation of minimum free energy and of maximum expected accuracy. Web server and source code available at http://bioinformatics.bc.edu/clotelab/RNAlocopt/. PMID:21297972

  17. RNase MRP cleaves pre-tRNASer-Met in the tRNA maturation pathway.

    PubMed

    Saito, Yuichiro; Takeda, Jun; Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V; Taoka, Masato; Isobe, Toshiaki

    2014-01-01

    Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNA(Ser-Met). To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNA(Ser-Met), suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry-based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute "Domain 1" in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP.

  18. Eukaryotic RNA polymerase subunit RPB8 is a new relative of the OB family.

    PubMed

    Krapp, S; Kelly, G; Reischl, J; Weinzierl, R O; Matthews, S

    1998-02-01

    RNA polymerase II subunit RPB8 is an essential subunit that is highly conserved throughout eukaryotic evolution and is present in all three types of nuclear RNA polymerases. We report the first high resolution structural insight into eukaryotic RNA polymerase architecture with the solution structure of RPB8 from Saccharomyces cerevisiae. It consists of an eight stranded, antiparallel beta-barrel, four short helical regions and a large, unstructured omega-loop. The strands are connected in classic Greek-key fashion. The overall topology is unusual and contains a striking C2 rotational symmetry. Furthermore, it is most likely a novel associate of the oligonucleotide/oligosaccharide (OB) binding protein class.

  19. Analysis of RNA binding by the dengue virus NS5 RNA capping enzyme.

    PubMed

    Henderson, Brittney R; Saeedi, Bejan J; Campagnola, Grace; Geiss, Brian J

    2011-01-01

    Flaviviruses are small, capped positive sense RNA viruses that replicate in the cytoplasm of infected cells. Dengue virus and other related flaviviruses have evolved RNA capping enzymes to form the viral RNA cap structure that protects the viral genome and directs efficient viral polyprotein translation. The N-terminal domain of NS5 possesses the methyltransferase and guanylyltransferase activities necessary for forming mature RNA cap structures. The mechanism for flavivirus guanylyltransferase activity is currently unknown, and how the capping enzyme binds its diphosphorylated RNA substrate is important for deciphering how the flavivirus guanylyltransferase functions. In this report we examine how flavivirus NS5 N-terminal capping enzymes bind to the 5' end of the viral RNA using a fluorescence polarization-based RNA binding assay. We observed that the K(D) for RNA binding is approximately 200 nM Dengue, Yellow Fever, and West Nile virus capping enzymes. Removal of one or both of the 5' phosphates reduces binding affinity, indicating that the terminal phosphates contribute significantly to binding. RNA binding affinity is negatively affected by the presence of GTP or ATP and positively affected by S-adensyl methoninine (SAM). Structural superpositioning of the dengue virus capping enzyme with the Vaccinia virus VP39 protein bound to RNA suggests how the flavivirus capping enzyme may bind RNA, and mutagenesis analysis of residues in the putative RNA binding site demonstrate that several basic residues are critical for RNA binding. Several mutants show differential binding to 5' di-, mono-, and un-phosphorylated RNAs. The mode of RNA binding appears similar to that found with other methyltransferase enzymes, and a discussion of diphosphorylated RNA binding is presented.

  20. Solution structure of an ATP-binding RNA aptamer reveals a novel fold.

    PubMed Central

    Dieckmann, T; Suzuki, E; Nakamura, G K; Feigon, J

    1996-01-01

    In vitro selection has been used to isolate several RNA aptamers that bind specifically to biological cofactors. A well-characterized example in the ATP-binding RNA aptamer family, which contains a conserved 11-base loop opposite a bulged G and flanked by regions of double-stranded RNA. The nucleotides in the consensus sequence provide a binding pocket for ATP (or AMP), which binds with a Kd in the micromolar range. Here we present the three-dimensional solution structure of a 36-nucleotide ATP-binding RNA aptamer complexed with AMP, determined from NMR-derived distance and dihedral angle restraints. The conserved loop and bulged G form a novel compact, folded structure around the AMP. The backbone tracing of the loop nucleotides can be described by a Greek zeta (zeta). Consecutive loop nucleotides G, A, A form a U-turn at the bottom of the zeta, and interact with the AMP to form a structure similar to a GNRA tetraloop, with AMP standing in for the final A. Two asymmetric G. G base pairs close the stems flanking the internal loop. Mutated aptamers support the existence of the tertiary interactions within the consensus nucleotides and with the AMP found in the calculated structures. PMID:8756406

  1. The Crystal Structure of the Drosophila Germline Inducer Oskar Identifies Two Domains with Distinct Vasa Helicase- and RNA-Binding Activities.

    PubMed

    Jeske, Mandy; Bordi, Matteo; Glatt, Sebastian; Müller, Sandra; Rybin, Vladimir; Müller, Christoph W; Ephrussi, Anne

    2015-07-28

    In many animals, the germ plasm segregates germline from soma during early development. Oskar protein is known for its ability to induce germ plasm formation and germ cells in Drosophila. However, the molecular basis of germ plasm formation remains unclear. Here, we show that Oskar is an RNA-binding protein in vivo, crosslinking to nanos, polar granule component, and germ cell-less mRNAs, each of which has a role in germline formation. Furthermore, we present high-resolution crystal structures of the two Oskar domains. RNA-binding maps in vitro to the C-terminal domain, which shows structural similarity to SGNH hydrolases. The highly conserved N-terminal LOTUS domain forms dimers and mediates Oskar interaction with the germline-specific RNA helicase Vasa in vitro. Our findings suggest a dual function of Oskar in RNA and Vasa binding, providing molecular clues to its germ plasm function. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Poly(A) RNA a new component of Cajal bodies.

    PubMed

    Kołowerzo, Agnieszka; Smoliński, Dariusz Jan; Bednarska, Elzbieta

    2009-07-01

    In European larch microsporocytes, spherical structures 0.5 to 6 microm in diameter are present in which poly(A) RNA accumulates. There were one to several bodies per cell and they were often present in the vicinity of the nucleolus. No nascent transcripts were observed within them. Splicing factors of the SR family, including protein SC35, which participates in bringing the 3' and 5' sites closer in the splicing reaction, were also not observed. The absence of the above-mentioned elements within bodies containing poly(A) RNA disqualifies them as sites of synthesis and preliminary stages of primary transcript maturation. However, they contained abundant elements of the splicing machinery commonly occurring in Cajal bodies, i.e., Sm proteins or small nuclear RNA (snRNA). The molecular composition as well as the characteristic ultrastructure of bodies containing poly(A) RNA proves that these were Cajal bodies. This is the first report of such poly(A) RNA localization.

  3. Nuclear matrix and hnRNP share a common structural constituent associated with premessenger RNA.

    PubMed Central

    Gallinaro, H; Puvion, E; Kister, L; Jacob, M

    1983-01-01

    Nuclear matrix and heterogeneous nuclear ribonucleoprotein (hnRNP) were compared to establish whether premessenger RNA (premRNA) was associated with a same constituent in both structures. The isolation of nuclear matrix included the removal of chromatin and of 0.4 M KCl-soluble material. HnRNP, isolated by a standard method was also treated by 0.4 M KCl. Both isolation procedures caused the removal of DNA, histones, a fraction of small nuclear RNA and of nonhistone proteins including the hnRNP proteins in the 30 000-40 000 mol. wt. range. High resolution autoradiography showed that hnRNA remained associated with the residual fibrils in both structures. They both contained the same premRNA and maturation products as shown by the analysis of the transcripts of the early region 3 of adenovirus 2. In addition, the small nuclear RNA and protein of the salt-resistant complexes were also present in the matrix. The results are compatible with the idea that the salt-resistant complexes from hnRNP constitute the fibrils associated with premRNA in the nucleoplasmic matrix. The fibrils may be the basic unit of splicing and their organization in matrix might provide the spatial configuration necessary for regulation. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:6557026

  4. PRince: a web server for structural and physicochemical analysis of protein-RNA interface.

    PubMed

    Barik, Amita; Mishra, Abhishek; Bahadur, Ranjit Prasad

    2012-07-01

    We have developed a web server, PRince, which analyzes the structural features and physicochemical properties of the protein-RNA interface. Users need to submit a PDB file containing the atomic coordinates of both the protein and the RNA molecules in complex form (in '.pdb' format). They should also mention the chain identifiers of interacting protein and RNA molecules. The size of the protein-RNA interface is estimated by measuring the solvent accessible surface area buried in contact. For a given protein-RNA complex, PRince calculates structural, physicochemical and hydration properties of the interacting surfaces. All these parameters generated by the server are presented in a tabular format. The interacting surfaces can also be visualized with software plug-in like Jmol. In addition, the output files containing the list of the atomic coordinates of the interacting protein, RNA and interface water molecules can be downloaded. The parameters generated by PRince are novel, and users can correlate them with the experimentally determined biophysical and biochemical parameters for better understanding the specificity of the protein-RNA recognition process. This server will be continuously upgraded to include more parameters. PRince is publicly accessible and free for use. Available at http://www.facweb.iitkgp.ernet.in/~rbahadur/prince/home.html.

  5. The nucleotide sequence of a major glycine transfer RNA from the posterior silk gland of Bombyx mori L.

    PubMed Central

    Zúñiga, M C; Steitz, J A

    1977-01-01

    The nucleotide sequence of tRNA1Gly isolated from the posterior silk gland of Bombyx mori has been determined. This transfer RNA is present in high amounts in the posterior silk gland during the fifth larval instar. It has a GCC anticodon, capable of decoding a major glycine codon in the fibroin messenger RNA, GGU. Structural features of Bombyx tRNA1Gly and its homology to other eukaryotic glycine tRNAs are discussed. Images PMID:414206

  6. tRNA modification profiles of the fast-proliferating cancer cells

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dong, Chao; Niu, Leilei; Song, Wei

    Despite the recent progress in RNA modification study, a comprehensive modification profile is still lacking for mammalian cells. Using a quantitative HPLC/MS/MS assay, we present here a study where RNA modifications are examined in term of the major RNA species. With paired slow- and fast-proliferating cell lines, distinct RNA modification profiles are first revealed for diverse RNA species. Compared to mRNAs, increased ribose and nucleobase modifications are shown for the highly-structured tRNAs and rRNAs, lending support to their contribution to the formation of high-order structures. This study also reveals a dynamic tRNA modification profile in the fast-proliferating cells. In additionmore » to cultured cells, this unique tRNA profile has been further confirmed with endometrial cancers and their adjacent normal tissues. Taken together, the results indicate that tRNA is a actively regulated RNA species in the fast-proliferating cancer cells, and suggest that they may play a more active role in biological process than expected. -- Highlights: •RNA modifications were first examined in term of the major RNA species. •A dynamic tRNA modifications was characterized for the fast-proliferating cells. •The unique tRNA profile was confirmed with endometrial cancers and their adjacent normal tissues. •tRNA was predicted as an actively regulated RNA species in the fast-proliferating cancer cells.« less

  7. Crystal structure of full-length Zika virus NS5 protein reveals a conformation similar to Japanese encephalitis virus NS5

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Upadhyay, Anup K.; Cyr, Matthew; Longenecker, Kenton

    The rapid spread of the recentZika virus(ZIKV) epidemic across various countries in the American continent poses a major health hazard for the unborn fetuses of pregnant women. To date, there is no effective medical intervention. The nonstructural protein 5 ofZika virus(ZIKV-NS5) is critical for ZIKV replication through the 5'-RNA capping and RNA polymerase activities present in its N-terminal methyltransferase (MTase) and C-terminal RNA-dependent RNA polymerase (RdRp) domains, respectively. The crystal structure of the full-length ZIKV-NS5 protein has been determined at 3.05 Å resolution from a crystal belonging to space groupP2 12 12 and containing two protein molecules in the asymmetricmore » unit. The structure is similar to that reported for the NS5 protein fromJapanese encephalitis virusand suggests opportunities for structure-based drug design targeting either its MTase or RdRp domain.« less

  8. Improved systematic tRNA gene annotation allows new insights into the evolution of mitochondrial tRNA structures and into the mechanisms of mitochondrial genome rearrangements

    PubMed Central

    Jühling, Frank; Pütz, Joern; Bernt, Matthias; Donath, Alexander; Middendorf, Martin; Florentz, Catherine; Stadler, Peter F.

    2012-01-01

    Transfer RNAs (tRNAs) are present in all types of cells as well as in organelles. tRNAs of animal mitochondria show a low level of primary sequence conservation and exhibit ‘bizarre’ secondary structures, lacking complete domains of the common cloverleaf. Such sequences are hard to detect and hence frequently missed in computational analyses and mitochondrial genome annotation. Here, we introduce an automatic annotation procedure for mitochondrial tRNA genes in Metazoa based on sequence and structural information in manually curated covariance models. The method, applied to re-annotate 1876 available metazoan mitochondrial RefSeq genomes, allows to distinguish between remaining functional genes and degrading ‘pseudogenes’, even at early stages of divergence. The subsequent analysis of a comprehensive set of mitochondrial tRNA genes gives new insights into the evolution of structures of mitochondrial tRNA sequences as well as into the mechanisms of genome rearrangements. We find frequent losses of tRNA genes concentrated in basal Metazoa, frequent independent losses of individual parts of tRNA genes, particularly in Arthropoda, and wide-spread conserved overlaps of tRNAs in opposite reading direction. Direct evidence for several recent Tandem Duplication-Random Loss events is gained, demonstrating that this mechanism has an impact on the appearance of new mitochondrial gene orders. PMID:22139921

  9. The rearrangement of motif F in the flavivirus RNA-directed RNA polymerase.

    PubMed

    Potapova, Ulyana; Feranchuk, Sergey; Leonova, Galina; Belikov, Sergei

    2018-03-01

    In the flavivirus genus, the non-structural protein NS5 plays a central role in RNA viral replication and constitutes a major target for drug discovery. One of the prime challenges in the study of NS5 protein is to investigate the interplay between the two protein domains, namely, the RNA-dependent RNA polymerase (RdRp) domain and the methyltransferase (MTase) domain. These investigations could clarify the multiple roles of NS5 protein in the virus life cycle. Here we present the results of sequence analyses and structural bioinformatics studies of NS5 protein, which suggest that the conserved motif F in the NS5 protein could act as a lock which controls the rearrangement of the domains and as a switch in the protein enzymatic activity. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Common 5S rRNA variants are likely to be accepted in many sequence contexts

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.

    2003-01-01

    Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The results demonstrate that changes that occur multiple times in a local region of RNA sequence space in fact usually will be accepted in any sequence context in that same local region.

  11. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, R.N.; Robinson, H.; Klauer, A. A.

    The essential RNA helicase, Mtr4, performs a critical role in RNA processing and degradation as an activator of the nuclear exosome. The molecular basis for this vital function is not understood and detailed analysis is significantly limited by the lack of structural data. In this study, we present the crystal structure of Mtr4. The structure reveals a new arch-like domain that is specific to Mtr4 and Ski2 (the cytosolic homologue of Mtr4). In vivo and in vitro analyses demonstrate that the Mtr4 arch domain is required for proper 5.8S rRNA processing, and suggest that the arch functions independently of canonicalmore » helicase activity. In addition, extensive conservation along the face of the putative RNA exit site highlights a potential interface with the exosome. These studies provide a molecular framework for understanding fundamental aspects of helicase function in exosome activation, and more broadly define the molecular architecture of Ski2-like helicases.« less

  12. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    PubMed

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.

  13. RNA-SSPT: RNA Secondary Structure Prediction Tools

    PubMed Central

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes. PMID:24250115

  14. Fragmentation of the large subunit ribosomal RNA gene in oyster mitochondrial genomes.

    PubMed

    Milbury, Coren A; Lee, Jung C; Cannone, Jamie J; Gaffney, Patrick M; Gutell, Robin R

    2010-09-02

    Discontinuous genes have been observed in bacteria, archaea, and eukaryotic nuclei, mitochondria and chloroplasts. Gene discontinuity occurs in multiple forms: the two most frequent forms result from introns that are spliced out of the RNA and the resulting exons are spliced together to form a single transcript, and fragmented gene transcripts that are not covalently attached post-transcriptionally. Within the past few years, fragmented ribosomal RNA (rRNA) genes have been discovered in bilateral metazoan mitochondria, all within a group of related oysters. In this study, we have characterized this fragmentation with comparative analysis and experimentation. We present secondary structures, modeled using comparative sequence analysis of the discontinuous mitochondrial large subunit rRNA genes of the cupped oysters C. virginica, C. gigas, and C. hongkongensis. Comparative structure models for the large subunit rRNA in each of the three oyster species are generally similar to those for other bilateral metazoans. We also used RT-PCR and analyzed ESTs to determine if the two fragmented LSU rRNAs are spliced together. The two segments are transcribed separately, and not spliced together although they still form functional rRNAs and ribosomes. Although many examples of discontinuous ribosomal genes have been documented in bacteria and archaea, as well as the nuclei, chloroplasts, and mitochondria of eukaryotes, oysters are some of the first characterized examples of fragmented bilateral animal mitochondrial rRNA genes. The secondary structures of the oyster LSU rRNA fragments have been predicted on the basis of previous comparative metazoan mitochondrial LSU rRNA structure models.

  15. Single-molecule FRET-Rosetta reveals RNA structural rearrangements during human telomerase catalysis

    PubMed Central

    Parks, Joseph W.; Kappel, Kalli; Das, Rhiju; Stone, Michael D.

    2017-01-01

    Maintenance of telomeres by telomerase permits continuous proliferation of rapidly dividing cells, including the majority of human cancers. Despite its direct biomedical significance, the architecture of the human telomerase complex remains unknown. Generating homogeneous telomerase samples has presented a significant barrier to developing improved structural models. Here we pair single-molecule Förster resonance energy transfer (smFRET) measurements with Rosetta modeling to map the conformations of the essential telomerase RNA core domain within the active ribonucleoprotein. FRET-guided modeling places the essential pseudoknot fold distal to the active site on a protein surface comprising the C-terminal element, a domain that shares structural homology with canonical polymerase thumb domains. An independently solved medium-resolution structure of Tetrahymena telomerase provides a blind test of our modeling methodology and sheds light on the structural homology of this domain across diverse organisms. Our smFRET-Rosetta models reveal nanometer-scale rearrangements within the RNA core domain during catalysis. Taken together, our FRET data and pseudoatomic molecular models permit us to propose a possible mechanism for how RNA core domain rearrangement is coupled to template hybrid elongation. PMID:28096444

  16. The system with zwitterionic lactose-based surfactant for complexation and delivery of small interfering ribonucleic acid—A structural and spectroscopic study

    NASA Astrophysics Data System (ADS)

    Skupin, Michalina; Sobczak, Krzysztof; Zieliński, Ryszard; Kozak, Maciej

    2016-05-01

    Systems suitable for the effective preparation of complexes with siRNA (small interfering RNA) are at the center of interest in the area of research work on the delivery of the RNA-based drugs (RNA-therapeutics). This article presents results of a study on the structural effects associated with siRNA complexation by a surfactant comprising a lactose group (N-(3-propanesulfone)-N-dodecyl-amino-beta-D-lactose hydrochloride, LA12). The double stranded siRNA oligomer (21 base pairs) used in this study is responsible for silencing a gene that can be important in the therapy of myotonic dystrophy type 1. The obtained siRNA/LA12 lipoplexes were studied using the methods of small angle scattering of synchrotron radiation, circular dichroism spectroscopy, Fourier transform infrared spectroscopy, and electrophoretic mobility tests. Lipoplexes form in solution stable lamellar or cubic phases. The surfactant selected for the study shows much lower cytotoxicity and good complexation abilities of siRNA than dicationic or polycationic surfactants.

  17. The system with zwitterionic lactose-based surfactant for complexation and delivery of small interfering ribonucleic acid—A structural and spectroscopic study

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Skupin, Michalina; Sobczak, Krzysztof; Zieliński, Ryszard

    Systems suitable for the effective preparation of complexes with siRNA (small interfering RNA) are at the center of interest in the area of research work on the delivery of the RNA-based drugs (RNA-therapeutics). This article presents results of a study on the structural effects associated with siRNA complexation by a surfactant comprising a lactose group (N-(3-propanesulfone)-N-dodecyl-amino-beta-D-lactose hydrochloride, LA12). The double stranded siRNA oligomer (21 base pairs) used in this study is responsible for silencing a gene that can be important in the therapy of myotonic dystrophy type 1. The obtained siRNA/LA12 lipoplexes were studied using the methods of small anglemore » scattering of synchrotron radiation, circular dichroism spectroscopy, Fourier transform infrared spectroscopy, and electrophoretic mobility tests. Lipoplexes form in solution stable lamellar or cubic phases. The surfactant selected for the study shows much lower cytotoxicity and good complexation abilities of siRNA than dicationic or polycationic surfactants.« less

  18. Protein interactions and complexes in human microRNA biogenesis and function

    PubMed Central

    Perron, Marjorie P.; Provost, Patrick

    2010-01-01

    Encoded in the genome of most eukaryotes, microRNAs (miRNAs) have been proposed to regulate specifically up to 90% of human genes through a process known as miRNA-guided RNA silencing. The aim of this review is to present this process as the integration of a succession of specialized molecular machines exerting well defined functions. The nuclear microprocessor complex initially recognizes and processes its primary miRNA substrate into a miRNA precursor (pre-miRNA). This structure is then exported to the cytoplasm by the Exportin-5 complex where it is presented to the pre-miRNA processing complex. Following pre-miRNA conversion into a miRNA:miRNA* duplex, this complex is assembled into a miRNA-containing ribonucleoprotein (miRNP) complex, after which the miRNA strand is selected. The degree of complementarity of the miRNA for its messenger RNA (mRNA) target guides the recruitment of the miRNP complex. Initially repressing its translation, the miRNP-silenced mRNA is directed to the P-bodies, where the mRNA is either released from its inhibition upon a cellular signal and/or actively degraded. The potency and specificity of miRNA biogenesis and function rely on the distinct protein·protein, protein·RNA and RNA:RNA interactions found in different complexes, each of which fulfill a specific function in a well orchestrated process. PMID:17981733

  19. A conformational switch is responsible for the reversal of the 6S RNA-dependent RNA polymerase inhibition in Escherichia coli.

    PubMed

    Steuten, Benedikt; Wagner, Rolf

    2012-12-01

    6S RNA is a bacterial transcriptional regulator,which accumulates during stationary phase and inhibits transcription from many promoters due to stable association with σ 70 -containing RNA polymerase. This inhibitory RNA polymerase ∼ 6S RNA complex dissociates during nutritional upshift, when cells undergo outgrowth from stationary phase, releasing active RNA polymerase ready for transcription. The release reaction depends on a characteristic property of 6S RNAs, namely to act as template for the de novo synthesis of small RNAs, termed pRNAs.Here, we used limited hydrolysis with structure-specific RNases and in-line probing of isolated 6S RNA and 6SRNA ∼ pRNA complexes to investigate the molecular details leading to the release reaction. Our results indicate that pRNA transcription induces the refolding of the 6S RNA secondary structure by disrupting part of the closing stem(conserved sequence regions CRI and CRIV) and formation of a new hairpin (conserved sequence regions CRIII and CRIV). Comparison of the dimethylsulfate modification pattern of 6S RNA in living cells at stationary growth and during outgrowth confirmed the conformational change observed in vitro. Based on our results, a model describing the individual steps of the release reaction is presented.

  20. Optimized guide RNA structure for genome editing via Cas9

    PubMed Central

    Xu, Jianyong; Lian, Wei; Jia, Yuning; Li, Lingyun; Huang, Zhong

    2017-01-01

    The genome editing tool Cas9-gRNA (guide RNA) has been successfully applied in different cell types and organisms with high efficiency. However, more efforts need to be made to enhance both efficiency and specificity. In the current study, we optimized the guide RNA structure of Streptococcus pyogenes CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system to improve its genome editing efficiency. Comparing with the original functional structure of guide RNA, which is composed of crRNA and tracrRNA, the widely used chimeric gRNA has shorter crRNA and tracrRNA sequence. The deleted RNA sequence could form extra loop structure, which might enhance the stability of the guide RNA structure and subsequently the genome editing efficiency. Thus the genome editing efficiency of different forms of guide RNA was tested. And we found that the chimeric structure of gRNA with original full length of crRNA and tracrRNA showed higher genome editing efficiency than the conventional chimeric structure or other types of gRNA we tested. Therefore our data here uncovered the new type of gRNA structure with higher genome editing efficiency. PMID:29212218

  1. Macromolecular recognition: Structural aspects of the origin of the genetic system

    NASA Technical Reports Server (NTRS)

    Rein, Robert; Barak, Dov; Luo, Ning; Zielinski, Theresa Julia; Shibata, Masayuki

    1991-01-01

    Theoretical simulation of prebiotic chemical processes is an invaluable tool for probing the phenomenon of evolution of life. Using computational and modeling techniques and guided by analogies from present day systems we, seek to understand the emergence of genetic apparatus, enzymatic catalysis and protein synthesis under prebiotic conditions. In one possible scenario, the RNA enzymatic reaction plays a key role in the emergence of the self-replicating and offers a clue to the onset of enzymatic catalysis prior to the existence of the protein biosynthetic machinery. Our ultimate goal is to propose a simple RNA segment which contains the specificity and catalytic activity of the contemporary RNA enzyme and which could emerge in a primordial chemical environment. To understand the mechanism of ribozyme catalyzed reactions, ab initio and semi-empirical (ZINDO) programs were used to investigate the reaction path of transphosphorylation. A special emphasis was placed on the possible catalytic and structural roles played by the coordinated magnesium cation. Both the inline and adjacent mechanisms of transphosphorylation have been studied. Another important aspect of this reaction is the identity of the functional groups which are essential for the acid base catalysis. The structural characteristics of the target helices, particularly a possible role of G center dot T pair, is under examination by molecular dynamics (MD) simulation technique. Modeling of the ancestral aminoacyl-tRNA synthetases (aRS) may provide important clues to the emergence of the genetic code and the protein synthetic machinery. Assuming that the catalytic function evolved before the elements of specific recognition of a particular amino acid, we are exploring the minimal structural requirements for the catalysis of tRNA aminoacylation. The molecular modeling system SYBYL was used for this study based on the high resolution crystallographic structures of the present day tyrosyl-adenylate:tyrRS and tRNA(Gln): ATP:glnRS complexes. The trinucleotide CCA of the 3'-end tRNA is placed into the active site pocket of tyrRS, based on the scheme of interaction between tRNA(Gln) and glnRS, and upon the stereochemistry of the tyrRS:tRNA:Tyr-AMP transition state. This provides a model of the non-specific recognition of a tRNA's 3'-end by an aRS, which might be similar to that of the ancestral aRS's. In the next step, modeling of the rest of the acceptor stem of tRNA (Tyr) with tyrRS is carried out.

  2. RNA Relics and Origin of Life

    PubMed Central

    Demongeot, Jacques; Glade, Nicolas; Moreira, Andrés; Vial, Laurent

    2009-01-01

    A number of small RNA sequences, located in different non-coding sequences and highly preserved across the tree of life, have been suggested to be molecular fossils, of ancient (and possibly primordial) origin. On the other hand, recent years have revealed the existence of ubiquitous roles for small RNA sequences in modern organisms, in functions ranging from cell regulation to antiviral activity. We propose that a single thread can be followed from the beginning of life in RNA structures selected only for stability reasons through the RNA relics and up to the current coevolution of RNA sequences; such an understanding would shed light both on the history and on the present development of the RNA machinery and interactions. After presenting the evidence (by comparing their sequences) that points toward a common thread, we discuss a scenario of genome coevolution (with emphasis on viral infectious processes) and finally propose a plan for the reevaluation of the stereochemical theory of the genetic code; we claim that it may still be relevant, and not only for understanding the origin of life, but also for a comprehensive picture of regulation in present-day cells. PMID:20111682

  3. Comparative Structural and Functional Analysis of Bunyavirus and Arenavirus Cap-Snatching Endonucleases

    PubMed Central

    Reguera, Juan; Gerlach, Piotr; Rosenthal, Maria; Gaudon, Stephanie; Coscia, Francesca; Günther, Stephan; Cusack, Stephen

    2016-01-01

    Segmented negative strand RNA viruses of the arena-, bunya- and orthomyxovirus families uniquely carry out viral mRNA transcription by the cap-snatching mechanism. This involves cleavage of host mRNAs close to their capped 5′ end by an endonuclease (EN) domain located in the N-terminal region of the viral polymerase. We present the structure of the cap-snatching EN of Hantaan virus, a bunyavirus belonging to hantavirus genus. Hantaan EN has an active site configuration, including a metal co-ordinating histidine, and nuclease activity similar to the previously reported La Crosse virus and Influenza virus ENs (orthobunyavirus and orthomyxovirus respectively), but is more active in cleaving a double stranded RNA substrate. In contrast, Lassa arenavirus EN has only acidic metal co-ordinating residues. We present three high resolution structures of Lassa virus EN with different bound ion configurations and show in comparative biophysical and biochemical experiments with Hantaan, La Crosse and influenza ENs that the isolated Lassa EN is essentially inactive. The results are discussed in the light of EN activation mechanisms revealed by recent structures of full-length influenza virus polymerase. PMID:27304209

  4. RNAiFOLD: a constraint programming algorithm for RNA inverse folding and molecular design.

    PubMed

    Garcia-Martin, Juan Antonio; Clote, Peter; Dotu, Ivan

    2013-04-01

    Synthetic biology is a rapidly emerging discipline with long-term ramifications that range from single-molecule detection within cells to the creation of synthetic genomes and novel life forms. Truly phenomenal results have been obtained by pioneering groups--for instance, the combinatorial synthesis of genetic networks, genome synthesis using BioBricks, and hybridization chain reaction (HCR), in which stable DNA monomers assemble only upon exposure to a target DNA fragment, biomolecular self-assembly pathways, etc. Such work strongly suggests that nanotechnology and synthetic biology together seem poised to constitute the most transformative development of the 21st century. In this paper, we present a Constraint Programming (CP) approach to solve the RNA inverse folding problem. Given a target RNA secondary structure, we determine an RNA sequence which folds into the target structure; i.e. whose minimum free energy structure is the target structure. Our approach represents a step forward in RNA design--we produce the first complete RNA inverse folding approach which allows for the specification of a wide range of design constraints. We also introduce a Large Neighborhood Search approach which allows us to tackle larger instances at the cost of losing completeness, while retaining the advantages of meeting design constraints (motif, GC-content, etc.). Results demonstrate that our software, RNAiFold, performs as well or better than all state-of-the-art approaches; nevertheless, our approach is unique in terms of completeness, flexibility, and the support of various design constraints. The algorithms presented in this paper are publicly available via the interactive webserver http://bioinformatics.bc.edu/clotelab/RNAiFold; additionally, the source code can be downloaded from that site.

  5. NMR studies of two spliced leader RNAs using isotope labeling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lapham, J.; Crothers, D.M.

    1994-12-01

    Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less

  6. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    PubMed

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  7. Structural basis of Brr2-Prp8 interactions and implications for U5 snRNP biogenesis and the spliceosome active site.

    PubMed

    Nguyen, Thi Hoang Duong; Li, Jade; Galej, Wojciech P; Oshikane, Hiroyuki; Newman, Andrew J; Nagai, Kiyoshi

    2013-06-04

    The U5 small nuclear ribonucleoprotein particle (snRNP) helicase Brr2 disrupts the U4/U6 small nuclear RNA (snRNA) duplex and allows U6 snRNA to engage in an intricate RNA network at the active center of the spliceosome. Here, we present the structure of yeast Brr2 in complex with the Jab1/MPN domain of Prp8, which stimulates Brr2 activity. Contrary to previous reports, our crystal structure and mutagenesis data show that the Jab1/MPN domain binds exclusively to the N-terminal helicase cassette. The residues in the Jab1/MPN domain, whose mutations in human Prp8 cause the degenerative eye disease retinitis pigmentosa, are found at or near the interface with Brr2, clarifying its molecular pathology. In the cytoplasm, Prp8 forms a precursor complex with U5 snRNA, seven Smproteins, Snu114, and Aar2, but after nuclear import, Brr2 replaces Aar2 to form mature U5 snRNP. Our structure explains why Aar2 and Brr2 are mutually exclusive and provides important insights into the assembly of U5 snRNP.

  8. RNA Helicase DDX1 Converts RNA G-Quadruplex Structures into R-Loops to Promote IgH Class Switch Recombination.

    PubMed

    Ribeiro de Almeida, Claudia; Dhir, Somdutta; Dhir, Ashish; Moghaddam, Amin E; Sattentau, Quentin; Meinhart, Anton; Proudfoot, Nicholas J

    2018-05-17

    Class switch recombination (CSR) at the immunoglobulin heavy-chain (IgH) locus is associated with the formation of R-loop structures over switch (S) regions. While these often occur co-transcriptionally between nascent RNA and template DNA, we now show that they also form as part of a post-transcriptional mechanism targeting AID to IgH S-regions. This depends on the RNA helicase DDX1 that is also required for CSR in vivo. DDX1 binds to G-quadruplex (G4) structures present in intronic switch transcripts and converts them into S-region R-loops. This in turn targets the cytidine deaminase enzyme AID to S-regions so promoting CSR. Notably R-loop levels over S-regions are diminished by chemical stabilization of G4 RNA or by the expression of a DDX1 ATPase-deficient mutant that acts as a dominant-negative protein to reduce CSR efficiency. In effect, we provide evidence for how S-region transcripts interconvert between G4 and R-loop structures to promote CSR in the IgH locus. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  9. Reviving the RNA World: An Insight into the Appearance of RNA Methyltransferases

    PubMed Central

    Rana, Ajay K.; Ankri, Serge

    2016-01-01

    RNA, the earliest genetic and catalytic molecule, has a relatively delicate and labile chemical structure, when compared to DNA. It is prone to be damaged by alkali, heat, nucleases, or stress conditions. One mechanism to protect RNA or DNA from damage is through site-specific methylation. Here, we propose that RNA methylation began prior to DNA methylation in the early forms of life evolving on Earth. In this article, the biochemical properties of some RNA methyltransferases (MTases), such as 2′-O-MTases (Rlml/RlmN), spOUT MTases and the NSun2 MTases are dissected for the insight they provide on the transition from an RNA world to our present RNA/DNA/protein world. PMID:27375676

  10. Eukaryotic ribonucleases P/MRP: the crystal structure of the P3 domain.

    PubMed

    Perederina, Anna; Esakova, Olga; Quan, Chao; Khanova, Elena; Krasilnikov, Andrey S

    2010-02-17

    Ribonuclease (RNase) P is a site-specific endoribonuclease found in all kingdoms of life. Typical RNase P consists of a catalytic RNA component and a protein moiety. In the eukaryotes, the RNase P lineage has split into two, giving rise to a closely related enzyme, RNase MRP, which has similar components but has evolved to have different specificities. The eukaryotic RNases P/MRP have acquired an essential helix-loop-helix protein-binding RNA domain P3 that has an important function in eukaryotic enzymes and distinguishes them from bacterial and archaeal RNases P. Here, we present a crystal structure of the P3 RNA domain from Saccharomyces cerevisiae RNase MRP in a complex with RNase P/MRP proteins Pop6 and Pop7 solved to 2.7 A. The structure suggests similar structural organization of the P3 RNA domains in RNases P/MRP and possible functions of the P3 domains and proteins bound to them in the stabilization of the holoenzymes' structures as well as in interactions with substrates. It provides the first insight into the structural organization of the eukaryotic enzymes of the RNase P/MRP family.

  11. Structural elements and organization of the ancestral translational machinery

    NASA Technical Reports Server (NTRS)

    Rein, R.; Srinivasan, S.; Mcdonald, J.; Raghunathan, G.; Shibata, M.

    1987-01-01

    The molecular mechanisms of the primitive translational apparatus are discussed in the framework of present-day protein biosynthesis. The structural necessities of an early adaptor and the multipoint recognition properties of such an adaptor are investigated on the basis of structure/function relationships found in a contemporary system and a molecular model of the contemporary transpeptidation complex. A model of the tRNA(Tyr)-tyrosyl tRNA synthetase complex including the positioning of the disordered region is proposed; the model is used to illustrate the required recognition properties of the ancestor aminoacyl synthetase.

  12. A semi-supervised learning approach for RNA secondary structure prediction.

    PubMed

    Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

    2015-08-01

    RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. CID-miRNA: A web server for prediction of novel miRNA precursors in human genome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tyagi, Sonika; Vaz, Candida; Gupta, Vipin

    2008-08-08

    microRNAs (miRNA) are a class of non-protein coding functional RNAs that are thought to regulate expression of target genes by direct interaction with mRNAs. miRNAs have been identified through both experimental and computational methods in a variety of eukaryotic organisms. Though these approaches have been partially successful, there is a need to develop more tools for detection of these RNAs as they are also thought to be present in abundance in many genomes. In this report we describe a tool and a web server, named CID-miRNA, for identification of miRNA precursors in a given DNA sequence, utilising secondary structure-based filteringmore » systems and an algorithm based on stochastic context free grammar trained on human miRNAs. CID-miRNA analyses a given sequence using a web interface, for presence of putative miRNA precursors and the generated output lists all the potential regions that can form miRNA-like structures. It can also scan large genomic sequences for the presence of potential miRNA precursors in its stand-alone form. The web server can be accessed at (http://mirna.jnu.ac.in/cidmirna/)« less

  14. Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions

    PubMed Central

    Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.

    2017-01-01

    Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401

  15. The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

    PubMed Central

    De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

    1992-01-01

    The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081

  16. DNA recognition by an RNA-guided bacterial Argonaute

    PubMed Central

    Doudna, Jennifer A.

    2017-01-01

    Argonaute (Ago) proteins are widespread in prokaryotes and eukaryotes and share a four-domain architecture capable of RNA- or DNA-guided nucleic acid recognition. Previous studies identified a prokaryotic Argonaute protein from the eubacterium Marinitoga piezophila (MpAgo), which binds preferentially to 5′-hydroxylated guide RNAs and cleaves single-stranded RNA (ssRNA) and DNA (ssDNA) targets. Here we present a 3.2 Å resolution crystal structure of MpAgo bound to a 21-nucleotide RNA guide and a complementary 21-nucleotide ssDNA substrate. Comparison of this ternary complex to other target-bound Argonaute structures reveals a unique orientation of the N-terminal domain, resulting in a straight helical axis of the entire RNA-DNA heteroduplex through the central cleft of the protein. Additionally, mismatches introduced into the heteroduplex reduce MpAgo cleavage efficiency with a symmetric profile centered around the middle of the helix. This pattern differs from the canonical mismatch tolerance of other Argonautes, which display decreased cleavage efficiency for substrates bearing sequence mismatches to the 5′ region of the guide strand. This structural analysis of MpAgo bound to a hybrid helix advances our understanding of the diversity of target recognition mechanisms by Argonaute proteins. PMID:28520746

  17. Overlapping local and long-range RNA-RNA interactions modulate dengue virus genome cyclization and replication.

    PubMed

    de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V

    2015-03-01

    The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  18. The impact of dyskeratosis congenita mutations on the structure and dynamics of the human telomerase RNA pseudoknot domain | Center for Cancer Research

    Cancer.gov

    The pseudoknot domain is a functionally crucial part of telomerase RNA and influences the activity and stability of the ribonucleoprotein complex. Autosomal dominant dyskeratosis congenita (DKC) is an inherited disease that is linked to mutations in telomerase RNA and impairs telomerase function. In this paper, we present a computational prediction of the influence of two base

  19. New Insights into the Biological Role of Mammalian ADARs; the RNA Editing Proteins

    PubMed Central

    Mannion, Niamh; Arieti, Fabiana; Gallo, Angela; Keegan, Liam P.; O’Connell, Mary A.

    2015-01-01

    The ADAR proteins deaminate adenosine to inosine in double-stranded RNA which is one of the most abundant modifications present in mammalian RNA. Inosine can have a profound effect on the RNAs that are edited, not only changing the base-pairing properties, but can also result in recoding, as inosine behaves as if it were guanosine. In mammals there are three ADAR proteins and two ADAR-related proteins (ADAD) expressed. All have a very similar modular structure; however, both their expression and biological function differ significantly. Only two of the ADAR proteins have enzymatic activity. However, both ADAR and ADAD proteins possess the ability to bind double-strand RNA. Mutations in ADARs have been associated with many diseases ranging from cancer, innate immunity to neurological disorders. Here, we will discuss in detail the domain structure of mammalian ADARs, the effects of RNA editing, and the role of ADARs in human diseases. PMID:26437436

  20. Frnakenstein: multiple target inverse RNA folding.

    PubMed

    Lyngsø, Rune B; Anderson, James W J; Sizikova, Elena; Badugu, Amarendra; Hyland, Tomas; Hein, Jotun

    2012-10-09

    RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein.

  1. Frnakenstein: multiple target inverse RNA folding

    PubMed Central

    2012-01-01

    Background RNA secondary structure prediction, or folding, is a classic problem in bioinformatics: given a sequence of nucleotides, the aim is to predict the base pairs formed in its three dimensional conformation. The inverse problem of designing a sequence folding into a particular target structure has only more recently received notable interest. With a growing appreciation and understanding of the functional and structural properties of RNA motifs, and a growing interest in utilising biomolecules in nano-scale designs, the interest in the inverse RNA folding problem is bound to increase. However, whereas the RNA folding problem from an algorithmic viewpoint has an elegant and efficient solution, the inverse RNA folding problem appears to be hard. Results In this paper we present a genetic algorithm approach to solve the inverse folding problem. The main aims of the development was to address the hitherto mostly ignored extension of solving the inverse folding problem, the multi-target inverse folding problem, while simultaneously designing a method with superior performance when measured on the quality of designed sequences. The genetic algorithm has been implemented as a Python program called Frnakenstein. It was benchmarked against four existing methods and several data sets totalling 769 real and predicted single structure targets, and on 292 two structure targets. It performed as well as or better at finding sequences which folded in silico into the target structure than all existing methods, without the heavy bias towards CG base pairs that was observed for all other top performing methods. On the two structure targets it also performed well, generating a perfect design for about 80% of the targets. Conclusions Our method illustrates that successful designs for the inverse RNA folding problem does not necessarily have to rely on heavy biases in base pair and unpaired base distributions. The design problem seems to become more difficult on larger structures when the target structures are real structures, while no deterioration was observed for predicted structures. Design for two structure targets is considerably more difficult, but far from impossible, demonstrating the feasibility of automated design of artificial riboswitches. The Python implementation is available at http://www.stats.ox.ac.uk/research/genome/software/frnakenstein. PMID:23043260

  2. RNApdbee 2.0: multifunctional tool for RNA structure annotation.

    PubMed

    Zok, Tomasz; Antczak, Maciej; Zurkowski, Michal; Popenda, Mariusz; Blazewicz, Jacek; Adamiak, Ryszard W; Szachniuk, Marta

    2018-04-30

    In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.

  3. Empirical analysis of RNA robustness and evolution using high-throughput sequencing of ribozyme reactions.

    PubMed

    Hayden, Eric J

    2016-08-15

    RNA molecules provide a realistic but tractable model of a genotype to phenotype relationship. This relationship has been extensively investigated computationally using secondary structure prediction algorithms. Enzymatic RNA molecules, or ribozymes, offer access to genotypic and phenotypic information in the laboratory. Advancements in high-throughput sequencing technologies have enabled the analysis of sequences in the lab that now rivals what can be accomplished computationally. This has motivated a resurgence of in vitro selection experiments and opened new doors for the analysis of the distribution of RNA functions in genotype space. A body of computational experiments has investigated the persistence of specific RNA structures despite changes in the primary sequence, and how this mutational robustness can promote adaptations. This article summarizes recent approaches that were designed to investigate the role of mutational robustness during the evolution of RNA molecules in the laboratory, and presents theoretical motivations, experimental methods and approaches to data analysis. Copyright © 2016 Elsevier Inc. All rights reserved.

  4. Invariant U2 snRNA nucleotides form a stem loop to recognize the intron early in splicing

    PubMed Central

    Perriman, Rhonda; Ares, Manuel

    2010-01-01

    U2 snRNA-intron branchpoint pairing is a critical step in pre-mRNA recognition by the splicing apparatus, but the mechanism by which these two RNAs engage each other is unknown. Here we identify a new U2 snRNA structure, the branchpoint interaction stem-loop (BSL), that presents the U2 nucleotides that will contact the intron. We provide evidence that the BSL forms prior to interaction with the intron, and is disrupted by the DExD/H protein Prp5p during engagement of the snRNA with the intron. In vitro splicing complex assembly in a BSL-destabilized mutant extract suggests that the BSL is required at a previously unrecognized step between commitment complex and prespliceosome formation. The extreme evolutionary conservation of the BSL suggests it represents an ancient structural solution to the problem of intron branchpoint recognition by dynamic RNA elements that must serve multiple functions at other times during splicing. PMID:20471947

  5. The molecular dynamics of long noncoding RNA control of transcription in PTEN and its pseudogene

    PubMed Central

    Lister, Nicholas; Shevchenko, Galina; Walshe, James L.; Groen, Jessica; Johnsson, Per; Vidarsdóttir, Linda; Grander, Dan; Ataide, Sandro F.; Morris, Kevin V.

    2017-01-01

    RNA has been found to interact with chromatin and modulate gene transcription. In human cells, little is known about how long noncoding RNAs (lncRNAs) interact with target loci in the context of chromatin. We find here, using the phosphatase and tensin homolog (PTEN) pseudogene as a model system, that antisense lncRNAs interact first with a 5′ UTR-containing promoter-spanning transcript, which is then followed by the recruitment of DNA methyltransferase 3a (DNMT3a), ultimately resulting in the transcriptional and epigenetic control of gene expression. Moreover, we find that the lncRNA and promoter-spanning transcript interaction are based on a combination of structural and sequence components of the antisense lncRNA. These observations suggest, on the basis of this one example, that evolutionary pressures may be placed on RNA structure more so than sequence conservation. Collectively, the observations presented here suggest a much more complex and vibrant RNA regulatory world may be operative in the regulation of gene expression. PMID:28847966

  6. dsRNA-protein interactions studied by molecular dynamics techniques. Unravelling dsRNA recognition by DCL1.

    PubMed

    Drusin, Salvador I; Suarez, Irina P; Gauto, Diego F; Rasia, Rodolfo M; Moreno, Diego M

    2016-04-15

    Double stranded RNA (dsRNA) participates in several biological processes, where RNA molecules acquire secondary structure inside the cell through base complementarity. The double stranded RNA binding domain (dsRBD) is one of the main protein folds that is able to recognize and bind to dsRNA regions. The N-terminal dsRBD of DCL1 in Arabidopsis thaliana (DCL1-1), in contrast to other studied dsRBDs, lacks a stable structure, behaving as an intrinsically disordered protein. DCL1-1 does however recognize dsRNA by acquiring a canonical fold in the presence of its substrate. Here we present a detailed modeling and molecular dynamics study of dsRNA recognition by DCL1-1. We found that DCL1-1 forms stable complexes with different RNAs and we characterized the residues involved in binding. Although the domain shows a binding loop substantially shorter than other homologs, it can still interact with the dsRNA and results in bending of the dsRNA A-type helix. Furthermore, we found that R8, a non-conserved residue located in the first dsRNA binding region, recognizes preferentially mismatched base pairs. We discuss our findings in the context of the function of DCL1-1 within the microRNA processing complex. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Mechanism of mRNA-STAR domain interaction: Molecular dynamics simulations of Mammalian Quaking STAR protein.

    PubMed

    Sharma, Monika; Anirudh, C R

    2017-10-03

    STAR proteins are evolutionary conserved mRNA-binding proteins that post-transcriptionally regulate gene expression at all stages of RNA metabolism. These proteins possess conserved STAR domain that recognizes identical RNA regulatory elements as YUAAY. Recently reported crystal structures show that STAR domain is composed of N-terminal QUA1, K-homology domain (KH) and C-terminal QUA2, and mRNA binding is mediated by KH-QUA2 domain. Here, we present simulation studies done to investigate binding of mRNA to STAR protein, mammalian Quaking protein (QKI). We carried out conventional MD simulations of STAR domain in presence and absence of mRNA, and studied the impact of mRNA on the stability, dynamics and underlying allosteric mechanism of STAR domain. Our unbiased simulations results show that presence of mRNA stabilizes the overall STAR domain by reducing the structural deviations, correlating the 'within-domain' motions, and maintaining the native contacts information. Absence of mRNA not only influenced the essential modes of motion of STAR domain, but also affected the connectivity of networks within STAR domain. We further explored the dissociation of mRNA from STAR domain using umbrella sampling simulations, and the results suggest that mRNA binding to STAR domain occurs in multi-step: first conformational selection of mRNA backbone conformations, followed by induced fit mechanism as nucleobases interact with STAR domain.

  8. Comparative structural analysis of human DEAD-box RNA helicases.

    PubMed

    Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

    2010-09-30

    DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members.

  9. Comparative Structural Analysis of Human DEAD-Box RNA Helicases

    PubMed Central

    Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

    2010-01-01

    DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members. PMID:20941364

  10. Structures of human SRP72 complexes provide insights into SRP RNA remodeling and ribosome interaction

    PubMed Central

    Becker, Matthias M. M.; Lapouge, Karine; Segnitz, Bernd; Wild, Klemens; Sinning, Irmgard

    2017-01-01

    Co-translational protein targeting and membrane protein insertion is a fundamental process and depends on the signal recognition particle (SRP). In mammals, SRP is composed of the SRP RNA crucial for SRP assembly and function and six proteins. The two largest proteins SRP68 and SRP72 form a heterodimer and bind to a regulatory site of the SRP RNA. Despite their essential roles in the SRP pathway, structural information has been available only for the SRP68 RNA-binding domain (RBD). Here we present the crystal structures of the SRP68 protein-binding domain (PBD) in complex with SRP72-PBD and of the SRP72-RBD bound to the SRP S domain (SRP RNA, SRP19 and SRP68) detailing all interactions of SRP72 within SRP. The SRP72-PBD is a tetratricopeptide repeat, which binds an extended linear motif of SRP68 with high affinity. The SRP72-RBD is a flexible peptide crawling along the 5e- and 5f-loops of SRP RNA. A conserved tryptophan inserts into the 5e-loop forming a novel type of RNA kink-turn stabilized by a potassium ion, which we define as K+-turn. In addition, SRP72-RBD remodels the 5f-loop involved in ribosome binding and visualizes SRP RNA plasticity. Docking of the S domain structure into cryo-electron microscopy density maps reveals multiple contact sites between SRP68/72 and the ribosome, and explains the role of SRP72 in the SRP pathway. PMID:27899666

  11. Prevention of cross-talk in conserved regulatory systems: identification of specificity determinants in RNA-binding anti-termination proteins of the BglG family

    PubMed Central

    Hübner, Sebastian; Declerck, Nathalie; Diethmaier, Christine; Le Coq, Dominique; Aymerich, Stephane; Stülke, Jörg

    2011-01-01

    Each family of signal transduction systems requires specificity determinants that link individual signals to the correct regulatory output. In Bacillus subtilis, a family of four anti-terminator proteins controls the expression of genes for the utilisation of alternative sugars. These regulatory systems contain the anti-terminator proteins and a RNA structure, the RNA anti-terminator (RAT) that is bound by the anti-terminator proteins. We have studied three of these proteins (SacT, SacY, and LicT) to understand how they can transmit a specific signal in spite of their strong structural homology. A screen for random mutations that render SacT capable to bind a RNA structure recognized by LicT only revealed a substitution (P26S) at one of the few non-conserved residues that are in contact with the RNA. We have randomly modified this position in SacT together with another non-conserved RNA-contacting residue (Q31). Surprisingly, the mutant proteins could bind all RAT structures that are present in B. subtilis. In a complementary approach, reciprocal amino acid exchanges have been introduced in LicT and SacY at non-conserved positions of the RNA-binding site. This analysis revealed the key role of an arginine side-chain for both the high affinity and specificity of LicT for its cognate RAT. Introduction of this Arg at the equivalent position of SacY (A26) increased the RNA binding in vitro but also resulted in a relaxed specificity. Altogether our results suggest that this family of anti-termination proteins has evolved to reach a compromise between RNA binding efficacy and specific interaction with individual target sequences. PMID:21278164

  12. Ab initio RNA folding by discrete molecular dynamics: From structure prediction to folding mechanisms

    PubMed Central

    Ding, Feng; Sharma, Shantanu; Chalasani, Poornima; Demidov, Vadim V.; Broude, Natalia E.; Dokholyan, Nikolay V.

    2008-01-01

    RNA molecules with novel functions have revived interest in the accurate prediction of RNA three-dimensional (3D) structure and folding dynamics. However, existing methods are inefficient in automated 3D structure prediction. Here, we report a robust computational approach for rapid folding of RNA molecules. We develop a simplified RNA model for discrete molecular dynamics (DMD) simulations, incorporating base-pairing and base-stacking interactions. We demonstrate correct folding of 150 structurally diverse RNA sequences. The majority of DMD-predicted 3D structures have <4 Å deviations from experimental structures. The secondary structures corresponding to the predicted 3D structures consist of 94% native base-pair interactions. Folding thermodynamics and kinetics of tRNAPhe, pseudoknots, and mRNA fragments in DMD simulations are in agreement with previous experimental findings. Folding of RNA molecules features transient, non-native conformations, suggesting non-hierarchical RNA folding. Our method allows rapid conformational sampling of RNA folding, with computational time increasing linearly with RNA length. We envision this approach as a promising tool for RNA structural and functional analyses. PMID:18456842

  13. Structure of the prolyl-tRNA synthetase from the eukaryotic pathogen Giardia lamblia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Larson, Eric T.; Kim, Jessica E.; Napuli, Alberto J.

    2012-09-01

    The structure of Giardia prolyl-tRNA synthetase cocrystallized with proline and ATP shows evidence for half-of-the-sites activity, leading to a corresponding mixture of reaction substrates and product (prolyl-AMP) in the two active sites of the dimer. The genome of the human intestinal parasite Giardia lamblia contains only a single aminoacyl-tRNA synthetase gene for each amino acid. The Giardia prolyl-tRNA synthetase gene product was originally misidentified as a dual-specificity Pro/Cys enzyme, in part owing to its unexpectedly high off-target activation of cysteine, but is now believed to be a normal representative of the class of archaeal/eukaryotic prolyl-tRNA synthetases. The 2.2 Å resolutionmore » crystal structure of the G. lamblia enzyme presented here is thus the first structure determination of a prolyl-tRNA synthetase from a eukaryote. The relative occupancies of substrate (proline) and product (prolyl-AMP) in the active site are consistent with half-of-the-sites reactivity, as is the observed biphasic thermal denaturation curve for the protein in the presence of proline and MgATP. However, no corresponding induced asymmetry is evident in the structure of the protein. No thermal stabilization is observed in the presence of cysteine and ATP. The implied low affinity for the off-target activation product cysteinyl-AMP suggests that translational fidelity in Giardia is aided by the rapid release of misactivated cysteine.« less

  14. Analyses of a whole-genome inter-clade recombination map of hepatitis delta virus suggest a host polymerase-driven and viral RNA structure-promoted template-switching mechanism for viral RNA recombination

    PubMed Central

    Chao, Mei; Wang, Tzu-Chi; Lin, Chia-Chi; Yung-Liang Wang, Robert; Lin, Wen-Bin; Lee, Shang-En; Cheng, Ying-Yu; Yeh, Chau-Ting; Iang, Shan-Bei

    2017-01-01

    The genome of hepatitis delta virus (HDV) is a 1.7-kb single-stranded circular RNA that folds into an unbranched rod-like structure and has ribozyme activity. HDV redirects host RNA polymerase(s) (RNAP) to perform viral RNA-directed RNA transcription. RNA recombination is known to contribute to the genetic heterogeneity of HDV, but its molecular mechanism is poorly understood. Here, we established a whole-genome HDV-1/HDV-4 recombination map using two cloned sequences coexisting in cultured cells. Our functional analyses of the resulting chimeric delta antigens (the only viral-encoded protein) and recombinant genomes provide insights into how recombination promotes the genotypic and phenotypic diversity of HDV. Our examination of crossover distribution and subsequent mutagenesis analyses demonstrated that ribozyme activity on HDV genome, which is required for viral replication, also contributes to the generation of an inter-clade junction. These data provide circumstantial evidence supporting our contention that HDV RNA recombination occurs via a replication-dependent mechanism. Furthermore, we identify an intrinsic asymmetric bulge on the HDV genome, which appears to promote recombination events in the vicinity. We therefore propose a mammalian RNAP-driven and viral-RNA-structure-promoted template-switching mechanism for HDV genetic recombination. The present findings improve our understanding of the capacities of the host RNAP beyond typical DNA-directed transcription. PMID:28977829

  15. MicroRNA-like viral small RNA from porcine reproductive and respiratory syndrome virus negatively regulates viral replication by targeting the viral nonstructural protein 2.

    PubMed

    Li, Na; Yan, Yunhuan; Zhang, Angke; Gao, Jiming; Zhang, Chong; Wang, Xue; Hou, Gaopeng; Zhang, Gaiping; Jia, Jinbu; Zhou, En-Min; Xiao, Shuqi

    2016-12-13

    Many viruses encode microRNAs (miRNAs) that are small non-coding single-stranded RNAs which play critical roles in virus-host interactions. Porcine reproductive and respiratory syndrome virus (PRRSV) is one of the most economically impactful viruses in the swine industry. The present study sought to determine whether PRRSV encodes miRNAs that could regulate PRRSV replication. Four viral small RNAs (vsRNAs) were mapped to the stem-loop structures in the ORF1a, ORF1b and GP2a regions of the PRRSV genome by bioinformatics prediction and experimental verification. Of these, the structures with the lowest minimum free energy (MFE) values predicted for PRRSV-vsRNA1 corresponded to typical stem-loop, hairpin structures. Inhibition of PRRSV-vsRNA1 function led to significant increases in viral replication. Transfection with PRRSV-vsRNA1 mimics significantly inhibited PRRSV replication in primary porcine alveolar macrophages (PAMs). The time-dependent increase in the abundance of PRRSV-vsRNA1 mirrored the gradual upregulation of PRRSV RNA expression. Knockdown of proteins associated with cellular miRNA biogenesis demonstrated that Drosha and Argonaute (Ago2) are involved in PRRSV-vsRNA1 biogenesis. Moreover, PRRSV-vsRNA1 bound specifically to the nonstructural protein 2 (NSP2)-coding sequence of PRRSV genome RNA. Collectively, the results reveal that PRRSV encodes a functional PRRSV-vsRNA1 which auto-regulates PRRSV replication by directly targeting and suppressing viral NSP2 gene expression. These findings not only provide new insights into the mechanism of the pathogenesis of PRRSV, but also explore a potential avenue for controlling PRRSV infection using viral small RNAs.

  16. Circularly polarized luminescence of helically assembled pyrene π-stacks on RNA and DNA duplexes.

    PubMed

    Nakamura, Mitsunobu; Ota, Fuyuki; Takada, Tadao; Akagi, Kazuo; Yamana, Kazushige

    2018-05-01

    In this report, we describe the circularly polarized luminescence (CPL) of the RNA duplexes having one to four 2'-O-pyrene modified uridines (Upy) and the DNA duplexes having two, four, and six pyrene modified non-nucleosidic linkers (Py). Both the pyrene π-stack arrays formed on the RNA and DNA double helical structures exhibited pyrene excimer fluorescence. In the pyrene-modified RNA systems, the RNA duplex having four Upys gives CPL emission with g lum value of <0.01 at 480 nm. The structure of pyrene stacks on the RNA duplex may be rigidly regulated with increase in the Upy domains, which resulted in the CPL emission. In the DNA systems, the pyrene-modified duplexes containing two and four Pys exhibited CPL emission with g lum values of <0.001 at 505 nm. The pyrene π-stack arrays presented here show CPL emission. However, the g lum values are relatively small when compared with our previous system consisting of the pyrene-zipper arrays on RNA. © 2018 Wiley Periodicals, Inc.

  17. The True Story and Advantages of RNA Phage Capsids as Nanotools.

    PubMed

    Pumpens, Paul; Renhofa, Regina; Dishlers, Andris; Kozlovska, Tatjana; Ose, Velta; Pushko, Peter; Tars, Kaspars; Grens, Elmars; Bachmann, Martin F

    2016-01-01

    RNA phages are often used as prototypes for modern recombinant virus-like particle (VLP) technologies. Icosahedral RNA phage VLPs can be formed from coat proteins (CPs) and are efficiently produced in bacteria and yeast. Both genetic fusion and chemical coupling have been successfully used for the production of numerous chimeras based on RNA phage VLPs. In this review, we describe advances in RNA phage VLP technology along with the history of the Leviviridae family, including its taxonomical organization, genomic structure, and important role in the development of molecular biology. Comparative 3D structures of different RNA phage VLPs are used to explain the level of VLP tolerance to foreign elements displayed on VLP surfaces. We also summarize data that demonstrate the ability of CPs to tolerate different organic (peptides, oligonucleotides, and carbohydrates) and inorganic (metal ions) compounds either chemically coupled or noncovalently added to the outer and/or inner surfaces of VLPs. Finally, we present lists of nanotechnological RNA phage VLP applications, such as experimental vaccines constructed by genetic fusion and chemical coupling methodologies, nanocontainers for targeted drug delivery, and bioimaging tools. © 2016 S. Karger AG, Basel.

  18. The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure.

    PubMed

    Leontis, N B; Westhof, E

    1998-09-01

    A significant fraction of the bases in a folded, structured RNA molecule participate in noncanonical base pairing interactions, often in the context of internal loops or multi-helix junction loops. The appearance of each new high-resolution RNA structure provides welcome data to guide efforts to understand and predict RNA 3D structure, especially when the RNA in question is a functionally conserved molecule. The recent publication of the crystal structure of the "Loop E" region of bacterial 5S ribosomal RNA is such an event [Correll CC, Freeborn B, Moore PB, Steitz TA, 1997, Cell 91:705-712]. In addition to providing more examples of already established noncanonical base pairs, such as purine-purine sheared pairings, trans-Hoogsteen UA, and GU wobble pairs, the structure provides the first high-resolution views of two new purine-purine pairings and a new GU pairing. The goal of the present analysis is to expand the capabilities of both chemical probing and phylogenetic analysis to predict with greater accuracy the structures of RNA molecules. First, in light of existing chemical probing data, we investigate what lessons could be learned regarding the interpretation of this widely used method of RNA structure probing. Then we analyze the 3D structure with reference to molecular phylogeny data (assuming conservation of function) to discover what alternative base pairings are geometrically compatible with the structure. The comparisons between previous modeling efforts and crystal structures show that the intricate involvements of ions and water molecules in the maintenance of non-Watson-Crick pairs render the process of correctly identifying the interacting sites in such pairs treacherous, except in cases of trans-Hoogsteen A/U or sheared A/G pairs for the adenine N1 site. The phylogenetic analysis identifies A/A, A/C, A/U and C/A, C/C, and C/U pairings isosteric with sheared A/G, as well as A/A and A/C pairings isosteric with both G/U and G/G bifurcated pairings. Thus, each non-Watson-Crick pair could be characterized by a phylogenetic signature of variations between isosteric-like pairings. In addition to the conservative changes, which form a dictionary of pairings isosterically compatible with those observed in the crystal structure, concerted changes involving several base pairs also occur. The latter covariations may indicate transitions between related but distinctive motifs within the loop E of 5S ribosomal RNA.

  19. RNA polymerase pausing and nascent RNA structure formation are linked through clamp domain movement

    PubMed Central

    Hein, Pyae P.; Kolb, Kellie E.; Windgassen, Tricia; Bellecourt, Michael J.; Darst, Seth A.; Mooney, Rachel A.; Landick, Robert

    2014-01-01

    The rates of RNA synthesis and nascent RNA folding into biologically active structures are linked via pausing by RNA polymerase (RNAP). Structures that form within the RNA exit channel can increase pausing by interacting with bacterial RNAP or decrease pausing by preventing backtracking. Conversely, pausing is required for proper folding of some RNAs. Opening of the RNAP clamp domain is proposed to mediate some effects of nascent RNA structures. However, the connections among RNA structure formation, clamp movement, and catalytic activity remain uncertain. We assayed exit-channel structure formation in Escherichia coli RNAP together with disulfide crosslinks that favor closed or open clamp conformations and found that clamp position directly influences RNA structure formation and catalytic activity. We report that exit-channel RNA structures slow pause escape by favoring clamp opening and through interactions with the flap that slow translocation. PMID:25108353

  20. RNase MRP Cleaves Pre-tRNASer-Met in the tRNA Maturation Pathway

    PubMed Central

    Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V.; Taoka, Masato; Isobe, Toshiaki

    2014-01-01

    Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNASer-Met. To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNASer-Met, suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry–based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute “Domain 1” in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP. PMID:25401760

  1. RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

    PubMed

    Moss, Walter N

    2018-03-01

    With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  2. Thermodynamic insights into 2-thiouridine-enhanced RNA hybridization

    PubMed Central

    Larsen, Aaron T.; Fahrenbach, Albert C.; Sheng, Jia; Pian, Julia; Szostak, Jack W.

    2015-01-01

    Nucleobase modifications dramatically alter nucleic acid structure and thermodynamics. 2-thiouridine (s2U) is a modified nucleobase found in tRNAs and known to stabilize U:A base pairs and destabilize U:G wobble pairs. The recently reported crystal structures of s2U-containing RNA duplexes do not entirely explain the mechanisms responsible for the stabilizing effect of s2U or whether this effect is entropic or enthalpic in origin. We present here thermodynamic evaluations of duplex formation using ITC and UV thermal denaturation with RNA duplexes containing internal s2U:A and s2U:U pairs and their native counterparts. These results indicate that s2U stabilizes both duplexes. The stabilizing effect is entropic in origin and likely results from the s2U-induced preorganization of the single-stranded RNA prior to hybridization. The same preorganizing effect is likely responsible for structurally resolving the s2U:U pair-containing duplex into a single conformation with a well-defined H-bond geometry. We also evaluate the effect of s2U on single strand conformation using UV- and CD-monitored thermal denaturation and on nucleoside conformation using 1H NMR spectroscopy, MD and umbrella sampling. These results provide insights into the effects that nucleobase modification has on RNA structure and thermodynamics and inform efforts toward improving both ribozyme-catalyzed and nonenzymatic RNA copying. PMID:26240387

  3. A global sampling approach to designing and reengineering RNA secondary structures.

    PubMed

    Levin, Alex; Lis, Mieszko; Ponty, Yann; O'Donnell, Charles W; Devadas, Srinivas; Berger, Bonnie; Waldispühl, Jérôme

    2012-11-01

    The development of algorithms for designing artificial RNA sequences that fold into specific secondary structures has many potential biomedical and synthetic biology applications. To date, this problem remains computationally difficult, and current strategies to address it resort to heuristics and stochastic search techniques. The most popular methods consist of two steps: First a random seed sequence is generated; next, this seed is progressively modified (i.e. mutated) to adopt the desired folding properties. Although computationally inexpensive, this approach raises several questions such as (i) the influence of the seed; and (ii) the efficiency of single-path directed searches that may be affected by energy barriers in the mutational landscape. In this article, we present RNA-ensign, a novel paradigm for RNA design. Instead of taking a progressive adaptive walk driven by local search criteria, we use an efficient global sampling algorithm to examine large regions of the mutational landscape under structural and thermodynamical constraints until a solution is found. When considering the influence of the seeds and the target secondary structures, our results show that, compared to single-path directed searches, our approach is more robust, succeeds more often and generates more thermodynamically stable sequences. An ensemble approach to RNA design is thus well worth pursuing as a complement to existing approaches. RNA-ensign is available at http://csb.cs.mcgill.ca/RNAensign.

  4. A global sampling approach to designing and reengineering RNA secondary structures

    PubMed Central

    Levin, Alex; Lis, Mieszko; Ponty, Yann; O’Donnell, Charles W.; Devadas, Srinivas; Berger, Bonnie; Waldispühl, Jérôme

    2012-01-01

    The development of algorithms for designing artificial RNA sequences that fold into specific secondary structures has many potential biomedical and synthetic biology applications. To date, this problem remains computationally difficult, and current strategies to address it resort to heuristics and stochastic search techniques. The most popular methods consist of two steps: First a random seed sequence is generated; next, this seed is progressively modified (i.e. mutated) to adopt the desired folding properties. Although computationally inexpensive, this approach raises several questions such as (i) the influence of the seed; and (ii) the efficiency of single-path directed searches that may be affected by energy barriers in the mutational landscape. In this article, we present RNA-ensign, a novel paradigm for RNA design. Instead of taking a progressive adaptive walk driven by local search criteria, we use an efficient global sampling algorithm to examine large regions of the mutational landscape under structural and thermodynamical constraints until a solution is found. When considering the influence of the seeds and the target secondary structures, our results show that, compared to single-path directed searches, our approach is more robust, succeeds more often and generates more thermodynamically stable sequences. An ensemble approach to RNA design is thus well worth pursuing as a complement to existing approaches. RNA-ensign is available at http://csb.cs.mcgill.ca/RNAensign. PMID:22941632

  5. Structure and Function of the N-Terminal Domain of the Vesicular Stomatitis Virus RNA Polymerase

    PubMed Central

    Qiu, Shihong; Ogino, Minako; Luo, Ming

    2015-01-01

    ABSTRACT Viruses have various mechanisms to duplicate their genomes and produce virus-specific mRNAs. Negative-strand RNA viruses encode their own polymerases to perform each of these processes. For the nonsegmented negative-strand RNA viruses, the polymerase is comprised of the large polymerase subunit (L) and the phosphoprotein (P). L proteins from members of the Rhabdoviridae, Paramyxoviridae, and Filoviridae share sequence and predicted secondary structure homology. Here, we present the structure of the N-terminal domain (conserved region I) of the L protein from a rhabdovirus, vesicular stomatitis virus, at 1.8-Å resolution. The strictly and strongly conserved residues in this domain cluster in a single area of the protein. Serial mutation of these residues shows that many of the amino acids are essential for viral transcription but not for mRNA capping. Three-dimensional alignments show that this domain shares structural homology with polymerases from other viral families, including segmented negative-strand RNA and double-stranded RNA (dsRNA) viruses. IMPORTANCE Negative-strand RNA viruses include a diverse set of viral families that infect animals and plants, causing serious illness and economic impact. The members of this group of viruses share a set of functionally conserved proteins that are essential to their replication cycle. Among this set of proteins is the viral polymerase, which performs a unique set of reactions to produce genome- and subgenome-length RNA transcripts. In this article, we study the polymerase of vesicular stomatitis virus, a member of the rhabdoviruses, which has served in the past as a model to study negative-strand RNA virus replication. We have identified a site in the N-terminal domain of the polymerase that is essential to viral transcription and that shares sequence homology with members of the paramyxoviruses and the filoviruses. Newly identified sites such as that described here could prove to be useful targets in the design of new therapeutics against negative-strand RNA viruses. PMID:26512087

  6. Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics

    PubMed Central

    Reeder, Jens; Giegerich, Robert

    2004-01-01

    Background The general problem of RNA secondary structure prediction under the widely used thermodynamic model is known to be NP-complete when the structures considered include arbitrary pseudoknots. For restricted classes of pseudoknots, several polynomial time algorithms have been designed, where the O(n6)time and O(n4) space algorithm by Rivas and Eddy is currently the best available program. Results We introduce the class of canonical simple recursive pseudoknots and present an algorithm that requires O(n4) time and O(n2) space to predict the energetically optimal structure of an RNA sequence, possible containing such pseudoknots. Evaluation against a large collection of known pseudoknotted structures shows the adequacy of the canonization approach and our algorithm. Conclusions RNA pseudoknots of medium size can now be predicted reliably as well as efficiently by the new algorithm. PMID:15294028

  7. The crystal structure of Zika virus NS5 reveals conserved drug targets.

    PubMed

    Duan, Wenqian; Song, Hao; Wang, Haiyuan; Chai, Yan; Su, Chao; Qi, Jianxun; Shi, Yi; Gao, George F

    2017-04-03

    Zika virus (ZIKV) has emerged as major health concern, as ZIKV infection has been shown to be associated with microcephaly, severe neurological disease and possibly male sterility. As the largest protein component within the ZIKV replication complex, NS5 plays key roles in the life cycle and survival of the virus through its N-terminal methyltransferase (MTase) and C-terminal RNA-dependent RNA polymerase (RdRp) domains. Here, we present the crystal structures of ZIKV NS5 MTase in complex with an RNA cap analogue ( m7 GpppA) and the free NS5 RdRp. We have identified the conserved features of ZIKV NS5 MTase and RdRp structures that could lead to development of current antiviral inhibitors being used against flaviviruses, including dengue virus and West Nile virus, to treat ZIKV infection. These results should inform and accelerate the structure-based design of antiviral compounds against ZIKV. © 2017 The Authors.

  8. e23D: database and visualization of A-to-I RNA editing sites mapped to 3D protein structures.

    PubMed

    Solomon, Oz; Eyal, Eran; Amariglio, Ninette; Unger, Ron; Rechavi, Gidi

    2016-07-15

    e23D, a database of A-to-I RNA editing sites from human, mouse and fly mapped to evolutionary related protein 3D structures, is presented. Genomic coordinates of A-to-I RNA editing sites are converted to protein coordinates and mapped onto 3D structures from PDB or theoretical models from ModBase. e23D allows visualization of the protein structure, modeling of recoding events and orientation of the editing with respect to nearby genomic functional sites from databases of disease causing mutations and genomic polymorphism. http://www.sheba-cancer.org.il/e23D CONTACT: oz.solomon@live.biu.ac.il or Eran.Eyal@sheba.health.gov.il. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  9. Structural basis for activity of highly efficient RNA mimics of green fluorescent protein

    PubMed Central

    Warner, Katherine Deigan; Chen, Michael C.; Song, Wenjiao; Strack, Rita L.; Thorn, Andrea; Jaffrey, Samie R.; Ferré-D’Amaré, Adrian R.

    2014-01-01

    Green fluorescent protein (GFP) and its derivatives revolutionized the study of proteins. Spinach is a recently reported in vitro evolved RNA mimic of GFP, which as genetically encoded fusions, makes possible live-cell, real-time imaging of biological RNAs, without resorting to large RNA-binding protein-GFP fusions. To elucidate the molecular basis of Spinach fluorescence, we have solved its co-crystal structure bound to its cognate exogenous chromophore, revealing that Spinach activates the small molecule by immobilizing it between a base triple, a G-quadruplex, and an unpaired guanine. Mutational and NMR analyses indicate that the G-quadruplex is essential for Spinach fluorescence, is also present in other fluorogenic RNAs, and may represent a general strategy for RNAs to induce fluorescence of chromophores. The structure has guided the design of a miniaturized 'Baby Spinach', and provides the foundation for structure-driven design and tuning of fluorescent RNAs. PMID:25026079

  10. Structures of two aptamers with differing ligand specificity reveal ruggedness in the functional landscape of RNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A.

    Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of themore » ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA.« less

  11. Structures of two aptamers with differing ligand specificity reveal ruggedness in the functional landscape of RNA.

    PubMed

    Knappenberger, Andrew John; Reiss, Caroline Wetherington; Strobel, Scott A

    2018-06-07

    Two classes of riboswitches related to the ykkC guanidine-I riboswitch bind phosphoribosyl pyrophosphate (PRPP) and guanosine tetraphosphate (ppGpp). Here we report the co-crystal structure of the PRPP aptamer and its ligand. We also report the structure of the G96A point mutant that prefers ppGpp over PRPP with a dramatic 40,000-fold switch in specificity. The ends of the aptamer form a helix that is not present in the guanidine aptamer and is involved in the expression platform. In the mutant, the base of ppGpp replaces G96 in three-dimensional space. This disrupts the S-turn, which is a primary structural feature of the ykkC RNA motif. These dramatic differences in ligand specificity are achieved with minimal mutations. ykkC aptamers are therefore a prime example of an RNA fold with a rugged fitness landscape. The ease with which the ykkC aptamer acquires new specificity represents a striking case of evolvability in RNA. © 2018, Knappenberger et al.

  12. Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl; Waleń, Tomasz; University of Warsaw, Banacha 2, 02-097 Warsaw

    2015-03-01

    A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure ismore » RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.« less

  13. 5-Fluoro pyrimidines: labels to probe DNA and RNA secondary structures by 1D 19F NMR spectroscopy

    PubMed Central

    Puffer, Barbara; Kreutz, Christoph; Rieder, Ulrike; Ebert, Marc-Olivier; Konrat, Robert; Micura, Ronald

    2009-01-01

    19F NMR spectroscopy has proved to be a valuable tool to monitor functionally important conformational transitions of nucleic acids. Here, we present a systematic investigation on the application of 5-fluoro pyrimidines to probe DNA and RNA secondary structures. Oligonucleotides with the propensity to adapt secondary structure equilibria were chosen as model systems and analyzed by 1D 19F and 1H NMR spectroscopy. A comparison with the unmodified analogs revealed that the equilibrium characteristics of the bistable DNA and RNA oligonucleotides were hardly affected upon fluorine substitution at C5 of pyrimidines. This observation was in accordance with UV spectroscopic melting experiments which demonstrated that single 5-fluoro substitutions in double helices lead to comparable thermodynamic stabilities. Thus, 5-fluoro pyrimidine labeling of DNA and RNA can be reliably applied for NMR based nucleic acid secondary structure evaluation. Furthermore, we developed a facile synthetic route towards 5-fluoro cytidine phosphoramidites that enables their convenient site-specific incorporation into oligonucleotides by solid-phase synthesis. PMID:19843610

  14. 5-Fluoro pyrimidines: labels to probe DNA and RNA secondary structures by 1D 19F NMR spectroscopy.

    PubMed

    Puffer, Barbara; Kreutz, Christoph; Rieder, Ulrike; Ebert, Marc-Olivier; Konrat, Robert; Micura, Ronald

    2009-12-01

    (19)F NMR spectroscopy has proved to be a valuable tool to monitor functionally important conformational transitions of nucleic acids. Here, we present a systematic investigation on the application of 5-fluoro pyrimidines to probe DNA and RNA secondary structures. Oligonucleotides with the propensity to adapt secondary structure equilibria were chosen as model systems and analyzed by 1D (19)F and (1)H NMR spectroscopy. A comparison with the unmodified analogs revealed that the equilibrium characteristics of the bistable DNA and RNA oligonucleotides were hardly affected upon fluorine substitution at C5 of pyrimidines. This observation was in accordance with UV spectroscopic melting experiments which demonstrated that single 5-fluoro substitutions in double helices lead to comparable thermodynamic stabilities. Thus, 5-fluoro pyrimidine labeling of DNA and RNA can be reliably applied for NMR based nucleic acid secondary structure evaluation. Furthermore, we developed a facile synthetic route towards 5-fluoro cytidine phosphoramidites that enables their convenient site-specific incorporation into oligonucleotides by solid-phase synthesis.

  15. 5S rRNA and ribosome.

    PubMed

    Gongadze, G M

    2011-12-01

    5S rRNA is an integral component of the ribosome of all living organisms. It is known that the ribosome without 5S rRNA is functionally inactive. However, the question about the specific role of this RNA in functioning of the translation apparatus is still open. This review presents a brief history of the discovery of 5S rRNA and studies of its origin and localization in the ribosome. The previously expressed hypotheses about the role of this RNA in the functioning of the ribosome are discussed considering the unique location of 5S rRNA in the ribosome and its intermolecular contacts. Based on analysis of the current data on ribosome structure and its functional complexes, the role of 5S rRNA as an intermediary between ribosome functional domains is discussed.

  16. RiboSketch: Versatile Visualization of Multi-stranded RNA and DNA Secondary Structure.

    PubMed

    Lu, Jacob S; Bindewald, Eckart; Kasprzak, Wojciech; Shapiro, Bruce A

    2018-06-15

    Creating clear, visually pleasing 2D depictions of RNA and DNA strands and their interactions is important to facilitate and communicate insights related to nucleic acid structure. Here we present RiboSketch, a secondary structure image production application that enables the visualization of multistranded structures via layout algorithms, comprehensive editing capabilities, and a multitude of simulation modes. These interactive features allow RiboSketch to create publication quality diagrams for structures with a wide range of composition, size, and complexity. The program may be run in any web browser without the need for installation, or as a standalone Java application. https://binkley2.ncifcrf.gov/users/bindewae/ribosketch_web.

  17. Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign

    PubMed Central

    2007-01-01

    Background Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction. Results The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources. Conclusion Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download. PMID:17445273

  18. Influenza Polymerase Can Adopt an Alternative Configuration Involving a Radical Repacking of PB2 Domains.

    PubMed

    Thierry, Eric; Guilligay, Delphine; Kosinski, Jan; Bock, Thomas; Gaudon, Stephanie; Round, Adam; Pflug, Alexander; Hengrung, Narin; El Omari, Kamel; Baudin, Florence; Hart, Darren J; Beck, Martin; Cusack, Stephen

    2016-01-07

    Influenza virus polymerase transcribes or replicates the segmented RNA genome (vRNA) into respectively viral mRNA or full-length copies and initiates RNA synthesis by binding the conserved 3' and 5' vRNA ends (the promoter). In recent structures of promoter-bound polymerase, the cap-binding and endonuclease domains are configured for cap snatching, which generates capped transcription primers. Here, we present a FluB polymerase structure with a bound complementary cRNA 5' end that exhibits a major rearrangement of the subdomains within the C-terminal two-thirds of PB2 (PB2-C). Notably, the PB2 nuclear localization signal (NLS)-containing domain translocates ∼90 Å to bind to the endonuclease domain. FluA PB2-C alone and RNA-free FluC polymerase are similarly arranged. Biophysical and cap-dependent endonuclease assays show that in solution the polymerase explores different conformational distributions depending on which RNA is bound. The inherent flexibility of the polymerase allows it to adopt alternative conformations that are likely important during polymerase maturation into active progeny RNPs. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  19. Cooperative mechanism of RNA packaging motor.

    PubMed

    Lísal, Jirí; Tuma, Roman

    2005-06-17

    P4 is a hexameric ATPase that serves as the RNA packaging motor in double-stranded RNA bacteriophages from the Cystoviridae family. P4 shares sequence and structural similarities with hexameric helicases. A structure-based mechanism for mechano-chemical coupling has recently been proposed for P4 from bacteriophage phi12. However, coordination of ATP hydrolysis among the subunits and coupling with RNA translocation remains elusive. Here we present detailed kinetic study of nucleotide binding, hydrolysis, and product release by phi12 P4 in the presence of different RNA and DNA substrates. Whereas binding affinities for ATP and ADP are not affected by RNA binding, the hydrolysis step is accelerated and the apparent cooperativity is increased. No nucleotide binding cooperativity is observed. We propose a stochastic-sequential cooperativity model to describe the coordination of ATP hydrolysis within the hexamer. In this model the apparent cooperativity is a result of hydrolysis stimulation by ATP and RNA binding to neighboring subunits rather than cooperative nucleotide binding. The translocation step appears coupled to hydrolysis, which is coordinated among three neighboring subunits. Simultaneous interaction of neighboring subunits with RNA makes the otherwise random hydrolysis sequential and processive.

  20. Mg2+ Effect on Argonaute and RNA Duplex by Molecular Dynamics and Bioinformatics Implications

    PubMed Central

    Nam, Seungyoon; Ryu, Hyojung; Son, Won-joon; Kim, Yon Hui; Kim, Kyung Tae; Balch, Curt; Nephew, Kenneth P.; Lee, Jinhyuk

    2014-01-01

    RNA interference (RNAi), mediated by small non-coding RNAs (e.g., miRNAs, siRNAs), influences diverse cellular functions. Highly complementary miRNA-target RNA (or siRNA-target RNA) duplexes are recognized by an Argonaute family protein (Ago2), and recent observations indicate that the concentration of Mg2+ ions influences miRNA targeting of specific mRNAs, thereby modulating miRNA-mRNA networks. In the present report, we studied the thermodynamic effects of differential [Mg2+] on slicing (RNA silencing cycle) through molecular dynamics simulation analysis, and its subsequent statistical analysis. Those analyses revealed different structural conformations of the RNA duplex in Ago2, depending on Mg2+ concentration. We also demonstrate that cation effects on Ago2 structural flexibility are critical to its catalytic/functional activity, with low [Mg2+] favoring greater Ago2 flexibility (e.g., greater entropy) and less miRNA/mRNA duplex stability, thus favoring slicing. The latter finding was supported by a negative correlation between expression of an Mg2+ influx channel, TRPM7, and one miRNA’s (miR-378) ability to downregulate its mRNA target, TMEM245. These results imply that thermodynamics could be applied to siRNA-based therapeutic strategies, using highly complementary binding targets, because Ago2 is also involved in RNAi slicing by exogenous siRNAs. However, the efficacy of a siRNA-based approach will differ, to some extent, based on the Mg2+ concentration even within the same disease type; therefore, different siRNA-based approaches might be considered for patient-to-patient needs. PMID:25330448

  1. Domain organization and crystal structure of the catalytic domain of E.coli RluF, a pseudouridine synthase that acts on 23S rRNA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sunita,S.; Zhenxing, H.; Swaathi, J.

    2006-01-01

    Pseudouridine synthases catalyze the isomerization of uridine to pseudouridine ({psi}) in rRNA and tRNA. The pseudouridine synthase RluF from Escherichia coli (E.C. 4.2.1.70) modifies U2604 in 23S rRNA, and belongs to a large family of pseudouridine synthases present in all kingdoms of life. Here we report the domain architecture and crystal structure of the catalytic domain of E. coli RluF at 2.6 Angstroms resolution. Limited proteolysis, mass spectrometry and N-terminal sequencing indicate that RluF has a distinct domain architecture, with the catalytic domain flanked at the N and C termini by additional domains connected to it by flexible linkers. Themore » structure of the catalytic domain of RluF is similar to those of RsuA and TruB. RluF is a member of the RsuA sequence family of {psi}-synthases, along with RluB and RluE. Structural comparison of RluF with its closest structural homologues, RsuA and TruB, suggests possible functional roles for the N-terminal and C-terminal domains of RluF.« less

  2. A structural-alphabet-based strategy for finding structural motifs across protein families

    PubMed Central

    Wu, Chih Yuan; Chen, Yao Chi; Lim, Carmay

    2010-01-01

    Proteins with insignificant sequence and overall structure similarity may still share locally conserved contiguous structural segments; i.e. structural/3D motifs. Most methods for finding 3D motifs require a known motif to search for other similar structures or functionally/structurally crucial residues. Here, without requiring a query motif or essential residues, a fully automated method for discovering 3D motifs of various sizes across protein families with different folds based on a 16-letter structural alphabet is presented. It was applied to structurally non-redundant proteins bound to DNA, RNA, obligate/non-obligate proteins as well as free DNA-binding proteins (DBPs) and proteins with known structures but unknown function. Its usefulness was illustrated by analyzing the 3D motifs found in DBPs. A non-specific motif was found with a ‘corner’ architecture that confers a stable scaffold and enables diverse interactions, making it suitable for binding not only DNA but also RNA and proteins. Furthermore, DNA-specific motifs present ‘only’ in DBPs were discovered. The motifs found can provide useful guidelines in detecting binding sites and computational protein redesign. PMID:20525797

  3. Recent structural studies on Dom34/aPelota and Hbs1/aEF1α: important factors for solving general problems of ribosomal stall in translation

    PubMed Central

    Kobayashi, Kan; Ishitani, Ryuichiro; Nureki, Osamu

    2013-01-01

    In the translation process, translating ribosomes usually move on an mRNA until they reach the stop codon. However, when ribosomes translate an aberrant mRNA, they stall. Then, ribosomes are rescued from the aberrant mRNA, and the aberrant mRNA is subsequently degraded. In eukaryotes, Pelota (Dom34 in yeast) and Hbs1 are responsible for solving general problems of ribosomal stall in translation. In archaea, aPelota and aEF1α, homologous to Pelota and Hbs1, respectively, are considered to be involved in that process. In recent years, great progress has been made in determining structures of Dom34/aPelota and Hbs1/aEF1α. In this review, we focus on the functional roles of Dom34/aPelota and Hbs1/aEF1α in ribosome rescue, based on recent structural studies of them. We will also present questions to be answered by future work. PMID:27493551

  4. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

    PubMed Central

    Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

    2010-01-01

    RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462

  5. Computational study of RNA folding kinetics and thermodynamics

    NASA Astrophysics Data System (ADS)

    Morgan, Steven Robert

    RNA in its many forms is involved in the processes of protein manufacture, gene splicing, catalysis and gene regulation. It is also the store of genetic information in some viruses. The function of the RNA is determined by its structure, and it is the purpose of this thesis to investigate kinetic and thermodynamic properties of RNA secondary structures in order to obtain a better understanding of their formation and function. Our main tenet is that kinetic formation of RNA structure is necessary to explain features found in natural RNA structures, as well as aspects of the biological function of RNA. Firstly we show that examination of the energies of fragments of RNA secondary structure provides evidence for kinetic formation of structure. Local regions of RNA of length less than about 100 nucleotides adopt a conformation with energy near or equal to the minimum possible for those regions, whilst the energies of larger domains are much further from the their respective minima. This is consistent with the patterns that would be expected if RNA structure is folded Idneticatic during transcription. A Monte-Carlo algorithm is then used to model the kinetic folding of RNA during transcriptional growth. The algorithm is capable of finding the correct structure of a natural RNA for which the minimum free energy approach is unsuccessful. In the viral phage MS2 Idneticatic formed RNA structure plays an important role in the regulation of gene expression. The folding algorithm can accurately model this by IdneticaUy controlling access to the gene initiation region. The algorithm is also successfully used to model the control of replication in the ColEl plasmid. Taking a different approach, we then use a simplified model of RNA secondary structure to investigate the size of energy barriers between degenerate minimum energy structures. This model has much in common with physical systems such as spin glasses, and in fact shows similar behaviour to these systems in that energy barriers between structures grow quickly with the length of the RNA sequence. These barriers will serve to trap RNA in non-optimal structures. Together these studies demonstrate the necessity of studying RNA secondary structure from a kinetic point of view, and provide clear directions in which further work may be taken. Kinetic models of RNA secondary structure should continue to prove useful in modelling the structure and function of RNA.

  6. Dendrimer Nanovectors for SiRNA Delivery.

    PubMed

    Liu, Xiaoxuan; Peng, Ling

    2016-01-01

    Small interfering RNA (SiRNA) delivery remains a major challenge in RNAi-based therapy. Dendrimers are emerging as appealing nonviral vectors for SiRNA delivery thanks to their well-defined architecture and their unique cooperativity and multivalency confined within a nanostructure. We have recently demonstrated that structurally flexible poly(amidoamine) (PAMAM) dendrimers are safe and effective nanovectors for SiRNA delivery in various disease models in vitro and in vivo. The present chapter showcases these dendrimers can package different SiRNA molecules into stable and nanosized particles, which protect SiRNA from degradation and promote cellular uptake of SiRNA, resulting in potent gene silencing at both mRNA and protein level in the prostate cancer cell model. Our results demonstrate this set of flexible PAMAM dendrimers are indeed safe and effective nonviral vectors for SiRNA delivery and hold great promise for further applications in functional genomics and RNAi-based therapies.

  7. Characteristics of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) rRNA genes of Apis mellifera (Insecta: Hymenoptera): structure, organization, and retrotransposable elements

    PubMed Central

    Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R

    2006-01-01

    As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639

  8. An efficient method for the prediction of deleterious multiple-point mutations in the secondary structure of RNAs using suboptimal folding solutions

    PubMed Central

    Churkin, Alexander; Barash, Danny

    2008-01-01

    Background RNAmute is an interactive Java application which, given an RNA sequence, calculates the secondary structure of all single point mutations and organizes them into categories according to their similarity to the predicted structure of the wild type. The secondary structure predictions are performed using the Vienna RNA package. A more efficient implementation of RNAmute is needed, however, to extend from the case of single point mutations to the general case of multiple point mutations, which may often be desired for computational predictions alongside mutagenesis experiments. But analyzing multiple point mutations, a process that requires traversing all possible mutations, becomes highly expensive since the running time is O(nm) for a sequence of length n with m-point mutations. Using Vienna's RNAsubopt, we present a method that selects only those mutations, based on stability considerations, which are likely to be conformational rearranging. The approach is best examined using the dot plot representation for RNA secondary structure. Results Using RNAsubopt, the suboptimal solutions for a given wild-type sequence are calculated once. Then, specific mutations are selected that are most likely to cause a conformational rearrangement. For an RNA sequence of about 100 nts and 3-point mutations (n = 100, m = 3), for example, the proposed method reduces the running time from several hours or even days to several minutes, thus enabling the practical application of RNAmute to the analysis of multiple-point mutations. Conclusion A highly efficient addition to RNAmute that is as user friendly as the original application but that facilitates the practical analysis of multiple-point mutations is presented. Such an extension can now be exploited prior to site-directed mutagenesis experiments by virologists, for example, who investigate the change of function in an RNA virus via mutations that disrupt important motifs in its secondary structure. A complete explanation of the application, called MultiRNAmute, is available at [1]. PMID:18445289

  9. Cryo-EM structure of Hepatitis C virus IRES bound to the human ribosome at 3.9-Å resolution

    NASA Astrophysics Data System (ADS)

    Quade, Nick; Boehringer, Daniel; Leibundgut, Marc; van den Heuvel, Joop; Ban, Nenad

    2015-07-01

    Hepatitis C virus (HCV), a widespread human pathogen, is dependent on a highly structured 5'-untranslated region of its mRNA, referred to as internal ribosome entry site (IRES), for the translation of all of its proteins. The HCV IRES initiates translation by directly binding to the small ribosomal subunit (40S), circumventing the need for many eukaryotic translation initiation factors required for mRNA scanning. Here we present the cryo-EM structure of the human 40S ribosomal subunit in complex with the HCV IRES at 3.9 Å resolution, determined by focused refinement of an 80S ribosome-HCV IRES complex. The structure reveals the molecular details of the interactions between the IRES and the 40S, showing that expansion segment 7 (ES7) of the 18S rRNA acts as a central anchor point for the HCV IRES. The structural data rationalizes previous biochemical and genetic evidence regarding the initiation mechanism of the HCV and other related IRESs.

  10. An integrated, structure- and energy-based view of the genetic code.

    PubMed

    Grosjean, Henri; Westhof, Eric

    2016-09-30

    The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Visualization of RNA structure models within the Integrative Genomics Viewer.

    PubMed

    Busan, Steven; Weeks, Kevin M

    2017-07-01

    Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  12. Base pairing and structural insights into the 5-formylcytosine in RNA duplex

    PubMed Central

    Wang, Rui; Luo, Zhipu; He, Kaizhang; Delaney, Michael O.; Chen, Doris; Sheng, Jia

    2016-01-01

    Abstract 5-Formylcytidine (f5C), a previously discovered natural nucleotide in the mitochondrial tRNA of many species including human, has been recently detected as the oxidative product of 5-methylcytidine (m5C) through 5-hydroxymethylcytidine (hm5C) in total RNA of mammalian cells. The discovery indicated that these cytosine derivatives in RNA might also play important epigenetic roles similar as in DNA, which has been intensively investigated in the past few years. In this paper, we studied the base pairing specificity of f5C in different RNA duplex contexts. We found that the 5-formyl group could increase duplex thermal stability and enhance base pairing specificity. We present three high-resolution crystal structures of an octamer RNA duplex [5′-GUA(f5C)GUAC-3′]2 that have been solved under three crystallization conditions with different buffers and pH values. Our results showed that the 5-formyl group is located in the same plane as the cytosine base and forms an intra-residue hydrogen bond with the amino group in the N4 position. In addition, this modification increases the base stacking between the f5C and the neighboring bases while not causing significant global and local structure perturbations. This work provides insights into the effects of 5-formylcytosine on RNA duplex. PMID:27079978

  13. Dithiothreitol (DTT) Acts as a Specific, UV-inducible Cross-linker in Elucidation of Protein–RNA Interactions*

    PubMed Central

    Zaman, Uzma; Richter, Florian M.; Hofele, Romina; Kramer, Katharina; Sachsenberg, Timo; Kohlbacher, Oliver; Lenz, Christof; Urlaub, Henning

    2015-01-01

    Protein–RNA cross-linking by UV irradiation at 254 nm wavelength has been established as an unbiased method to identify proteins in direct contact with RNA, and has been successfully applied to investigate the spatial arrangement of protein and RNA in large macromolecular assemblies, e.g. ribonucleoprotein-complex particles (RNPs). The mass spectrometric analysis of such peptide-RNA cross-links provides high resolution structural data to the point of mapping protein–RNA interactions to specific peptides or even amino acids. However, the approach suffers from the low yield of cross-linking products, which can be addressed by improving enrichment and analysis methods. In the present article, we introduce dithiothreitol (DTT) as a potent protein–RNA cross-linker. In order to evaluate the efficiency and specificity of DTT, we used two systems, a small synthetic peptide from smB protein incubated with U1 snRNA oligonucleotide and native ribonucleoprotein complexes from S. cerevisiae. Our results unambiguously show that DTT covalently participates in cysteine-uracil crosslinks, which is observable as a mass increment of 151.9966 Da (C4H8S2O2) upon mass spectrometric analysis. DTT presents advantages for cross-linking of cysteine containing regions of proteins. This is evidenced by comparison to experiments where (tris(2-carboxyethyl)phosphine) is used as reducing agent, and significantly less cross-links encompassing cysteine residues are found. We further propose insertion of DTT between the cysteine and uracil reactive sites as the most probable structure of the cross-linking products. PMID:26450613

  14. Posttranscriptional modification of tRNA in thermophilic archaea (Archaebacteria).

    PubMed Central

    Edmonds, C G; Crain, P F; Gupta, R; Hashizume, T; Hocart, C H; Kowalak, J A; Pomerantz, S C; Stetter, K O; McCloskey, J A

    1991-01-01

    Nucleoside modification has been studied in unfractionated tRNA from 11 thermophilic archaea (archaebacteria), including phylogenetically diverse representatives of thermophilic methanogens and sulfur-metabolizing hyperthermophiles which grow optimally in the temperature range of 56 (Thermoplasma acidophilum) to 105 degrees C (Pyrodictium occultum), and for comparison from the most thermophilic bacterium (eubacterium) known, Thermotoga maritima (80 degrees C). Nine nucleosides are found to be unique to the archaea, six of which are structurally novel in being modified both in the base and by methylation in ribose and occur primarily in tRNA from the extreme thermophiles in the Crenarchaeota of the archaeal phylogenetic tree. 2-Thiothymine occurs in tRNA from Thermococcus sp., and constitutes the only known occurrence of the thymine moiety in archaeal RNA, in contrast to its near-ubiquitous presence in tRNA from bacteria and eukarya. A total of 33 modified nucleosides are rigorously characterized in archaeal tRNA in the present study, demonstrating that the structural range of posttranscriptional modifications in archaeal tRNA is more extensive than previously known. From a phylogenetic standpoint, certain tRNA modifications occur in the archaea which are otherwise unique to either the bacterial or eukaryal domain, although the overall patterns of modification are more typical of eukaryotes than bacteria. PMID:1708763

  15. Structure of a novel antibacterial toxin that exploits elongation factor Tu to cleave specific transfer RNAs

    DOE PAGES

    Michalska, Karolina; Gucinski, Grant C.; Garza-Sanchez, Fernando; ...

    2017-08-11

    Contact-dependent growth inhibition (CDI) is a mechanism of inter-cellular competition in which Gram-negative bacteria exchange polymorphic toxins using type V secretion systems. Here, we present structures of the CDI toxin from Escherichia coli NC101 in ternary complex with its cognate immunity protein and elongation factor Tu (EF-Tu). The toxin binds exclusively to domain 2 of EF-Tu, partially overlapping the site that interacts with the 3'-end of aminoacyl-tRNA (aa-tRNA). The toxin exerts a unique ribonuclease activity that cleaves the single-stranded 3'-end from tRNAs that contain guanine discriminator nucleotides. EF-Tu is required to support this tRNase activity in vitro, suggesting the toxinmore » specifically cleaves substrate in the context of GTP·EF-Tu·aa-tRNA complexes. However, superimposition of the toxin domain onto previously solved GTP·EF-Tu·aa-tRNA structures reveals potential steric clashes with both aa-tRNA and the switch I region of EF-Tu. Further, the toxin induces conformational changes in EF-Tu, displacing a β-hairpin loop that forms a critical salt-bridge contact with the 3'-terminal adenylate of aa-tRNA. Altogether, these observations suggest that the toxin remodels GTP·EF-Tu·aa-tRNA complexes to free the 3'-end of aa-tRNA for entry into the nuclease active site.« less

  16. Structure of a novel antibacterial toxin that exploits elongation factor Tu to cleave specific transfer RNAs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Michalska, Karolina; Gucinski, Grant C.; Garza-Sanchez, Fernando

    Contact-dependent growth inhibition (CDI) is a mechanism of inter-cellular competition in which Gram-negative bacteria exchange polymorphic toxins using type V secretion systems. Here, we present structures of the CDI toxin from Escherichia coli NC101 in ternary complex with its cognate immunity protein and elongation factor Tu (EF-Tu). The toxin binds exclusively to domain 2 of EF-Tu, partially overlapping the site that interacts with the 3'-end of aminoacyl-tRNA (aa-tRNA). The toxin exerts a unique ribonuclease activity that cleaves the single-stranded 3'-end from tRNAs that contain guanine discriminator nucleotides. EF-Tu is required to support this tRNase activity in vitro, suggesting the toxinmore » specifically cleaves substrate in the context of GTP·EF-Tu·aa-tRNA complexes. However, superimposition of the toxin domain onto previously solved GTP·EF-Tu·aa-tRNA structures reveals potential steric clashes with both aa-tRNA and the switch I region of EF-Tu. Further, the toxin induces conformational changes in EF-Tu, displacing a β-hairpin loop that forms a critical salt-bridge contact with the 3'-terminal adenylate of aa-tRNA. Altogether, these observations suggest that the toxin remodels GTP·EF-Tu·aa-tRNA complexes to free the 3'-end of aa-tRNA for entry into the nuclease active site.« less

  17. RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

    NASA Astrophysics Data System (ADS)

    Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

    Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.

  18. Improve the prediction of RNA-binding residues using structural neighbours.

    PubMed

    Li, Quan; Cao, Zanxia; Liu, Haiyan

    2010-03-01

    The interactions between RNA-binding proteins (RBPs) with RNA play key roles in managing some of the cell's basic functions. The identification and prediction of RNA binding sites is important for understanding the RNA-binding mechanism. Computational approaches are being developed to predict RNA-binding residues based on the sequence- or structure-derived features. To achieve higher prediction accuracy, improvements on current prediction methods are necessary. We identified that the structural neighbors of RNA-binding and non-RNA-binding residues have different amino acid compositions. Combining this structure-derived feature with evolutionary (PSSM) and other structural information (secondary structure and solvent accessibility) significantly improves the predictions over existing methods. Using a multiple linear regression approach and 6-fold cross validation, our best model can achieve an overall correct rate of 87.8% and MCC of 0.47, with a specificity of 93.4%, correctly predict 52.4% of the RNA-binding residues for a dataset containing 107 non-homologous RNA-binding proteins. Compared with existing methods, including the amino acid compositions of structure neighbors lead to clearly improvement. A web server was developed for predicting RNA binding residues in a protein sequence (or structure),which is available at http://mcgill.3322.org/RNA/.

  19. Regulatory effects of cotranscriptional RNA structure formation and transitions.

    PubMed

    Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2016-09-01

    RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.

  20. Probing Xist RNA Structure in Cells Using Targeted Structure-Seq

    PubMed Central

    Rutenberg-Schoenberg, Michael; Simon, Matthew D.

    2015-01-01

    The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615

  1. Desthiobiotin-Streptavidin-Affinity Mediated Purification of RNA-Interacting Proteins in Mesothelioma Cells.

    PubMed

    Kresoja-Rakic, Jelena; Felley-Bosco, Emanuela

    2018-04-25

    The in vitro RNA-pulldown is still largely used in the first steps of protocols aimed at identifying RNA-binding proteins that recognize specific RNA structures and motifs. In this RNA-pulldown protocol, commercially synthesized RNA probes are labeled with a modified form of biotin, desthiobiotin, at the 3' terminus of the RNA strand, which reversibly binds to streptavidin and thus allows elution of proteins under more physiological conditions. The RNA-desthiobiotin is immobilized through interaction with streptavidin on magnetic beads, which are used to pull down proteins that specifically interact with the RNA of interest. Non-denatured and active proteins from the cytosolic fraction of mesothelioma cells are used as the source of proteins. The method described here can be applied to detect the interaction between known RNA binding proteins and a 25-nucleotide (nt) long RNA probe containing a sequence of interest. This is useful to complete the functional characterization of stabilizing or destabilizing elements present in RNA molecules achieved using a reporter vector assay.

  2. RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

    PubMed

    Li, Ying; Shi, Xiaohu; Liang, Yanchun; Xie, Juan; Zhang, Yu; Ma, Qin

    2017-01-21

    RNAs have been found to carry diverse functionalities in nature. Inferring the similarity between two given RNAs is a fundamental step to understand and interpret their functional relationship. The majority of functional RNAs show conserved secondary structures, rather than sequence conservation. Those algorithms relying on sequence-based features usually have limitations in their prediction performance. Hence, integrating RNA structure features is very critical for RNA analysis. Existing algorithms mainly fall into two categories: alignment-based and alignment-free. The alignment-free algorithms of RNA comparison usually have lower time complexity than alignment-based algorithms. An alignment-free RNA comparison algorithm was proposed, in which novel numerical representations RNA-TVcurve (triple vector curve representation) of RNA sequence and corresponding secondary structure features are provided. Then a multi-scale similarity score of two given RNAs was designed based on wavelet decomposition of their numerical representation. In support of RNA mutation and phylogenetic analysis, a web server (RNA-TVcurve) was designed based on this alignment-free RNA comparison algorithm. It provides three functional modules: 1) visualization of numerical representation of RNA secondary structure; 2) detection of single-point mutation based on secondary structure; and 3) comparison of pairwise and multiple RNA secondary structures. The inputs of the web server require RNA primary sequences, while corresponding secondary structures are optional. For the primary sequences alone, the web server can compute the secondary structures using free energy minimization algorithm in terms of RNAfold tool from Vienna RNA package. RNA-TVcurve is the first integrated web server, based on an alignment-free method, to deliver a suite of RNA analysis functions, including visualization, mutation analysis and multiple RNAs structure comparison. The comparison results with two popular RNA comparison tools, RNApdist and RNAdistance, showcased that RNA-TVcurve can efficiently capture subtle relationships among RNAs for mutation detection and non-coding RNA classification. All the relevant results were shown in an intuitive graphical manner, and can be freely downloaded from this server. RNA-TVcurve, along with test examples and detailed documents, are available at: http://ml.jlu.edu.cn/tvcurve/ .

  3. Towards Long-Range RNA Structure Prediction in Eukaryotic Genes.

    PubMed

    Pervouchine, Dmitri D

    2018-06-15

    The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.

  4. Shared Sulfur Mobilization Routes for tRNA Thiolation and Molybdenum Cofactor Biosynthesis in Prokaryotes and Eukaryotes

    PubMed Central

    Leimkühler, Silke; Bühning, Martin; Beilschmidt, Lena

    2017-01-01

    Modifications of transfer RNA (tRNA) have been shown to play critical roles in the biogenesis, metabolism, structural stability and function of RNA molecules, and the specific modifications of nucleobases with sulfur atoms in tRNA are present in pro- and eukaryotes. Here, especially the thiomodifications xm5s2U at the wobble position 34 in tRNAs for Lys, Gln and Glu, were suggested to have an important role during the translation process by ensuring accurate deciphering of the genetic code and by stabilization of the tRNA structure. The trafficking and delivery of sulfur nucleosides is a complex process carried out by sulfur relay systems involving numerous proteins, which not only deliver sulfur to the specific tRNAs but also to other sulfur-containing molecules including iron–sulfur clusters, thiamin, biotin, lipoic acid and molybdopterin (MPT). Among the biosynthesis of these sulfur-containing molecules, the biosynthesis of the molybdenum cofactor (Moco) and the synthesis of thio-modified tRNAs in particular show a surprising link by sharing protein components for sulfur mobilization in pro- and eukaryotes. PMID:28098827

  5. Contributions of 3'-overhang to the dissociation of small interfering RNAs from the PAZ domain: molecular dynamics simulation study.

    PubMed

    Lee, Hui Sun; Lee, Soo Nam; Joo, Chul Hyun; Lee, Heuiran; Lee, Han Saem; Yoon, Seung Yong; Kim, Yoo Kyum; Choe, Han

    2007-03-01

    RNA interference (RNAi) is a 'knock-down' reaction to reduce expression of a specific gene through highly regulated, enzyme-mediated processes. Small interfering RNAs (siRNAs) are RNA molecules that play an effector role in RNAi and can bind the PAZ domains present in Dicer and RISC. We investigated the interaction between the PAZ domain and the siRNA-like duplexes through dissociation molecular dynamics (DMD) simulations. Specifically, we focused on the response of the PAZ domain to various 3'-overhang structures of the siRNA-like duplexes. We found that the siRNA-like duplex with the 3' UU-overhang made relatively more stable complex with the PAZ domain compared to those with 3' CC-, AA-, and GG-overhangs. The siRNA-like duplex with UU-overhang was easily dissociated from the PAZ domain once the structural stability of the complex is impaired. Interestingly, the 3' UU-overhang spent the least time at the periphery region of the binding pocket during the dissociation process, which can be mainly attributable to UU-overhang's smallest number of hydrogen bonds.

  6. Targeted delivery of miRNA therapeutics for cardiovascular diseases: opportunities and challenges.

    PubMed

    Kwekkeboom, Rick F J; Lei, Zhiyong; Doevendans, Pieter A; Musters, René J P; Sluijter, Joost P G

    2014-09-01

    Dysregulation of miRNA expression has been associated with many cardiovascular diseases in animal models, as well as in patients. In the present review, we summarize recent findings on the role of miRNAs in cardiovascular diseases and discuss the opportunities, possibilities and challenges of using miRNAs as future therapeutic targets. Furthermore, we focus on the different approaches that can be used to deliver these newly developed miRNA therapeutics to their sites of action. Since siRNAs are structurally homologous with the miRNA therapeutics, important lessons learned from siRNA delivery strategies are discussed that might be applicable to targeted delivery of miRNA therapeutics, thereby reducing costs and potential side effects, and improving efficacy.

  7. Eukaryotic 5S rRNA biogenesis

    PubMed Central

    Ciganda, Martin; Williams, Noreen

    2012-01-01

    The ribosome is a large complex containing both protein and RNA which must be assembled in a precise manner to allow proper functioning in the critical role of protein synthesis. 5S rRNA is the smallest of the RNA components of the ribosome, and although it has been studied for decades, we still do not have a clear understanding of its function within the complex ribosome machine. It is the only RNA species that binds ribosomal proteins prior to its assembly into the ribosome. Its transport into the nucleolus requires this interaction. Here we present an overview of some of the key findings concerning the structure and function of 5S rRNA and how its association with specific proteins impacts its localization and function. PMID:21957041

  8. Evaluation of sequence alignments and oligonucleotide probes with respect to three-dimensional structure of ribosomal RNA using ARB software package

    PubMed Central

    Kumar, Yadhu; Westram, Ralf; Kipfer, Peter; Meier, Harald; Ludwig, Wolfgang

    2006-01-01

    Background Availability of high-resolution RNA crystal structures for the 30S and 50S ribosomal subunits and the subsequent validation of comparative secondary structure models have prompted the biologists to use three-dimensional structure of ribosomal RNA (rRNA) for evaluating sequence alignments of rRNA genes. Furthermore, the secondary and tertiary structural features of rRNA are highly useful and successfully employed in designing rRNA targeted oligonucleotide probes intended for in situ hybridization experiments. RNA3D, a program to combine sequence alignment information with three-dimensional structure of rRNA was developed. Integration into ARB software package, which is used extensively by the scientific community for phylogenetic analysis and molecular probe designing, has substantially extended the functionality of ARB software suite with 3D environment. Results Three-dimensional structure of rRNA is visualized in OpenGL 3D environment with the abilities to change the display and overlay information onto the molecule, dynamically. Phylogenetic information derived from the multiple sequence alignments can be overlaid onto the molecule structure in a real time. Superimposition of both statistical and non-statistical sequence associated information onto the rRNA 3D structure can be done using customizable color scheme, which is also applied to a textual sequence alignment for reference. Oligonucleotide probes designed by ARB probe design tools can be mapped onto the 3D structure along with the probe accessibility models for evaluation with respect to secondary and tertiary structural conformations of rRNA. Conclusion Visualization of three-dimensional structure of rRNA in an intuitive display provides the biologists with the greater possibilities to carry out structure based phylogenetic analysis. Coupled with secondary structure models of rRNA, RNA3D program aids in validating the sequence alignments of rRNA genes and evaluating probe target sites. Superimposition of the information derived from the multiple sequence alignment onto the molecule dynamically allows the researchers to observe any sequence inherited characteristics (phylogenetic information) in real-time environment. The extended ARB software package is made freely available for the scientific community via . PMID:16672074

  9. RNA nanopatterning on graphene

    NASA Astrophysics Data System (ADS)

    Li, Q.; Froning, J. P.; Pykal, M.; Zhang, S.; Wang, Z.; Vondrák, M.; Banáš, P.; Čépe, K.; Jurečka, P.; Šponer, J.; Zbořil, R.; Dong, M.; Otyepka, M.

    2018-07-01

    Graphene-based materials enable the sensing of diverse biomolecules using experimental approaches based on electrochemistry, spectroscopy, or other methods. Although basic sensing was achieved, it had until now not been possible to understand and control biomolecules’ structural and morphological organization on graphene surfaces (i.e. their stacking, folding/unfolding, self-assembly, and nano-patterning). Here we present the insight into structural and morphological organization of biomolecules on graphene in water, using an RNA hairpin as a model system. We show that the key parameters governing the RNA’s behavior on the graphene surface are the number of graphene layers, RNA concentration, and temperature. At high concentrations, the RNA forms a film on the graphene surface with entrapped nanobubbles. The density and the size of the bubbles depend on the number of graphene layers. At lower concentrations, unfolded RNA stacks on the graphene and forms molecular clusters on the surface. Such a control over the conformational behavior of interacting biomolecules at graphene/water interfaces would facilitate new applications of graphene derivatives in biotechnology and biomedicine.

  10. Protocols for Molecular Dynamics Simulations of RNA Nanostructures.

    PubMed

    Kim, Taejin; Kasprzak, Wojciech K; Shapiro, Bruce A

    2017-01-01

    Molecular dynamics (MD) simulations have been used as one of the main research tools to study a wide range of biological systems and bridge the gap between X-ray crystallography or NMR structures and biological mechanism. In the field of RNA nanostructures, MD simulations have been used to fix steric clashes in computationally designed RNA nanostructures, characterize the dynamics, and investigate the interaction between RNA and other biomolecules such as delivery agents and membranes.In this chapter we present examples of computational protocols for molecular dynamics simulations in explicit and implicit solvent using the Amber Molecular Dynamics Package. We also show examples of post-simulation analysis steps and briefly mention selected tools beyond the Amber package. Limitations of the methods, tools, and protocols are also discussed. Most of the examples are illustrated for a small RNA duplex (helix), but the protocols are applicable to any nucleic acid structure, subject only to the computational speed and memory limitations of the hardware available to the user.

  11. The Hexamer Structure of the Rift Valley Fever Virus Nucleoprotein Suggests a Mechanism for its Assembly into Ribonucleoprotein Complexes

    PubMed Central

    Ferron, François; Li, Zongli; Danek, Eric I.; Luo, Dahai; Wong, Yeehwa; Coutard, Bruno; Lantez, Violaine; Charrel, Rémi; Canard, Bruno; Walz, Thomas; Lescar, Julien

    2011-01-01

    Rift Valley fever virus (RVFV), a Phlebovirus with a genome consisting of three single-stranded RNA segments, is spread by infected mosquitoes and causes large viral outbreaks in Africa. RVFV encodes a nucleoprotein (N) that encapsidates the viral RNA. The N protein is the major component of the ribonucleoprotein complex and is also required for genomic RNA replication and transcription by the viral polymerase. Here we present the 1.6 Å crystal structure of the RVFV N protein in hexameric form. The ring-shaped hexamers form a functional RNA binding site, as assessed by mutagenesis experiments. Electron microscopy (EM) demonstrates that N in complex with RNA also forms rings in solution, and a single-particle EM reconstruction of a hexameric N-RNA complex is consistent with the crystallographic N hexamers. The ring-like organization of the hexamers in the crystal is stabilized by circular interactions of the N terminus of RVFV N, which forms an extended arm that binds to a hydrophobic pocket in the core domain of an adjacent subunit. The conformation of the N-terminal arm differs from that seen in a previous crystal structure of RVFV, in which it was bound to the hydrophobic pocket in its own core domain. The switch from an intra- to an inter-molecular interaction mode of the N-terminal arm may be a general principle that underlies multimerization and RNA encapsidation by N proteins from Bunyaviridae. Furthermore, slight structural adjustments of the N-terminal arm would allow RVFV N to form smaller or larger ring-shaped oligomers and potentially even a multimer with a super-helical subunit arrangement. Thus, the interaction mode between subunits seen in the crystal structure would allow the formation of filamentous ribonucleocapsids in vivo. Both the RNA binding cleft and the multimerization site of the N protein are promising targets for the development of antiviral drugs. PMID:21589902

  12. The quantitative determination of metabolites of 6-mercaptopurine in biological materials. VI. Evidence for posttranscriptional modification of 6-thioguanosine residues in RNA from L5178Y cells treated with 6-mercaptopurine.

    PubMed

    Breter, H J

    1985-05-24

    Mammalian cells incorporate 6-thioguanosine into their nucleic acids when grown in the presence of 6-mercaptopurine. 35S-labeled total RNA was prepared from L5178Y murine lymphoma cells grown in vitro in the presence of 6-[35S]mercaptopurine. Base analyses of this RNA suggested that 6-thioguanosine residues in RNA molecules undergo posttranscriptional modification. Thus, enzymatic peak-shifting analyses using anion-exchange high-performance liquid chromatography were applied to the hydrolysis products released from total RNA preparations by digestion with nuclease P1 or nuclease P1 plus nucleotide pyrophosphatase. At least eight 35S-labeled, phosphatase-sensitive compounds structurally different from [35S]6thioGMP were found in nuclease P1 digests. Four of these compounds were susceptible to cleavage with nucleotide pyrophosphatase, thus indicating that they contained phosphoric acid anhydride bonds. Individual RNA species were not separately examined, the radiochromatographic data, however, which were obtained from digests of total RNA preparations, present evidence that 6-thioguanosine 5'-diphosphate and 6-thioguanosine 5'-triphosphate exist as 5'-terminal starting nucleotides (in tRNA and rRNA) and that 6-thioguanosine becomes incorporated into the highly modified dinucleoside triphosphate structures (caps) which commonly block the 5'-termini of eukaryotic poly(A)+ mRNA-molecules.

  13. Global identification of target recognition and cleavage by the Microprocessor in human ES cells

    PubMed Central

    Seong, Youngmo; Lim, Do-Hwan; Kim, Augustine; Seo, Jae Hong; Lee, Young Sik; Song, Hoseok; Kwon, Young-Soo

    2014-01-01

    The Microprocessor plays an essential role in canonical miRNA biogenesis by facilitating cleavage of stem-loop structures in primary transcripts to yield pre-miRNAs. Although miRNA biogenesis has been extensively studied through biochemical and molecular genetic approaches, it has yet to be addressed to what extent the current miRNA biogenesis models hold true in intact cells. To address the issues of in vivo recognition and cleavage by the Microprocessor, we investigate RNAs that are associated with DGCR8 and Drosha by using immunoprecipitation coupled with next-generation sequencing. Here, we present global protein–RNA interactions with unprecedented sensitivity and specificity. Our data indicate that precursors of canonical miRNAs and miRNA-like hairpins are the major substrates of the Microprocessor. As a result of specific enrichment of nascent cleavage products, we are able to pinpoint the Microprocessor-mediated cleavage sites per se at single-nucleotide resolution. Unexpectedly, a 2-nt 3′ overhang invariably exists at the ends of cleaved bases instead of nascent pre-miRNAs. Besides canonical miRNA precursors, we find that two novel miRNA-like structures embedded in mRNAs are cleaved to yield pre-miRNA-like hairpins, uncoupled from miRNA maturation. Our data provide a framework for in vivo Microprocessor-mediated cleavage and a foundation for experimental and computational studies on miRNA biogenesis in living cells. PMID:25326327

  14. 3D RNA and functional interactions from evolutionary couplings

    PubMed Central

    Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

    2016-01-01

    Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444

  15. Rclick: a web server for comparison of RNA 3D structures.

    PubMed

    Nguyen, Minh N; Verma, Chandra

    2015-03-15

    RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  16. miRCat2: accurate prediction of plant and animal microRNAs from next-generation sequencing datasets

    PubMed Central

    Paicu, Claudia; Mohorianu, Irina; Stocks, Matthew; Xu, Ping; Coince, Aurore; Billmeier, Martina; Dalmay, Tamas; Moulton, Vincent; Moxon, Simon

    2017-01-01

    Abstract Motivation MicroRNAs are a class of ∼21–22 nt small RNAs which are excised from a stable hairpin-like secondary structure. They have important gene regulatory functions and are involved in many pathways including developmental timing, organogenesis and development in eukaryotes. There are several computational tools for miRNA detection from next-generation sequencing datasets. However, many of these tools suffer from high false positive and false negative rates. Here we present a novel miRNA prediction algorithm, miRCat2. miRCat2 incorporates a new entropy-based approach to detect miRNA loci, which is designed to cope with the high sequencing depth of current next-generation sequencing datasets. It has a user-friendly interface and produces graphical representations of the hairpin structure and plots depicting the alignment of sequences on the secondary structure. Results We test miRCat2 on a number of animal and plant datasets and present a comparative analysis with miRCat, miRDeep2, miRPlant and miReap. We also use mutants in the miRNA biogenesis pathway to evaluate the predictions of these tools. Results indicate that miRCat2 has an improved accuracy compared with other methods tested. Moreover, miRCat2 predicts several new miRNAs that are differentially expressed in wild-type versus mutants in the miRNA biogenesis pathway. Availability and Implementation miRCat2 is part of the UEA small RNA Workbench and is freely available from http://srna-workbench.cmp.uea.ac.uk/. Contact v.moulton@uea.ac.uk or s.moxon@uea.ac.uk Supplementary information Supplementary data are available at Bioinformatics online. PMID:28407097

  17. RNA structures as mediators of neurological diseases and as drug targets

    PubMed Central

    Bernat, Viachaslau; Disney, Matthew D.

    2015-01-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. PMID:26139368

  18. The conservation and function of RNA secondary structure in plants

    PubMed Central

    Vandivier, Lee E.; Anderson, Stephen J.; Foley, Shawn W.; Gregory, Brian D.

    2016-01-01

    RNA transcripts fold into secondary structures via intricate patterns of base pairing. These secondary structures impart catalytic, ligand binding, and scaffolding functions to a wide array of RNAs, forming a critical node of biological regulation. Among their many functions, RNA structural elements modulate epigenetic marks, alter mRNA stability and translation, regulate alternative splicing, transduce signals, and scaffold large macromolecular complexes. Thus, the study of RNA secondary structure is critical to understanding the function and regulation of RNA transcripts. Here, we review the origins, form, and function of RNA secondary structure, focusing on plants. We then provide an overview of methods for probing secondary structure, from physical methods such as X-ray crystallography and nuclear magnetic resonance imaging (NMR) to chemical and nuclease probing methods. Marriage with high-throughput sequencing has enabled these latter methods to scale across whole transcriptomes, yielding tremendous new insights into the form and function of RNA secondary structure. PMID:26865341

  19. ModeRNA server: an online tool for modeling RNA 3D structures.

    PubMed

    Rother, Magdalena; Milanowska, Kaja; Puton, Tomasz; Jeleniewicz, Jaroslaw; Rother, Kristian; Bujnicki, Janusz M

    2011-09-01

    The diverse functional roles of non-coding RNA molecules are determined by their underlying structure. ModeRNA server is an online tool for RNA 3D structure modeling by the comparative approach, based on a template RNA structure and a user-defined target-template sequence alignment. It offers an option to search for potential templates, given the target sequence. The server also provides tools for analyzing, editing and formatting of RNA structure files. It facilitates the use of the ModeRNA software and offers new options in comparison to the standalone program. ModeRNA server was implemented using the Python language and the Django web framework. It is freely available at http://iimcb.genesilico.pl/modernaserver. iamb@genesilico.pl.

  20. The complete mitochondrial genome of the gall-forming fly, Fergusonina taylori Nelson and Yeates (Diptera: Fergusoninidae).

    PubMed

    Nelson, Leigh A; Cameron, Stephen L; Yeates, David K

    2011-10-01

    The monogeneric family Fergusoninidae consists of gall-forming flies that, together with Fergusobia (Tylenchida: Neotylenchidae) nematodes, form the only known mutualistic association between insects and nematodes. In this study, the entire 16,000 bp mitochondrial genome of Fergusonina taylori Nelson and Yeates was sequenced. The circular genome contains one encoding region including 27 genes and one non-coding A+T-rich region. The arrangement of the protein-coding, ribosomal RNA (rRNA) and transfer RNA (tRNA) genes was the same as that found in the ancestral insect. Nucleotide composition is highly A+T biased. All of the protein initiation codons are ATN, except for nad1 which begins with TTT. All 22 tRNA anticodons of F. taylori match those observed in Drosophila yakuba, and all form the typical cloverleaf structure except for tRNA-Ser((AGN)) which lacks a dihydrouridine (DHU) arm. Secondary structural features of the rRNA genes of Fergusonina are similar to those proposed for other insects, with minor modifications. The mitochondrial genome of Fergusonina presented here may prove valuable for resolving the sister group to the Fergusoninidae, and expands the available mtDNA data sources for acalyptrates overall.

  1. In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

    PubMed

    Meyer, Irmtraud M

    2017-05-01

    RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.

  2. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

    PubMed Central

    Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

    2012-01-01

    While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738

  3. RNA 3D Structure Modeling by Combination of Template-Based Method ModeRNA, Template-Free Folding with SimRNA, and Refinement with QRNAS.

    PubMed

    Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M

    2016-01-01

    RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .

  4. Freiburg RNA tools: a central online resource for RNA-focused research and teaching.

    PubMed

    Raden, Martin; Ali, Syed M; Alkhnbashi, Omer S; Busch, Anke; Costa, Fabrizio; Davis, Jason A; Eggenhofer, Florian; Gelhausen, Rick; Georg, Jens; Heyne, Steffen; Hiller, Michael; Kundu, Kousik; Kleinkauf, Robert; Lott, Steffen C; Mohamed, Mostafa M; Mattheis, Alexander; Miladi, Milad; Richter, Andreas S; Will, Sebastian; Wolff, Joachim; Wright, Patrick R; Backofen, Rolf

    2018-05-21

    The Freiburg RNA tools webserver is a well established online resource for RNA-focused research. It provides a unified user interface and comprehensive result visualization for efficient command line tools. The webserver includes RNA-RNA interaction prediction (IntaRNA, CopraRNA, metaMIR), sRNA homology search (GLASSgo), sequence-structure alignments (LocARNA, MARNA, CARNA, ExpaRNA), CRISPR repeat classification (CRISPRmap), sequence design (antaRNA, INFO-RNA, SECISDesign), structure aberration evaluation of point mutations (RaSE), and RNA/protein-family models visualization (CMV), and other methods. Open education resources offer interactive visualizations of RNA structure and RNA-RNA interaction prediction as well as basic and advanced sequence alignment algorithms. The services are freely available at http://rna.informatik.uni-freiburg.de.

  5. RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

    PubMed

    Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

    2018-01-01

    RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

  6. High-performance liquid chromatography purification of homogenous-length RNA produced by trans cleavage with a hammerhead ribozyme.

    PubMed Central

    Shields, T P; Mollova, E; Ste Marie, L; Hansen, M R; Pardi, A

    1999-01-01

    An improved method is presented for the preparation of milligram quantities of homogenous-length RNAs suitable for nuclear magnetic resonance or X-ray crystallographic structural studies. Heterogeneous-length RNA transcripts are processed with a hammerhead ribozyme to yield homogenous-length products that are then readily purified by anion exchange high-performance liquid chromatography. This procedure eliminates the need for denaturing polyacrylamide gel electrophoresis, which is the most laborious step in the standard procedure for large-scale production of RNA by in vitro transcription. The hammerhead processing of the heterogeneous-length RNA transcripts also substantially improves the overall yield and purity of the desired RNA product. PMID:10496226

  7. Structural Basis for the Altered PAM Recognition by Engineered CRISPR-Cpf1.

    PubMed

    Nishimasu, Hiroshi; Yamano, Takashi; Gao, Linyi; Zhang, Feng; Ishitani, Ryuichiro; Nureki, Osamu

    2017-07-06

    The RNA-guided Cpf1 nuclease cleaves double-stranded DNA targets complementary to the CRISPR RNA (crRNA), and it has been harnessed for genome editing technologies. Recently, Acidaminococcus sp. BV3L6 (AsCpf1) was engineered to recognize altered DNA sequences as the protospacer adjacent motif (PAM), thereby expanding the target range of Cpf1-mediated genome editing. Whereas wild-type AsCpf1 recognizes the TTTV PAM, the RVR (S542R/K548V/N552R) and RR (S542R/K607R) variants can efficiently recognize the TATV and TYCV PAMs, respectively. However, their PAM recognition mechanisms remained unknown. Here we present the 2.0 Å resolution crystal structures of the RVR and RR variants bound to a crRNA and its target DNA. The structures revealed that the RVR and RR variants primarily recognize the PAM-complementary nucleotides via the substituted residues. Our high-resolution structures delineated the altered PAM recognition mechanisms of the AsCpf1 variants, providing a basis for the further engineering of CRISPR-Cpf1. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Computational Assessment of Potassium and Magnesium Ion Binding to a Buried Pocket in GTPase-Associating Center RNA

    PubMed Central

    2016-01-01

    An experimentally well-studied model of RNA tertiary structures is a 58mer rRNA fragment, known as GTPase-associating center (GAC) RNA, in which a highly negative pocket walled by phosphate oxygen atoms is stabilized by a chelated cation. Although such deep pockets with more than one direct phosphate to ion chelation site normally include magnesium, as shown in one GAC crystal structure, another GAC crystal structure and solution experiments suggest potassium at this site. Both crystal structures also depict two magnesium ions directly bound to the phosphate groups comprising this controversial pocket. Here, we used classical molecular dynamics simulations as well as umbrella sampling to investigate the possibility of binding of potassium versus magnesium inside the pocket and to better characterize the chelation of one of the binding magnesium ions outside the pocket. The results support the preference of the pocket to accommodate potassium rather than magnesium and suggest that one of the closely binding magnesium ions can only bind at high magnesium concentrations, such as might be present during crystallization. This work illustrates the complementary utility of molecular modeling approaches with atomic-level detail in resolving discrepancies between conflicting experimental results. PMID:27983843

  9. Computational Assessment of Potassium and Magnesium Ion Binding to a Buried Pocket in GTPase-Associating Center RNA.

    PubMed

    Hayatshahi, Hamed S; Roe, Daniel R; Galindo-Murillo, Rodrigo; Hall, Kathleen B; Cheatham, Thomas E

    2017-01-26

    An experimentally well-studied model of RNA tertiary structures is a 58mer rRNA fragment, known as GTPase-associating center (GAC) RNA, in which a highly negative pocket walled by phosphate oxygen atoms is stabilized by a chelated cation. Although such deep pockets with more than one direct phosphate to ion chelation site normally include magnesium, as shown in one GAC crystal structure, another GAC crystal structure and solution experiments suggest potassium at this site. Both crystal structures also depict two magnesium ions directly bound to the phosphate groups comprising this controversial pocket. Here, we used classical molecular dynamics simulations as well as umbrella sampling to investigate the possibility of binding of potassium versus magnesium inside the pocket and to better characterize the chelation of one of the binding magnesium ions outside the pocket. The results support the preference of the pocket to accommodate potassium rather than magnesium and suggest that one of the closely binding magnesium ions can only bind at high magnesium concentrations, such as might be present during crystallization. This work illustrates the complementary utility of molecular modeling approaches with atomic-level detail in resolving discrepancies between conflicting experimental results.

  10. Tighter Ligand Binding Can Compensate for Impaired Stability of an RNA-Binding Protein.

    PubMed

    Wallis, Christopher P; Richman, Tara R; Filipovska, Aleksandra; Rackham, Oliver

    2018-06-15

    It has been widely shown that ligand-binding residues, by virtue of their orientation, charge, and solvent exposure, often have a net destabilizing effect on proteins that is offset by stability conferring residues elsewhere in the protein. This structure-function trade-off can constrain possible adaptive evolutionary changes of function and may hamper protein engineering efforts to design proteins with new functions. Here, we present evidence from a large randomized mutant library screen that, in the case of PUF RNA-binding proteins, this structural relationship may be inverted and that active-site mutations that increase protein activity are also able to compensate for impaired stability. We show that certain mutations in RNA-protein binding residues are not necessarily destabilizing and that increased ligand-binding can rescue an insoluble, unstable PUF protein. We hypothesize that these mutations restabilize the protein via thermodynamic coupling of protein folding and RNA binding.

  11. Accurate Classification of RNA Structures Using Topological Fingerprints

    PubMed Central

    Li, Kejie; Gribskov, Michael

    2016-01-01

    While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint. PMID:27755571

  12. A Boost for the Emerging Field of RNA Nanotechnology

    PubMed Central

    2011-01-01

    This Nano Focus article highlights recent advances in RNA nanotechnology as presented at the First International Conference of RNA Nanotechnology and Therapeutics, which took place in Cleveland, OH, USA (October 23–25, 2010) (http://www.eng.uc.edu/nanomedicine/RNA2010/), chaired by Peixuan Guo and co-chaired by David Rueda and Scott Tenenbaum. The conference was the first of its kind to bring together more than 30 invited speakers in the frontier of RNA nanotechnology from France, Sweden, South Korea, China, and throughout the United States to discuss RNA nanotechnology and its applications. It provided a platform for researchers from academia, government, and the pharmaceutical industry to share existing knowledge, vision, technology, and challenges in the field and promoted collaborations among researchers interested in advancing this emerging scientific discipline. The meeting covered a range of topics, including biophysical and single-molecule approaches for characterization of RNA nanostructures; structure studies on RNA nanoparticles by chemical or biochemical approaches, computation, prediction, and modeling of RNA nanoparticle structures; methods for the assembly of RNA nanoparticles; chemistry for RNA synthesis, conjugation, and labeling; and application of RNA nanoparticles in therapeutics. A special invited talk on the well-established principles of DNA nanotechnology was arranged to provide models for RNA nanotechnology. An Administrator from National Institutes of Health (NIH) National Cancer Institute (NCI) Alliance for Nanotechnology in Cancer discussed the current nanocancer research directions and future funding opportunities at NCI. As indicated by the feedback received from the invited speakers and the meeting participants, this meeting was extremely successful, exciting, and informative, covering many groundbreaking findings, pioneering ideas, and novel discoveries. PMID:21604810

  13. RNA Characterization by Solid-State NMR Spectroscopy.

    PubMed

    Yang, Yufei; Wang, Shenlin

    2018-06-21

    The structures of RNAs, which play critical roles in various biological processes, provide important clues and insights into the biological functions of these molecules. However, RNA structure determination remains a challenging topic. In recent years, magic-angle-spinning solid-state NMR (MAS SSNMR) has emerged as an alternative technique for structural and dynamic characterization of RNA. MAS SSNMR has been successfully applied to provide atomic-level structural information about several RNA molecules and RNA-protein complexes. In this Minireview, we give an overview of recent progress in the field of MAS SSNMR based RNA structural characterization, and introduce sample preparation strategies and SSNMR spectroscopic techniques that have been incorporated to identify RNA structural elements. We also highlight a few impressive examples of RNAs that have been investigated extensively by SSNMR. Finally, we briefly discuss future technical trends in the use of MAS SSNMR to facilitate RNA structure determination. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Structure of the MazF-mt9 toxin, a tRNA-specific endonuclease from Mycobacterium tuberculosis.

    PubMed

    Chen, Ran; Tu, Jie; Liu, Zhihui; Meng, Fanrong; Ma, Pinyun; Ding, Zhishan; Yang, Chengwen; Chen, Lei; Deng, Xiangyu; Xie, Wei

    2017-05-06

    Tuberculosis (TB) is a severe disease caused by Mycobacterium tuberculosis (M. tb) and the well-characterized M. tb MazE/F proteins play important roles in stress adaptation. Recently, the MazF-mt9 toxin has been found to display endonuclease activities towards tRNAs but the mechanism is unknown. We hereby present the crystal structure of apo-MazF-mt9. The enzyme recognizes tRNA Lys with a central UUU motif within the anticodon loop, but is insensitive to the sequence context outside of the loop. Based on our crystallographic and biochemical studies, we identified key residues for catalysis and proposed the potential tRNA-binding site. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Structural features of microRNA (miRNA) precursors and their relevance to miRNA biogenesis and small interfering RNA/short hairpin RNA design.

    PubMed

    Krol, Jacek; Sobczak, Krzysztof; Wilczynska, Urszula; Drath, Maria; Jasinska, Anna; Kaczynska, Danuta; Krzyzosiak, Wlodzimierz J

    2004-10-01

    We have established the structures of 10 human microRNA (miRNA) precursors using biochemical methods. Eight of these structures turned out to be different from those that were computer-predicted. The differences localized in the terminal loop region and at the opposite side of the precursor hairpin stem. We have analyzed the features of these structures from the perspectives of miRNA biogenesis and active strand selection. We demonstrated the different thermodynamic stability profiles for pre-miRNA hairpins harboring miRNAs at their 5'- and 3'-sides and discussed their functional implications. Our results showed that miRNA prediction based on predicted precursor structures may give ambiguous results, and the success rate is significantly higher for the experimentally determined structures. On the other hand, the differences between the predicted and experimentally determined structures did not affect the stability of termini produced through "conceptual dicing." This result confirms the value of thermodynamic analysis based on mfold as a predictor of strand section by RNAi-induced silencing complex (RISC).

  16. Stem-Loop RNA Hairpins in Giant Viruses: Invading rRNA-Like Repeats and a Template Free RNA

    PubMed Central

    Seligmann, Hervé; Raoult, Didier

    2018-01-01

    We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5′ UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing). PMID:29449833

  17. Vfold: a web server for RNA structure and folding thermodynamics prediction.

    PubMed

    Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

    2014-01-01

    The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".

  18. Differential structural status of the RNA counterpart of an undecamer quasi-palindromic DNA sequence present in LCR of human β-globin gene cluster.

    PubMed

    Kaushik, Mahima; Kukreti, Shrikant

    2015-01-01

    Our previous work on structural polymorphism shown at a single nucleotide polymorphism (SNP) (A → G) site located on HS4 region of locus control region (LCR) of β-globin gene has established a hairpin → duplex equilibrium corresponding to A → B like DNA transition (Kaushik M, Kukreti, R., Grover, D., Brahmachari, S.K. and Kukreti S. Nucleic Acids Res. 2003; Kaushik M, Kukreti S. Nucleic Acids Res. 2006). The G-allele of A → G SNP has been shown to be significantly associated with the occurrence of β-thalassemia. Considering the significance of this 11-nt long quasi-palindromic sequence [5'-TGGGG(G/A)CCCCA; HP(G/A)11] of β-globin gene LCR, we further explored the differential behavior of the same DNA sequence with its RNA counterpart, using various biophysical and biochemical techniques. In contrast to its DNA counterpart exhibiting a A → B structural transition and an equilibrium between duplex and hairpin forms, the studied RNA oligonucleotide sequence [5'-UGGGG(G/A)CCCCA; RHP(G/A)11] existed only in duplex form (A-conformation) and did not form hairpin. The single residue difference from A to G led to the unusual thermal stability of the RNA structure formed by the studied sequence. Since, naturally occurring mutations and various SNP sites may stabilize or destabilize the local DNA/RNA secondary structures, these structural transitions may affect the gene expression by a change in the protein-DNA recognition patterns.

  19. Escherichia coli Ribosomal Protein S1 Unfolds Structured mRNAs Onto the Ribosome for Active Translation Initiation

    PubMed Central

    Duval, Mélodie; Korepanov, Alexey; Fuchsbauer, Olivier; Fechter, Pierre; Haller, Andrea; Fabbretti, Attilio; Choulier, Laurence; Micura, Ronald; Klaholz, Bruno P.; Romby, Pascale; Springer, Mathias; Marzi, Stefano

    2013-01-01

    Regulation of translation initiation is well appropriate to adapt cell growth in response to stress and environmental changes. Many bacterial mRNAs adopt structures in their 5′ untranslated regions that modulate the accessibility of the 30S ribosomal subunit. Structured mRNAs interact with the 30S in a two-step process where the docking of a folded mRNA precedes an accommodation step. Here, we used a combination of experimental approaches in vitro (kinetic of mRNA unfolding and binding experiments to analyze mRNA–protein or mRNA–ribosome complexes, toeprinting assays to follow the formation of ribosomal initiation complexes) and in vivo (genetic) to monitor the action of ribosomal protein S1 on the initiation of structured and regulated mRNAs. We demonstrate that r-protein S1 endows the 30S with an RNA chaperone activity that is essential for the docking and the unfolding of structured mRNAs, and for the correct positioning of the initiation codon inside the decoding channel. The first three OB-fold domains of S1 retain all its activities (mRNA and 30S binding, RNA melting activity) on the 30S subunit. S1 is not required for all mRNAs and acts differently on mRNAs according to the signals present at their 5′ ends. This work shows that S1 confers to the ribosome dynamic properties to initiate translation of a large set of mRNAs with diverse structural features. PMID:24339747

  20. Structural variations of single and tandem mismatches in RNA duplexes: a joint MD simulation and crystal structure database analysis.

    PubMed

    Halder, Sukanya; Bhattacharyya, Dhananjay

    2012-10-04

    Internal loops within RNA duplex regions are formed by single or tandem basepairing mismatches with flanking canonical Watson-Crick basepairs on both sides. They are the most common motif observed in RNA secondary structures and play integral functional and structural roles. In this report, we have studied the structural features of 1 × 1, 2 × 2, and 3 × 3 internal loops using all-atom molecular dynamics (MD) simulation technique with explicit solvent model. As MD simulation is intricately dependent on the choice of force-field and these are often rather approximate, we have used both the most popular force-fields for nucleic acids-CHARMM27 and AMBER94-for a comparative analysis. We find that tandem noncanonical basepairs forming 2 × 2 and 3 × 3 internal loops are considerably more stable than the single mismatches forming 1 × 1 internal loops, irrespective of the force field. We have also analyzed crystal structure database to study the conservation of these helical fragments in the corresponding sets of RNA structures. We observe that the nature of stability in MD simulations mimic their fluctuating natures in crystal data sets also, probably indicating reliable natures of both the force fields to reproduce experimental results. We also notice significant structural changes in the wobble G:U basepairs present in these double helical stretches, leading to a biphasic stability for these wobble pairs to release the deformational strains introduced by internal loops within duplex regions.

  1. Visualization of RNA–protein interactions in living cells: FMRP and IMP1 interact on mRNAs

    PubMed Central

    Rackham, Oliver; Brown, Chris M

    2004-01-01

    Protein expression depends significantly on the stability, translation efficiency and localization of mRNA. These qualities are largely dictated by the RNA-binding proteins associated with an mRNA. Here, we report a method to visualize and localize RNA–protein interactions in living mammalian cells. Using this method, we found that the fragile X mental retardation protein (FMRP) isoform 18 and the human zipcode-binding protein 1 ortholog IMP1, an RNA transport factor, were present on common mRNAs. These interactions occurred predominantly in the cytoplasm, in granular structures. In addition, FMRP and IMP1 interacted independently of RNA. Tethering of FMRP to an mRNA caused IMP1 to be recruited to the same mRNA and resulted in granule formation. The intimate association of FMRP and IMP1 suggests a link between mRNA transport and translational repression in mammalian cells. PMID:15282548

  2. Characterization of a Novel Association between Two Trypanosome-Specific Proteins and 5S rRNA

    PubMed Central

    Ciganda, Martin; Williams, Noreen

    2012-01-01

    P34 and P37 are two previously identified RNA binding proteins in the flagellate protozoan Trypanosoma brucei. RNA interference studies have determined that the proteins are essential and are involved in ribosome biogenesis. Here, we show that these proteins interact in vitro with the 5S rRNA with nearly identical binding characteristics in the absence of other cellular factors. The T. brucei 5S rRNA has a complex secondary structure and presents four accessible loops (A to D) for interactions with RNA-binding proteins. In other eukaryotes, loop C is bound by the L5 ribosomal protein and loop A mainly by TFIIIA. The binding of P34 and P37 to T. brucei 5S rRNA involves the LoopA region of the RNA, but these proteins also protect the L5 binding site located on LoopC. PMID:22253864

  3. Small interfering ribonucleic acid induces liquid-to-ripple phase transformation in a phospholipid membrane

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Choubey, Amit; Nomura, Ken-ichi; Kalia, Rajiv K.

    Small interfering ribonucleic acid (siRNA) molecules play a pivotal role in silencing gene expression via the RNA interference mechanism. A key limitation to the widespread implementation of siRNA therapeutics is the difficulty of delivering siRNA-based drugs to cells. Here, we examine changes in the structure and dynamics of a dipalmitoylphosphatidylcholine bilayer in the presence of a siRNA molecule and mechanical barriers to siRNA transfection in the bilayer. Our all-atom molecular dynamics simulation shows that siRNA induces a liquid crystalline-to-ripple phase transformation in the bilayer. The ripple phase consists of a major region of non-interdigitated and a minor region of interdigitatedmore » lipid molecules with an intervening kink. In the ripple phase, hydrocarbon chains of lipid molecules have large compressive stresses, which present a considerable barrier to siRNA transfection.« less

  4. Interpretable dimensionality reduction of single cell transcriptome data with deep generative models.

    PubMed

    Ding, Jiarui; Condon, Anne; Shah, Sohrab P

    2018-05-21

    Single-cell RNA-sequencing has great potential to discover cell types, identify cell states, trace development lineages, and reconstruct the spatial organization of cells. However, dimension reduction to interpret structure in single-cell sequencing data remains a challenge. Existing algorithms are either not able to uncover the clustering structures in the data or lose global information such as groups of clusters that are close to each other. We present a robust statistical model, scvis, to capture and visualize the low-dimensional structures in single-cell gene expression data. Simulation results demonstrate that low-dimensional representations learned by scvis preserve both the local and global neighbor structures in the data. In addition, scvis is robust to the number of data points and learns a probabilistic parametric mapping function to add new data points to an existing embedding. We then use scvis to analyze four single-cell RNA-sequencing datasets, exemplifying interpretable two-dimensional representations of the high-dimensional single-cell RNA-sequencing data.

  5. Crystal structures of a group II intron maturase reveal a missing link in spliceosome evolution.

    PubMed

    Zhao, Chen; Pyle, Anna Marie

    2016-06-01

    Group II introns are self-splicing ribozymes that are essential in many organisms, and they have been hypothesized to share a common evolutionary ancestor with the spliceosome. Although structural similarity of RNA components supports this connection, it is of interest to determine whether associated protein factors also share an evolutionary heritage. Here we present the crystal structures of reverse transcriptase (RT) domains from two group II intron-encoded proteins (maturases) from Roseburia intestinalis and Eubacterium rectale, obtained at 1.2-Å and 2.1-Å resolution, respectively. These domains are more similar in architecture to the spliceosomal Prp8 RT-like domain than to any other RTs, and they share substantial similarity with flaviviral RNA polymerases. The RT domain itself is sufficient for binding intron RNA with high affinity and specificity, and it is contained within an active RT enzyme. These studies provide a foundation for understanding structure-function relationships within group II intron-maturase complexes.

  6. The 28S–18S rDNA intergenic spacer from Crithidia fasciculata: repeated sequences, length heterogeneity, putative processing sites and potential interactions between U3 small nucleolar RNA and the ribosomal RNA precursor

    PubMed Central

    Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.

    2000-01-01

    In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863

  7. Highly-sensitive microRNA detection based on bio-bar-code assay and catalytic hairpin assembly two-stage amplification.

    PubMed

    Tang, Songsong; Gu, Yuan; Lu, Huiting; Dong, Haifeng; Zhang, Kai; Dai, Wenhao; Meng, Xiangdan; Yang, Fan; Zhang, Xueji

    2018-04-03

    Herein, a highly-sensitive microRNA (miRNA) detection strategy was developed by combining bio-bar-code assay (BBA) with catalytic hairpin assembly (CHA). In the proposed system, two nanoprobes of magnetic nanoparticles functionalized with DNA probes (MNPs-DNA) and gold nanoparticles with numerous barcode DNA (AuNPs-DNA) were designed. In the presence of target miRNA, the MNP-DNA and AuNP-DNA hybridized with target miRNA to form a "sandwich" structure. After "sandwich" structures were separated from the solution by the magnetic field and dehybridized by high temperature, the barcode DNA sequences were released by dissolving AuNPs. The released barcode DNA sequences triggered the toehold strand displacement assembly of two hairpin probes, leading to recycle of barcode DNA sequences and producing numerous fluorescent CHA products for miRNA detection. Under the optimal experimental conditions, the proposed two-stage amplification system could sensitively detect target miRNA ranging from 10 pM to 10 aM with a limit of detection (LOD) down to 97.9 zM. It displayed good capability to discriminate single base and three bases mismatch due to the unique sandwich structure. Notably, it presented good feasibility for selective multiplexed detection of various combinations of synthetic miRNA sequences and miRNAs extracted from different cell lysates, which were in agreement with the traditional polymerase chain reaction analysis. The two-stage amplification strategy may be significant implication in the biological detection and clinical diagnosis. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. RNAstructure: software for RNA secondary structure prediction and analysis.

    PubMed

    Reuter, Jessica S; Mathews, David H

    2010-03-15

    To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.

  9. An RNA pseudoknot is required for production of yellow fever virus subgenomic RNA by the host nuclease XRN1.

    PubMed

    Silva, Patrícia A G C; Pereira, Carina F; Dalebout, Tim J; Spaan, Willy J M; Bredenbeek, Peter J

    2010-11-01

    Cells and mice infected with arthropod-borne flaviviruses produce a small subgenomic RNA that is colinear with the distal part of the viral 3'-untranslated region (UTR). This small subgenomic flavivirus RNA (sfRNA) results from the incomplete degradation of the viral genome by the host 5'-3' exonuclease XRN1. Production of the sfRNA is important for the pathogenicity of the virus. This study not only presents a detailed description of the yellow fever virus (YFV) sfRNA but, more importantly, describes for the first time the molecular characteristics of the stalling site for XRN1 in the flavivirus genome. Similar to the case for West Nile virus, the YFV sfRNA was produced by XRN1. However, in contrast to the case for other arthropod-borne flaviviruses, not one but two sfRNAs were detected in YFV-infected mammalian cells. The smaller of these two sfRNAs was not observed in infected mosquito cells. The larger sfRNA could also be produced in vitro by incubation with purified XRN1. These two YFV sfRNAs formed a 5'-nested set. The 5' ends of the YFV sfRNAs were found to be just upstream of the previously predicted RNA pseudoknot PSK3. RNA structure probing and mutagenesis studies provided strong evidence that this pseudoknot structure was formed and served as the molecular signal to stall XRN1. The sequence involved in PSK3 formation was cloned into the Sinrep5 expression vector and shown to direct the production of an sfRNA-like RNA. These results underscore the importance of the RNA pseudoknot in stalling XRN1 and also demonstrate that it is the sole viral requirement for sfRNA production.

  10. Kissing-loop interaction between 5′ and 3′ ends of tick-borne Langat virus genome ‘bridges the gap’ between mosquito- and tick-borne flaviviruses in mechanisms of viral RNA cyclization: applications for virus attenuation and vaccine development

    PubMed Central

    Tsetsarkin, Konstantin A.; Liu, Guangping; Shen, Kui; Pletnev, Alexander G.

    2016-01-01

    Insertion of microRNA target sequences into the flavivirus genome results in selective tissue-specific attenuation and host-range restriction of live attenuated vaccine viruses. However, previous strategies for miRNA-targeting did not incorporate a mechanism to prevent target elimination under miRNA-mediated selective pressure, restricting their use in vaccine development. To overcome this limitation, we developed a new approach for miRNA-targeting of tick-borne flavivirus (Langat virus, LGTV) in the duplicated capsid gene region (DCGR). Genetic stability of viruses with DCGR was ensured by the presence of multiple cis-acting elements within the N-terminal capsid coding region, including the stem-loop structure (5′SL6) at the 3′ end of the promoter. We found that the 5′SL6 functions as a structural scaffold for the conserved hexanucleotide motif at its tip and engages in a complementary interaction with the region present in the 3′ NCR to enhance viral RNA replication. The resulting kissing-loop interaction, common in tick-borne flaviviruses, supports a single pair of cyclization elements (CYC) and functions as a homolog of the second pair of CYC that is present in the majority of mosquito-borne flaviviruses. Placing miRNA targets into the DCGR results in superior attenuation of LGTV in the CNS and does not interfere with development of protective immunity in immunized mice. PMID:26850640

  11. Correcting pervasive errors in RNA crystallography through enumerative structure prediction.

    PubMed

    Chou, Fang-Chieh; Sripakdeevong, Parin; Dibrov, Sergey M; Hermann, Thomas; Das, Rhiju

    2013-01-01

    Three-dimensional RNA models fitted into crystallographic density maps exhibit pervasive conformational ambiguities, geometric errors and steric clashes. To address these problems, we present enumerative real-space refinement assisted by electron density under Rosetta (ERRASER), coupled to Python-based hierarchical environment for integrated 'xtallography' (PHENIX) diffraction-based refinement. On 24 data sets, ERRASER automatically corrects the majority of MolProbity-assessed errors, improves the average R(free) factor, resolves functionally important discrepancies in noncanonical structure and refines low-resolution models to better match higher-resolution models.

  12. RNA Structures as Mediators of Neurological Diseases and as Drug Targets.

    PubMed

    Bernat, Viachaslau; Disney, Matthew D

    2015-07-01

    RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this Review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate the study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Trans-acting RNAs as molecular probes for monitoring time-dependent structural change of an RNA complex adapting two structures.

    PubMed

    Maeda, Yuri; Furuta, Hiroyuki; Ikawa, Yoshiya

    2011-03-01

    As dynamic structural changes are pivotal for the functions of some classes of RNA molecule, it is important to develop methods to monitor structural changes in RNA in a time-dependent manner without chemical modification. Based on previous reports that trans-acting RNAs can be used as probes for analysis and control of 3D structures of target RNAs, we applied this method to monitor time-dependent structural changes in RNA. We designed and performed a proof-of-principle study using a simple model RNA complex that adopts two different structures as a target. The time-dependent structural changes in the target RNA were successfully monitored using two trans-acting RNAs, which stably form a ternary complex with the bimolecular target RNA and act as a catalyst to join two RNA fragments of the target complex, respectively. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  14. Structure of a human cap-dependent 48S translation pre-initiation complex

    PubMed Central

    Eliseev, Boris; Yeramala, Lahari; Leitner, Alexander; Karuppasamy, Manikandan; Raimondeau, Etienne; Huard, Karine; Alkalaeva, Elena; Aebersold, Ruedi

    2018-01-01

    Abstract Eukaryotic translation initiation is tightly regulated, requiring a set of conserved initiation factors (eIFs). Translation of a capped mRNA depends on the trimeric eIF4F complex and eIF4B to load the mRNA onto the 43S pre-initiation complex comprising 40S and initiation factors 1, 1A, 2, 3 and 5 as well as initiator-tRNA. Binding of the mRNA is followed by mRNA scanning in the 48S pre-initiation complex, until a start codon is recognised. Here, we use a reconstituted system to prepare human 48S complexes assembled on capped mRNA in the presence of eIF4B and eIF4F. The highly purified h-48S complexes are used for cross-linking/mass spectrometry, revealing the protein interaction network in this complex. We report the electron cryo-microscopy structure of the h-48S complex at 6.3 Å resolution. While the majority of eIF4B and eIF4F appear to be flexible with respect to the ribosome, additional density is detected at the entrance of the 40S mRNA channel which we attribute to the RNA-recognition motif of eIF4B. The eight core subunits of eIF3 are bound at the 40S solvent-exposed side, as well as the subunits eIF3d, eIF3b and eIF3i. elF2 and initiator-tRNA bound to the start codon are present at the 40S intersubunit side. This cryo-EM structure represents a molecular snap-shot revealing the h-48S complex following start codon recognition. PMID:29401259

  15. Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9)

    PubMed Central

    Fu, Qinqin; Yuan, Y. Adam

    2013-01-01

    Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA. PMID:23361462

  16. Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9).

    PubMed

    Fu, Qinqin; Yuan, Y Adam

    2013-03-01

    Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA.

  17. Structure and function of the N-terminal domain of the yeast telomerase reverse transcriptase

    PubMed Central

    Petrova, Olga A; Mantsyzov, Alexey B; Rodina, Elena V; Efimov, Sergey V; Hackenberg, Claudia; Hakanpää, Johanna; Klochkov, Vladimir V; Lebedev, Andrej A; Chugunova, Anastasia A; Malyavko, Alexander N; Zatsepin, Timofei S; Mishin, Alexey V; Zvereva, Maria I

    2018-01-01

    Abstract The elongation of single-stranded DNA repeats at the 3′-ends of chromosomes by telomerase is a key process in maintaining genome integrity in eukaryotes. Abnormal activation of telomerase leads to uncontrolled cell division, whereas its down-regulation is attributed to ageing and several pathologies related to early cell death. Telomerase function is based on the dynamic interactions of its catalytic subunit (TERT) with nucleic acids—telomerase RNA, telomeric DNA and the DNA/RNA heteroduplex. Here, we present the crystallographic and NMR structures of the N-terminal (TEN) domain of TERT from the thermotolerant yeast Hansenula polymorpha and demonstrate the structural conservation of the core motif in evolutionarily divergent organisms. We identify the TEN residues that are involved in interactions with the telomerase RNA and in the recognition of the ‘fork’ at the distal end of the DNA product/RNA template heteroduplex. We propose that the TEN domain assists telomerase biological function and is involved in restricting the size of the heteroduplex during telomere repeat synthesis. PMID:29294091

  18. Structural analysis of the Quaking homodimerization interface

    PubMed Central

    Beuck, Christine; Qu, Song; Fagg, W. Samuel; Ares, Manuel; Williamson, James R.

    2012-01-01

    Quaking is a prototypical member of the STAR protein family, which plays key roles in posttranscriptional gene regulation by controlling mRNA translation, stability and splicing. QkI-5 has been shown to regulate mRNA expression in the central nervous system, but little is known about its roles in other tissues. STAR proteins function as dimers and bind to bipartite RNA sequences, however, the structural and functional roles of homo- and hetero-dimerization are still unclear. Here, we present the crystal structure of the QkI dimerization domain, which adopts a similar stacked helix-turn-helix arrangement as its homologs GLD-1 and Sam68, but differs by an additional helix inserted in the dimer interface. Variability of the dimer interface residues likely ensures selective homodimerization by preventing association with non-cognate STAR family proteins in the cell. Mutations that inhibit dimerization also significantly impair RNA binding in vitro, alter QkI-5 protein levels, and impair QkI function in a splicing assay in vivo. Together our results indicate that a functional Qua1 homodimerization domain is required for QkI-5 function in mammalian cells. PMID:22982292

  19. Ensemble cryo-EM elucidates the mechanism of translation fidelity

    PubMed Central

    Loveland, Anna B.; Demo, Gabriel; Grigorieff, Nikolaus; Korostelev, Andrei A.

    2017-01-01

    SUMMARY Faithful gene translation depends on accurate decoding, whose structural mechanism remains a matter of debate. Ribosomes decode mRNA codons by selecting cognate aminoacyl-tRNAs delivered by EF-Tu. We present high-resolution structural ensembles of ribosomes with cognate or near-cognate aminoacyl-tRNAs delivered by EF-Tu. Both cognate and near-cognate tRNA anticodons explore the A site of an open 30S subunit, while inactive EF-Tu is separated from the 50S subunit. A transient conformation of decoding-center nucleotide G530 stabilizes the cognate codon-anticodon helix, initiating step-wise “latching” of the decoding center. The resulting 30S domain closure docks EF-Tu at the sarcin-ricin loop of the 50S subunit, activating EF-Tu for GTP hydrolysis and ensuing aminoacyl-tRNA accommodation. By contrast, near-cognate complexes fail to induce the G530 latch, thus favoring open 30S pre-accommodation intermediates with inactive EF-Tu. This work unveils long-sought structural differences between the pre-accommodation of cognate and near-cognate tRNA that elucidate the mechanism of accurate decoding. PMID:28538735

  20. Methylation of miRNA genes and oncogenesis.

    PubMed

    Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A

    2015-02-01

    Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.

  1. Solution structure of a modified 2′,5′-linked RNA hairpin involved in an equilibrium with duplex

    PubMed Central

    Plevnik, Miha; Gdaniec, Zofia; Plavec, Janez

    2005-01-01

    The isomerization of phosphodiester functionality of nucleic acids from 3′,5′- to a less common 2′,5′-linkage influences the complex interplay of stereoelectronic effects that drive pseudorotational equilibrium of sugar rings and thus affect the conformational propensities for compact or more extended structures. The present study highlights the subtle balance of non-covalent forces at play in structural equilibrium of 2′,5′-linked RNA analogue, 3′-O-(2-methoxyethyl) substituted dodecamer *CG*CGAA*U*U*CG*CG, 3′-MOE-2′,5′-RNA, where all cytosines and uracils are methylated at C5. The NMR and UV spectroscopic studies have shown that 3′-MOE-2′,5′-RNA adopts both hairpin and duplex secondary structures, which are involved in a dynamic exchange that is slow on the NMR timescale and exhibits strand and salt concentration as well as pH dependence. Unusual effect of pH over a narrow physiological range is observed for imino proton resonances with exchange broadening observed at lower pH and relatively sharp lines observed at higher pH. The solution structure of 3′-MOE-2′,5′-RNA hairpin displays a unique and well-defined loop, which is stabilized by Watson–Crick A5·*U8 base pair and by n → π* stacking interactions of O4′ lone-pair electrons of A6 and *U8 with aromatic rings of A5 and *U7, respectively. In contrast, the stem region of 3′-MOE-2′,5′-RNA hairpin is more flexible. Our data highlight the important feature of backbone modifications that can have pronounced effects on interstrand association of nucleic acids. PMID:15788747

  2. The m1A(58) modification in eubacterial tRNA: An overview of tRNA recognition and mechanism of catalysis by TrmI.

    PubMed

    Dégut, Clément; Ponchon, Luc; Folly-Klan, Marcia; Barraud, Pierre; Tisné, Carine

    2016-03-01

    The enzymes of the TrmI family catalyze the formation of the m(1)A58 modification in tRNA. We previously solved the crystal structure of the Thermus thermophilus enzyme and conducted a biophysical study to characterize the interaction between TrmI and tRNA. TrmI enzymes are active as a tetramer and up to two tRNAs can bind to TrmI simultaneously. In this paper, we present the structures of two TrmI mutants (D170A and Y78A). These residues are conserved in the active site of TrmIs and their mutations result in a dramatic alteration of TrmI activity. Both structures of TrmI mutants revealed the flexibility of the N-terminal domain that is probably important to bind tRNA. The structure of TrmI Y78A catalytic domain is unmodified regarding the binding of the SAM co-factor and the conformation of residues potentially interacting with the substrate adenine. This structure reinforces the previously proposed role of Y78, i.e. stabilize the conformation of the A58 ribose needed to hold the adenosine in the active site. The structure of the D170A mutant shows a flexible active site with one loop occupying in part the place of the co-factor and the second loop moving at the entrance to the active site. This structure and recent data confirms the central role of D170 residue binding the amino moiety of SAM and the exocyclic amino group of adenine. Possible mechanisms for methyl transfer are then discussed. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Identification and characterization of circular RNAs in zebrafish.

    PubMed

    Shen, Yudong; Guo, Xianwu; Wang, Weimin

    2017-01-01

    Circular RNA (circRNA), a class of RNAs with circular structure, has received little attention until recently, when some new features and functions were discovered. In the present study, we sequenced circRNAs in zebrafish (Danio rerio) and identified 3868 circRNAs using three algorithms (find_circ, CIRI, segemehl). The analysis of microRNA target sites on circRNAs shows that some circRNAs may function as miRNA sponges. Furthermore, we identified the existence of reverse complementary sequences in the flanking regions of only 25 (2.64%) exonic circRNAs, indicating that the mechanism of zebrafish exonic circRNA biogenesis might be different from that in mammals. Moreover, 1122 (29%) zebrafish circRNA sequences showed homology with human, mouse and coelacanth circRNAs. © 2016 Federation of European Biochemical Societies.

  4. Functional Architecture of T7 RNA Polymerase Transcription Complexes

    PubMed Central

    Nayak, Dhananjaya; Guo, Qing; Sousa, Rui

    2007-01-01

    Summary T7 RNA polymerase is the best-characterized member of a widespread family of single-subunit RNA polymerases. Crystal structures of T7 RNA polymerase initiation and elongation complexes have provided a wealth of detailed information on RNA polymerase interactions with the promoter and transcription bubble, but the absence of DNA downstream of the melted region of the template in the initiation complex structure, and the absence of DNA upstream of the transcription bubble in the elongation complex structure means that our picture of the functional architecture of T7 RNA polymerase transcription complexes remains incomplete. Here we use the site-specifically tethered chemical nucleases and functional characterization of directed T7 RNAP mutants to both reveal the architecture of the duplex DNA that flanks the transcription bubble in the T7 RNAP initiation and elongation complexes, and to define the function of the interactions made by these duplex elements. We find that downstream duplex interactions made with a cluster of lysines (K711/K713/K714) are present during both elongation and initiation where they contribute to stabilizing a bend in the downstream DNA that is important for promoter opening. The upstream DNA in the elongation complex is also found to be sharply bent at the upstream edge of the transcription bubble, thereby allowing formation of upstream duplex:polymerase interactions that contribute to elongation complex stability. PMID:17580086

  5. Structural and Functional Analysis of an mRNP Complex That Mediates the High Stability of Human β-Globin mRNA

    PubMed Central

    Yu, Jia; Russell, J. Eric

    2001-01-01

    Human globins are encoded by mRNAs exhibiting high stabilities in transcriptionally silenced erythrocyte progenitors. Unlike α-globin mRNA, whose stability is enhanced by assembly of a specific messenger RNP (mRNP) α complex on its 3′ untranslated region (UTR), neither the structure(s) nor the mechanism(s) that effects the high-level stability of human β-globin mRNA has been identified. The present work describes an mRNP complex assembling on the 3′ UTR of the β-globin mRNA that exhibits many of the properties of the stability-enhancing α complex. The β-globin mRNP complex is shown to contain one or more factors homologous to αCP, a 39-kDa RNA-binding protein that is integral to α-complex assembly. Sequence analysis implicates a specific 14-nucleotide pyrimidine-rich track within its 3′ UTR as the site of β-globin mRNP assembly. The importance of this track to mRNA stability is subsequently verified in vivo using mice expressing human β-globin transgenes that contain informative mutations in this region. In combination, the in vitro and in vivo analyses indicate that the high stabilities of the α- and β-globin mRNAs are maintained through related mRNP complexes that may share a common regulatory pathway. PMID:11486027

  6. Structure of RNA polymerase complex and genome within a dsRNA virus provides insights into the mechanisms of transcription and assembly.

    PubMed

    Wang, Xurong; Zhang, Fuxian; Su, Rui; Li, Xiaowu; Chen, Wenyuan; Chen, Qingxiu; Yang, Tao; Wang, Jiawei; Liu, Hongrong; Fang, Qin; Cheng, Lingpeng

    2018-06-25

    Most double-stranded RNA (dsRNA) viruses transcribe RNA plus strands within a common innermost capsid shell. This process requires coordinated efforts by RNA-dependent RNA polymerase (RdRp) together with other capsid proteins and genomic RNA. Here we report the near-atomic resolution structure of the RdRp protein VP2 in complex with its cofactor protein VP4 and genomic RNA within an aquareovirus capsid using 200-kV cryoelectron microscopy and symmetry-mismatch reconstruction. The structure of these capsid proteins enabled us to observe the elaborate nonicosahedral structure within the double-layered icosahedral capsid. Our structure shows that the RdRp complex is anchored at the inner surface of the capsid shell and interacts with genomic dsRNA and four of the five asymmetrically arranged N termini of the capsid shell proteins under the fivefold axis, implying roles for these N termini in virus assembly. The binding site of the RNA end at VP2 is different from the RNA cap binding site identified in the crystal structure of orthoreovirus RdRp λ3, although the structures of VP2 and λ3 are almost identical. A loop, which was thought to separate the RNA template and transcript, interacts with an apical domain of the capsid shell protein, suggesting a mechanism for regulating RdRp replication and transcription. A conserved nucleoside triphosphate binding site was localized in our RdRp cofactor protein VP4 structure, and interactions between the VP4 and the genomic RNA were identified.

  7. Hydration of nucleic acid fragments: comparison of theory and experiment for high-resolution crystal structures of RNA, DNA, and DNA-drug complexes.

    PubMed Central

    Hummer, G; García, A E; Soumpasis, D M

    1995-01-01

    A computationally efficient method to describe the organization of water around solvated biomolecules is presented. It is based on a statistical mechanical expression for the water-density distribution in terms of particle correlation functions. The method is applied to analyze the hydration of small nucleic acid molecules in the crystal environment, for which high-resolution x-ray crystal structures have been reported. Results for RNA [r(ApU).r(ApU)] and DNA [d(CpG).d(CpG) in Z form and with parallel strand orientation] and for DNA-drug complexes [d(CpG).d(CpG) with the drug proflavine intercalated] are described. A detailed comparison of theoretical and experimental data shows positional agreement for the experimentally observed water sites. The presented method can be used for refinement of the water structure in x-ray crystallography, hydration analysis of nuclear magnetic resonance structures, and theoretical modeling of biological macromolecules such as molecular docking studies. The speed of the computations allows hydration analyses of molecules of almost arbitrary size (tRNA, protein-nucleic acid complexes, etc.) in the crystal environment and in aqueous solution. Images FIGURE 1 FIGURE 2 FIGURE 5 FIGURE 6 FIGURE 9 FIGURE 12 FIGURE 13 PMID:7542034

  8. Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

    PubMed

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2018-03-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.

  9. Novel Positive-Sense, Single-Stranded RNA (+ssRNA) Virus with Di-Cistronic Genome from Intestinal Content of Freshwater Carp (Cyprinus carpio)

    PubMed Central

    Pankovics, Péter; Simmonds, Peter

    2011-01-01

    A novel positive-sense, single-stranded RNA (+ssRNA) virus (Halastavi árva RNA virus, HalV; JN000306) with di-cistronic genome organization was serendipitously identified in intestinal contents of freshwater carps (Cyprinus carpio) fished by line-fishing from fishpond “Lőrinte halastó” located in Veszprém County, Hungary. The complete nucleotide (nt) sequence of the genomic RNA is 9565 nt in length and contains two long - non-in-frame - open reading frames (ORFs), which are separated by an intergenic region. The ORF1 (replicase) is preceded by an untranslated sequence of 827 nt, while an untranslated region of 139 nt follows the ORF2 (capsid proteins). The deduced amino acid (aa) sequences of the ORFs showed only low (less than 32%) and partial similarity to the non-structural (2C-like helicase, 3C-like cystein protease and 3D-like RNA dependent RNA polymerase) and structural proteins (VP2/VP4/VP3) of virus families in Picornavirales especially to members of the viruses with dicistronic genome. Halastavi árva RNA virus is present in intestinal contents of omnivorous freshwater carps but the origin and the host species of this virus remains unknown. The unique viral sequence and the actual position indicate that Halastavi árva RNA virus seems to be the first member of a new di-cistronic ssRNA virus. Further studies are required to investigate the specific host species (and spectrum), ecology and role of Halastavi árva RNA virus in the nature. PMID:22195010

  10. Inverted repeat Alu elements in the human lincRNA-p21 adopt a conserved secondary structure that regulates RNA function

    PubMed Central

    Chillón, Isabel; Pyle, Anna M.

    2016-01-01

    LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782

  11. Small-interfering RNA (siRNA)-based functional micro- and nanostructures for efficient and selective gene silencing.

    PubMed

    Lee, Soo Hyeon; Chung, Bong Hyun; Park, Tae Gwan; Nam, Yoon Sung; Mok, Hyejung

    2012-07-17

    Because of RNA's ability to encode structure and functional information, researchers have fabricated diverse geometric structures from this polymer at the micro- and nanoscale. With their tunable structures, rigidity, and biocompatibility, novel two-dimensional and three-dimensional RNA structures can serve as a fundamental platform for biomedical applications, including engineered tissues, biosensors, and drug delivery vehicles. The discovery of the potential of small-interfering RNA (siRNA) has underscored the applications of RNA-based micro- and nanostructures in medicine. Small-interfering RNA (siRNA), synthetic double-stranded RNA consisting of approximately 21 base pairs, suppresses problematic target genes in a sequence-specific manner via inherent RNA interference (RNAi) processing. As a result, siRNA offers a potential strategy for treatment of many human diseases. However, due to inefficient delivery to cells and off-target effects, the clinical application of therapeutic siRNA has been very challenging. To address these issues, researchers have studied a variety of nanocarrier systems for siRNA delivery. In this Account, we describe several strategies for efficient siRNA delivery and selective gene silencing. We took advantage of facile chemical conjugation and complementary hybridization to design novel siRNA-based micro- and nanostructures. Using chemical crosslinkers and hydrophobic/hydrophilic polymers at the end of siRNA, we produced various RNA-based structures, including siRNA block copolymers, micelles, linear siRNA homopolymers, and microhydrogels. Because of their increased charge density and flexibility compared with conventional siRNA, these micro- and nanostructures can form polyelectrolyte complexes with poorly charged and biocompatible cationic carriers that are both more condensed and more homogenous than the complexes formed in other carrier systems. In addition, the fabricated siRNA-based structures are linked by cleavable disulfide bonds for facile generation of original siRNA in the cytosol and for target-specific gene silencing. These newly developed siRNA-based structures greatly enhance intracellular uptake and gene silencing both in vitro and in vivo, making them promising biomaterials for siRNA therapeutics.

  12. A high-throughput and rapid computational method for screening of RNA post-transcriptional modifications that can be recognized by target proteins.

    PubMed

    Orr, Asuka A; Gonzalez-Rivera, Juan C; Wilson, Mark; Bhikha, P Reena; Wang, Daiqi; Contreras, Lydia M; Tamamis, Phanourios

    2018-02-01

    There are over 150 currently known, highly diverse chemically modified RNAs, which are dynamic, reversible, and can modulate RNA-protein interactions. Yet, little is known about the wealth of such interactions. This can be attributed to the lack of tools that allow the rapid study of all the potential RNA modifications that might mediate RNA-protein interactions. As a promising step toward this direction, here we present a computational protocol for the characterization of interactions between proteins and RNA containing post-transcriptional modifications. Given an RNA-protein complex structure, potential RNA modified ribonucleoside positions, and molecular mechanics parameters for capturing energetics of RNA modifications, our protocol operates in two stages. In the first stage, a decision-making tool, comprising short simulations and interaction energy calculations, performs a fast and efficient search in a high-throughput fashion, through a list of different types of RNA modifications categorized into trees according to their structural and physicochemical properties, and selects a subset of RNA modifications prone to interact with the target protein. In the second stage, RNA modifications that are selected as recognized by the protein are examined in-detail using all-atom simulations and free energy calculations. We implement and experimentally validate this protocol in a test case involving the study of RNA modifications in complex with Escherichia coli (E. coli) protein Polynucleotide Phosphorylase (PNPase), depicting the favorable interaction between 8-oxo-7,8-dihydroguanosine (8-oxoG) RNA modification and PNPase. Further advancement of the protocol can broaden our understanding of protein interactions with all known RNA modifications in several systems. Copyright © 2018 Elsevier Inc. All rights reserved.

  13. Global identification of target recognition and cleavage by the Microprocessor in human ES cells.

    PubMed

    Seong, Youngmo; Lim, Do-Hwan; Kim, Augustine; Seo, Jae Hong; Lee, Young Sik; Song, Hoseok; Kwon, Young-Soo

    2014-11-10

    The Microprocessor plays an essential role in canonical miRNA biogenesis by facilitating cleavage of stem-loop structures in primary transcripts to yield pre-miRNAs. Although miRNA biogenesis has been extensively studied through biochemical and molecular genetic approaches, it has yet to be addressed to what extent the current miRNA biogenesis models hold true in intact cells. To address the issues of in vivo recognition and cleavage by the Microprocessor, we investigate RNAs that are associated with DGCR8 and Drosha by using immunoprecipitation coupled with next-generation sequencing. Here, we present global protein-RNA interactions with unprecedented sensitivity and specificity. Our data indicate that precursors of canonical miRNAs and miRNA-like hairpins are the major substrates of the Microprocessor. As a result of specific enrichment of nascent cleavage products, we are able to pinpoint the Microprocessor-mediated cleavage sites per se at single-nucleotide resolution. Unexpectedly, a 2-nt 3' overhang invariably exists at the ends of cleaved bases instead of nascent pre-miRNAs. Besides canonical miRNA precursors, we find that two novel miRNA-like structures embedded in mRNAs are cleaved to yield pre-miRNA-like hairpins, uncoupled from miRNA maturation. Our data provide a framework for in vivo Microprocessor-mediated cleavage and a foundation for experimental and computational studies on miRNA biogenesis in living cells. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. 2-D Structure of the A Region of Xist RNA and Its Implication for PRC2 Association

    PubMed Central

    Maenner, Sylvain; Blaud, Magali; Fouillen, Laetitia; Savoye, Anne; Marchand, Virginie; Dubois, Agnès; Sanglier-Cianférani, Sarah; Van Dorsselaer, Alain; Clerc, Philippe; Avner, Philip; Visvikis, Athanase; Branlant, Christiane

    2010-01-01

    In placental mammals, inactivation of one of the X chromosomes in female cells ensures sex chromosome dosage compensation. The 17 kb non-coding Xist RNA is crucial to this process and accumulates on the future inactive X chromosome. The most conserved Xist RNA region, the A region, contains eight or nine repeats separated by U-rich spacers. It is implicated in the recruitment of late inactivated X genes to the silencing compartment and likely in the recruitment of complex PRC2. Little is known about the structure of the A region and more generally about Xist RNA structure. Knowledge of its structure is restricted to an NMR study of a single A repeat element. Our study is the first experimental analysis of the structure of the entire A region in solution. By the use of chemical and enzymatic probes and FRET experiments, using oligonucleotides carrying fluorescent dyes, we resolved problems linked to sequence redundancies and established a 2-D structure for the A region that contains two long stem-loop structures each including four repeats. Interactions formed between repeats and between repeats and spacers stabilize these structures. Conservation of the spacer terminal sequences allows formation of such structures in all sequenced Xist RNAs. By combination of RNP affinity chromatography, immunoprecipitation assays, mass spectrometry, and Western blot analysis, we demonstrate that the A region can associate with components of the PRC2 complex in mouse ES cell nuclear extracts. Whilst a single four-repeat motif is able to associate with components of this complex, recruitment of Suz12 is clearly more efficient when the entire A region is present. Our data with their emphasis on the importance of inter-repeat pairing change fundamentally our conception of the 2-D structure of the A region of Xist RNA and support its possible implication in recruitment of the PRC2 complex. PMID:20052282

  15. Designing and Testing Functional RNA Nanoparticles | Center for Cancer Research

    Cancer.gov

    Recent advances in nanotechnology have generated excitement that nanomaterials may provide novel approaches for the diagnosis and treatment of deadly diseases, such as cancer. However, the use of synthetic materials to generate nanoparticles can present challenges with endotoxin content, sterility, or biocompatibility. Employing biological materials may overcome these issues with RNA being particularly attractive given the clinical applications of RNA interference and the abundance of functional RNAs, including aptamers and ribozymes. RNA can form stable three-dimensional nanoparticle structures that can be decorated with other nucleic acids, small molecules, or proteins, potentially increasing local concentrations of therapeutic agents and acting synergistically when combined.

  16. RNA folding: structure prediction, folding kinetics and ion electrostatics.

    PubMed

    Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

    2015-01-01

    Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.

  17. Structure of Hepatitis C Virus Polymerase in Complex with Primer-Template RNA

    PubMed Central

    Murakami, Eisuke; Lam, Angela M.; Grice, Rena L.; Du, Jinfa; Sofia, Michael J.; Furman, Philip A.; Otto, Michael J.

    2012-01-01

    The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory β-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesis at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory β-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus. PMID:22496223

  18. Exploration of RNA structure spaces

    NASA Technical Reports Server (NTRS)

    Fox, G. E.

    1991-01-01

    In order to understand the structure of real structure spaces, we are studying the 5S rRNA structure space experimentally. A plasmid containing a synthetic 5S rRNA gene, two rRNA promoters, and transcription terminators has been assembled. Assays are conducted to determine if the foreign 5S rRNA is expressed, and to see whether or not it is incorporated into ribosomes. Evolutionary competition is used to determine the relative fitness of strains containing the foreign 5S rRNA and a control 5S rRNA. By using site directed mutagenesis, a number of mutants can be made in order to study the boundaries of the structure space and how sharply defined they are. By making similar studies in the vicinity of structure space, it will be possible to determine how homogeneous the 5S rRNA structure space is. Useable experimental protocols have been developed, and a number of mutants have already been studied. Initial results suggest an explanation of why single stranded regions of the RNA are less subject to mutation than double stranded regions.

  19. INFO-RNA--a fast approach to inverse RNA folding.

    PubMed

    Busch, Anke; Backofen, Rolf

    2006-08-01

    The structure of RNA molecules is often crucial for their function. Therefore, secondary structure prediction has gained much interest. Here, we consider the inverse RNA folding problem, which means designing RNA sequences that fold into a given structure. We introduce a new algorithm for the inverse folding problem (INFO-RNA) that consists of two parts; a dynamic programming method for good initial sequences and a following improved stochastic local search that uses an effective neighbor selection method. During the initialization, we design a sequence that among all sequences adopts the given structure with the lowest possible energy. For the selection of neighbors during the search, we use a kind of look-ahead of one selection step applying an additional energy-based criterion. Afterwards, the pre-ordered neighbors are tested using the actual optimization criterion of minimizing the structure distance between the target structure and the mfe structure of the considered neighbor. We compared our algorithm to RNAinverse and RNA-SSD for artificial and biological test sets. Using INFO-RNA, we performed better than RNAinverse and in most cases, we gained better results than RNA-SSD, the probably best inverse RNA folding tool on the market. www.bioinf.uni-freiburg.de?Subpages/software.html.

  20. Distinct families of cis-acting RNA replication elements epsilon from hepatitis B viruses

    PubMed Central

    Chen, Augustine; Brown, Chris

    2012-01-01

    The hepadnavirus encapsidation signal, epsilon (ε), is an RNA structure located at the 5′ end of the viral pregenomic RNA. It is essential for viral replication and functions in polymerase protein binding and priming. This structure could also have potential regulatory roles in controlling the expression of viral replicative proteins. In addition to its structure, the primary sequence of this RNA element has crucial functional roles in the viral lifecycle. Although the ε elements in hepadnaviruses share common critical functions, there are some significant differences in mammalian and avian hepadnaviruses, which include both sequence and structural variations.   Here we present several covariance models for ε elements from the Hepadnaviridae. The model building included experimentally determined data from previous studies using chemical probing and NMR analysis. These models have sufficient similarity to comprise a clan. The clan has in common a highly conserved overall structure consisting of a lower-stem, bulge, upper-stem and apical-loop. The models differ in functionally critical regions—notably the two types of avian ε elements have a tetra-loop (UGUU) including a non-canonical UU base pair, while the hepatitis B virus (HBV) epsilon has a tri-loop (UGU). The avian epsilon elements have a less stable dynamic structure in the upper stem. Comparisons between these models and all other Rfam models, and searches of genomes, showed these structures are specific to the Hepadnaviridae. Two family models and the clan are available from the Rfam database. PMID:22418844

  1. RNA-Puzzles: A CASP-like evaluation of RNA three-dimensional structure prediction

    PubMed Central

    Cruz, José Almeida; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cao, Song; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Flores, Samuel Coulbourn; Huang, Lili; Lavender, Christopher A.; Lisi, Véronique; Major, François; Mikolajczak, Katarzyna; Patel, Dinshaw J.; Philips, Anna; Puton, Tomasz; Santalucia, John; Sijenyi, Fredrick; Hermann, Thomas; Rother, Kristian; Rother, Magdalena; Serganov, Alexander; Skorupski, Marcin; Soltysinski, Tomasz; Sripakdeevong, Parin; Tuszynska, Irina; Weeks, Kevin M.; Waldsich, Christina; Wildauer, Michael; Leontis, Neocles B.; Westhof, Eric

    2012-01-01

    We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises. PMID:22361291

  2. Sequence characterization of 5S ribosomal RNA from eight gram positive procaryotes

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Luehrsen, K. R.; Pribula, C. D.; Fox, G. E.

    1976-01-01

    Complete nucleotide sequences are presented for 5S rRNA from Bacillus subtilis, B. firmus, B. pasteurii, B. brevis, Lactobacillus brevis, and Streptococcus faecalis, and 5S rRNA oligonucleotide catalogs and partial sequence data are given for B. cereus and Sporosarcina ureae. These data demonstrate a striking consistency of 5S rRNA primary and secondary structure within a given bacterial grouping. An exception is B. brevis, in which the 5S rRNA sequence varies significantly from that of other bacilli in the tuned helix and the procaryotic loop. The localization of these variations suggests that B. brevis occupies an ecological niche that selects such changes. It is noted that this organism produces antibiotics which affect ribosome function.

  3. SimRNAweb: a web server for RNA 3D structure modeling with optional restraints.

    PubMed

    Magnus, Marcin; Boniecki, Michał J; Dawson, Wayne; Bujnicki, Janusz M

    2016-07-08

    RNA function in many biological processes depends on the formation of three-dimensional (3D) structures. However, RNA structure is difficult to determine experimentally, which has prompted the development of predictive computational methods. Here, we introduce a user-friendly online interface for modeling RNA 3D structures using SimRNA, a method that uses a coarse-grained representation of RNA molecules, utilizes the Monte Carlo method to sample the conformational space, and relies on a statistical potential to describe the interactions in the folding process. SimRNAweb makes SimRNA accessible to users who do not normally use high performance computational facilities or are unfamiliar with using the command line tools. The simplest input consists of an RNA sequence to fold RNA de novo. Alternatively, a user can provide a 3D structure in the PDB format, for instance a preliminary model built with some other technique, to jump-start the modeling close to the expected final outcome. The user can optionally provide secondary structure and distance restraints, and can freeze a part of the starting 3D structure. SimRNAweb can be used to model single RNA sequences and RNA-RNA complexes (up to 52 chains). The webserver is available at http://genesilico.pl/SimRNAweb. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tan, Kemin; Johnson, Parker M.; Stols, Lucy

    Contact-dependent growth inhibition (CDI) is an important mechanism of intercellular competition between neighboring Gram-negative bacteria. CDI systems encode large surface-exposed CdiA effector proteins that carry a variety of C-terminal toxin domains (CdiA-CTs). All CDI +bacteria also produce CdiI immunity proteins that specifically bind to the cognate CdiA-CT and neutralize its toxin activity to prevent auto-inhibition. Here, the X-ray crystal structure of a CdiI immunity protein fromNeisseria meningitidisMC58 is presented at 1.45 Å resolution. The CdiI protein has structural homology to the Whirly family of RNA-binding proteins, but appears to lack the characteristic nucleic acid-binding motif of this family. Sequence homologymore » suggests that the cognate CdiA-CT is related to the eukaryotic EndoU family of RNA-processing enzymes. A homology model is presented of the CdiA-CT based on the structure of the XendoU nuclease fromXenopus laevis. Molecular-docking simulations predict that the CdiA-CT toxin active site is occluded upon binding to the CdiI immunity protein. Together, these observations suggest that the immunity protein neutralizes toxin activity by preventing access to RNA substrates.« less

  5. Closing loop base pairs in RNA loop-loop complexes: structural behavior, interaction energy and solvation analysis through molecular dynamics simulations.

    PubMed

    Golebiowski, Jérôme; Antonczak, Serge; Fernandez-Carmona, Juan; Condom, Roger; Cabrol-Bass, Daniel

    2004-12-01

    Nanosecond molecular dynamics using the Ewald summation method have been performed to elucidate the structural and energetic role of the closing base pair in loop-loop RNA duplexes neutralized by Mg2+ counterions in aqueous phases. Mismatches GA, CU and Watson-Crick GC base pairs have been considered for closing the loop of an RNA in complementary interaction with HIV-1 TAR. The simulations reveal that the mismatch GA base, mediated by a water molecule, leads to a complex that presents the best compromise between flexibility and energetic contributions. The mismatch CU base pair, in spite of the presence of an inserted water molecule, is too short to achieve a tight interaction at the closing-loop junction and seems to force TAR to reorganize upon binding. An energetic analysis has allowed us to quantify the strength of the interactions of the closing and the loop-loop pairs throughout the simulations. Although the water-mediated GA closing base pair presents an interaction energy similar to that found on fully geometry-optimized structure, the water-mediated CU closing base pair energy interaction reaches less than half the optimal value.

  6. Using in-cell SHAPE-Seq and simulations to probe structure-function design principles of RNA transcriptional regulators.

    PubMed

    Takahashi, Melissa K; Watters, Kyle E; Gasper, Paul M; Abbott, Timothy R; Carlson, Paul D; Chen, Alan A; Lucks, Julius B

    2016-06-01

    Antisense RNA-mediated transcriptional regulators are powerful tools for controlling gene expression and creating synthetic gene networks. RNA transcriptional repressors derived from natural mechanisms called attenuators are particularly versatile, though their mechanistic complexity has made them difficult to engineer. Here we identify a new structure-function design principle for attenuators that enables the forward engineering of new RNA transcriptional repressors. Using in-cell SHAPE-Seq to characterize the structures of attenuator variants within Escherichia coli, we show that attenuator hairpins that facilitate interaction with antisense RNAs require interior loops for proper function. Molecular dynamics simulations of these attenuator variants suggest these interior loops impart structural flexibility. We further observe hairpin flexibility in the cellular structures of natural RNA mechanisms that use antisense RNA interactions to repress translation, confirming earlier results from in vitro studies. Finally, we design new transcriptional attenuators in silico using an interior loop as a structural requirement and show that they function as desired in vivo. This work establishes interior loops as an important structural element for designing synthetic RNA gene regulators. We anticipate that the coupling of experimental measurement of cellular RNA structure and function with computational modeling will enable rapid discovery of structure-function design principles for a diverse array of natural and synthetic RNA regulators. © 2016 Takahashi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  7. A new way to see RNA

    PubMed Central

    Keating, Kevin S.; Humphris, Elisabeth L.; Pyle, Anna Marie

    2015-01-01

    Unlike proteins, the RNA backbone has numerous degrees of freedom (eight, if one counts the sugar pucker), making RNA modeling, structure building and prediction a multidimensional problem of exceptionally high complexity. And yet RNA tertiary structures are not infinite in their structural morphology; rather, they are built from a limited set of discrete units. In order to reduce the dimensionality of the RNA backbone in a physically reasonable way, a shorthand notation was created that reduced the RNA backbone torsion angles to two (η and θ, analogous to ϕ and ψ in proteins). When these torsion angles are calculated for nucleotides in a crystallographic database and plotted against one another, one obtains a plot analogous to a Ramachandran plot (the η/θ plot), with highly populated and unpopulated regions. Nucleotides that occupy proximal positions on the plot have identical structures and are found in the same units of tertiary structure. In this review, we describe the statistical validation of the η/θ formalism and the exploration of features within the η/θ plot. We also describe the application of the η/θ formalism in RNA motif discovery, structural comparison, RNA structure building and tertiary structure prediction. More than a tool, however, the η/θ formalism has provided new insights into RNA structure itself, revealing its fundamental components and the factors underlying RNA architectural form. PMID:21729350

  8. On structural transitions, thermodynamic equilibrium, and the phase diagram of DNA and RNA duplexes under torque and tension.

    PubMed

    Wereszczynski, Jeff; Andricioaei, Ioan

    2006-10-31

    A precise understanding of the flexibility of double stranded nucleic acids and the nature of their deformed conformations induced by external forces is important for a wide range of biological processes including transcriptional regulation, supercoil and catenane removal, and site-specific recombination. We present, at atomic resolution, a simulation of the dynamics involved in the transitions from B-DNA and A-RNA to Pauling (P) forms and to denatured states driven by application of external torque and tension. We then calculate the free energy profile along a B- to P-transition coordinate and from it, compute a reversible pathway, i.e., an isotherm of tension and torque pairs required to maintain P-DNA in equilibrium. The reversible isotherm maps correctly onto a phase diagram derived from single molecule experiments, and yields values of elongation, twist, and twist-stretch coupling in agreement with measured values. We also show that configurational entropy compensates significantly for the large electrostatic energy increase due to closer-packed P backbones. A similar set of simulations applied to RNA are used to predict a novel structure, P-RNA, with its associated free energy, equilibrium tension, torque and structural parameters, and to assign the location, on the phase-diagram, of a putative force-torque-dependent RNA "triple point."

  9. High-affinity RNA aptamers to C-reactive protein (CRP): newly developed pre-elution methods for aptamer selection

    NASA Astrophysics Data System (ADS)

    Orito, N.; Umekage, S.; Sato, K.; Kawauchi, S.; Tanaka, H.; Sakai, E.; Tanaka, T.; Kikuchi, Y.

    2012-03-01

    We have developed a modified SELEX (systematic evolution of ligands by exponential enrichment) method to obtain RNA aptamers with high affinity to C-reactive protein (CRP). CRP is a clinical biomarker present in plasma, the level of which increases in response to infections and noninfectious inflammation. The CRP level is also an important prognostic indicator in patients with several syndromes. At present, CRP content in blood is measured immunochemically using antibodies. To develop a more sensitive method using RNA aptamers, we have attempted to obtain high-affinity RNA aptamers to CRP. We succeeded in obtaining an RNA aptamer with high affinity to CRP using a CRP-immobilized Sepharose column and pre-elution procedure. Pre-elution is a method that removes the weak binding portion from a selected RNA population by washing for a short time with buffer containing CRP. By surface plasmon-resonance (SPR) analysis, the affinity constant of this aptamer for CRP was calculated to be KD = 2.25×10-9 (M). The secondary structure, contact sites with CRP protein, and application of this aptamer will be described.

  10. RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme

    PubMed Central

    Biesiada, Marcin; Boniecki, Michał J.; Chou, Fang-Chieh; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Dunin-Horkawicz, Stanisław; Geniesse, Caleb; Kappel, Kalli; Kladwang, Wipapat; Krokhotin, Andrey; Łach, Grzegorz E.; Major, François; Mann, Thomas H.; Pachulska-Wieczorek, Katarzyna; Patel, Dinshaw J.; Piccirilli, Joseph A.; Popenda, Mariusz; Purzycka, Katarzyna J.; Ren, Aiming; Rice, Greggory M.; Santalucia, John; Tandon, Arpit; Trausch, Jeremiah J.; Wang, Jian; Weeks, Kevin M.; Williams, Benfeard; Xiao, Yi; Zhang, Dong; Zok, Tomasz

    2017-01-01

    RNA-Puzzles is a collective experiment in blind 3D RNA structure prediction. We report here a third round of RNA-Puzzles. Five puzzles, 4, 8, 12, 13, 14, all structures of riboswitch aptamers and puzzle 7, a ribozyme structure, are included in this round of the experiment. The riboswitch structures include biological binding sites for small molecules (S-adenosyl methionine, cyclic diadenosine monophosphate, 5-amino 4-imidazole carboxamide riboside 5′-triphosphate, glutamine) and proteins (YbxF), and one set describes large conformational changes between ligand-free and ligand-bound states. The Varkud satellite ribozyme is the most recently solved structure of a known large ribozyme. All puzzles have established biological functions and require structural understanding to appreciate their molecular mechanisms. Through the use of fast-track experimental data, including multidimensional chemical mapping, and accurate prediction of RNA secondary structure, a large portion of the contacts in 3D have been predicted correctly leading to similar topologies for the top ranking predictions. Template-based and homology-derived predictions could predict structures to particularly high accuracies. However, achieving biological insights from de novo prediction of RNA 3D structures still depends on the size and complexity of the RNA. Blind computational predictions of RNA structures already appear to provide useful structural information in many cases. Similar to the previous RNA-Puzzles Round II experiment, the prediction of non-Watson–Crick interactions and the observed high atomic clash scores reveal a notable need for an algorithm of improvement. All prediction models and assessment results are available at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:28138060

  11. Studies on sex-organ development. Changes in chromatin structure during spermatogenesis in maturing rooster testis as demonstrated by the initiation pattern of ribonucleic acid synthesis in vitro.

    PubMed Central

    Mezquita, C; Teng, C S

    1978-01-01

    To probe the structural change in the genome of the differentiating germ cell of the maturing rooster testis, the chromatin from nuclei at various stages of differentiation were transcribed with prokaryotic RNA polymerase from Escherichia coli or with eukaryotic RNA polymerase II from wheat germ. The transcription was performed under conditions of blockage of RNA chain reinitiation in vitro with rifampicin or rifampicin AF/013. With the E. coli enzyme, the changes in (1) the titration curve for the enzyme-chromatin interaction, (2) the number of initiation sites, (3) the rate of elongation of RNA chains, and (4) the kinetics of the formation of stable initiation complexes revealed the unmasking of DNA in elongated spermatids and the masking of DNA in spermatozoa. In both cases the stability of the DNA duplex in the initiation region for RNA synthesis greatly increased. In contrast with the E. coli enzyme, the wheat-germ RNA polymerase II was relatively inefficient at transcribing chromatin of elongated spermatids. Such behaviour can be predicted if unmasked double-stranded DNA is present in elongated spermatids. PMID:346018

  12. ε, a new subunit of RNA polymerase found in gram-positive bacteria.

    PubMed

    Keller, Andrew N; Yang, Xiao; Wiedermannová, Jana; Delumeau, Olivier; Krásný, Libor; Lewis, Peter J

    2014-10-01

    RNA polymerase in bacteria is a multisubunit protein complex that is essential for gene expression. We have identified a new subunit of RNA polymerase present in the high-A+T Firmicutes phylum of Gram-positive bacteria and have named it ε. Previously ε had been identified as a small protein (ω1) that copurified with RNA polymerase. We have solved the structure of ε by X-ray crystallography and show that it is not an ω subunit. Rather, ε bears remarkable similarity to the Gp2 family of phage proteins involved in the inhibition of host cell transcription following infection. Deletion of ε shows no phenotype and has no effect on the transcriptional profile of the cell. Determination of the location of ε within the assembly of RNA polymerase core by single-particle analysis suggests that it binds toward the downstream side of the DNA binding cleft. Due to the structural similarity of ε with Gp2 and the fact they bind similar regions of RNA polymerase, we hypothesize that ε may serve a role in protection from phage infection. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  13. Cas5d Protein Processes Pre-crRNA and Assembles into a Cascade-like Interference Complex in Subtype I-C/Dvulg CRISPR-Cas System

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nam, Ki Hyun; Haitjema, Charles; Liu, Xueqi

    Clustered regularly interspaced short palindromic repeats (CRISPRs), together with an operon of CRISPR-associated (Cas) proteins, form an RNA-based prokaryotic immune system against exogenous genetic elements. Cas5 family proteins are found in several type I CRISPR-Cas systems. Here, we report the molecular function of subtype I-C/Dvulg Cas5d from Bacillus halodurans. We show that Cas5d cleaves pre-crRNA into unit length by recognizing both the hairpin structure and the 3 single stranded sequence in the CRISPR repeat region. Cas5d structure reveals a ferredoxin domain-based architecture and a catalytic triad formed by Y46, K116, and H117 residues. We further show that after pre-crRNA processing,more » Cas5d assembles with crRNA, Csd1, and Csd2 proteins to form a multi-sub-unit interference complex similar to Escherichia coli Cascade (CRISPR-associated complex for antiviral defense) in architecture. Our results suggest that formation of a crRNA-presenting Cascade-like complex is likely a common theme among type I CRISPR subtypes.« less

  14. FRASS: the web-server for RNA structural comparison

    PubMed Central

    2010-01-01

    Background The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. Results The FRASS web-server represents a RNA chain with its Gauss integrals and allows one to compare structures of RNA chains and to find similar entries in a database derived from the Protein Data Bank. We observed that FRASS scores correlate well with the ARTS and LaJolla similarity scores. Moreover, the-web server can also reproduce satisfactorily the DARTS classification of RNA 3D structures and the classification of the SCOR functions that was obtained by the SARA method. Conclusions The FRASS web-server can be easily used to detect relationships among RNA molecules and to scan efficiently the rapidly enlarging structural databases. PMID:20553602

  15. Functional 5′ UTR mRNA structures in eukaryotic translation regulation and how to find them

    PubMed Central

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2017-01-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms. PMID:29165424

  16. Evolutionary plasticity of the NHL domain underlies distinct solutions to RNA recognition.

    PubMed

    Kumari, Pooja; Aeschimann, Florian; Gaidatzis, Dimos; Keusch, Jeremy J; Ghosh, Pritha; Neagu, Anca; Pachulska-Wieczorek, Katarzyna; Bujnicki, Janusz M; Gut, Heinz; Großhans, Helge; Ciosk, Rafal

    2018-04-19

    RNA-binding proteins regulate all aspects of RNA metabolism. Their association with RNA is mediated by RNA-binding domains, of which many remain uncharacterized. A recently reported example is the NHL domain, found in prominent regulators of cellular plasticity like the C. elegans LIN-41. Here we employ an integrative approach to dissect the RNA specificity of LIN-41. Using computational analysis, structural biology, and in vivo studies in worms and human cells, we find that a positively charged pocket, specific to the NHL domain of LIN-41 and its homologs (collectively LIN41), recognizes a stem-loop RNA element, whose shape determines the binding specificity. Surprisingly, the mechanism of RNA recognition by LIN41 is drastically different from that of its more distant relative, the fly Brat. Our phylogenetic analysis suggests that this reflects a rapid evolution of the domain, presenting an interesting example of a conserved protein fold that acquired completely different solutions to RNA recognition.

  17. An analytical platform for mass spectrometry-based identification and chemical analysis of RNA in ribonucleoprotein complexes.

    PubMed

    Taoka, Masato; Yamauchi, Yoshio; Nobe, Yuko; Masaki, Shunpei; Nakayama, Hiroshi; Ishikawa, Hideaki; Takahashi, Nobuhiro; Isobe, Toshiaki

    2009-11-01

    We describe here a mass spectrometry (MS)-based analytical platform of RNA, which combines direct nano-flow reversed-phase liquid chromatography (RPLC) on a spray tip column and a high-resolution LTQ-Orbitrap mass spectrometer. Operating RPLC under a very low flow rate with volatile solvents and MS in the negative mode, we could estimate highly accurate mass values sufficient to predict the nucleotide composition of a approximately 21-nucleotide small interfering RNA, detect post-transcriptional modifications in yeast tRNA, and perform collision-induced dissociation/tandem MS-based structural analysis of nucleolytic fragments of RNA at a sub-femtomole level. Importantly, the method allowed the identification and chemical analysis of small RNAs in ribonucleoprotein (RNP) complex, such as the pre-spliceosomal RNP complex, which was pulled down from cultured cells with a tagged protein cofactor as bait. We have recently developed a unique genome-oriented database search engine, Ariadne, which allows tandem MS-based identification of RNAs in biological samples. Thus, the method presented here has broad potential for automated analysis of RNA; it complements conventional molecular biology-based techniques and is particularly suited for simultaneous analysis of the composition, structure, interaction, and dynamics of RNA and protein components in various cellular RNP complexes.

  18. Interaction of zanamivir with DNA and RNA: Models for drug DNA and drug RNA bindings

    NASA Astrophysics Data System (ADS)

    Nafisi, Shohreh; Kahangi, Fatemeh Ghoreyshi; Azizi, Ebrahim; Zebarjad, Nader; Tajmir-Riahi, Heidar-Ali

    2007-03-01

    Zanamivir (ZAN) is the first of a new generation of influenza virus-specific drugs known as neuraminidase inhibitors, which acts by interfering with life cycles of influenza viruses A and B. It prevents the virus spreading infection to other cells by blocking the neuraminidase enzyme present on the surface of the virus. The aim of this study was to examine the stability and structural features of calf thymus DNA and yeast RNA complexes with zanamivir in aqueous solution, using constant DNA or RNA concentration (12.5 mM) and various zanamivir/polynucleotide ( P) ratios of 1/20, 1/10, 1/4, and 1/2. FTIR and UV-visible spectroscopy are used to determine the drug external binding modes, the binding constant and the stability of zanamivir-DNA and RNA complexes in aqueous solution. Structural analysis showed major interaction of zanamivir with G-C (major groove) and A-T (minor groove) base pairs and minor perturbations of the backbone PO 2 group with overall binding constants of Kzanamivir-DNA = 1.30 × 10 4 M -1 and Kzanamivir-RNA = 1.38 × 10 4 M -1. The drug interaction induces a partial B to A-DNA transition, while RNA remains in A-conformation.

  19. Hepatitis Delta Antigen Requires a Flexible Quasi-Double-Stranded RNA Structure To Bind and Condense Hepatitis Delta Virus RNA in a Ribonucleoprotein Complex

    PubMed Central

    Griffin, Brittany L.; Chasovskikh, Sergey; Dritschilo, Anatoly

    2014-01-01

    ABSTRACT The circular genome and antigenome RNAs of hepatitis delta virus (HDV) form characteristic unbranched, quasi-double-stranded RNA secondary structures in which short double-stranded helical segments are interspersed with internal loops and bulges. The ribonucleoprotein complexes (RNPs) formed by these RNAs with the virus-encoded protein hepatitis delta antigen (HDAg) perform essential roles in the viral life cycle, including viral replication and virion formation. Little is understood about the formation and structure of these complexes and how they function in these key processes. Here, the specific RNA features required for HDAg binding and the topology of the complexes formed were investigated. Selective 2′OH acylation analyzed by primer extension (SHAPE) applied to free and HDAg-bound HDV RNAs indicated that the characteristic secondary structure of the RNA is preserved when bound to HDAg. Notably, the analysis indicated that predicted unpaired positions in the RNA remained dynamic in the RNP. Analysis of the in vitro binding activity of RNAs in which internal loops and bulges were mutated and of synthetically designed RNAs demonstrated that the distinctive secondary structure, not the primary RNA sequence, is the major determinant of HDAg RNA binding specificity. Atomic force microscopy analysis of RNPs formed in vitro revealed complexes in which the HDV RNA is substantially condensed by bending or wrapping. Our results support a model in which the internal loops and bulges in HDV RNA contribute flexibility to the quasi-double-stranded structure that allows RNA bending and condensing by HDAg. IMPORTANCE RNA-protein complexes (RNPs) formed by the hepatitis delta virus RNAs and protein, HDAg, perform critical roles in virus replication. Neither the structures of these RNPs nor the RNA features required to form them have been characterized. HDV RNA is unusual in that it forms an unbranched quasi-double-stranded structure in which short base-paired segments are interspersed with internal loops and bulges. We analyzed the role of the HDV RNA sequence and secondary structure in the formation of a minimal RNP and visualized the structure of this RNP using atomic force microscopy. Our results indicate that HDAg does not recognize the primary sequence of the RNA; rather, the principle contribution of unpaired bases in HDV RNA to HDAg binding is to allow flexibility in the unbranched quasi-double-stranded RNA structure. Visualization of RNPs by atomic force microscopy indicated that the RNA is significantly bent or condensed in the complex. PMID:24741096

  20. RNomics in Archaea reveals a further link between splicing of archaeal introns and rRNA processing

    PubMed Central

    Tang, Thean Hock; Rozhdestvensky, Timofey S.; d’Orval, Béatrice Clouet; Bortolin, Marie-Line; Huber, Harald; Charpentier, Bruno; Branlant, Christiane; Bachellerie, Jean-Pierre; Brosius, Jürgen; Hüttenhofer, Alexander

    2002-01-01

    The bulge–helix–bulge (BHB) motif recognised by the archaeal splicing endonuclease is also found in the long processing stems of archaeal rRNA precursors in which it is cleaved to generate pre-16S and pre-23S rRNAs. We show that in two species, Archaeoglobus fulgidus and Sulfolobus solfataricus, representatives from the two major archaeal kingdoms Euryarchaeota and Crenarchaeota, respectively, the pre-rRNA spacers cleaved at the BHB motifs surrounding pre-16S and pre-23S rRNAs subsequently become ligated. In addition, we present evidence that this is accompanied by circularisation of ribosomal pre-16S and pre-23S rRNAs in both species. These data reveal a further link between intron splicing and pre-rRNA processing in Archaea, which might reflect a common evolutionary origin of the two processes. One spliced RNA species designated 16S-D RNA, resulting from religation at the BHB motif of 16S pre-rRNA, is a highly abundant and stable RNA which folds into a three-stem structure interrupted by two single-stranded regions as assessed by chemical probing. It spans a region of the pre-rRNA 5′ external transcribed spacer exhibiting a highly conserved folding pattern in Archaea. Surprisingly, 16S-D RNA contains structural motifs found in archaeal C/D box small RNAs and binds to the L7Ae protein, a core component of archaeal C/D box RNPs. This supports the notion that it might have an important but still unknown role in pre-rRNA biogenesis or might even target RNA molecules other than rRNA. PMID:11842103

  1. Computer-Aided Design of RNA Origami Structures.

    PubMed

    Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

    2017-01-01

    RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.

  2. SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data

    PubMed Central

    Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo

    2018-01-01

    RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423

  3. Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

    PubMed

    Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

    2008-07-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. (c) 2008 Wiley Periodicals, Inc.

  4. Impact of target mRNA structure on siRNA silencing efficiency: a large-scale study

    PubMed Central

    Gredell, Joseph A.; Berger, Angela K.; Walton, S. Patrick

    2009-01-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5’- and 3’-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5’-end or 3’-end were silenced, on average, ~10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. PMID:18306428

  5. Superposition of two tRNA{sup Ser} acceptor stem crystal structures: Comparison of structure, ligands and hydration

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eichert, Andre; Fuerste, Jens P.; Ulrich, Alexander

    2010-05-07

    We solved the X-ray structures of two Escherichia coli tRNA{sup Ser} acceptor stem microhelices. As both tRNAs are aminoacylated by the same seryl-tRNA-synthetase, we performed a comparative structure analysis of both duplexes to investigate the helical conformation, the hydration patterns and magnesium binding sites. It is well accepted, that the hydration of RNA plays an important role in RNA-protein interactions and that the extensive solvent content of the minor groove has a special function in RNA. The detailed comparison of both tRNA{sup Ser} microhelices provides insights into the structural arrangement of the isoacceptor tRNA aminoacyl stems with respect to themore » surrounding water molecules and may eventually help us to understand their biological function at atomic resolution.« less

  6. Thioflavin T as an efficient fluorescence sensor for selective recognition of RNA G-quadruplexes

    NASA Astrophysics Data System (ADS)

    Xu, Shujuan; Li, Qian; Xiang, Junfeng; Yang, Qianfan; Sun, Hongxia; Guan, Aijiao; Wang, Lixia; Liu, Yan; Yu, Lijia; Shi, Yunhua; Chen, Hongbo; Tang, Yalin

    2016-04-01

    RNA G-quadruplexes (G4s) play important roles in translational regulation, mRNA processing events and gene expression. Therefore, a fluorescent probe that is capable of efficiently recognizing RNA G-quadruplex structures among other RNA forms is highly desirable. In this study, a water-soluble fluorogenic dye (i.e., Thioflavin T (ThT)) was employed to recognize RNA G-quadruplex structures using UV-Vis absorption spectra, fluorescence spectra and emission lifetime experiments. By stacking on the G-tetrad, the ThT probe exhibited highly specific recognition of RNA G-quadruplex structures with striking fluorescence enhancement compared with other RNA forms. The specific binding demonstrates that ThT is an efficient fluorescence sensor that can distinguish G4 and non-G4 RNA structures.

  7. Two new mutations in the MT-TW gene leading to the disruption of the secondary structure of the tRNA(Trp) in patients with Leigh syndrome.

    PubMed

    Mkaouar-Rebai, Emna; Chamkha, Imen; Kammoun, Fatma; Kammoun, Thouraya; Aloulou, Hajer; Hachicha, Mongia; Triki, Chahnez; Fakhfakh, Faiza

    2009-07-01

    Leigh syndrome is a progressive neurodegenerative disorder occurring in infancy and childhood characterized in most cases by a psychomotor retardation, optic atrophy, ataxia, dystonia, failure to thrive, seizures and respiratory failure. In this study, we performed a systematic sequence analysis of mitochondrial genes associated with LS in Tunisian patients. We sequenced the encoded complex I units: ND2, ND3, ND4, ND5 and ND6 genes and the mitochondrial ATPase 6, tRNA(Val), tRNA(Leu(UUR)), tRNA(Trp) and tRNA(Lys) genes in 10 unrelated patients with Leigh syndrome. We revealed the presence of 34 reported polymorphisms, nine novel nucleotide variants and two new mutations (T5523G and A5559G) in the tested patients. These two mutations were localized in two conserved regions of the tRNA(Trp) and affect, respectively, the D-stem and the T-stem of the mitochondrial tRNA leading to a disruption of the secondary structure of this tRNA. SSP-PCR analysis showed that the T5523G and A5559G mutations were present with respective heteroplasmic rates of 66% and 43 %. We report here the first mutational screening of mitochondrial mutations in Tunisian patients with Leigh syndrome which described two novel mutations associated with this disorder.

  8. Kinetic analysis of the effects of target structure on siRNA efficiency

    NASA Astrophysics Data System (ADS)

    Chen, Jiawen; Zhang, Wenbing

    2012-12-01

    RNAi efficiency for target cleavage and protein expression is related to the target structure. Considering the RNA-induced silencing complex (RISC) as a multiple turnover enzyme, we investigated the effect of target mRNA structure on siRNA efficiency with kinetic analysis. The 4-step model was used to study the target cleavage kinetic process: hybridization nucleation at an accessible target site, RISC-mRNA hybrid elongation along with mRNA target structure melting, target cleavage, and enzyme reactivation. At this model, the terms accounting for the target accessibility, stability, and the seed and the nucleation site effects are all included. The results are in good agreement with that of experiments which show different arguments about the structure effects on siRNA efficiency. It shows that the siRNA efficiency is influenced by the integrated factors of target's accessibility, stability, and the seed effects. To study the off-target effects, a simple model of one siRNA binding to two mRNA targets was designed. By using this model, the possibility for diminishing the off-target effects by the concentration of siRNA was discussed.

  9. Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

    PubMed

    Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

    2016-03-14

    RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. On the importance of cotranscriptional RNA structure formation

    PubMed Central

    Lai, Daniel; Proctor, Jeff R.; Meyer, Irmtraud M.

    2013-01-01

    The expression of genes, both coding and noncoding, can be significantly influenced by RNA structural features of their corresponding transcripts. There is by now mounting experimental and some theoretical evidence that structure formation in vivo starts during transcription and that this cotranscriptional folding determines the functional RNA structural features that are being formed. Several decades of research in bioinformatics have resulted in a wide range of computational methods for predicting RNA secondary structures. Almost all state-of-the-art methods in terms of prediction accuracy, however, completely ignore the process of structure formation and focus exclusively on the final RNA structure. This review hopes to bridge this gap. We summarize the existing evidence for cotranscriptional folding and then review the different, currently used strategies for RNA secondary-structure prediction. Finally, we propose a range of ideas on how state-of-the-art methods could be potentially improved by explicitly capturing the process of cotranscriptional structure formation. PMID:24131802

  11. Functional Nanostructures for Effective Delivery of Small Interfering RNA Therapeutics

    PubMed Central

    Hong, Cheol Am; Nam, Yoon Sung

    2014-01-01

    Small interfering RNA (siRNA) has proved to be a powerful tool for target-specific gene silencing via RNA interference (RNAi). Its ability to control targeted gene expression gives new hope to gene therapy as a treatment for cancers and genetic diseases. However, siRNA shows poor pharmacological properties, such as low serum stability, off-targeting, and innate immune responses, which present a significant challenge for clinical applications. In addition, siRNA cannot cross the cell membrane for RNAi activity because of its anionic property and stiff structure. Therefore, the development of a safe, stable, and efficient system for the delivery of siRNA therapeutics into the cytoplasm of targeted cells is crucial. Several nanoparticle platforms for siRNA delivery have been developed to overcome the major hurdles facing the therapeutic uses of siRNA. This review covers a broad spectrum of non-viral siRNA delivery systems developed for enhanced cellular uptake and targeted gene silencing in vitro and in vivo and discusses their characteristics and opportunities for clinical applications of therapeutic siRNA. PMID:25285170

  12. Decreased A-to-I RNA editing as a source of keratinocytes' dsRNA in psoriasis.

    PubMed

    Shallev, Lea; Kopel, Eli; Feiglin, Ariel; Leichner, Gil S; Avni, Dror; Sidi, Yechezkel; Eisenberg, Eli; Barzilai, Aviv; Levanon, Erez Y; Greenberger, Shoshana

    2018-06-01

    Recognition of dsRNA molecules activates the MDA5-MAVS pathway and plays a critical role in stimulating type-I interferon responses in psoriasis. However, the source of the dsRNA accumulation in psoriatic keratinocytes remains largely unknown. A-to-I RNA editing is a common co- or post-transcriptional modification that diversifies adenosine in dsRNA, and leads to unwinding of dsRNA structures. Thus, impaired RNA editing activity can result in an increased load of endogenous dsRNAs. Here we provide a transcriptome-wide analysis of RNA editing across dozens of psoriasis patients, and we demonstrate a global editing reduction in psoriatic lesions. In addition to the global alteration, we also detect editing changes in functional recoding sites located in the IGFBP7 , COPA , and FLNA genes. Accretion of dsRNA activates autoimmune responses, and therefore the results presented here, linking for the first time an autoimmune disease to reduction in global editing level, are relevant to a wide range of autoimmune diseases. © 2018 Shallev et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  13. Structural Requirement in Clostridium perfringens Collagenase mRNA 5′ Leader Sequence for Translational Induction through Small RNA-mRNA Base Pairing

    PubMed Central

    Nomura, Nobuhiko; Nakamura, Kouji

    2013-01-01

    The Gram-positive anaerobic bacterium Clostridium perfringens is pathogenic to humans and animals, and the production of its toxins is strictly regulated during the exponential phase. We recently found that the 5′ leader sequence of the colA transcript encoding collagenase, which is a major toxin of this organism, is processed and stabilized in the presence of the small RNA VR-RNA. The primary colA 5′-untranslated region (5′UTR) forms a long stem-loop structure containing an internal bulge and masks its own ribosomal binding site. Here we found that VR-RNA directly regulates colA expression through base pairing with colA mRNA in vivo. However, when the internal bulge structure was closed by point mutations in colA mRNA, translation ceased despite the presence of VR-RNA. In addition, a mutation disrupting the colA stem-loop structure induced mRNA processing and ColA-FLAG translational activation in the absence of VR-RNA, indicating that the stem-loop and internal bulge structure of the colA 5′ leader sequence is important for regulation by VR-RNA. On the other hand, processing was required for maximal ColA expression but was not essential for VR-RNA-dependent colA regulation. Finally, colA processing and translational activation were induced at a high temperature without VR-RNA. These results suggest that inhibition of the colA 5′ leader structure through base pairing is the primary role of VR-RNA in colA regulation and that the colA 5′ leader structure is a possible thermosensor. PMID:23585542

  14. RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.

    PubMed

    Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

    2008-02-01

    RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication.

  15. RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae

    PubMed Central

    Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

    2008-01-01

    RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning—possibly mediated by intrinsically disordered protein segments—is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication. PMID:18033802

  16. A Method to Predict the Structure and Stability of RNA/RNA Complexes.

    PubMed

    Xu, Xiaojun; Chen, Shi-Jie

    2016-01-01

    RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.

  17. Synthesizing topological structures containing RNA

    NASA Astrophysics Data System (ADS)

    Liu, Di; Shao, Yaming; Chen, Gang; Tse-Dinh, Yuk-Ching; Piccirilli, Joseph A.; Weizmann, Yossi

    2017-03-01

    Though knotting and entanglement have been observed in DNA and proteins, their existence in RNA remains an enigma. Synthetic RNA topological structures are significant for understanding the physical and biological properties pertaining to RNA topology, and these properties in turn could facilitate identifying naturally occurring topologically nontrivial RNA molecules. Here we show that topological structures containing single-stranded RNA (ssRNA) free of strong base pairing interactions can be created either by configuring RNA-DNA hybrid four-way junctions or by template-directed synthesis with a single-stranded DNA (ssDNA) topological structure. By using a constructed ssRNA knot as a highly sensitive topological probe, we find that Escherichia coli DNA topoisomerase I has low RNA topoisomerase activity and that the R173A point mutation abolishes the unknotting activity for ssRNA, but not for ssDNA. Furthermore, we discover the topological inhibition of reverse transcription (RT) and obtain different RT-PCR patterns for an ssRNA knot and circle of the same sequence.

  18. RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

    ERIC Educational Resources Information Center

    Ellington, Roni; Wachira, James; Nkwanta, Asamoah

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses…

  19. Domain structure of the ribozyme from eubacterial ribonuclease P.

    PubMed Central

    Loria, A; Pan, T

    1996-01-01

    Large RNAs can be composed of discrete domains that fold independently. One such "folding domain" has been identified previously in the ribozyme from Bacillus subtilis ribonuclease P (denoted P RNA). This domain contains roughly one-third of all residues. Folding of an RNA construct consisting of the remaining two-thirds of B. subtilis P RNA was examined by Fe(II)-EDTA hydroxyl radical protection. This molecule folds into the proper higher-order structure under identical conditions as the full-length P RNA, suggesting the presence of a second folding domain in B. subtilis P RNA. Folding analysis of the Escherichia coli P RNA by hydroxyl radical protection shows that this P RNA is completely folded at 5-6 mM Mg2+. In order to analyze the structural organization of folding domains in E. coli P RNA, constructs were designed based on the domain structure of B. subtilis P RNA. Fe(II)-EDTA protection indicates that E. coli P RNA also contains two folding domains. Despite the significant differences at the secondary structure level, both P RNAs appear to converge structurally at the folding domain level. The pre-tRNA substrate, localized in previous studies, may bind across the folding domains with the acceptor stem/3'CCA contacting the domain including the active site and the T stem-loop contacting the other. Because all eubacterial P RNAs share considerable homology in secondary structure to either B. subtilis or E. coli P RNA, these results suggest that this domain structure may be applicable for most, if not all, eubacterial P RNAs. Identification of folding domains should be valuable in dissecting structure-function relationship of large RNAs. PMID:8718684

  20. Computational Prediction of miRNA Genes from Small RNA Sequencing Data

    PubMed Central

    Kang, Wenjing; Friedländer, Marc R.

    2015-01-01

    Next-generation sequencing now for the first time allows researchers to gage the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile them. microRNAs (miRNAs) are 22 nucleotide small RNAs (sRNAs) that post-transcriptionally reduce the output of protein coding genes. They have established roles in numerous biological processes, including cancers and other diseases. During miRNA biogenesis, the sRNAs are sequentially cleaved from precursor molecules that have a characteristic hairpin RNA structure. The vast majority of new miRNA genes that are discovered are mined from small RNA sequencing (sRNA-seq), which can detect more than a billion RNAs in a single run. However, given that many of the detected RNAs are degradation products from all types of transcripts, the accurate identification of miRNAs remain a non-trivial computational problem. Here, we review the tools available to predict animal miRNAs from sRNA sequencing data. We present tools for generalist and specialist use cases, including prediction from massively pooled data or in species without reference genome. We also present wet-lab methods used to validate predicted miRNAs, and approaches to computationally benchmark prediction accuracy. For each tool, we reference validation experiments and benchmarking efforts. Last, we discuss the future of the field. PMID:25674563

  1. A folded viral noncoding RNA blocks host cell exoribonucleases through a conformationally dynamic RNA structure.

    PubMed

    Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S

    2018-06-19

    Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.

  2. R2R--software to speed the depiction of aesthetic consensus RNA secondary structures.

    PubMed

    Weinberg, Zasha; Breaker, Ronald R

    2011-01-04

    With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.

  3. Functions of the 3′ and 5′ genome RNA regions of members of the genus Flavivirus

    PubMed Central

    Brinton, Margo A.; Basu, Mausumi

    2015-01-01

    The positive sense genomes of members of the genus Flavivirus in the family Flaviviridae are ~11 kb nts in length and have a 5′ type I cap but no 3′ poly A. The 5′ and 3′ terminal regions contain short conserved sequences that are proposed to be repeated remnants of an ancient sequence. However, the functions of most of these conserved sequences have not yet been determined. The terminal regions of the genome also contain multiple conserved RNA structures. Functional data for many of these structures has been obtained. Three sets of complementary 3′ and 5′ terminal region sequences, some of which are located in conserved RNA structures, interact to form a panhandle structure that is required for initiation of minus strand RNA synthesis with the 5′ terminal structure functioning as the promoter. How the switch from the terminal RNA structure base pairing to the long distance RNA-RNA interaction is triggered and regulated is not well understood but evidence suggests involvement of a cell protein binding to three sites on the 3′ terminal RNA structures and a cis-acting metastable 3′ RNA element in the 3′ terminal structure. Cell proteins may also be involved in facilitating exponential replication of nascent genomic RNA within replication vesicles at later times of infection cycle. Other conserved RNA structures and/or sequences in the 5′ and 3′ terminal regions have been proposed to regulate genome translation. Additional functions of the 5′ and 3′ terminal sequences have also been reported. PMID:25683510

  4. Direct Duplex Detection: An Emerging Tool in the RNA Structure Analysis Toolbox.

    PubMed

    Weidmann, Chase A; Mustoe, Anthony M; Weeks, Kevin M

    2016-09-01

    While a variety of powerful tools exists for analyzing RNA structure, identifying long-range and intermolecular base-pairing interactions has remained challenging. Recently, three groups introduced a high-throughput strategy that uses psoralen-mediated crosslinking to directly identify RNA-RNA duplexes in cells. Initial application of these methods highlights the preponderance of long-range structures within and between RNA molecules and their widespread structural dynamics. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Free energy minimization to predict RNA secondary structures and computational RNA design.

    PubMed

    Churkin, Alexander; Weinbrand, Lina; Barash, Danny

    2015-01-01

    Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

  6. MCTBI: a web server for predicting metal ion effects in RNA structures.

    PubMed

    Sun, Li-Zhen; Zhang, Jing-Xiang; Chen, Shi-Jie

    2017-08-01

    Metal ions play critical roles in RNA structure and function. However, web servers and software packages for predicting ion effects in RNA structures are notably scarce. Furthermore, the existing web servers and software packages mainly neglect ion correlation and fluctuation effects, which are potentially important for RNAs. We here report a new web server, the MCTBI server (http://rna.physics.missouri.edu/MCTBI), for the prediction of ion effects for RNA structures. This server is based on the recently developed MCTBI, a model that can account for ion correlation and fluctuation effects for nucleic acid structures and can provide improved predictions for the effects of metal ions, especially for multivalent ions such as Mg 2+ effects, as shown by extensive theory-experiment test results. The MCTBI web server predicts metal ion binding fractions, the most probable bound ion distribution, the electrostatic free energy of the system, and the free energy components. The results provide mechanistic insights into the role of metal ions in RNA structure formation and folding stability, which is important for understanding RNA functions and the rational design of RNA structures. © 2017 Sun et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  7. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mosley, Ralph T.; Edwards, Thomas E.; Murakami, Eisuke

    The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory {beta}-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesismore » at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory {beta}-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus.« less

  8. Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions.

    PubMed

    Shi, Ya-Zhou; Jin, Lei; Feng, Chen-Jie; Tan, Ya-Lan; Tan, Zhi-Jie

    2018-06-01

    RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.

  9. Intracellular Virus-Specific Structures and RNAs in Oncornavirus-Producing Human Cells

    PubMed Central

    Bukrinskaya, A. G.; Miller, G. G.; Lebedeva, E. N.; Zhdanov, V. M.

    1974-01-01

    Two kinds of virus-specific structures were isolated from the cytoplasm of Detroit-6 and human amnion cells producing oncornavirus-like particles. These structures represented A particles with the diameter of 70 to 80 nm and aggregated strands of nucleocapsids with the diameter of 3 and 6 nm. The structures were separated from cellular contaminants by isopycnic banding in linear sucrose gradients and subsequently further purified by sedimentation in velocity sucrose gradients. Their sedimentation coefficient was 250 and 150S, respectively. Both structures contain 60, 45, and 35S RNA species, and 150S structures also contained 20S RNA. The 35 and 20S RNA from the 150S structure formed hybrids with DNA enzymatically synthesized on extracellular virions. The structures displayed endogeneous polymerase activity, DNA product of the reaction being predominantly associated with 60S RNA. No 70S RNA was found in the cell structures of various densities. Also, the virions purified from tissue culture fluid contained 70S RNA. These findings are consistent with those on extracellular maturation of oncornavirus RNA. Images PMID:4810779

  10. High-resolution structure of the Escherichia coli ribosome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.

    Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less

  11. High-resolution structure of the Escherichia coli ribosome

    DOE PAGES

    Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.; ...

    2015-03-16

    Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less

  12. Quantitation of base substitutions in eukaryotic 5S rRNA: selection for the maintenance of RNA secondary structure.

    PubMed

    Curtiss, W C; Vournakis, J N

    1984-01-01

    Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.

  13. Post-transcriptional inducible gene regulation by natural antisense RNA.

    PubMed

    Nishizawa, Mikio; Ikeya, Yukinobu; Okumura, Tadayoshi; Kimura, Tominori

    2015-01-01

    Accumulating data indicate the existence of natural antisense transcripts (asRNAs), frequently transcribed from eukaryotic genes and do not encode proteins in many cases. However, their importance has been overlooked due to their heterogeneity, low expression level, and unknown function. Genes induced in responses to various stimuli are transcriptionally regulated by the activation of a gene promoter and post-transcriptionally regulated by controlling mRNA stability and translatability. A low-copy-number asRNA may post-transcriptionally regulate gene expression with cis-controlling elements on the mRNA. The asRNA itself may act as regulatory RNA in concert with trans-acting factors, including various RNA-binding proteins that bind to cis-controlling elements, microRNAs, and drugs. A novel mechanism that regulates mRNA stability includes the interaction of asRNA with mRNA by hybridization to loops in secondary structures. Furthermore, recent studies have shown that the functional network of mRNAs, asRNAs, and microRNAs finely tunes the levels of mRNA expression. The post-transcriptional mechanisms via these RNA-RNA interactions may play pivotal roles to regulate inducible gene expression and present the possibility of the involvement of asRNAs in various diseases.

  14. The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

    PubMed Central

    Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

    2016-01-01

    Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712

  15. Structure of Escherichia coli Arginyl-tRNA Synthetase in Complex with tRNAArg: Pivotal Role of the D-loop.

    PubMed

    Stephen, Preyesh; Ye, Sheng; Zhou, Ming; Song, Jian; Zhang, Rongguang; Wang, En-Duo; Giegé, Richard; Lin, Sheng-Xiang

    2018-05-25

    Aminoacyl-tRNA synthetases are essential components in protein biosynthesis. Arginyl-tRNA synthetase (ArgRS) belongs to the small group of aminoacyl-tRNA synthetases requiring cognate tRNA for amino acid activation. The crystal structure of Escherichia coli (Eco) ArgRS has been solved in complex with tRNA Arg at 3.0-Å resolution. With this first bacterial tRNA complex, we are attempting to bridge the gap existing in structure-function understanding in prokaryotic tRNA Arg recognition. The structure shows a tight binding of tRNA on the synthetase through the identity determinant A20 from the D-loop, a tRNA recognition snapshot never elucidated structurally. This interaction of A20 involves 5 amino acids from the synthetase. Additional contacts via U20a and U16 from the D-loop reinforce the interaction. The importance of D-loop recognition in EcoArgRS functioning is supported by a mutagenesis analysis of critical amino acids that anchor tRNA Arg on the synthetase; in particular, mutations at amino acids interacting with A20 affect binding affinity to the tRNA and specificity of arginylation. Altogether the structural and functional data indicate that the unprecedented ArgRS crystal structure represents a snapshot during functioning and suggest that the recognition of the D-loop by ArgRS is an important trigger that anchors tRNA Arg on the synthetase. In this process, A20 plays a major role, together with prominent conformational changes in several ArgRS domains that may eventually lead to the mature ArgRS:tRNA complex and the arginine activation. Functional implications that could be idiosyncratic to the arginine identity of bacterial ArgRSs are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.

  16. Structural basis of reverse nucleotide polymerization

    PubMed Central

    Nakamura, Akiyoshi; Nemoto, Taiki; Heinemann, Ilka U.; Yamashita, Keitaro; Sonoda, Tomoyo; Komoda, Keisuke; Tanaka, Isao; Söll, Dieter; Yao, Min

    2013-01-01

    Nucleotide polymerization proceeds in the forward (5′-3′) direction. This tenet of the central dogma of molecular biology is found in diverse processes including transcription, reverse transcription, DNA replication, and even in lagging strand synthesis where reverse polymerization (3′-5′) would present a “simpler” solution. Interestingly, reverse (3′-5′) nucleotide addition is catalyzed by the tRNA maturation enzyme tRNAHis guanylyltransferase, a structural homolog of canonical forward polymerases. We present a Candida albicans tRNAHis guanylyltransferase-tRNAHis complex structure that reveals the structural basis of reverse polymerization. The directionality of nucleotide polymerization is determined by the orientation of approach of the nucleotide substrate. The tRNA substrate enters the enzyme’s active site from the opposite direction (180° flip) compared with similar nucleotide substrates of canonical 5′-3′ polymerases, and the finger domains are on opposing sides of the core palm domain. Structural, biochemical, and phylogenetic data indicate that reverse polymerization appeared early in evolution and resembles a mirror image of the forward process. PMID:24324136

  17. Searching RNA motifs and their intermolecular contacts with constraint networks.

    PubMed

    Thébault, P; de Givry, S; Schiex, T; Gaspin, C

    2006-09-01

    Searching RNA gene occurrences in genomic sequences is a task whose importance has been renewed by the recent discovery of numerous functional RNA, often interacting with other ligands. Even if several programs exist for RNA motif search, none exists that can represent and solve the problem of searching for occurrences of RNA motifs in interaction with other molecules. We present a constraint network formulation of this problem. RNA are represented as structured motifs that can occur on more than one sequence and which are related together by possible hybridization. The implemented tool MilPat is used to search for several sRNA families in genomic sequences. Results show that MilPat allows to efficiently search for interacting motifs in large genomic sequences and offers a simple and extensible framework to solve such problems. New and known sRNA are identified as H/ACA candidates in Methanocaldococcus jannaschii. http://carlit.toulouse.inra.fr/MilPaT/MilPat.pl.

  18. JNSViewer—A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures

    PubMed Central

    Dong, Min; Graham, Mitchell; Yadav, Nehul

    2017-01-01

    Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome) were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html. PMID:28582416

  19. Free Energy Landscape and Multiple Folding Pathways of an H-Type RNA Pseudoknot

    PubMed Central

    Bian, Yunqiang; Zhang, Jian; Wang, Jun; Wang, Jihua; Wang, Wei

    2015-01-01

    How RNA sequences fold to specific tertiary structures is one of the key problems for understanding their dynamics and functions. Here, we study the folding process of an H-type RNA pseudoknot by performing a large-scale all-atom MD simulation and bias-exchange metadynamics. The folding free energy landscapes are obtained and several folding intermediates are identified. It is suggested that the folding occurs via multiple mechanisms, including a step-wise mechanism starting either from the first helix or the second, and a cooperative mechanism with both helices forming simultaneously. Despite of the multiple mechanism nature, the ensemble folding kinetics estimated from a Markov state model is single-exponential. It is also found that the correlation between folding and binding of metal ions is significant, and the bound ions mediate long-range interactions in the intermediate structures. Non-native interactions are found to be dominant in the unfolded state and also present in some intermediates, possibly hinder the folding process of the RNA. PMID:26030098

  20. Cis- and trans-regulation of luteovirus gene expression by the 3’ end of the viral genome

    PubMed Central

    Miller, W. Allen; Jackson, Jacquelyn; Feng, Ying

    2016-01-01

    Translation of the 5.7 kb luteovirus genome is controlled by the 3’ untranslated region (UTR). Base pairing between regions of the 3’ UTR and sequences kilobases upstream is required for cap-independent translation and ribosomal frameshifting needed to synthesize the viral replicase. Luteoviruses produce subgenomic RNAs, which can serve as mRNA, but one sgRNA also regulates translation initiation in trans. As on all viruses, the 3’ and 5’ ends contain structures that are presumed to facilitate RNA synthesis. This review describes the structures and interactions of Barley yellow dwarf virus RNA that facilitate the complex interplay between the above events and result in a successful virus infection. We also present surprising results on the apparent lack of need for some subgenomic RNAs for the virus to infect cells or whole plants. In summary, the UTRs of luteoviruses are highly complex entities that control and fine-tune many key events of the virus replication cycle. PMID:25858272

  1. Probing RNA Native Conformational Ensembles with Structural Constraints.

    PubMed

    Fonseca, Rasmus; van den Bedem, Henry; Bernauer, Julie

    2016-05-01

    Noncoding ribonucleic acids (RNA) play a critical role in a wide variety of cellular processes, ranging from regulating gene expression to post-translational modification and protein synthesis. Their activity is modulated by highly dynamic exchanges between three-dimensional conformational substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined by distance constraints in the tertiary structure. The dimensionality reduction enables efficient exploration of conformational space. We show that the conformational distributions obtained with our method broadly sample the conformational landscape observed in NMR experiments. Compared to normal mode analysis-based exploration, our procedure diffuses faster through the experimental ensemble while also accessing conformational substates to greater precision. Our results suggest that conformational sampling with a highly reduced but fully atomistic representation of noncoding RNA expresses key features of their dynamic nature.

  2. Memory-efficient RNA energy landscape exploration

    PubMed Central

    Mann, Martin; Kucharík, Marcel; Flamm, Christoph; Wolfinger, Michael T.

    2014-01-01

    Motivation: Energy landscapes provide a valuable means for studying the folding dynamics of short RNA molecules in detail by modeling all possible structures and their transitions. Higher abstraction levels based on a macro-state decomposition of the landscape enable the study of larger systems; however, they are still restricted by huge memory requirements of exact approaches. Results: We present a highly parallelizable local enumeration scheme that enables the computation of exact macro-state transition models with highly reduced memory requirements. The approach is evaluated on RNA secondary structure landscapes using a gradient basin definition for macro-states. Furthermore, we demonstrate the need for exact transition models by comparing two barrier-based approaches, and perform a detailed investigation of gradient basins in RNA energy landscapes. Availability and implementation: Source code is part of the C++ Energy Landscape Library available at http://www.bioinf.uni-freiburg.de/Software/. Contact: mmann@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24833804

  3. Structural Insights into the Polyphyletic Origins of Glycyl tRNA Synthetases*♦

    PubMed Central

    Valencia-Sánchez, Marco Igor; Rodríguez-Hernández, Annia; Ferreira, Ruben; Santamaría-Suárez, Hugo Aníbal; Arciniega, Marcelino; Dock-Bregeon, Anne-Catherine; Moras, Dino; Beinsteiner, Brice; Brieba, Luis G.; Grøtli, Morten

    2016-01-01

    Glycyl tRNA synthetase (GlyRS) provides a unique case among class II aminoacyl tRNA synthetases, with two clearly widespread types of enzymes: a dimeric (α2) species present in some bacteria, archaea, and eukaryotes; and a heterotetrameric form (α2β2) present in most bacteria. Although the differences between both types of GlyRS at the anticodon binding domain level are evident, the extent and implications of the variations in the catalytic domain have not been described, and it is unclear whether the mechanism of amino acid recognition is also dissimilar. Here, we show that the α-subunit of the α2β2 GlyRS from the bacterium Aquifex aeolicus is able to perform the first step of the aminoacylation reaction, which involves the activation of the amino acid with ATP. The crystal structure of the α-subunit in the complex with an analog of glycyl adenylate at 2.8 Å resolution presents a conformational arrangement that properly positions the cognate amino acid. This work shows that glycine is recognized by a subset of different residues in the two types of GlyRS. A structural and sequence analysis of class II catalytic domains shows that bacterial GlyRS is closely related to alanyl tRNA synthetase, which led us to define a new subclassification of these ancient enzymes and to propose an evolutionary path of α2β2 GlyRS, convergent with α2 GlyRS and divergent from AlaRS, thus providing a possible explanation for the puzzling existence of two proteins sharing the same fold and function but not a common ancestor. PMID:27226617

  4. Structural insights into the polyphyletic origins of glycyl tRNA synthetases

    DOE PAGES

    Valencia-Sánchez, Marco Igor; Rodríguez-Hernández, Annia; Ferreira, Ruben; ...

    2016-05-23

    Glycyl tRNA synthetase (GlyRS) provides a unique case among class II aminoacyl tRNA synthetases, with two clearly widespread types of enzymes: a dimeric (α 2) species present in some bacteria, archaea, and eukaryotes; and a heterotetrameric form (α 2β 2) present in most bacteria. Although the differences between both types of GlyRS at the anticodon binding domain level are evident, the extent and implications of the variations in the catalytic domain have not been described, and it is unclear whether the mechanism of amino acid recognition is also dissimilar. Here, we show that the α-subunit of the α 2β 2more » GlyRS from the bacterium Aquifex aeolicus is able to perform the first step of the aminoacylation reaction, which involves the activation of the amino acid with ATP. The crystal structure of the α-subunit in the complex with an analog of glycyl adenylate at 2.8 Å resolution presents a conformational arrangement that properly positions the cognate amino acid. This work shows that glycine is recognized by a subset of different residues in the two types of GlyRS. Furthermore, a structural and sequence analysis of class II catalytic domains shows that bacterial GlyRS is closely related to alanyl tRNA synthetase, which led us to define a new subclassification of these ancient enzymes and to propose an evolutionary path of α 2β 2 GlyRS, convergent with α 2 GlyRS and divergent from AlaRS, thus providing a possible explanation for the puzzling existence of two proteins sharing the same fold and function but not a common ancestor.« less

  5. The complete sequence of the mitochondrial genome of Arctic fox (Alopex lagopus).

    PubMed

    Yan, Shou-Qing; Guo, Peng-Cheng; Yue, Yuan; Li, Wan-Hong; Bai, Chun-Yan; Li, Yu-Mei; Sun, Jin-Hai; Zhao, Zhi-Hui

    2016-11-01

    In the present study, the complete mitochondrial genome sequence of Arctic fox (Alopex lagopus) was determined for the first time. It has a total length of 16,656 bp, and contains 13 protein-coding genes, 22 tRNA genes, 2 ribosome RNA genes and 1 control region. The nucleotide composition is 31.3% for A, 26.2% for C, 14.8% for G and 27.7% for T, respectively. The D-loop region located between tRNA Pro and tRNA Phe contains a (ACACGTACACGCAT) 18 tandem repeat array. The data will be useful for the investigation of the genetic structure and diversity in the natural and farmed population of Arctic foxes.

  6. In cell mutational interference mapping experiment (in cell MIME) identifies the 5' polyadenylation signal as a dual regulator of HIV-1 genomic RNA production and packaging.

    PubMed

    Smyth, Redmond P; Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe; von Kleist, Max; Marquet, Roland

    2018-05-18

    Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5' region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5' PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production.

  7. In cell mutational interference mapping experiment (in cell MIME) identifies the 5′ polyadenylation signal as a dual regulator of HIV-1 genomic RNA production and packaging

    PubMed Central

    Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe

    2018-01-01

    Abstract Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5′ region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5′ PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production. PMID:29514260

  8. Evaluating and learning from RNA pseudotorsional space: quantitative validation of a reduced representation for RNA structure.

    PubMed

    Wadley, Leven M; Keating, Kevin S; Duarte, Carlos M; Pyle, Anna Marie

    2007-09-28

    Quantitatively describing RNA structure and conformational elements remains a formidable problem. Seven standard torsion angles and the sugar pucker are necessary to characterize the conformation of an RNA nucleotide completely. Progress has been made toward understanding the discrete nature of RNA structure, but classifying simple and ubiquitous structural elements such as helices and motifs remains a difficult task. One approach for describing RNA structure in a simple, mathematically consistent, and computationally accessible manner involves the invocation of two pseudotorsions, eta (C4'(n-1), P(n), C4'(n), P(n+1)) and theta (P(n), C4'(n), P(n+1), C4'(n+1)), which can be used to describe RNA conformation in much the same way that varphi and psi are used to describe backbone configuration of proteins. Here, we conduct an exploration and statistical evaluation of pseudotorsional space and of the Ramachandran-like eta-theta plot. We show that, through the rigorous quantitative analysis of the eta-theta plot, the pseudotorsional descriptors eta and theta, together with sugar pucker, are sufficient to describe RNA backbone conformation fully in most cases. These descriptors are also shown to contain considerable information about nucleotide base conformation, revealing a previously uncharacterized interplay between backbone and base orientation. A window function analysis is used to discern statistically relevant regions of density in the eta-theta scatter plot and then nucleotides in colocalized clusters in the eta-theta plane are shown to have similar 3-D structures through RMSD analysis of the RNA structural constituents. We find that major clusters in the eta-theta plot are few, underscoring the discrete nature of RNA backbone conformation. Like the Ramachandran plot, the eta-theta plot is a valuable system for conceptualizing biomolecular conformation, it is a useful tool for analyzing RNA tertiary structures, and it is a vital component of new approaches for solving the 3-D structures of large RNA molecules and RNA assemblies.

  9. Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).

    PubMed

    Watters, Kyle E; Lucks, Julius B

    2016-01-01

    Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.

  10. Biochemical identification of Argonaute 2 as the sole protein required for RNA-induced silencing complex activity

    PubMed Central

    Rand, Tim A.; Ginalski, Krzysztof; Grishin, Nick V.; Wang, Xiaodong

    2004-01-01

    RNA interference is carried out by the small double-stranded RNA-induced silencing complex (RISC). The RISC-bound small RNA guides the RISC complex to identify and cleave mRNAs with complementary sequences. The proteins that make up the RISC complex and cleave mRNA have not been unequivocally defined. Here, we report the biochemical purification of RISC activity to homogeneity from Drosophila Schnieder 2 cell extracts. Argonaute 2 (Ago-2) is the sole protein component present in the purified, functional RISC. By using a bioinformatics method that combines sequence-profile analysis with predicted protein secondary structure, we found homology between the PIWI domain of Ago-2 and endonuclease V and identified potential active-site amino acid residues within the PIWI domain of Ago-2. PMID:15452342

  11. Biochemical identification of Argonaute 2 as the sole protein required for RNA-induced silencing complex activity.

    PubMed

    Rand, Tim A; Ginalski, Krzysztof; Grishin, Nick V; Wang, Xiaodong

    2004-10-05

    RNA interference is carried out by the small double-stranded RNA-induced silencing complex (RISC). The RISC-bound small RNA guides the RISC complex to identify and cleave mRNAs with complementary sequences. The proteins that make up the RISC complex and cleave mRNA have not been unequivocally defined. Here, we report the biochemical purification of RISC activity to homogeneity from Drosophila Schnieder 2 cell extracts. Argonaute 2 (Ago-2) is the sole protein component present in the purified, functional RISC. By using a bioinformatics method that combines sequence-profile analysis with predicted protein secondary structure, we found homology between the PIWI domain of Ago-2 and endonuclease V and identified potential active-site amino acid residues within the PIWI domain of Ago-2.

  12. Improving RNA nearest neighbor parameters for helices by going beyond the two-state model.

    PubMed

    Spasic, Aleksandar; Berger, Kyle D; Chen, Jonathan L; Seetin, Matthew G; Turner, Douglas H; Mathews, David H

    2018-06-01

    RNA folding free energy change nearest neighbor parameters are widely used to predict folding stabilities of secondary structures. They were determined by linear regression to datasets of optical melting experiments on small model systems. Traditionally, the optical melting experiments are analyzed assuming a two-state model, i.e. a structure is either complete or denatured. Experimental evidence, however, shows that structures exist in an ensemble of conformations. Partition functions calculated with existing nearest neighbor parameters predict that secondary structures can be partially denatured, which also directly conflicts with the two-state model. Here, a new approach for determining RNA nearest neighbor parameters is presented. Available optical melting data for 34 Watson-Crick helices were fit directly to a partition function model that allows an ensemble of conformations. Fitting parameters were the enthalpy and entropy changes for helix initiation, terminal AU pairs, stacks of Watson-Crick pairs and disordered internal loops. The resulting set of nearest neighbor parameters shows a 38.5% improvement in the sum of residuals in fitting the experimental melting curves compared to the current literature set.

  13. Structural Basis for Conserved Regulation and Adaptation of the Signal Recognition Particle Targeting Complex.

    PubMed

    Wild, Klemens; Bange, Gert; Motiejunas, Domantas; Kribelbauer, Judith; Hendricks, Astrid; Segnitz, Bernd; Wade, Rebecca C; Sinning, Irmgard

    2016-07-17

    The signal recognition particle (SRP) is a ribonucleoprotein complex with a key role in targeting and insertion of membrane proteins. The two SRP GTPases, SRP54 (Ffh in bacteria) and FtsY (SRα in eukaryotes), form the core of the targeting complex (TC) regulating the SRP cycle. The architecture of the TC and its stimulation by RNA has been described for the bacterial SRP system while this information is lacking for other domains of life. Here, we present the crystal structures of the GTPase heterodimers of archaeal (Sulfolobus solfataricus), eukaryotic (Homo sapiens), and chloroplast (Arabidopsis thaliana) SRP systems. The comprehensive structural comparison combined with Brownian dynamics simulations of TC formation allows for the description of the general blueprint and of specific adaptations of the quasi-symmetric heterodimer. Our work defines conserved external nucleotide-binding sites for SRP GTPase activation by RNA. Structural analyses of the GDP-bound, post-hydrolysis states reveal a conserved, magnesium-sensitive switch within the I-box. Overall, we provide a general model for SRP cycle regulation by RNA. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Switch from translation to RNA replication in a positive-stranded RNA virus

    PubMed Central

    Gamarnik, Andrea V.; Andino, Raul

    1998-01-01

    In positive-stranded viruses, the genomic RNA serves as a template for both translation and RNA replication. Using poliovirus as a model, we examined the interaction between these two processes. We show that the RNA polymerase is unable to replicate RNA templates undergoing translation. We discovered that an RNA structure at the 5′ end of the viral genome, next to the internal ribosomal entry site, carries signals that control both viral translation and RNA synthesis. The interaction of this RNA structure with the cellular factor PCBP up-regulates viral translation, while the binding of the viral protein 3CD represses translation and promotes negative-strand RNA synthesis. We propose that the interaction of 3CD with this RNA structure controls whether the genomic RNA is used for translation or RNA replication. PMID:9694795

  15. Analysis of RNA structure using small-angle X-ray scattering

    PubMed Central

    Cantara, William A.; Olson, Erik D.; Musier-Forsyth, Karin

    2016-01-01

    In addition to their role in correctly attaching specific amino acids to cognate tRNAs, aminoacyl-tRNA synthetases (aaRS) have been found to possess many alternative functions and often bind to and act on other nucleic acids. In contrast to the well-defined 3D structure of tRNA, the structures of many of the other RNAs recognized by aaRSs have not been solved. Despite advances in the use of X-ray crystallography (XRC), nuclear magnetic resonance (NMR) spectroscopy and cryo-electron microscopy (cryo-EM) for structural characterization of biomolecules, significant challenges to solving RNA structures still exist. Recently, small-angle X-ray scattering (SAXS) has been increasingly employed to characterize the 3D structures of RNAs and RNA-protein complexes. SAXS is capable of providing low-resolution tertiary structure information under physiological conditions and with less intensive sample preparation and data analysis requirements than XRC, NMR and cryo-EM. In this article, we describe best practices involved in the process of RNA and RNA-protein sample preparation, SAXS data collection, data analysis, and structural model building. PMID:27777026

  16. A fast and robust iterative algorithm for prediction of RNA pseudoknotted secondary structures

    PubMed Central

    2014-01-01

    Background Improving accuracy and efficiency of computational methods that predict pseudoknotted RNA secondary structures is an ongoing challenge. Existing methods based on free energy minimization tend to be very slow and are limited in the types of pseudoknots that they can predict. Incorporating known structural information can improve prediction accuracy; however, there are not many methods for prediction of pseudoknotted structures that can incorporate structural information as input. There is even less understanding of the relative robustness of these methods with respect to partial information. Results We present a new method, Iterative HFold, for pseudoknotted RNA secondary structure prediction. Iterative HFold takes as input a pseudoknot-free structure, and produces a possibly pseudoknotted structure whose energy is at least as low as that of any (density-2) pseudoknotted structure containing the input structure. Iterative HFold leverages strengths of earlier methods, namely the fast running time of HFold, a method that is based on the hierarchical folding hypothesis, and the energy parameters of HotKnots V2.0. Our experimental evaluation on a large data set shows that Iterative HFold is robust with respect to partial information, with average accuracy on pseudoknotted structures steadily increasing from roughly 54% to 79% as the user provides up to 40% of the input structure. Iterative HFold is much faster than HotKnots V2.0, while having comparable accuracy. Iterative HFold also has significantly better accuracy than IPknot on our HK-PK and IP-pk168 data sets. Conclusions Iterative HFold is a robust method for prediction of pseudoknotted RNA secondary structures, whose accuracy with more than 5% information about true pseudoknot-free structures is better than that of IPknot, and with about 35% information about true pseudoknot-free structures compares well with that of HotKnots V2.0 while being significantly faster. Iterative HFold and all data used in this work are freely available at http://www.cs.ubc.ca/~hjabbari/software.php. PMID:24884954

  17. Self-assembly of free-standing RNA membranes

    NASA Astrophysics Data System (ADS)

    Han, Daehoon; Park, Yongkuk; Kim, Hyejin; Lee, Jong Bum

    2014-07-01

    RNA has emerged as a promising material for nanostructure and microstructure engineering. Although rare, some macroscopic RNA structures have also been constructed using lipid or polymer materials. Here, we report the first example of an enzymatically generated RNA membrane. This robust and free-standing RNA membrane has a macroscopic structure and is generated without any polymer support or complexation. Our RNA membrane is fabricated following two sequential processes, complementary rolling circle transcription and evaporation-induced self-assembly, and its structural and functional properties are rationally controlled by adjusting RNA base pairing. In this study, three types of RNA membranes are fabricated and are used to demonstrate potential applications.

  18. Structural basis of malaria parasite lysyl-tRNA synthetase inhibition by cladosporin.

    PubMed

    Khan, Sameena; Sharma, Arvind; Belrhali, Hassan; Yogavel, Manickam; Sharma, Amit

    2014-06-01

    Malaria parasites inevitably develop drug resistance to anti-malarials over time. Hence the immediacy for discovering new chemical scaffolds to include in combination malaria drug therapy. The desirable attributes of new chemotherapeutic agents currently include activity against both liver and blood stage malaria parasites. One such recently discovered compound called cladosporin abrogates parasite growth via inhibition of Plasmodium falciparum lysyl-tRNA synthetase (PfKRS), an enzyme central to protein translation. Here, we present crystal structure of ternary PfKRS-lysine-cladosporin (PfKRS-K-C) complex that reveals cladosporin's remarkable ability to mimic the natural substrate adenosine and thereby colonize PfKRS active site. The isocoumarin fragment of cladosporin sandwiches between critical adenine-recognizing residues while its pyran ring fits snugly in the ribose-recognizing cavity. PfKRS-K-C structure highlights ample space within PfKRS active site for further chemical derivatization of cladosporin. Such derivatives may be useful against additional human pathogens that retain high conservation in cladosporin chelating residues within their lysyl-tRNA synthetase.

  19. The bifurcated stem loop 4 (SL4) is crucial for efficient packaging of mouse mammary tumor virus (MMTV) genomic RNA.

    PubMed

    Mustafa, Farah; Vivet-Boudou, Valérie; Jabeen, Ayesha; Ali, Lizna M; Kalloush, Rawan M; Marquet, Roland; Rizvi, Tahir A

    2018-06-21

    Packaging the mouse mammary tumor virus (MMTV) genomic RNA (gRNA) requires the entire 5' untranslated region (UTR) in conjunction with the first 120 nucleotides of the gag gene. This region includes several palindromic (pal) sequence(s) and stable stem loops (SLs). Among these, stem loop 4 (SL4) adopts a bifurcated structure consisting of three stems, two apical loops, and an internal loop. Pal II, located in one of the apical loops, mediates gRNA dimerization, a process intricately linked to packaging. We thus hypothesized that the bifurcated SL4 structure could constitute the major gRNA packaging determinant. To test this hypothesis, the two apical loops and the flanking sequences forming the bifurcated SL4 were individually mutated. These mutations all had deleterious effects on gRNA packaging and propagation. Next, single and compensatory mutants were designed to destabilize then recreate the bifurcated SL4 structure. A structure-function analysis using bioinformatics predictions and RNA chemical probing revealed that mutations that led to the loss of the SL4 bifurcated structure abrogated RNA packaging and propagation, while compensatory mutations that recreated the native SL4 structure restored RNA packaging and propagation to wild type levels. Altogether, our results demonstrate that SL4 constitutes the principal packaging determinant of MMTV gRNA. Our findings further suggest that SL4 acts as a structural switch that can not only differentiate between RNA for translation versus packaging/dimerization, but its location also allows differentiation between spliced and unspliced RNAs during gRNA encapsidation.

  20. SiRNA Crosslinked Nanoparticles for the Treatment of Inflammation-induced Liver Injury.

    PubMed

    Tang, Yaqin; Zeng, Ziying; He, Xiao; Wang, Tingting; Ning, Xinghai; Feng, Xuli

    2017-02-01

    RNA interference mediated by small interfering RNA (siRNA) provides a powerful tool for gene regulation, and has a broad potential as a promising therapeutic strategy. However, therapeutics based on siRNA have had limited clinical success due to their undesirable pharmacokinetic properties. This study presents pH-sensitive nanoparticles-based siRNA delivery systems (PNSDS), which are positive-charge-free nanocarriers, composed of siRNA chemically crosslinked with multi-armed poly(ethylene glycol) carriers via acid-labile acetal linkers. The unique siRNA crosslinked structure of PNSDS allows it to have minimal cytotoxicity, high siRNA loading efficiency, and a stimulus-responsive property that enables the selective intracellular release of siRNA in response to pH conditions. This study demonstrates that PNSDS can deliver tumor necrosis factor alpha (TNF-α) siRNA into macrophages and induce the efficient down regulation of the targeted gene in complete cell culture media. Moreover, PNSDS with mannose targeting moieties can selectively accumulate in mice liver, induce specific inhibition of macrophage TNF-α expression in vivo, and consequently protect mice from inflammation-induced liver damages. Therefore, this novel siRNA delivering platform would greatly improve the therapeutic potential of RNAi based therapies.

  1. Topological constraints are major determinants of tRNA tertiary structure and dynamics and provide basis for tertiary folding cooperativity

    PubMed Central

    Mustoe, Anthony M.; Brooks, Charles L.; Al-Hashimi, Hashim M.

    2014-01-01

    Recent studies have shown that basic steric and connectivity constraints encoded at the secondary structure level are key determinants of 3D structure and dynamics in simple two-way RNA junctions. However, the role of these topological constraints in higher order RNA junctions remains poorly understood. Here, we use a specialized coarse-grained molecular dynamics model to directly probe the thermodynamic contributions of topological constraints in defining the 3D architecture and dynamics of transfer RNA (tRNA). Topological constraints alone restrict tRNA's allowed conformational space by over an order of magnitude and strongly discriminate against formation of non-native tertiary contacts, providing a sequence independent source of folding specificity. Topological constraints also give rise to long-range correlations between the relative orientation of tRNA's helices, which in turn provides a mechanism for encoding thermodynamic cooperativity between distinct tertiary interactions. These aspects of topological constraints make it such that only several tertiary interactions are needed to confine tRNA to its native global structure and specify functionally important 3D dynamics. We further show that topological constraints are conserved across tRNA's different naturally occurring secondary structures. Taken together, our results emphasize the central role of secondary-structure-encoded topological constraints in defining RNA 3D structure, dynamics and folding. PMID:25217593

  2. Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency.

    PubMed

    Dang, Ying; Jia, Gengxiang; Choi, Jennie; Ma, Hongming; Anaya, Edgar; Ye, Chunting; Shankar, Premlata; Wu, Haoquan

    2015-12-15

    Single-guide RNA (sgRNA) is one of the two key components of the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 genome-editing system. The current commonly used sgRNA structure has a shortened duplex compared with the native bacterial CRISPR RNA (crRNA)-transactivating crRNA (tracrRNA) duplex and contains a continuous sequence of thymines, which is the pause signal for RNA polymerase III and thus could potentially reduce transcription efficiency. Here, we systematically investigate the effect of these two elements on knockout efficiency and showed that modifying the sgRNA structure by extending the duplex length and mutating the fourth thymine of the continuous sequence of thymines to cytosine or guanine significantly, and sometimes dramatically, improves knockout efficiency in cells. In addition, the optimized sgRNA structure also significantly increases the efficiency of more challenging genome-editing procedures, such as gene deletion, which is important for inducing a loss of function in non-coding genes. By a systematic investigation of sgRNA structure we find that extending the duplex by approximately 5 bp combined with mutating the continuous sequence of thymines at position 4 to cytosine or guanine significantly increases gene knockout efficiency in CRISPR-Cas9-based genome editing experiments.

  3. Assembly of RNA nanostructures on supported lipid bilayers

    NASA Astrophysics Data System (ADS)

    Dabkowska, Aleksandra P.; Michanek, Agnes; Jaeger, Luc; Rabe, Michael; Chworos, Arkadiusz; Höök, Fredrik; Nylander, Tommy; Sparr, Emma

    2014-12-01

    The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces.The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces. Electronic supplementary information (ESI) available: Table with sequences of tRNA units used in this study; schematic structures of the RNA polyhedron and its building blocks; gel electrophoresis characterization of the RNA polyhedron and squares; AFM characterization of RNA tectosquare; schematic structures of RNA-9 and RNA-10 and their association with lipid bilayers; QCM-D frequency and dissipation data (as function of time) for adsorption of RNA polyhedrons, RNA squares and RNA9-10 TIRF images of RNA with Gelstar after photobleaching with analysis; Correlation plot in change of shear viscosity for TS3 and TO3-4 models for the stoichiometry of TS; QCM-D dissipation data for the sequential experiment in Fig. 5a; QCM-D and for the assembly of building blocks at the bilayer scaffold at varying bulk concentrations; QCM-D of adsorption of TS3. See DOI: 10.1039/c4nr05968a

  4. The Valine Anticodon and Valylatability of Peanut Clump Virus RNAs Are Not Essential but Provide a Modest Competitive Advantage in Plants†

    PubMed Central

    Matsuda, Daiki; Dunoyer, Patrice; Hemmer, Odile; Fritsch, Christiane; Dreher, Theo W.

    2000-01-01

    The role of valine aminoacylation of the two genomic RNAs of Peanut clump virus (PCV) was studied by comparing the amplification in vivo of RNAs with GAC, GΔC, or CCA anticodons in the tRNA-like structure (TLS) present at the 3′ end of each viral RNA. The PCV RNA1 TLS of isolate PCV2 possesses a GAC anticodon and is capable of highly efficient valylation, whereas the RNA2 TLS has a GΔC anticodon that does not support valylation. The presence in RNA1 of GΔC or CCA anticodons that conferred nonvalylatability resulted in about 2- to 4-fold and a 14- to 24-fold reduction, respectively, in RNA accumulations in tobacco BY-2 protoplasts inoculated with the RNA1 variants together with wild-type RNA2(GΔC). No differences in RNA levels were observed among protoplasts inoculated with the three variant RNA2s in the presence of wild-type RNA1(GAC). All combinations of valylatable and nonvalylatable RNAs 1 and 2 were similarly infectious in Nicotiana benthamiana plants, and viral RNAs accumulated to similar levels; all input TLS sequences were present unchanged in apical leaves. In direct competition experiments in N. benthamiana plants, however, both RNA1 and RNA2 with GAC valylatable anticodons outcompeted the nonvalylatable variants. We conclude that valylation provides a small but significant replicational advantage to both PCV RNAs. Sequence analysis of the TLS from RNA2 of a second PCV isolate, PO2A, revealed the presence of an intact GAC valine anticodon, suggesting that the differential valylation of the genomic RNAs of isolate PCV2 is not a general characteristic of PCV. PMID:10954573

  5. Design of a Temperature-Responsive Transcription Terminator.

    PubMed

    Roßmanith, Johanna; Weskamp, Mareen; Narberhaus, Franz

    2018-02-16

    RNA structures regulate various steps in gene expression. Transcription in bacteria is typically terminated by stable hairpin structures. Translation initiation can be modulated by metabolite- or temperature-sensitive RNA structures, called riboswitches or RNA thermometers (RNATs), respectively. RNATs control translation initiation by occlusion of the ribosome binding site at low temperatures. Increasing temperatures destabilize the RNA structure and facilitate ribosome access. In this study, we exploited temperature-responsive RNAT structures to design regulatory elements that control transcription termination instead of translation initiation in Escherichia coli. In order to mimic the structure of factor-independent intrinsic terminators, naturally occurring RNAT hairpins were genetically engineered to be followed by a U-stretch. Functional temperature-responsive terminators (thermoterms) prevented mRNA synthesis at low temperatures but resumed transcription after a temperature upshift. The successful design of temperature-controlled terminators highlights the potential of RNA structures as versatile gene expression control elements.

  6. Computational analysis of conserved RNA secondary structure in transcriptomes and genomes.

    PubMed

    Eddy, Sean R

    2014-01-01

    Transcriptomics experiments and computational predictions both enable systematic discovery of new functional RNAs. However, many putative noncoding transcripts arise instead from artifacts and biological noise, and current computational prediction methods have high false positive rates. I discuss prospects for improving computational methods for analyzing and identifying functional RNAs, with a focus on detecting signatures of conserved RNA secondary structure. An interesting new front is the application of chemical and enzymatic experiments that probe RNA structure on a transcriptome-wide scale. I review several proposed approaches for incorporating structure probing data into the computational prediction of RNA secondary structure. Using probabilistic inference formalisms, I show how all these approaches can be unified in a well-principled framework, which in turn allows RNA probing data to be easily integrated into a wide range of analyses that depend on RNA secondary structure inference. Such analyses include homology search and genome-wide detection of new structural RNAs.

  7. Sequence to Structure (S2S): display, manipulate and interconnect RNA data from sequence to structure.

    PubMed

    Jossinet, Fabrice; Westhof, Eric

    2005-08-01

    Efficient RNA sequence manipulations (such as multiple alignments) need to be constrained by rules of RNA structure folding. The structural knowledge has increased dramatically in the last years with the accumulation of several large RNA structures similar to those of the bacterial ribosome subunits. However, no tool in the RNA community provides an easy way to link and integrate progress made at the sequence level using the available three-dimensional information. Sequence to Structure (S2S) proposes a framework in which an user can easily display, manipulate and interconnect heterogeneous RNA data, such as multiple sequence alignments, secondary and tertiary structures. S2S has been implemented using the Java language and has been developed and tested under UNIX systems, such as Linux and MacOSX. S2S is available at http://bioinformatics.org/S2S/.

  8. Crystal structure of RlmAI: Implications for understanding the 23S rRNA G745/G748-methylation at the macrolide antibiotic-binding site

    PubMed Central

    Das, Kalyan; Acton, Thomas; Chiang, Yiwen; Shih, Lydia; Arnold, Eddy; Montelione, Gaetano T.

    2004-01-01

    The RlmA class of enzymes (RlmAI and RlmAII) catalyzes N1-methylation of a guanine base (G745 in Gram-negative and G748 in Gram-positive bacteria) of hairpin 35 of 23S rRNA. We have determined the crystal structure of Escherichia coli RlmAI at 2.8-Å resolution, providing 3D structure information for the RlmA class of RNA methyltransferases. The dimeric protein structure exhibits features that provide new insights into its molecular function. Each RlmAI molecule has a Zn-binding domain, responsible for specific recognition and binding of its rRNA substrate, and a methyltransferase domain. The asymmetric RlmAI dimer observed in the crystal structure has a well defined W-shaped RNA-binding cleft. Two S-adenosyl-l-methionine substrate molecules are located at the two valleys of the W-shaped RNA-binding cleft. The unique shape of the RNA-binding cleft, different from that of known RNA-binding proteins, is highly specific and structurally complements the 3D structure of hairpin 35 of bacterial 23S rRNA. Apart from the hairpin 35, parts of hairpins 33 and 34 also interact with the RlmAI dimer. PMID:14999102

  9. Gene silencing efficiency and INF-β induction effects of splicing miRNA 155-based artificial miRNA with pre-miRNA stem-loop structures.

    PubMed

    Sin, Onsam; Mabiala, Prudence; Liu, Ye; Sun, Ying; Hu, Tao; Liu, Qingzhen; Guo, Deyin

    2012-02-01

    Artificial microRNA (miRNA) expression vectors have been developed and used for RNA interference. The secondary structure of artificial miRNA is important for RNA interference efficacy. We designed two groups of six artificial splicing miRNA 155-based miRNAs (SM155-based miRNAs) with the same target in the coding region or 3' UTR of a target gene and studied their RNA silencing efficiency and interferon β (IFN-β) induction effects. SM155-based miRNA with a mismatch at the +1 position and a bulge at the +11, +12 positions in a miRNA precursor stem-loop structure showed the highest gene silencing efficiency and lowest IFN-β induction effect (increased IFN-β mRNA level by 10% in both target cases), regardless of the specificity of the target sequence, suggesting that pSM155-based miRNA with this design could be a valuable miRNA expression vector.

  10. Complex community of nitrite-dependent anaerobic methane oxidation bacteria in coastal sediments of the Mai Po wetland by PCR amplification of both 16S rRNA and pmoA genes.

    PubMed

    Chen, Jing; Zhou, Zhichao; Gu, Ji-Dong

    2015-02-01

    In the present work, both 16S rRNA and pmoA gene-based PCR primers were employed successfully to study the diversity and distribution of n-damo bacteria in the surface and lower layer sediments at the coastal Mai Po wetland. The occurrence of n-damo bacteria in both the surface and subsurface sediments with high diversity was confirmed in this study. Unlike the two other known n-damo communities from coastal areas, the pmoA gene-amplified sequences in the present work clustered not only with some freshwater subclusters but also within three newly erected marine subclusters mostly, indicating the unique niche specificity of n-damo bacteria in this wetland. Results suggested vegetation affected the distribution and community structures of n-damo bacteria in the sediments and n-damo could coexist with sulfate-reducing methanotrophs in the coastal ecosystem. Community structures of the Mai Po n-damo bacteria based on 16S rRNA gene were different from those of either the freshwater or the marine. In contrast, structures of the Mai Po n-damo communities based on pmoA gene grouped with the marine ones and were clearly distinguished from the freshwater ones. The abundance of n-damo bacteria at this wetland was quantified using 16S rRNA gene PCR primers to be 2.65-6.71 × 10(5) copies/g dry sediment. Ammonium and nitrite strongly affected the community structures and distribution of n-damo bacteria in the coastal Mai Po wetland sediments.

  11. Multistep modeling of protein structure: application towards refinement of tyr-tRNA synthetase

    NASA Technical Reports Server (NTRS)

    Srinivasan, S.; Shibata, M.; Roychoudhury, M.; Rein, R.

    1987-01-01

    The scope of multistep modeling (MSM) is expanding by adding a least-squares minimization step in the procedure to fit backbone reconstruction consistent with a set of C-alpha coordinates. The analytical solution of Phi and Psi angles, that fits a C-alpha x-ray coordinate is used for tyr-tRNA synthetase. Phi and Psi angles for the region where the above mentioned method fails, are obtained by minimizing the difference in C-alpha distances between the computed model and the crystal structure in a least-squares sense. We present a stepwise application of this part of MSM to the determination of the complete backbone geometry of the 321 N terminal residues of tyrosine tRNA synthetase to a root mean square deviation of 0.47 angstroms from the crystallographic C-alpha coordinates.

  12. Characterizing RNA Dynamics at Atomic Resolution Using Solution-state NMR Spectroscopy

    PubMed Central

    Bothe, Jameson R.; Nikolova, Evgenia N.; Eichhorn, Catherine D.; Chugh, Jeetender; Hansen, Alexandar L.; Al-Hashimi, Hashim M.

    2012-01-01

    Many recently discovered non-coding RNAs do not fold into a single native conformation, but rather, sample many different conformations along their free energy landscape to carry out their biological function. Unprecedented insights into the RNA dynamic structure landscape are provided by solution-state NMR techniques that measure the structural, kinetic, and thermodynamic characteristics of motions spanning picosecond to second timescales at atomic resolution. From these studies a basic description of the RNA dynamic structure landscape is emerging, bringing new insights into how RNA structures change to carry out their function as well as applications in RNA-targeted drug discovery and RNA bioengineering. PMID:22036746

  13. Interaction of Sulforaphane with DNA and RNA

    PubMed Central

    Abassi Joozdani, Farzaneh; Yari, Faramarz; Abassi Joozdani, Parvaneh; Nafisi, Shohreh

    2015-01-01

    Sulforaphane (SFN) is an isothiocyanate found in cruciferous vegetables with anti-inflammatory, anti-oxidant and anti-cancer activities. However, the antioxidant and anticancer mechanism of sulforaphane is not well understood. In the present research, we reported binding modes, binding constants and stability of SFN–DNA and -RNA complexes by Fourier transform infrared (FTIR) and UV–Visible spectroscopic methods. Spectroscopic evidence showed DNA intercalation with some degree of groove binding. SFN binds minor and major grooves of DNA and backbone phosphate (PO2), while RNA binding is through G, U, A bases with some degree of SFN–phosphate (PO2) interaction. Overall binding constants were estimated to be K(SFN–DNA)=3.01 (± 0.035)×104 M-1 and K(SFN–RNA)= 6.63 (±0.042)×103 M-1. At high SFN concentration (SFN/RNA = 1/1), DNA conformation changed from B to A occurred, while RNA remained in A-family structure. PMID:26030290

  14. Transcriptome Wide Annotation of Eukaryotic RNase III Reactivity and Degradation Signals

    PubMed Central

    Gagnon, Jules; Lavoie, Mathieu; Catala, Mathieu; Malenfant, Francis; Elela, Sherif Abou

    2015-01-01

    Detection and validation of the RNA degradation signals controlling transcriptome stability are essential steps for understanding how cells regulate gene expression. Here we present complete genomic and biochemical annotations of the signals required for RNA degradation by the dsRNA specific ribonuclease III (Rnt1p) and examine its impact on transcriptome expression. Rnt1p cleavage signals are randomly distributed in the yeast genome, and encompass a wide variety of sequences, indicating that transcriptome stability is not determined by the recurrence of a fixed cleavage motif. Instead, RNA reactivity is defined by the sequence and structural context in which the cleavage sites are located. Reactive signals are often associated with transiently expressed genes, and their impact on RNA expression is linked to growth conditions. Together, the data suggest that Rnt1p reactivity is triggered by malleable RNA degradation signals that permit dynamic response to changes in growth conditions. PMID:25680180

  15. Light-Driven Nano-oscillators for Label-Free Single-Molecule Monitoring of MicroRNA.

    PubMed

    Chen, Zixuan; Peng, Yujiao; Cao, Yue; Wang, Hui; Zhang, Jian-Rong; Chen, Hong-Yuan; Zhu, Jun-Jie

    2018-06-13

    Here, we present a mapping tool based on individual light-driven nano-oscillators for label-free single-molecule monitoring of microRNA. This design uses microRNA as a single-molecule damper for nano-oscillators by forming a rigid dual-strand structure in the gap between nano-oscillators and the immobilized surface. The ultrasensitive detection is attributed to comparable dimensions of the gap and microRNA. A developed surface plasmon-coupled scattering imaging technology enables us to directly measure the real-time gap distance vibration of multiple nano-oscillators with high accuracy and fast dynamics. High-level and low-level states of the oscillation amplitude indicate melting and hybridization statuses of microRNA. Lifetimes of two states reveal that the hybridization rate of microRNA is determined by the three-dimensional diffusion. This imaging technique contributes application potentials in a single-molecule detection and nanomechanics study.

  16. RNA design using simulated SHAPE data.

    PubMed

    Lotfi, Mohadeseh; Zare-Mirakabad, Fatemeh; Montaseri, Soheila

    2018-05-03

    It has long been established that in addition to being involved in protein translation, RNA plays essential roles in numerous other cellular processes, including gene regulation and DNA replication. Such roles are known to be dictated by higher-order structures of RNA molecules. It is therefore of prime importance to find an RNA sequence that can fold to acquire a particular function that is desirable for use in pharmaceuticals and basic research. The challenge of finding an RNA sequence for a given structure is known as the RNA design problem. Although there are several algorithms to solve this problem, they mainly consider hard constraints, such as minimum free energy, to evaluate the predicted sequences. Recently, SHAPE data has emerged as a new soft constraint for RNA secondary structure prediction. To take advantage of this new experimental constraint, we report here a new method for accurate design of RNA sequences based on their secondary structures using SHAPE data as pseudo-free energy. We then compare our algorithm with four others: INFO-RNA, ERD, MODENA and RNAifold 2.0. Our algorithm precisely predicts 26 out of 29 new sequences for the structures extracted from the Rfam dataset, while the other four algorithms predict no more than 22 out of 29. The proposed algorithm is comparable to the above algorithms on RNA-SSD datasets, where they can predict up to 33 appropriate sequences for RNA secondary structures out of 34.

  17. Structures of human ADAR2 bound to dsRNA reveal base-flipping mechanism and basis for site selectivity

    DOE PAGES

    Matthews, Melissa M.; Thomas, Justin M.; Zheng, Yuxuan; ...

    2016-04-11

    Adenosine deaminases acting on RNA (ADARs) are editing enzymes that convert adenosine to inosine in duplex RNA, a modification reaction with wide-ranging consequences in RNA function. Understanding of the ADAR reaction mechanism, the origin of editing-site selectivity, and the effect of mutations is limited by the lack of high-resolution structural data for complexes of ADARs bound to substrate RNAs. In this paper, we describe four crystal structures of the human ADAR2 deaminase domain bound to RNA duplexes bearing a mimic of the deamination reaction intermediate. These structures, together with structure-guided mutagenesis and RNA-modification experiments, explain the basis of the ADARmore » deaminase domain's dsRNA specificity, its base-flipping mechanism, and its nearest-neighbor preferences. In addition, we identified an ADAR2-specific RNA-binding loop near the enzyme active site, thus rationalizing differences in selectivity observed between different ADARs. In conclusion, our results provide a structural framework for understanding the effects of ADAR mutations associated with human disease.« less

  18. A novel knowledge-based potential for RNA 3D structure evaluation

    NASA Astrophysics Data System (ADS)

    Yang, Yi; Gu, Qi; Zhang, Ben-Gong; Shi, Ya-Zhou; Shao, Zhi-Gang

    2018-03-01

    Ribonucleic acids (RNAs) play a vital role in biology, and knowledge of their three-dimensional (3D) structure is required to understand their biological functions. Recently structural prediction methods have been developed to address this issue, but a series of RNA 3D structures are generally predicted by most existing methods. Therefore, the evaluation of the predicted structures is generally indispensable. Although several methods have been proposed to assess RNA 3D structures, the existing methods are not precise enough. In this work, a new all-atom knowledge-based potential is developed for more accurately evaluating RNA 3D structures. The potential not only includes local and nonlocal interactions but also fully considers the specificity of each RNA by introducing a retraining mechanism. Based on extensive test sets generated from independent methods, the proposed potential correctly distinguished the native state and ranked near-native conformations to effectively select the best. Furthermore, the proposed potential precisely captured RNA structural features such as base-stacking and base-pairing. Comparisons with existing potential methods show that the proposed potential is very reliable and accurate in RNA 3D structure evaluation. Project supported by the National Science Foundation of China (Grants Nos. 11605125, 11105054, 11274124, and 11401448).

  19. SSMART: Sequence-structure motif identification for RNA-binding proteins.

    PubMed

    Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe

    2018-06-11

    RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.

  20. The identification and functional annotation of RNA structures conserved in vertebrates

    PubMed Central

    Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan

    2017-01-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280

  1. RNA-dependent RNA polymerase: Addressing Zika outbreak by a phylogeny-based drug target study.

    PubMed

    Stephen, Preyesh; Lin, Sheng-Xiang

    2018-01-01

    Since the first major outbreak of Zika virus (ZIKV) in 2007, ZIKV is spreading explosively through South and Central America, and recent reports in highly populated developing countries alarm the possibility of a more catastrophic outbreak. ZIKV infection in pregnant women leads to embryonic microcephaly and Guillain-Barré syndrome in adults. At present, there is limited understanding of the infectious mechanism, and no approved therapy has been reported. Despite the withdrawal of public health emergency, the WHO still considers the ZIKV as a highly significant and long-term public health challenge that the situation has to be addressed rapidly. Non-structural protein 5 is essential for capping and replication of viral RNA and comprises a methyltransferase and RNA-dependent RNA polymerase (RdRp) domain. We used molecular modeling to obtain the structure of ZIKV RdRp, and by molecular docking and phylogeny analysis, we here demonstrate the potential sites for drug screening. Two metal binding sites and an NS3-interacting region in ZIKV RdRp are demonstrated as potential drug screening sites. The docked structures reveal a remarkable degree of conservation at the substrate binding site and the potential drug screening sites. A phylogeny-based approach is provided for an emergency preparedness, where similar class of ligands could target phylogenetically related proteins. © 2017 John Wiley & Sons A/S.

  2. Ribozyme-catalysed RNA synthesis using triplet building blocks.

    PubMed

    Attwater, James; Raguram, Aditya; Morgunov, Alexey S; Gianni, Edoardo; Holliger, Philipp

    2018-05-15

    RNA-catalyzed RNA replication is widely believed to have supported a primordial biology. However, RNA catalysis is dependent upon RNA folding, and this yields structures that can block replication of such RNAs. To address this apparent paradox we have re-examined the building blocks used for RNA replication. We report RNA-catalysed RNA synthesis on structured templates when using trinucleotide triphosphates (triplets) as substrates, catalysed by a general and accurate triplet polymerase ribozyme that emerged from in vitro evolution as a mutualistic RNA heterodimer. The triplets cooperatively invaded and unraveled even highly stable RNA secondary structures, and support non-canonical primer-free and bidirectional modes of RNA synthesis and replication. Triplet substrates thus resolve a central incongruity of RNA replication, and here allow the ribozyme to synthesise its own catalytic subunit '+' and '-' strands in segments and assemble them into a new active ribozyme. © 2018, Attwater et al.

  3. Functional RNA structures throughout the Hepatitis C Virus genome.

    PubMed

    Adams, Rebecca L; Pirakitikulr, Nathan; Pyle, Anna Marie

    2017-06-01

    The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. RNA and RNP as Building Blocks for Nanotechnology and Synthetic Biology.

    PubMed

    Ohno, Hirohisa; Saito, Hirohide

    2016-01-01

    Recent technologies that aimed to elucidate cellular function have revealed essential roles for RNA molecules in living systems. Our knowledge concerning functional and structural information of naturally occurring RNA and RNA-protein (RNP) complexes is increasing rapidly. RNA and RNP interaction motifs are structural units that function as building blocks to constitute variety of complex structures. RNA-central synthetic biology and nanotechnology are constructive approaches that employ the accumulated information and build synthetic RNA (RNP)-based circuits and nanostructures. Here, we describe how to design and construct synthetic RNA (RNP)-based devices and structures at the nanometer-scale for biological and future therapeutic applications. RNA/RNP nanostructures can also be utilized as the molecular scaffold to control the localization or interactions of target molecule(s). Moreover, RNA motifs recognized by RNA-binding proteins can be applied to make protein-responsive translational "switches" that can turn gene expression "on" or "off" depending on the intracellular environment. This "synthetic RNA and RNP world" will expand tools for nanotechnology and synthetic biology. In addition, these reconstructive approaches would lead to a greater understanding of building principle in naturally occurring RNA/RNP molecules and systems. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. [Genomic disorders in the mononuclear blood cells of those who worked in the cleanup of the accident at the Chernobyl Atomic Electric Power Station].

    PubMed

    Butenko, Z A; Smirnova, I A; Zak, K P; Mikhaĭlovskaia, E V; Ianok, E A; Kishinskaia, E G

    1998-01-01

    The results of molecular investigations of blood mononuclears from 120 clean-up workers after 7-9 years of Chernobyl accident with the total exposure radiation doses ranging from 5 to 76 cGr are presented. Structural polymorphism of the leukemia associated bcr and ribosomal RNA (rRNA) genes were studied using Southern blot hybridization. Allelic polymorphism of bcr gene with characteristic for leukemia allele distribution was detected in 16.6%. Rearrangements of rRNA genes were observed in 13% of Chernobyl accident clean-up workers.

  6. 5S ribosomal RNA database Y2K

    PubMed Central

    Szymanski, Maciej; Barciszewska, Miroslawa Z.; Barciszewski, Jan; Erdmann, Volker A.

    2000-01-01

    This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan. pl/5SData/index.html . This edition of the database contains 1985 primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms. PMID:10592212

  7. 5S ribosomal RNA database Y2K.

    PubMed

    Szymanski, M; Barciszewska, M Z; Barciszewski, J; Erdmann, V A

    2000-01-01

    This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan.pl/5SData/index.html. This edition of the database contains 1985primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms.

  8. RNA Mimicry by the Fap7 Adenylate Kinase in Ribosome Biogenesis

    PubMed Central

    Réty, Stéphane; Lebaron, Simon; Deschamps, Patrick; Bareille, Joseph; Jombart, Julie; Robert-Paganin, Julien; Delbos, Lila; Chardon, Florian; Zhang, Elodie; Charenton, Clément; Tollervey, David; Leulliot, Nicolas

    2014-01-01

    During biogenesis of the 40S and 60S ribosomal subunits, the pre-40S particles are exported to the cytoplasm prior to final cleavage of the 20S pre-rRNA to mature 18S rRNA. Amongst the factors involved in this maturation step, Fap7 is unusual, as it both interacts with ribosomal protein Rps14 and harbors adenylate kinase activity, a function not usually associated with ribonucleoprotein assembly. Human hFap7 also regulates Cajal body assembly and cell cycle progression via the p53–MDM2 pathway. This work presents the functional and structural characterization of the Fap7–Rps14 complex. We report that Fap7 association blocks the RNA binding surface of Rps14 and, conversely, Rps14 binding inhibits adenylate kinase activity of Fap7. In addition, the affinity of Fap7 for Rps14 is higher with bound ADP, whereas ATP hydrolysis dissociates the complex. These results suggest that Fap7 chaperones Rps14 assembly into pre-40S particles via RNA mimicry in an ATP-dependent manner. Incorporation of Rps14 by Fap7 leads to a structural rearrangement of the platform domain necessary for the pre-rRNA to acquire a cleavage competent conformation. PMID:24823650

  9. The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Ruiying; Zheng, Han; Preamplume, Gan

    The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less

  10. R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

    PubMed Central

    2011-01-01

    Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file. PMID:21205310

  11. RNA Structural Analysis by Evolving SHAPE Chemistry

    PubMed Central

    Spitale, Robert C.; Flynn, Ryan A.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

    2017-01-01

    RNA is central to the flow of biological information. From transcription to splicing, RNA localization, translation, and decay, RNA is intimately involved in regulating every step of the gene expression program, and is thus essential for health and understanding disease. RNA has the unique ability to base-pair with itself and other nucleic acids to form complex structures. Hence the information content in RNA is not simply its linear sequence of bases, but is also encoded in complex folding of RNA molecules. A general chemical functionality that all RNAs have is a 2’-hydroxyl group in the ribose ring, and the reactivity of the 2'-hydroxyl in RNA is gated by local nucleotide flexibility. In other words, the 2'-hydroxyl is reactive at single-stranded and conformationally flexible positions but is unreactive at nucleotides constrained by base pairing. Recent efforts have been focused on developing reagents that modify RNA as a function of RNA 2’ hydroxyl group flexibility. Such RNA structure probing techniques can be read out by primer extension in experiments termed RNA SHAPE (Selective 2’ Hydroxyl Acylation and Primer Extension). Herein we describe the efforts devoted to the design and utilization of SHAPE probes for characterizing RNA structure. We also describe current technological advances that are being used to utilize SHAPE chemistry with deep sequencing to probe many RNAs in parallel. The merger of chemistry with genomics is sure to open the door to genome-wide exploration of RNA structure and function. PMID:25132067

  12. Regulation of a maize HD-ZIP IV transcription factor by a non-conventional RDR2-dependent small RNA.

    PubMed

    Klein-Cosson, Catherine; Chambrier, Pierre; Rogowsky, Peter M; Vernoud, Vanessa

    2015-03-01

    Small non-coding RNAs are versatile riboregulators that control gene expression at the transcriptional or post-transcriptional level, governing many facets of plant development. Here we present evidence for the existence of a 24 nt small RNA (named small1) that is complementary to the 3' UTR of OCL1 (Outer Cell Layer1), the founding member of the maize HD-ZIP IV gene family encoding plant-specific transcription factors that are mainly involved in epidermis differentiation and specialization. The biogenesis of small1 depends on DICER-like 3 (DCL3), RNA-dependent RNA polymerase 2 (RDR2) and RNA polymerase IV, components that are usually required for RNA-dependent DNA-methylation. Unexpectedly, GFP sensor experiments in transient and stable transformation systems revealed that small1 may regulate its target at the post-transcriptional level, mainly through translational repression. This translational repression is attenuated in an rdr2 mutant background in which small1 does not accumulate. Our experiments further showed the possible involvement of a secondary stem-loop structure present in the 3' UTR of OCL1 for efficient target repression, suggesting the existence of several regulatory mechanisms affecting OCL1 mRNA stability and translation. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  13. CRISPR-Cas9 Structures and Mechanisms.

    PubMed

    Jiang, Fuguo; Doudna, Jennifer A

    2017-05-22

    Many bacterial clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) systems employ the dual RNA-guided DNA endonuclease Cas9 to defend against invading phages and conjugative plasmids by introducing site-specific double-stranded breaks in target DNA. Target recognition strictly requires the presence of a short protospacer adjacent motif (PAM) flanking the target site, and subsequent R-loop formation and strand scission are driven by complementary base pairing between the guide RNA and target DNA, Cas9-DNA interactions, and associated conformational changes. The use of CRISPR-Cas9 as an RNA-programmable DNA targeting and editing platform is simplified by a synthetic single-guide RNA (sgRNA) mimicking the natural dual trans-activating CRISPR RNA (tracrRNA)-CRISPR RNA (crRNA) structure. This review aims to provide an in-depth mechanistic and structural understanding of Cas9-mediated RNA-guided DNA targeting and cleavage. Molecular insights from biochemical and structural studies provide a framework for rational engineering aimed at altering catalytic function, guide RNA specificity, and PAM requirements and reducing off-target activity for the development of Cas9-based therapies against genetic diseases.

  14. StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase.

    PubMed

    Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L

    2011-06-02

    Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.

  15. R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures.

    PubMed

    Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B; Zirbel, Craig L

    2013-07-01

    The R3D Align web server provides online access to 'RNA 3D Align' (R3D Align), a method for producing accurate nucleotide-level structural alignments of RNA 3D structures. The web server provides a streamlined and intuitive interface, input data validation and output that is more extensive and easier to read and interpret than related servers. The R3D Align web server offers a unique Gallery of Featured Alignments, providing immediate access to pre-computed alignments of large RNA 3D structures, including all ribosomal RNAs, as well as guidance on effective use of the server and interpretation of the output. By accessing the non-redundant lists of RNA 3D structures provided by the Bowling Green State University RNA group, R3D Align connects users to structure files in the same equivalence class and the best-modeled representative structure from each group. The R3D Align web server is freely accessible at http://rna.bgsu.edu/r3dalign/.

  16. Magnesium-binding architectures in RNA crystal structures: validation, binding preferences, classification and motif detection

    PubMed Central

    Zheng, Heping; Shabalin, Ivan G.; Handing, Katarzyna B.; Bujnicki, Janusz M.; Minor, Wladek

    2015-01-01

    The ubiquitous presence of magnesium ions in RNA has long been recognized as a key factor governing RNA folding, and is crucial for many diverse functions of RNA molecules. In this work, Mg2+-binding architectures in RNA were systematically studied using a database of RNA crystal structures from the Protein Data Bank (PDB). Due to the abundance of poorly modeled or incorrectly identified Mg2+ ions, the set of all sites was comprehensively validated and filtered to identify a benchmark dataset of 15 334 ‘reliable’ RNA-bound Mg2+ sites. The normalized frequencies by which specific RNA atoms coordinate Mg2+ were derived for both the inner and outer coordination spheres. A hierarchical classification system of Mg2+ sites in RNA structures was designed and applied to the benchmark dataset, yielding a set of 41 types of inner-sphere and 95 types of outer-sphere coordinating patterns. This classification system has also been applied to describe six previously reported Mg2+-binding motifs and detect them in new RNA structures. Investigation of the most populous site types resulted in the identification of seven novel Mg2+-binding motifs, and all RNA structures in the PDB were screened for the presence of these motifs. PMID:25800744

  17. A DEAD-box RNA helicase promotes thermodynamic equilibration of kinetically trapped RNA structures in vivo.

    PubMed

    Ruminski, Dana J; Watson, Peter Y; Mahen, Elisabeth M; Fedor, Martha J

    2016-03-01

    RNAs must assemble into specific structures in order to carry out their biological functions, but in vitro RNA folding reactions produce multiple misfolded structures that fail to exchange with functional structures on biological time scales. We used carefully designed self-cleaving mRNAs that assemble through well-defined folding pathways to identify factors that differentiate intracellular and in vitro folding reactions. Our previous work showed that simple base-paired RNA helices form and dissociate with the same rate and equilibrium constants in vivo and in vitro. However, exchange between adjacent secondary structures occurs much faster in vivo, enabling RNAs to quickly adopt structures with the lowest free energy. We have now used this approach to probe the effects of an extensively characterized DEAD-box RNA helicase, Mss116p, on a series of well-defined RNA folding steps in yeast. Mss116p overexpression had no detectable effect on helix formation or dissociation kinetics or on the stability of interdomain tertiary interactions, consistent with previous evidence that intracellular factors do not affect these folding parameters. However, Mss116p overexpression did accelerate exchange between adjacent helices. The nonprocessive nature of RNA duplex unwinding by DEAD-box RNA helicases is consistent with a branch migration mechanism in which Mss116p lowers barriers to exchange between otherwise stable helices by the melting and annealing of one or two base pairs at interhelical junctions. These results suggest that the helicase activity of DEAD-box proteins like Mss116p distinguish intracellular RNA folding pathways from nonproductive RNA folding reactions in vitro and allow RNA structures to overcome kinetic barriers to thermodynamic equilibration in vivo. © 2016 Ruminski et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. Molecular determinants of Cytochrome C oxidase IV mRNA axonal trafficking

    PubMed Central

    Kar, Amar N.; Vargas, Jose Norberto S.; Chen, Cai-Yun; Kowalak, Jeffrey A; Gioio, Anthony E.; Kaplan, Barry B.

    2017-01-01

    In previous studies, we identified a putative 38-nucleotide stem-loop structure (zipcode) in the 3′ untranslated region of the cytochrome c oxidase subunit IV (COXIV) mRNA that was necessary and sufficient for the axonal localization of the message in primary superior cervical ganglion (SCG) neurons. However, little is known about the proteins that interact with the COXIV-zipcode and regulate the axonal trafficking and local translation of the COXIV message. To identify proteins involved in the axonal transport of the COXIV mRNA, we used the biotinylated 38-nucleotide COXIV RNA zipcode as bait in the affinity purification of COXIV zipcode binding proteins. Gel-shift assays of the biotinylated COXIV zipcode indicated that the putative stem-loop structure functions as a nucleation site for the formation of ribonucleoprotein complexes. Mass spectrometric analysis of the COXIV zipcode ribonucleoprotein complex led to the identification of a large number RNA binding proteins, including fused in sarcoma/translated in liposarcoma (FUS/TLS), and Y-box protein 1 (YB-1). Validation experiments, using western analyses, confirmed the presence of the candidate proteins in the COXIV zipcode affinity purified complexes obtained from SCG axons. Immunohistochemical studies show that FUS, and YB-1 are present in SCG axons. Importantly, RNA immunoprecipitation studies show that FUS, and YB-1 interact with endogenous axonal COXIV transcripts. siRNA-mediated downregulation of the candidate proteins FUS and YB-1 expression in the cell-bodies diminishes the levels of COXIV mRNA in the axon, suggesting functional roles for these proteins in the axonal trafficking of COXIV mRNA. PMID:28161363

  19. Microfluidic chip based micro RNA detection through the combination of fluorescence and surface enhanced Raman scattering techniques

    NASA Astrophysics Data System (ADS)

    Wang, Zhile; Zong, Shenfei; Wang, Zhuyuan; Wu, Lei; Chen, Peng; Yun, Binfeng; Cui, Yiping

    2017-03-01

    We present a novel microfluidic chip based method for the detection of micro RNA (miRNA) via the combination of fluorescence and surface enhanced Raman scattering (SERS) spectroscopies. First, silver nanoparticles (Ag NPs) are immobilized onto a glass slide, forming a SERS enhancing substrate. Then a specificially designed molecular beacon (MB) is attached to the SERS substrate. The 3‧ end of the MB is decorated with a thiol group to facilitate the attachment of the MB, while the 5‧ end of the MB is labeled with an organic dye 6-FAM, which is used both as the fluorophore and SERS reporter. In the absence of target miRNA, the MB will form a hairpin structure, making 6-FAM close to the Ag NPs. Hence, the fluorescence of 6-FAM will be quenched and the Raman signal of 6-FAM will be enhanced. On the contrary, with target miRNA present, hybridization between the miRNA and MB will unfold the MB and increase the distance between 6-FAM and the Ag NPs. Thus the fluorescence of 6-FAM will recover and the SERS signal of 6-FAM will decrease. So the target miRNA will simultaneously introduce opposite changing trends in the intensities of the fluorescence and SERS signals. By combining the opposite changes in the two optical spectra, an improved sensitivity and linearity toward the target miRNA is achieved as compared with using solely fluorescence or SERS. Moreover, introducing the microfluidic chip can reduce the reaction time, reagent dosage and complexity of detection. With the improved sensitivity and simplicity, we anticipate that the presented method can have great potential in the investigation of miRNA related diseases.

  20. Genome-wide identification of long non-coding RNA and mRNA profiling using RNA sequencing in subjects with sensitive skin

    PubMed Central

    Tu, Ying; Xu, Dan; Feng, Jiaqi; He, Li

    2017-01-01

    Sensitive skin (SS) is a condition of subjective cutaneous hyper-reactivity. The role of long non-coding RNAs (lncRNAs) in subjects with SS is unclear. Therefore, the aim of the present study was to provide a comprehensive profile of the mRNAs and lncRNAs in subjects with SS. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis presented the characteristics of associated protein-coding genes. In addition, a co-expression network of lncRNA and mRNA was constructed to identify potential underlying regulation targets; the results were verified by quantitative real-time PCR (qRT-PCR) and RNA-seq analyses in patients with SS and normal samples. Compared with the normal skin group, 266 novel lncRNAs and 6750 annotated lncRNAs were identified in the SS group. A total of 71 lncRNA transcripts and 2615 mRNA transcripts were differentially expressed (P < 0.05). The heat signature of the SS samples could be distinguished from the normal skin samples, whereas the majority of the genes that were present in enriched pathways were those that participated in focal adhesion, PI3K-Akt signaling, and cancer-related pathways. Five transcripts were selected for qRT-PCR analysis and the results were consistent with RNA-seq. The results suggested that LNC_000265 may play a role in the epidermal barrier structure of patient with SS. The data suggest novel genes and pathways that may be involved in the pathogenesis of SS and highlight potential targets that could be used for individualized treatment applications. PMID:29383128

Top