rna structures including: Topics by Science.gov

Sample records for rna structures including

High-throughput determination of RNA structure by proximity ligation.

PubMed

Ramani, Vijay; Qiu, Ruolan; Shendure, Jay

2015-09-01

We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.
RNA folding: structure prediction, folding kinetics and ion electrostatics.

PubMed

Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

2015-01-01

Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.
Regulatory effects of cotranscriptional RNA structure formation and transitions.

PubMed

Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

2016-09-01

RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.
Probing Xist RNA Structure in Cells Using Targeted Structure-Seq

PubMed Central

Rutenberg-Schoenberg, Michael; Simon, Matthew D.

2015-01-01

The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615
RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme

PubMed Central

Biesiada, Marcin; Boniecki, Michał J.; Chou, Fang-Chieh; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Dunin-Horkawicz, Stanisław; Geniesse, Caleb; Kappel, Kalli; Kladwang, Wipapat; Krokhotin, Andrey; Łach, Grzegorz E.; Major, François; Mann, Thomas H.; Pachulska-Wieczorek, Katarzyna; Patel, Dinshaw J.; Piccirilli, Joseph A.; Popenda, Mariusz; Purzycka, Katarzyna J.; Ren, Aiming; Rice, Greggory M.; Santalucia, John; Tandon, Arpit; Trausch, Jeremiah J.; Wang, Jian; Weeks, Kevin M.; Williams, Benfeard; Xiao, Yi; Zhang, Dong; Zok, Tomasz

2017-01-01

RNA-Puzzles is a collective experiment in blind 3D RNA structure prediction. We report here a third round of RNA-Puzzles. Five puzzles, 4, 8, 12, 13, 14, all structures of riboswitch aptamers and puzzle 7, a ribozyme structure, are included in this round of the experiment. The riboswitch structures include biological binding sites for small molecules (S-adenosyl methionine, cyclic diadenosine monophosphate, 5-amino 4-imidazole carboxamide riboside 5′-triphosphate, glutamine) and proteins (YbxF), and one set describes large conformational changes between ligand-free and ligand-bound states. The Varkud satellite ribozyme is the most recently solved structure of a known large ribozyme. All puzzles have established biological functions and require structural understanding to appreciate their molecular mechanisms. Through the use of fast-track experimental data, including multidimensional chemical mapping, and accurate prediction of RNA secondary structure, a large portion of the contacts in 3D have been predicted correctly leading to similar topologies for the top ranking predictions. Template-based and homology-derived predictions could predict structures to particularly high accuracies. However, achieving biological insights from de novo prediction of RNA 3D structures still depends on the size and complexity of the RNA. Blind computational predictions of RNA structures already appear to provide useful structural information in many cases. Similar to the previous RNA-Puzzles Round II experiment, the prediction of non-Watson–Crick interactions and the observed high atomic clash scores reveal a notable need for an algorithm of improvement. All prediction models and assessment results are available at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:28138060
RNAstructure: software for RNA secondary structure prediction and analysis.

PubMed

Reuter, Jessica S; Mathews, David H

2010-03-15

To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.
SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

PubMed

Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

2016-04-20

RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Vfold: a web server for RNA structure and folding thermodynamics prediction.

PubMed

Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

2014-01-01

The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".
RNApdbee 2.0: multifunctional tool for RNA structure annotation.

PubMed

Zok, Tomasz; Antczak, Maciej; Zurkowski, Michal; Popenda, Mariusz; Blazewicz, Jacek; Adamiak, Ryszard W; Szachniuk, Marta

2018-04-30

In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.
New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.

PubMed

Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou

2011-11-01

Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Methods to enable the design of bioactive small molecules targeting RNA

PubMed Central

Disney, Matthew D.; Yildirim, Ilyas; Childs-Disney, Jessica L.

2014-01-01

RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including Structure-Activity Relationships Through Sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome. PMID:24357181
Methods to enable the design of bioactive small molecules targeting RNA.

PubMed

Disney, Matthew D; Yildirim, Ilyas; Childs-Disney, Jessica L

2014-02-21

RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including structure-activity relationships through sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome.
ClaRNA: a classifier of contacts in RNA 3D structures based on a comparative analysis of various classification schemes

PubMed Central

Waleń, Tomasz; Chojnowski, Grzegorz; Gierski, Przemysław; Bujnicki, Janusz M.

2014-01-01

The understanding of folding and function of RNA molecules depends on the identification and classification of interactions between ribonucleotide residues. We developed a new method named ClaRNA for computational classification of contacts in RNA 3D structures. Unique features of the program are the ability to identify imperfect contacts and to process coarse-grained models. Each doublet of spatially close ribonucleotide residues in a query structure is compared to clusters of reference doublets obtained by analysis of a large number of experimentally determined RNA structures, and assigned a score that describes its similarity to one or more known types of contacts, including pairing, stacking, base–phosphate and base–ribose interactions. The accuracy of ClaRNA is 0.997 for canonical base pairs, 0.983 for non-canonical pairs and 0.961 for stacking interactions. The generalized squared correlation coefficient (GC2) for ClaRNA is 0.969 for canonical base pairs, 0.638 for non-canonical pairs and 0.824 for stacking interactions. The classifier can be easily extended to include new types of spatial relationships between pairs or larger assemblies of nucleotide residues. ClaRNA is freely available via a web server that includes an extensive set of tools for processing and visualizing structural information about RNA molecules. PMID:25159614
Freiburg RNA tools: a central online resource for RNA-focused research and teaching.

PubMed

Raden, Martin; Ali, Syed M; Alkhnbashi, Omer S; Busch, Anke; Costa, Fabrizio; Davis, Jason A; Eggenhofer, Florian; Gelhausen, Rick; Georg, Jens; Heyne, Steffen; Hiller, Michael; Kundu, Kousik; Kleinkauf, Robert; Lott, Steffen C; Mohamed, Mostafa M; Mattheis, Alexander; Miladi, Milad; Richter, Andreas S; Will, Sebastian; Wolff, Joachim; Wright, Patrick R; Backofen, Rolf

2018-05-21

The Freiburg RNA tools webserver is a well established online resource for RNA-focused research. It provides a unified user interface and comprehensive result visualization for efficient command line tools. The webserver includes RNA-RNA interaction prediction (IntaRNA, CopraRNA, metaMIR), sRNA homology search (GLASSgo), sequence-structure alignments (LocARNA, MARNA, CARNA, ExpaRNA), CRISPR repeat classification (CRISPRmap), sequence design (antaRNA, INFO-RNA, SECISDesign), structure aberration evaluation of point mutations (RaSE), and RNA/protein-family models visualization (CMV), and other methods. Open education resources offer interactive visualizations of RNA structure and RNA-RNA interaction prediction as well as basic and advanced sequence alignment algorithms. The services are freely available at http://rna.informatik.uni-freiburg.de.
Structural imprints in vivo decode RNA regulatory mechanisms.

PubMed

Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y

2015-03-26

Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.
Using RNA Sequence and Structure for the Prediction of Riboswitch Aptamer: A Comprehensive Review of Available Software and Tools

PubMed Central

Antunes, Deborah; Jorge, Natasha A. N.; Caffarena, Ernesto R.; Passetti, Fabio

2018-01-01

RNA molecules are essential players in many fundamental biological processes. Prokaryotes and eukaryotes have distinct RNA classes with specific structural features and functional roles. Computational prediction of protein structures is a research field in which high confidence three-dimensional protein models can be proposed based on the sequence alignment between target and templates. However, to date, only a few approaches have been developed for the computational prediction of RNA structures. Similar to proteins, RNA structures may be altered due to the interaction with various ligands, including proteins, other RNAs, and metabolites. A riboswitch is a molecular mechanism, found in the three kingdoms of life, in which the RNA structure is modified by the binding of a metabolite. It can regulate multiple gene expression mechanisms, such as transcription, translation initiation, and mRNA splicing and processing. Due to their nature, these entities also act on the regulation of gene expression and detection of small metabolites and have the potential to helping in the discovery of new classes of antimicrobial agents. In this review, we describe software and web servers currently available for riboswitch aptamer identification and secondary and tertiary structure prediction, including applications. PMID:29403526
bpRNA: large-scale automated annotation and analysis of RNA secondary structure.

PubMed

Danaee, Padideh; Rouches, Mason; Wiley, Michelle; Deng, Dezhong; Huang, Liang; Hendrix, David

2018-05-09

While RNA secondary structure prediction from sequence data has made remarkable progress, there is a need for improved strategies for annotating the features of RNA secondary structures. Here, we present bpRNA, a novel annotation tool capable of parsing RNA structures, including complex pseudoknot-containing RNAs, to yield an objective, precise, compact, unambiguous, easily-interpretable description of all loops, stems, and pseudoknots, along with the positions, sequence, and flanking base pairs of each such structural feature. We also introduce several new informative representations of RNA structure types to improve structure visualization and interpretation. We have further used bpRNA to generate a web-accessible meta-database, 'bpRNA-1m', of over 100 000 single-molecule, known secondary structures; this is both more fully and accurately annotated and over 20-times larger than existing databases. We use a subset of the database with highly similar (≥90% identical) sequences filtered out to report on statistical trends in sequence, flanking base pairs, and length. Both the bpRNA method and the bpRNA-1m database will be valuable resources both for specific analysis of individual RNA molecules and large-scale analyses such as are useful for updating RNA energy parameters for computational thermodynamic predictions, improving machine learning models for structure prediction, and for benchmarking structure-prediction algorithms.
Small-interfering RNA (siRNA)-based functional micro- and nanostructures for efficient and selective gene silencing.

PubMed

Lee, Soo Hyeon; Chung, Bong Hyun; Park, Tae Gwan; Nam, Yoon Sung; Mok, Hyejung

2012-07-17

Because of RNA's ability to encode structure and functional information, researchers have fabricated diverse geometric structures from this polymer at the micro- and nanoscale. With their tunable structures, rigidity, and biocompatibility, novel two-dimensional and three-dimensional RNA structures can serve as a fundamental platform for biomedical applications, including engineered tissues, biosensors, and drug delivery vehicles. The discovery of the potential of small-interfering RNA (siRNA) has underscored the applications of RNA-based micro- and nanostructures in medicine. Small-interfering RNA (siRNA), synthetic double-stranded RNA consisting of approximately 21 base pairs, suppresses problematic target genes in a sequence-specific manner via inherent RNA interference (RNAi) processing. As a result, siRNA offers a potential strategy for treatment of many human diseases. However, due to inefficient delivery to cells and off-target effects, the clinical application of therapeutic siRNA has been very challenging. To address these issues, researchers have studied a variety of nanocarrier systems for siRNA delivery. In this Account, we describe several strategies for efficient siRNA delivery and selective gene silencing. We took advantage of facile chemical conjugation and complementary hybridization to design novel siRNA-based micro- and nanostructures. Using chemical crosslinkers and hydrophobic/hydrophilic polymers at the end of siRNA, we produced various RNA-based structures, including siRNA block copolymers, micelles, linear siRNA homopolymers, and microhydrogels. Because of their increased charge density and flexibility compared with conventional siRNA, these micro- and nanostructures can form polyelectrolyte complexes with poorly charged and biocompatible cationic carriers that are both more condensed and more homogenous than the complexes formed in other carrier systems. In addition, the fabricated siRNA-based structures are linked by cleavable disulfide bonds for facile generation of original siRNA in the cytosol and for target-specific gene silencing. These newly developed siRNA-based structures greatly enhance intracellular uptake and gene silencing both in vitro and in vivo, making them promising biomaterials for siRNA therapeutics.
Inverted repeat Alu elements in the human lincRNA-p21 adopt a conserved secondary structure that regulates RNA function

PubMed Central

Chillón, Isabel; Pyle, Anna M.

2016-01-01

LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Visualization of RNA structure models within the Integrative Genomics Viewer.

PubMed

Busan, Steven; Weeks, Kevin M

2017-07-01

Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

Improve the prediction of RNA-binding residues using structural neighbours.

PubMed

Li, Quan; Cao, Zanxia; Liu, Haiyan

2010-03-01

The interactions between RNA-binding proteins (RBPs) with RNA play key roles in managing some of the cell's basic functions. The identification and prediction of RNA binding sites is important for understanding the RNA-binding mechanism. Computational approaches are being developed to predict RNA-binding residues based on the sequence- or structure-derived features. To achieve higher prediction accuracy, improvements on current prediction methods are necessary. We identified that the structural neighbors of RNA-binding and non-RNA-binding residues have different amino acid compositions. Combining this structure-derived feature with evolutionary (PSSM) and other structural information (secondary structure and solvent accessibility) significantly improves the predictions over existing methods. Using a multiple linear regression approach and 6-fold cross validation, our best model can achieve an overall correct rate of 87.8% and MCC of 0.47, with a specificity of 93.4%, correctly predict 52.4% of the RNA-binding residues for a dataset containing 107 non-homologous RNA-binding proteins. Compared with existing methods, including the amino acid compositions of structure neighbors lead to clearly improvement. A web server was developed for predicting RNA binding residues in a protein sequence (or structure),which is available at http://mcgill.3322.org/RNA/.
Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus

PubMed Central

Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.

2015-01-01

We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064
Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus.

PubMed

Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W

2008-06-05

We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.
The RNA synthesis machinery of negative-stranded RNA viruses

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ortín, Juan, E-mail: jortin@cnb.csic.es; Martín-Benito, Jaime, E-mail: jmartinb@cnb.csic.es

The group of Negative-Stranded RNA Viruses (NSVs) includes many human pathogens, like the influenza, measles, mumps, respiratory syncytial or Ebola viruses, which produce frequent epidemics of disease and occasional, high mortality outbreaks by transmission from animal reservoirs. The genome of NSVs consists of one to several single-stranded, negative-polarity RNA molecules that are always assembled into mega Dalton-sized complexes by association to many nucleoprotein monomers. These RNA-protein complexes or ribonucleoproteins function as templates for transcription and replication by action of the viral RNA polymerase and accessory proteins. Here we review our knowledge on these large RNA-synthesis machines, including the structure ofmore » their components, the interactions among them and their enzymatic activities, and we discuss models showing how they perform the virus transcription and replication programmes. - Highlights: • Overall organisation of NSV RNA synthesis machines. • Structure and function of the ribonucleoprotein components: Atomic structure of the RNA polymerase complex. • Commonalities and differences between segmented- and non-segmented NSVs. • Transcription versus replication programmes.« less
Swellix: a computational tool to explore RNA conformational space.

PubMed

Sloat, Nathan; Liu, Jui-Wen; Schroeder, Susan J

2017-11-21

The sequence of nucleotides in an RNA determines the possible base pairs for an RNA fold and thus also determines the overall shape and function of an RNA. The Swellix program presented here combines a helix abstraction with a combinatorial approach to the RNA folding problem in order to compute all possible non-pseudoknotted RNA structures for RNA sequences. The Swellix program builds on the Crumple program and can include experimental constraints on global RNA structures such as the minimum number and lengths of helices from crystallography, cryoelectron microscopy, or in vivo crosslinking and chemical probing methods. The conceptual advance in Swellix is to count helices and generate all possible combinations of helices rather than counting and combining base pairs. Swellix bundles similar helices and includes improvements in memory use and efficient parallelization. Biological applications of Swellix are demonstrated by computing the reduction in conformational space and entropy due to naturally modified nucleotides in tRNA sequences and by motif searches in Human Endogenous Retroviral (HERV) RNA sequences. The Swellix motif search reveals occurrences of protein and drug binding motifs in the HERV RNA ensemble that do not occur in minimum free energy or centroid predicted structures. Swellix presents significant improvements over Crumple in terms of efficiency and memory use. The efficient parallelization of Swellix enables the computation of sequences as long as 418 nucleotides with sufficient experimental constraints. Thus, Swellix provides a practical alternative to free energy minimization tools when multiple structures, kinetically determined structures, or complex RNA-RNA and RNA-protein interactions are present in an RNA folding problem.
A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

PubMed

Rivas, Elena; Lang, Raymond; Eddy, Sean R

2012-02-01

The standard approach for single-sequence RNA secondary structure prediction uses a nearest-neighbor thermodynamic model with several thousand experimentally determined energy parameters. An attractive alternative is to use statistical approaches with parameters estimated from growing databases of structural RNAs. Good results have been reported for discriminative statistical methods using complex nearest-neighbor models, including CONTRAfold, Simfold, and ContextFold. Little work has been reported on generative probabilistic models (stochastic context-free grammars [SCFGs]) of comparable complexity, although probabilistic models are generally easier to train and to use. To explore a range of probabilistic models of increasing complexity, and to directly compare probabilistic, thermodynamic, and discriminative approaches, we created TORNADO, a computational tool that can parse a wide spectrum of RNA grammar architectures (including the standard nearest-neighbor model and more) using a generalized super-grammar that can be parameterized with probabilities, energies, or arbitrary scores. By using TORNADO, we find that probabilistic nearest-neighbor models perform comparably to (but not significantly better than) discriminative methods. We find that complex statistical models are prone to overfitting RNA structure and that evaluations should use structurally nonhomologous training and test data sets. Overfitting has affected at least one published method (ContextFold). The most important barrier to improving statistical approaches for RNA secondary structure prediction is the lack of diversity of well-curated single-sequence RNA secondary structures in current RNA databases.
A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more

PubMed Central

Rivas, Elena; Lang, Raymond; Eddy, Sean R.

2012-01-01

The standard approach for single-sequence RNA secondary structure prediction uses a nearest-neighbor thermodynamic model with several thousand experimentally determined energy parameters. An attractive alternative is to use statistical approaches with parameters estimated from growing databases of structural RNAs. Good results have been reported for discriminative statistical methods using complex nearest-neighbor models, including CONTRAfold, Simfold, and ContextFold. Little work has been reported on generative probabilistic models (stochastic context-free grammars [SCFGs]) of comparable complexity, although probabilistic models are generally easier to train and to use. To explore a range of probabilistic models of increasing complexity, and to directly compare probabilistic, thermodynamic, and discriminative approaches, we created TORNADO, a computational tool that can parse a wide spectrum of RNA grammar architectures (including the standard nearest-neighbor model and more) using a generalized super-grammar that can be parameterized with probabilities, energies, or arbitrary scores. By using TORNADO, we find that probabilistic nearest-neighbor models perform comparably to (but not significantly better than) discriminative methods. We find that complex statistical models are prone to overfitting RNA structure and that evaluations should use structurally nonhomologous training and test data sets. Overfitting has affected at least one published method (ContextFold). The most important barrier to improving statistical approaches for RNA secondary structure prediction is the lack of diversity of well-curated single-sequence RNA secondary structures in current RNA databases. PMID:22194308
Correlation of RNA secondary structure statistics with thermodynamic stability and applications to folding.

PubMed

Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu

2009-08-28

The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.
TRANSAT-- method for detecting the conserved helices of functional RNA structures, including transient, pseudo-knotted and alternative structures.

PubMed

Wiebe, Nicholas J P; Meyer, Irmtraud M

2010-06-24

The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular environment.
Structure of Hepatitis C Virus Polymerase in Complex with Primer-Template RNA

PubMed Central

Murakami, Eisuke; Lam, Angela M.; Grice, Rena L.; Du, Jinfa; Sofia, Michael J.; Furman, Philip A.; Otto, Michael J.

2012-01-01

The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory β-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesis at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory β-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus. PMID:22496223
Atomic force microscopy of RNA: State of the art and recent advancements.

PubMed

Schön, Peter

2018-01-01

The atomic force microscope (AFM) has become a powerful tool for the visualization, probing and manipulation of RNA at the single molecule level. AFM measurements can be carried out in buffer solution in a physiological medium, which is crucial to study the structure and function of biomolecules, also allowing studying them at work. Imaging the specimen in its native state is a great advantage compared to other high resolution methods such as electron microscopy and X-ray diffraction. There is no need to stain, freeze or crystallize biological samples. Moreover, compared to NMR spectroscopy for instance, for AFM studies the size of the biomolecules is not limiting. Consequently the AFM allows one also to investigate larger RNA molecules. In particular, structural studies of nucleic acids and assemblies thereof, have been carried out by AFM routinely including ssRNA, dsRNA and nucleoprotein complexes thereof, as well as RNA aggregates and 2D RNA assemblies. These are becoming increasingly important as novel unique building blocks in the emerging field of RNA nanotechnology. In particular by AFM unique information can be obtained on these RNA based assemblies. Moreover, the AFM is of fundamental relevance to study biological relevant RNA interactions and dynamics. In this short review a brief overview will be given on structural studies that have been done related to AFM topographic imaging of RNA, RNA assemblies and aggregates. Finally, an overview on AFM beyond imaging will be provided. This includes force spectroscopy of RNA under physiological conditions in aqueous buffer to probe RNA interaction with proteins and ligands as well as other AFM tip based RNA probing. Important applications include the detection and quantification of RNA in biological samples. A selection of recent highlights and breakthroughs will be provided related to structural and functional studies by AFM. The main intention of this short review to provide the reader with a flavor of what AFM is able to contribute to RNA research and engineering. Copyright © 2017 Elsevier Ltd. All rights reserved.
RNA design rules from a massive open laboratory

PubMed Central

Lee, Jeehyung; Kladwang, Wipapat; Lee, Minjae; Cantu, Daniel; Azizyan, Martin; Kim, Hanjoo; Limpaecher, Alex; Gaikwad, Snehal; Yoon, Sungroh; Treuille, Adrien; Das, Rhiju

2014-01-01

Self-assembling RNA molecules present compelling substrates for the rational interrogation and control of living systems. However, imperfect in silico models—even at the secondary structure level—hinder the design of new RNAs that function properly when synthesized. Here, we present a unique and potentially general approach to such empirical problems: the Massive Open Laboratory. The EteRNA project connects 37,000 enthusiasts to RNA design puzzles through an online interface. Uniquely, EteRNA participants not only manipulate simulated molecules but also control a remote experimental pipeline for high-throughput RNA synthesis and structure mapping. We show herein that the EteRNA community leveraged dozens of cycles of continuous wet laboratory feedback to learn strategies for solving in vitro RNA design problems on which automated methods fail. The top strategies—including several previously unrecognized negative design rules—were distilled by machine learning into an algorithm, EteRNABot. Over a rigorous 1-y testing phase, both the EteRNA community and EteRNABot significantly outperformed prior algorithms in a dozen RNA secondary structure design tests, including the creation of dendrimer-like structures and scaffolds for small molecule sensors. These results show that an online community can carry out large-scale experiments, hypothesis generation, and algorithm design to create practical advances in empirical science. PMID:24469816
Identification and characterization of a class of MALAT1 -like genomic loci

DOE PAGES

Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.; ...

2017-05-23

The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less
Identification and characterization of a class of MALAT1 -like genomic loci

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.

The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less
Efficient RNA structure comparison algorithms.

PubMed

Arslan, Abdullah N; Anandan, Jithendar; Fry, Eric; Monschke, Keith; Ganneboina, Nitin; Bowerman, Jason

2017-12-01

Recently proposed relative addressing-based ([Formula: see text]) RNA secondary structure representation has important features by which an RNA structure database can be stored into a suffix array. A fast substructure search algorithm has been proposed based on binary search on this suffix array. Using this substructure search algorithm, we present a fast algorithm that finds the largest common substructure of given multiple RNA structures in [Formula: see text] format. The multiple RNA structure comparison problem is NP-hard in its general formulation. We introduced a new problem for comparing multiple RNA structures. This problem has more strict similarity definition and objective, and we propose an algorithm that solves this problem efficiently. We also develop another comparison algorithm that iteratively calls this algorithm to locate nonoverlapping large common substructures in compared RNAs. With the new resulting tools, we improved the RNASSAC website (linked from http://faculty.tamuc.edu/aarslan ). This website now also includes two drawing tools: one specialized for preparing RNA substructures that can be used as input by the search tool, and another one for automatically drawing the entire RNA structure from a given structure sequence.
RNA Bricks—a database of RNA 3D motifs and their interactions

PubMed Central

Chojnowski, Grzegorz; Waleń, Tomasz; Bujnicki, Janusz M.

2014-01-01

The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions. PMID:24220091
DOE Office of Scientific and Technical Information (OSTI.GOV)

Mosley, Ralph T.; Edwards, Thomas E.; Murakami, Eisuke

The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory {beta}-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesismore » at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory {beta}-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus.« less
Computational analysis of conserved RNA secondary structure in transcriptomes and genomes.

PubMed

Eddy, Sean R

2014-01-01

Transcriptomics experiments and computational predictions both enable systematic discovery of new functional RNAs. However, many putative noncoding transcripts arise instead from artifacts and biological noise, and current computational prediction methods have high false positive rates. I discuss prospects for improving computational methods for analyzing and identifying functional RNAs, with a focus on detecting signatures of conserved RNA secondary structure. An interesting new front is the application of chemical and enzymatic experiments that probe RNA structure on a transcriptome-wide scale. I review several proposed approaches for incorporating structure probing data into the computational prediction of RNA secondary structure. Using probabilistic inference formalisms, I show how all these approaches can be unified in a well-principled framework, which in turn allows RNA probing data to be easily integrated into a wide range of analyses that depend on RNA secondary structure inference. Such analyses include homology search and genome-wide detection of new structural RNAs.
Kinetic analysis of the effects of target structure on siRNA efficiency

NASA Astrophysics Data System (ADS)

Chen, Jiawen; Zhang, Wenbing

2012-12-01

RNAi efficiency for target cleavage and protein expression is related to the target structure. Considering the RNA-induced silencing complex (RISC) as a multiple turnover enzyme, we investigated the effect of target mRNA structure on siRNA efficiency with kinetic analysis. The 4-step model was used to study the target cleavage kinetic process: hybridization nucleation at an accessible target site, RISC-mRNA hybrid elongation along with mRNA target structure melting, target cleavage, and enzyme reactivation. At this model, the terms accounting for the target accessibility, stability, and the seed and the nucleation site effects are all included. The results are in good agreement with that of experiments which show different arguments about the structure effects on siRNA efficiency. It shows that the siRNA efficiency is influenced by the integrated factors of target's accessibility, stability, and the seed effects. To study the off-target effects, a simple model of one siRNA binding to two mRNA targets was designed. By using this model, the possibility for diminishing the off-target effects by the concentration of siRNA was discussed.
RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

PubMed Central

2010-01-01

Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. Conclusions RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field. PMID:20459631

RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures.

PubMed

Popenda, Mariusz; Szachniuk, Marta; Blazewicz, Marek; Wasik, Szymon; Burke, Edmund K; Blazewicz, Jacek; Adamiak, Ryszard W

2010-05-06

Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field.
Accelerated probabilistic inference of RNA structure evolution

PubMed Central

Holmes, Ian

2005-01-01

Background Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. Results We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. Conclusion A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License. PMID:15790387
In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

PubMed

Meyer, Irmtraud M

2017-05-01

RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.
Characterising RNA secondary structure space using information entropy

PubMed Central

2013-01-01

Comparative methods for RNA secondary structure prediction use evolutionary information from RNA alignments to increase prediction accuracy. The model is often described in terms of stochastic context-free grammars (SCFGs), which generate a probability distribution over secondary structures. It is, however, unclear how this probability distribution changes as a function of the input alignment. As prediction programs typically only return a single secondary structure, better characterisation of the underlying probability space of RNA secondary structures is of great interest. In this work, we show how to efficiently compute the information entropy of the probability distribution over RNA secondary structures produced for RNA alignments by a phylo-SCFG, and implement it for the PPfold model. We also discuss interpretations and applications of this quantity, including how it can clarify reasons for low prediction reliability scores. PPfold and its source code are available from http://birc.au.dk/software/ppfold/. PMID:23368905
Interspecific variation in mitochondrial serine transfer RNA (UCN) in Euptychiina butterflies (Lepidoptera: Satyrinae): structure and alignment.

PubMed

Marín, Mario Alejandro; López, Andrés; Uribe, Sandra Inés

2012-06-01

The nucleotide variation and structural patterns of mitochondrial RNA molecule have been proposed as useful tools in molecular systematics; however, their usefulness is always subject to a proper assessment of homology in the sequence alignment. The present study describes the secondary structure of mitochondrial tRNA for the amino acid serine (UCN) on 13 Euptychiina species and the evaluation of its potential use for evolutionary studies in this group of butterflies. The secondary structure of tRNAs showed variation among the included species except between Hermeuptychia sp1 and sp2. Variation was concentrated in the ribotimidina-pseudouridine-cystosine (TψC), dihydrouridine (DHU) and variable loops and in the DHU and TψC arms. These results suggest this region as a potential marker useful for taxonomic differentiation of species in this group and also confirm the importance of including information from the secondary structure of tRNA to optimize the alignments.
Site specific incorporation of heavy atom-containing unnatural amino acids into proteins for structure determination

DOEpatents

Xie, Jianming [San Diego, CA; Wang, Lei [San Diego, CA; Wu, Ning [Boston, MA; Schultz, Peter G [La Jolla, CA

2008-07-15

Translation systems and other compositions including orthogonal aminoacyl tRNA-synthetases that preferentially charge an orthogonal tRNA with an iodinated or brominated amino acid are provided. Nucleic acids encoding such synthetases are also described, as are methods and kits for producing proteins including heavy atom-containing amino acids, e.g., brominated or iodinated amino acids. Methods of determining the structure of a protein, e.g., a protein into which a heavy atom has been site-specifically incorporated through use of an orthogonal tRNA/aminoacyl tRNA-synthetase pair, are also described.
Retroviruses: Gaining an Understanding.

ERIC Educational Resources Information Center

DiSpezio, Michael A.

1990-01-01

Contrasted are DNA viruses, RNA viruses, and RNA retroviruses. The structure, genome, and replication of retroviruses are discussed. The discovery, structure, and action of the HIV virus are described. A list of 17 references is included. (CW)
Nanomanipulation of Single RNA Molecules by Optical Tweezers

PubMed Central

Stephenson, William; Wan, Gorby; Tenenbaum, Scott A.; Li, Pan T. X.

2014-01-01

A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed. PMID:25177917
The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Ruiying; Zheng, Han; Preamplume, Gan

The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less
RACER a Coarse-Grained RNA Model for Capturing Folding Free Energy in Molecular Dynamics Simulations

NASA Astrophysics Data System (ADS)

Cheng, Sara; Bell, David; Ren, Pengyu

RACER is a coarse-grained RNA model that can be used in molecular dynamics simulations to predict native structures and sequence-specific variation of free energy of various RNA structures. RACER is capable of accurate prediction of native structures of duplexes and hairpins (average RMSD of 4.15 angstroms), and RACER can capture sequence-specific variation of free energy in excellent agreement with experimentally measured stabilities (r-squared =0.98). The RACER model implements a new effective non-bonded potential and re-parameterization of hydrogen bond and Debye-Huckel potentials. Insights from the RACER model include the importance of treating pairing and stacking interactions separately in order to distinguish folded an unfolded states and identification of hydrogen-bonding, base stacking, and electrostatic interactions as essential driving forces for RNA folding. Future applications of the RACER model include predicting free energy landscapes of more complex RNA structures and use of RACER for multiscale simulations.
A complex structure in the mRNA of Tf1 is recognized and cleaved to generate the primer of reverse transcription.

PubMed

Lin, J H; Levin, H L

1997-01-15

All retroviruses and LTR-containing retrotransposons are thought to require specific tRNA molecules to serve as primers of reverse transcription. An exception is the LTR-containing retrotransposon Tf1, isolated from Schizosaccharomyces pombe. Instead of requiring a tRNA, the reverse transcriptase of Tf1 uses the first 11 bases of the Tf1 transcript as the primer for reverse transcription. The primer is generated by a cleavage that occurs between bases 11 and 12 of the Tf1 mRNA. Sequence analysis of the 5' untranslated region of the Tf1 mRNA resulted in the identification of a region with the potential to form an RNA structure of 89 bases that included the primer binding site and the first 11 bases of the Tf1 mRNA. Systematic mutagenesis of this region revealed 34 single-point mutants in the structure that resulted in reduced transposition activity. The defects in transposition correlated with reduced level of Tf1 reverse transcripts as determined by DNA blot analysis. Evidence that the RNA structure did form in vivo included the result that strains with second site mutations that restored complementarity resulted in increased levels of reverse transcripts and Tf1 transposition. The majority of the mutants defective for reverse transcription were unable to cleave the Tf1 mRNA between bases 11 and 12. These data indicate that formation of an extensive RNA structure was required for the cleavage reaction that generated the primer for Tf1 reverse transcription.
T7-RNA Polymerase

NASA Technical Reports Server (NTRS)

1997-01-01

T7-RNA Polymerase grown on STS-81. Structure-Function Relationships of RNA Polymerase: DNA-dependent RNA polymerase is the key enzyme responsible for the biosynthesis of RNA, a process known as transcription. Principal Investigator's include Dr. Dan Carter, Dr. B.C. Wang, and Dr. John Rose of New Century Pharmaceuticals.
Thermodynamic heuristics with case-based reasoning: combined insights for RNA pseudoknot secondary structure.

PubMed

Al-Khatib, Ra'ed M; Rashid, Nur'Aini Abdul; Abdullah, Rosni

2011-08-01

The secondary structure of RNA pseudoknots has been extensively inferred and scrutinized by computational approaches. Experimental methods for determining RNA structure are time consuming and tedious; therefore, predictive computational approaches are required. Predicting the most accurate and energy-stable pseudoknot RNA secondary structure has been proven to be an NP-hard problem. In this paper, a new RNA folding approach, termed MSeeker, is presented; it includes KnotSeeker (a heuristic method) and Mfold (a thermodynamic algorithm). The global optimization of this thermodynamic heuristic approach was further enhanced by using a case-based reasoning technique as a local optimization method. MSeeker is a proposed algorithm for predicting RNA pseudoknot structure from individual sequences, especially long ones. This research demonstrates that MSeeker improves the sensitivity and specificity of existing RNA pseudoknot structure predictions. The performance and structural results from this proposed method were evaluated against seven other state-of-the-art pseudoknot prediction methods. The MSeeker method had better sensitivity than the DotKnot, FlexStem, HotKnots, pknotsRG, ILM, NUPACK and pknotsRE methods, with 79% of the predicted pseudoknot base-pairs being correct.
R3D Align web server for global nucleotide to nucleotide alignments of RNA 3D structures.

PubMed

Rahrig, Ryan R; Petrov, Anton I; Leontis, Neocles B; Zirbel, Craig L

2013-07-01

The R3D Align web server provides online access to 'RNA 3D Align' (R3D Align), a method for producing accurate nucleotide-level structural alignments of RNA 3D structures. The web server provides a streamlined and intuitive interface, input data validation and output that is more extensive and easier to read and interpret than related servers. The R3D Align web server offers a unique Gallery of Featured Alignments, providing immediate access to pre-computed alignments of large RNA 3D structures, including all ribosomal RNAs, as well as guidance on effective use of the server and interpretation of the output. By accessing the non-redundant lists of RNA 3D structures provided by the Bowling Green State University RNA group, R3D Align connects users to structure files in the same equivalence class and the best-modeled representative structure from each group. The R3D Align web server is freely accessible at http://rna.bgsu.edu/r3dalign/.
Crystal structure of Zika virus NS5 RNA-dependent RNA polymerase.

PubMed

Godoy, Andre S; Lima, Gustavo M A; Oliveira, Ketllyn I Z; Torres, Naiara U; Maluf, Fernando V; Guido, Rafael V C; Oliva, Glaucius

2017-03-27

The current Zika virus (ZIKV) outbreak became a global health threat of complex epidemiology and devastating neurological impacts, therefore requiring urgent efforts towards the development of novel efficacious and safe antiviral drugs. Due to its central role in RNA viral replication, the non-structural protein 5 (NS5) RNA-dependent RNA-polymerase (RdRp) is a prime target for drug discovery. Here we describe the crystal structure of the recombinant ZIKV NS5 RdRp domain at 1.9 Å resolution as a platform for structure-based drug design strategy. The overall structure is similar to other flaviviral homologues. However, the priming loop target site, which is suitable for non-nucleoside polymerase inhibitor design, shows significant differences in comparison with the dengue virus structures, including a tighter pocket and a modified local charge distribution.
RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

PubMed

Li, Ying; Shi, Xiaohu; Liang, Yanchun; Xie, Juan; Zhang, Yu; Ma, Qin

2017-01-21

RNAs have been found to carry diverse functionalities in nature. Inferring the similarity between two given RNAs is a fundamental step to understand and interpret their functional relationship. The majority of functional RNAs show conserved secondary structures, rather than sequence conservation. Those algorithms relying on sequence-based features usually have limitations in their prediction performance. Hence, integrating RNA structure features is very critical for RNA analysis. Existing algorithms mainly fall into two categories: alignment-based and alignment-free. The alignment-free algorithms of RNA comparison usually have lower time complexity than alignment-based algorithms. An alignment-free RNA comparison algorithm was proposed, in which novel numerical representations RNA-TVcurve (triple vector curve representation) of RNA sequence and corresponding secondary structure features are provided. Then a multi-scale similarity score of two given RNAs was designed based on wavelet decomposition of their numerical representation. In support of RNA mutation and phylogenetic analysis, a web server (RNA-TVcurve) was designed based on this alignment-free RNA comparison algorithm. It provides three functional modules: 1) visualization of numerical representation of RNA secondary structure; 2) detection of single-point mutation based on secondary structure; and 3) comparison of pairwise and multiple RNA secondary structures. The inputs of the web server require RNA primary sequences, while corresponding secondary structures are optional. For the primary sequences alone, the web server can compute the secondary structures using free energy minimization algorithm in terms of RNAfold tool from Vienna RNA package. RNA-TVcurve is the first integrated web server, based on an alignment-free method, to deliver a suite of RNA analysis functions, including visualization, mutation analysis and multiple RNAs structure comparison. The comparison results with two popular RNA comparison tools, RNApdist and RNAdistance, showcased that RNA-TVcurve can efficiently capture subtle relationships among RNAs for mutation detection and non-coding RNA classification. All the relevant results were shown in an intuitive graphical manner, and can be freely downloaded from this server. RNA-TVcurve, along with test examples and detailed documents, are available at: http://ml.jlu.edu.cn/tvcurve/ .
A novel knowledge-based potential for RNA 3D structure evaluation

NASA Astrophysics Data System (ADS)

Yang, Yi; Gu, Qi; Zhang, Ben-Gong; Shi, Ya-Zhou; Shao, Zhi-Gang

2018-03-01

Ribonucleic acids (RNAs) play a vital role in biology, and knowledge of their three-dimensional (3D) structure is required to understand their biological functions. Recently structural prediction methods have been developed to address this issue, but a series of RNA 3D structures are generally predicted by most existing methods. Therefore, the evaluation of the predicted structures is generally indispensable. Although several methods have been proposed to assess RNA 3D structures, the existing methods are not precise enough. In this work, a new all-atom knowledge-based potential is developed for more accurately evaluating RNA 3D structures. The potential not only includes local and nonlocal interactions but also fully considers the specificity of each RNA by introducing a retraining mechanism. Based on extensive test sets generated from independent methods, the proposed potential correctly distinguished the native state and ranked near-native conformations to effectively select the best. Furthermore, the proposed potential precisely captured RNA structural features such as base-stacking and base-pairing. Comparisons with existing potential methods show that the proposed potential is very reliable and accurate in RNA 3D structure evaluation. Project supported by the National Science Foundation of China (Grants Nos. 11605125, 11105054, 11274124, and 11401448).
The identification and functional annotation of RNA structures conserved in vertebrates

PubMed Central

Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan

2017-01-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
A parallel implementation of the Wuchty algorithm with additional experimental filters to more thoroughly explore RNA conformational space.

PubMed

Stone, Jonathan W; Bleckley, Samuel; Lavelle, Sean; Schroeder, Susan J

2015-01-01

We present new modifications to the Wuchty algorithm in order to better define and explore possible conformations for an RNA sequence. The new features, including parallelization, energy-independent lonely pair constraints, context-dependent chemical probing constraints, helix filters, and optional multibranch loops, provide useful tools for exploring the landscape of RNA folding. Chemical probing alone may not necessarily define a single unique structure. The helix filters and optional multibranch loops are global constraints on RNA structure that are an especially useful tool for generating models of encapsidated viral RNA for which cryoelectron microscopy or crystallography data may be available. The computations generate a combinatorially complete set of structures near a free energy minimum and thus provide data on the density and diversity of structures near the bottom of a folding funnel for an RNA sequence. The conformational landscapes for some RNA sequences may resemble a low, wide basin rather than a steep funnel that converges to a single structure.
TBI server: a web server for predicting ion effects in RNA folding.

PubMed

Zhu, Yuhong; He, Zhaojian; Chen, Shi-Jie

2015-01-01

Metal ions play a critical role in the stabilization of RNA structures. Therefore, accurate prediction of the ion effects in RNA folding can have a far-reaching impact on our understanding of RNA structure and function. Multivalent ions, especially Mg²⁺, are essential for RNA tertiary structure formation. These ions can possibly become strongly correlated in the close vicinity of RNA surface. Most of the currently available software packages, which have widespread success in predicting ion effects in biomolecular systems, however, do not explicitly account for the ion correlation effect. Therefore, it is important to develop a software package/web server for the prediction of ion electrostatics in RNA folding by including ion correlation effects. The TBI web server http://rna.physics.missouri.edu/tbi_index.html provides predictions for the total electrostatic free energy, the different free energy components, and the mean number and the most probable distributions of the bound ions. A novel feature of the TBI server is its ability to account for ion correlation and ion distribution fluctuation effects. By accounting for the ion correlation and fluctuation effects, the TBI server is a unique online tool for computing ion-mediated electrostatic properties for given RNA structures. The results can provide important data for in-depth analysis for ion effects in RNA folding including the ion-dependence of folding stability, ion uptake in the folding process, and the interplay between the different energetic components.

Structural landscape of base pairs containing post-transcriptional modifications in RNA

PubMed Central

Seelam, Preethi P.; Sharma, Purshotam

2017-01-01

Base pairs involving post-transcriptionally modified nucleobases are believed to play important roles in a wide variety of functional RNAs. Here we present our attempts toward understanding the structural and functional role of naturally occurring modified base pairs using a combination of X-ray crystal structure database analysis, sequence analysis, and advanced quantum chemical methods. Our bioinformatics analysis reveals that despite their presence in all major secondary structural elements, modified base pairs are most prevalent in tRNA crystal structures and most commonly involve guanine or uridine modifications. Further, analysis of tRNA sequences reveals additional examples of modified base pairs at structurally conserved tRNA regions and highlights the conservation patterns of these base pairs in three domains of life. Comparison of structures and binding energies of modified base pairs with their unmodified counterparts, using quantum chemical methods, allowed us to classify the base modifications in terms of the nature of their electronic structure effects on base-pairing. Analysis of specific structural contexts of modified base pairs in RNA crystal structures revealed several interesting scenarios, including those at the tRNA:rRNA interface, antibiotic-binding sites on the ribosome, and the three-way junctions within tRNA. These scenarios, when analyzed in the context of available experimental data, allowed us to correlate the occurrence and strength of modified base pairs with their specific functional roles. Overall, our study highlights the structural importance of modified base pairs in RNA and points toward the need for greater appreciation of the role of modified bases and their interactions, in the context of many biological processes involving RNA. PMID:28341704
Blind prediction of noncanonical RNA structure at atomic accuracy.

PubMed

Watkins, Andrew M; Geniesse, Caleb; Kladwang, Wipapat; Zakrevsky, Paul; Jaeger, Luc; Das, Rhiju

2018-05-01

Prediction of RNA structure from nucleotide sequence remains an unsolved grand challenge of biochemistry and requires distinct concepts from protein structure prediction. Despite extensive algorithmic development in recent years, modeling of noncanonical base pairs of new RNA structural motifs has not been achieved in blind challenges. We report a stepwise Monte Carlo (SWM) method with a unique add-and-delete move set that enables predictions of noncanonical base pairs of complex RNA structures. A benchmark of 82 diverse motifs establishes the method's general ability to recover noncanonical pairs ab initio, including multistrand motifs that have been refractory to prior approaches. In a blind challenge, SWM models predicted nucleotide-resolution chemical mapping and compensatory mutagenesis experiments for three in vitro selected tetraloop/receptors with previously unsolved structures (C7.2, C7.10, and R1). As a final test, SWM blindly and correctly predicted all noncanonical pairs of a Zika virus double pseudoknot during a recent community-wide RNA-Puzzle. Stepwise structure formation, as encoded in the SWM method, enables modeling of noncanonical RNA structure in a variety of previously intractable problems.
Principles for Predicting RNA Secondary Structure Design Difficulty.

PubMed

Anderson-Lee, Jeff; Fisker, Eli; Kosaraju, Vineet; Wu, Michelle; Kong, Justin; Lee, Jeehyung; Lee, Minjae; Zada, Mathew; Treuille, Adrien; Das, Rhiju

2016-02-27

Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
RNA design using simulated SHAPE data.

PubMed

Lotfi, Mohadeseh; Zare-Mirakabad, Fatemeh; Montaseri, Soheila

2018-05-03

It has long been established that in addition to being involved in protein translation, RNA plays essential roles in numerous other cellular processes, including gene regulation and DNA replication. Such roles are known to be dictated by higher-order structures of RNA molecules. It is therefore of prime importance to find an RNA sequence that can fold to acquire a particular function that is desirable for use in pharmaceuticals and basic research. The challenge of finding an RNA sequence for a given structure is known as the RNA design problem. Although there are several algorithms to solve this problem, they mainly consider hard constraints, such as minimum free energy, to evaluate the predicted sequences. Recently, SHAPE data has emerged as a new soft constraint for RNA secondary structure prediction. To take advantage of this new experimental constraint, we report here a new method for accurate design of RNA sequences based on their secondary structures using SHAPE data as pseudo-free energy. We then compare our algorithm with four others: INFO-RNA, ERD, MODENA and RNAifold 2.0. Our algorithm precisely predicts 26 out of 29 new sequences for the structures extracted from the Rfam dataset, while the other four algorithms predict no more than 22 out of 29. The proposed algorithm is comparable to the above algorithms on RNA-SSD datasets, where they can predict up to 33 appropriate sequences for RNA secondary structures out of 34.
Domain structure of the ribozyme from eubacterial ribonuclease P.

PubMed Central

Loria, A; Pan, T

1996-01-01

Large RNAs can be composed of discrete domains that fold independently. One such "folding domain" has been identified previously in the ribozyme from Bacillus subtilis ribonuclease P (denoted P RNA). This domain contains roughly one-third of all residues. Folding of an RNA construct consisting of the remaining two-thirds of B. subtilis P RNA was examined by Fe(II)-EDTA hydroxyl radical protection. This molecule folds into the proper higher-order structure under identical conditions as the full-length P RNA, suggesting the presence of a second folding domain in B. subtilis P RNA. Folding analysis of the Escherichia coli P RNA by hydroxyl radical protection shows that this P RNA is completely folded at 5-6 mM Mg2+. In order to analyze the structural organization of folding domains in E. coli P RNA, constructs were designed based on the domain structure of B. subtilis P RNA. Fe(II)-EDTA protection indicates that E. coli P RNA also contains two folding domains. Despite the significant differences at the secondary structure level, both P RNAs appear to converge structurally at the folding domain level. The pre-tRNA substrate, localized in previous studies, may bind across the folding domains with the acceptor stem/3'CCA contacting the domain including the active site and the T stem-loop contacting the other. Because all eubacterial P RNAs share considerable homology in secondary structure to either B. subtilis or E. coli P RNA, these results suggest that this domain structure may be applicable for most, if not all, eubacterial P RNAs. Identification of folding domains should be valuable in dissecting structure-function relationship of large RNAs. PMID:8718684
RNA G-quadruplexes: emerging mechanisms in disease

PubMed Central

Cammas, Anne

2017-01-01

Abstract RNA G-quadruplexes (G4s) are formed by G-rich RNA sequences in protein-coding (mRNA) and non-coding (ncRNA) transcripts that fold into a four-stranded conformation. Experimental studies and bioinformatic predictions support the view that these structures are involved in different cellular functions associated to both DNA processes (telomere elongation, recombination and transcription) and RNA post-transcriptional mechanisms (including pre-mRNA processing, mRNA turnover, targeting and translation). An increasing number of different diseases have been associated with the inappropriate regulation of RNA G4s exemplifying the potential importance of these structures on human health. Here, we review the different molecular mechanisms underlying the link between RNA G4s and human diseases by proposing several overlapping models of deregulation emerging from recent research, including (i) sequestration of RNA-binding proteins, (ii) aberrant expression or localization of RNA G4-binding proteins, (iii) repeat associated non-AUG (RAN) translation, (iv) mRNA translational blockade and (v) disabling of protein–RNA G4 complexes. This review also provides a comprehensive survey of the functional RNA G4 and their mechanisms of action. Finally, we highlight future directions for research aimed at improving our understanding on RNA G4-mediated regulatory mechanisms linked to diseases. PMID:28013268
Guide-bound structures of an RNA-targeting A-cleaving CRISPR–Cas13a enzyme

DOE Office of Scientific and Technical Information (OSTI.GOV)

Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.

CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR–Cas systems include the Cas13a enzyme, an RNA-activated RNase capable of crRNA processing and single-stranded RNA degradation upon target-transcript binding. Here we present the 2.0-Å resolution crystal structure of a crRNA-bound Lachnospiraceae bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target-RNA-specifying sequence in the protein interior explains the conformational gatingmore » of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked before target-RNA recognition, with implications for both bacterial immunity and diagnostic applications.« less
Guide-bound structures of an RNA-targeting A-cleaving CRISPR–Cas13a enzyme

DOE PAGES

Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.; ...

2017-09-11

CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR–Cas systems include the Cas13a enzyme, an RNA-activated RNase capable of crRNA processing and single-stranded RNA degradation upon target-transcript binding. Here we present the 2.0-Å resolution crystal structure of a crRNA-bound Lachnospiraceae bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target-RNA-specifying sequence in the protein interior explains the conformational gatingmore » of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked before target-RNA recognition, with implications for both bacterial immunity and diagnostic applications.« less
CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction

PubMed Central

Puton, Tomasz; Kozlowski, Lukasz P.; Rother, Kristian M.; Bujnicki, Janusz M.

2013-01-01

We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks. PMID:23435231
Towards the discovery of drug-like RNA ligands?

PubMed

Foloppe, Nicolas; Matassova, Natalia; Aboul-Ela, Fareed

2006-11-01

Targeting RNA with small molecule drugs is an area of great potential for therapeutic treatment of infections and possibly genetic and autoimmune diseases. However, a mature set of precedents and established methodology is lacking. The physicochemical properties of RNA raise specific issues and obstacles to development, and contribute to explain the distinct characteristics of natural RNA ligands, including antibiotics. Yet, RNA-targeting strategies are being implemented to reinvigorate antibacterial discovery by using the ribosomal X-ray structures to modify known antibiotics. To exploit further these structures, we suggest the use of existing protein kinase-directed libraries of drug-like compounds to target the A-site of the bacterial ribosome, on the basis of a specific structural hypothesis.
RNase MRP cleaves pre-tRNASer-Met in the tRNA maturation pathway.

PubMed

Saito, Yuichiro; Takeda, Jun; Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V; Taoka, Masato; Isobe, Toshiaki

2014-01-01

Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNA(Ser-Met). To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNA(Ser-Met), suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry-based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute "Domain 1" in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP.
An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures

PubMed Central

Zandi, Kasra; Butler, Gregory; Kharma, Nawwaf

2016-01-01

Computational design of RNA sequences that fold into targeted secondary structures has many applications in biomedicine, nanotechnology and synthetic biology. An RNA molecule is made of different types of secondary structure elements and an important RNA element named pseudoknot plays a key role in stabilizing the functional form of the molecule. However, due to the computational complexities associated with characterizing pseudoknotted RNA structures, most of the existing RNA sequence designer algorithms generally ignore this important structural element and therefore limit their applications. In this paper we present a new algorithm to design RNA sequences for pseudoknotted secondary structures. We use NUPACK as the folding algorithm to compute the equilibrium characteristics of the pseudoknotted RNAs, and describe a new adaptive defect weighted sampling algorithm named Enzymer to design low ensemble defect RNA sequences for targeted secondary structures including pseudoknots. We used a biological data set of 201 pseudoknotted structures from the Pseudobase library to benchmark the performance of our algorithm. We compared the quality characteristics of the RNA sequences we designed by Enzymer with the results obtained from the state of the art MODENA and antaRNA. Our results show our method succeeds more frequently than MODENA and antaRNA do, and generates sequences that have lower ensemble defect, lower probability defect and higher thermostability. Finally by using Enzymer and by constraining the design to a naturally occurring and highly conserved Hammerhead motif, we designed 8 sequences for a pseudoknotted cis-acting Hammerhead ribozyme. Enzymer is available for download at https://bitbucket.org/casraz/enzymer. PMID:27499762
The identification and functional annotation of RNA structures conserved in vertebrates.

PubMed

Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

2017-08-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
The bifurcated stem loop 4 (SL4) is crucial for efficient packaging of mouse mammary tumor virus (MMTV) genomic RNA.

PubMed

Mustafa, Farah; Vivet-Boudou, Valérie; Jabeen, Ayesha; Ali, Lizna M; Kalloush, Rawan M; Marquet, Roland; Rizvi, Tahir A

2018-06-21

Packaging the mouse mammary tumor virus (MMTV) genomic RNA (gRNA) requires the entire 5' untranslated region (UTR) in conjunction with the first 120 nucleotides of the gag gene. This region includes several palindromic (pal) sequence(s) and stable stem loops (SLs). Among these, stem loop 4 (SL4) adopts a bifurcated structure consisting of three stems, two apical loops, and an internal loop. Pal II, located in one of the apical loops, mediates gRNA dimerization, a process intricately linked to packaging. We thus hypothesized that the bifurcated SL4 structure could constitute the major gRNA packaging determinant. To test this hypothesis, the two apical loops and the flanking sequences forming the bifurcated SL4 were individually mutated. These mutations all had deleterious effects on gRNA packaging and propagation. Next, single and compensatory mutants were designed to destabilize then recreate the bifurcated SL4 structure. A structure-function analysis using bioinformatics predictions and RNA chemical probing revealed that mutations that led to the loss of the SL4 bifurcated structure abrogated RNA packaging and propagation, while compensatory mutations that recreated the native SL4 structure restored RNA packaging and propagation to wild type levels. Altogether, our results demonstrate that SL4 constitutes the principal packaging determinant of MMTV gRNA. Our findings further suggest that SL4 acts as a structural switch that can not only differentiate between RNA for translation versus packaging/dimerization, but its location also allows differentiation between spliced and unspliced RNAs during gRNA encapsidation.
Thermodynamic stability of RNA structures formed by CNG trinucleotide repeats. Implication for prediction of RNA structure.

PubMed

Broda, Magdalena; Kierzek, Elzbieta; Gdaniec, Zofia; Kulinski, Tadeusz; Kierzek, Ryszard

2005-08-16

Trinucleotide repeat expansion diseases (TREDs) are correlated with elongation of CNG DNA and RNA repeats to pathological level. This paper shows, for the first time, complete data concerning thermodynamic stabilities of RNA with CNG trinucleotide repeats. Our studies include the stability of oligoribonucleotides composed of two to seven of CAG, CCG, CGG, and CUG repeats. The thermodynamic parameters of helix propagation correlated with the presence of multiple N-N mismatches within CNG RNA duplexes were also determined. Moreover, the total stability of CNG RNA hairpins, as well as the contribution of trinucleotide repeats placed only in the stem or loop regions, was evaluated. The improved thermodynamic parameters allow to predict much more accurately the thermodynamic stabilities and structures of CNG RNAs.
Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.

PubMed

Hajdin, Christine E; Bellaousov, Stanislav; Huggins, Wayne; Leonard, Christopher W; Mathews, David H; Weeks, Kevin M

2013-04-02

A pseudoknot forms in an RNA when nucleotides in a loop pair with a region outside the helices that close the loop. Pseudoknots occur relatively rarely in RNA but are highly overrepresented in functionally critical motifs in large catalytic RNAs, in riboswitches, and in regulatory elements of viruses. Pseudoknots are usually excluded from RNA structure prediction algorithms. When included, these pairings are difficult to model accurately, especially in large RNAs, because allowing this structure dramatically increases the number of possible incorrect folds and because it is difficult to search the fold space for an optimal structure. We have developed a concise secondary structure modeling approach that combines SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) experimental chemical probing information and a simple, but robust, energy model for the entropic cost of single pseudoknot formation. Structures are predicted with iterative refinement, using a dynamic programming algorithm. This melded experimental and thermodynamic energy function predicted the secondary structures and the pseudoknots for a set of 21 challenging RNAs of known structure ranging in size from 34 to 530 nt. On average, 93% of known base pairs were predicted, and all pseudoknots in well-folded RNAs were identified.
Chemical Approaches for Structure and Function of RNA in Postgenomic Era

PubMed Central

Ro-Choi, Tae Suk; Choi, Yong Chun

2012-01-01

In the study of cellular RNA chemistry, a major thrust of research focused upon sequence determinations for decades. Structures of snRNAs (4.5S RNA I (Alu), U1, U2, U3, U4, U5, and U6) were determined at Baylor College of Medicine, Houston, Tex, in an earlier time of pregenomic era. They show novel modifications including base methylation, sugar methylation, 5′-cap structures (types 0–III) and sequence heterogeneity. This work offered an exciting problem of posttranscriptional modification and underwent numerous significant advances through technological revolutions during pregenomic, genomic, and postgenomic eras. Presently, snRNA research is making progresses involved in enzymology of snRNA modifications, molecular evolution, mechanism of spliceosome assembly, chemical mechanism of intron removal, high-order structure of snRNA in spliceosome, and pathology of splicing. These works are destined to reach final pathway of work “Function and Structure of Spliceosome” in addition to exciting new exploitation of other noncoding RNAs in all aspects of regulatory functions. PMID:22347623
A multi-step strategy to obtain crystals of the dengue virus RNA-dependent RNA polymerase that diffract to high resolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yap, Thai Leong; School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551; Chen, Yen Liang

Crystals of the RNA-dependent RNA polymerase catalytic domain from the dengue virus NS5 protein have been obtained using a strategy that included expression screening of naturally occurring serotype variants of the protein, the addition of divalent metal ions and crystal dehydration. These crystals diffract to 1.85 Å resolution and are thus suitable for a structure-based drug-design program. Dengue virus, a member of the Flaviviridae genus, causes dengue fever, an important emerging disease with several million infections occurring annually for which no effective therapy exists. The viral RNA-dependent RNA polymerase NS5 plays an important role in virus replication and represents anmore » interesting target for the development of specific antiviral compounds. Crystals that diffract to 1.85 Å resolution that are suitable for three-dimensional structure determination and thus for a structure-based drug-design program have been obtained using a strategy that included expression screening of naturally occurring serotype variants of the protein, the addition of divalent metal ions and crystal dehydration.« less
Template-Based Modeling of Protein-RNA Interactions.

PubMed

Zheng, Jinfang; Kundrotas, Petras J; Vakser, Ilya A; Liu, Shiyong

2016-09-01

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.
Assembly and analysis of eukaryotic Argonaute–RNA complexes in microRNA-target recognition

PubMed Central

Gan, Hin Hark; Gunsalus, Kristin C.

2015-01-01

Experimental studies have uncovered a variety of microRNA (miRNA)–target duplex structures that include perfect, imperfect and seedless duplexes. However, non-canonical binding modes from imperfect/seedless duplexes are not well predicted by computational approaches, which rely primarily on sequence and secondary structural features, nor have their tertiary structures been characterized because solved structures to date are limited to near perfect, straight duplexes in Argonautes (Agos). Here, we use structural modeling to examine the role of Ago dynamics in assembling viable eukaryotic miRNA-induced silencing complexes (miRISCs). We show that combinations of low-frequency, global modes of motion of Ago domains are required to accommodate RNA duplexes in model human and C. elegans Ago structures. Models of viable miRISCs imply that Ago adopts variable conformations at distinct target sites that generate distorted, imperfect miRNA-target duplexes. Ago's ability to accommodate a duplex is dependent on the region where structural distortions occur: distortions in solvent-exposed seed and 3′-end regions are less likely to produce steric clashes than those in the central duplex region. Energetic analyses of assembled miRISCs indicate that target recognition is also driven by favorable Ago-duplex interactions. Such structural insights into Ago loading and target recognition mechanisms may provide a more accurate assessment of miRNA function. PMID:26432829

Structural Features of a Picornavirus Polymerase Involved in the Polyadenylation of Viral RNA

PubMed Central

Kempf, Brian J.; Kelly, Michelle M.; Springer, Courtney L.; Peersen, Olve B.

2013-01-01

Picornaviruses have 3′ polyadenylated RNA genomes, but the mechanisms by which these genomes are polyadenylated during viral replication remain obscure. Based on prior studies, we proposed a model wherein the poliovirus RNA-dependent RNA polymerase (3Dpol) uses a reiterative transcription mechanism while replicating the poly(A) and poly(U) portions of viral RNA templates. To further test this model, we examined whether mutations in 3Dpol influenced the polyadenylation of virion RNA. We identified nine alanine substitution mutations in 3Dpol that resulted in shorter or longer 3′ poly(A) tails in virion RNA. These mutations could disrupt structural features of 3Dpol required for the recruitment of a cellular poly(A) polymerase; however, the structural orientation of these residues suggests a direct role of 3Dpol in the polyadenylation of RNA genomes. Reaction mixtures containing purified 3Dpol and a template RNA with a defined poly(U) sequence provided data consistent with a template-dependent reiterative transcription mechanism for polyadenylation. The phylogenetically conserved structural features of 3Dpol involved in the polyadenylation of virion RNA include a thumb domain alpha helix that is positioned in the minor groove of the double-stranded RNA product and lysine and arginine residues that interact with the phosphates of both the RNA template and product strands. PMID:23468507
Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer.

PubMed

Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

2010-05-01

The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq-RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq-RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 A, while the type 2 Hfq-RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 A. Diffraction data were collected to a resolution of 2.20 A from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis.
Nuclear matrix and hnRNP share a common structural constituent associated with premessenger RNA.

PubMed Central

Gallinaro, H; Puvion, E; Kister, L; Jacob, M

1983-01-01

Nuclear matrix and heterogeneous nuclear ribonucleoprotein (hnRNP) were compared to establish whether premessenger RNA (premRNA) was associated with a same constituent in both structures. The isolation of nuclear matrix included the removal of chromatin and of 0.4 M KCl-soluble material. HnRNP, isolated by a standard method was also treated by 0.4 M KCl. Both isolation procedures caused the removal of DNA, histones, a fraction of small nuclear RNA and of nonhistone proteins including the hnRNP proteins in the 30 000-40 000 mol. wt. range. High resolution autoradiography showed that hnRNA remained associated with the residual fibrils in both structures. They both contained the same premRNA and maturation products as shown by the analysis of the transcripts of the early region 3 of adenovirus 2. In addition, the small nuclear RNA and protein of the salt-resistant complexes were also present in the matrix. The results are compatible with the idea that the salt-resistant complexes from hnRNP constitute the fibrils associated with premRNA in the nucleoplasmic matrix. The fibrils may be the basic unit of splicing and their organization in matrix might provide the spatial configuration necessary for regulation. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 7. PMID:6557026
Web-Beagle: a web server for the alignment of RNA secondary structures.

PubMed

Mattei, Eugenio; Pietrosanto, Marco; Ferrè, Fabrizio; Helmer-Citterich, Manuela

2015-07-01

Web-Beagle (http://beagle.bio.uniroma2.it) is a web server for the pairwise global or local alignment of RNA secondary structures. The server exploits a new encoding for RNA secondary structure and a substitution matrix of RNA structural elements to perform RNA structural alignments. The web server allows the user to compute up to 10 000 alignments in a single run, taking as input sets of RNA sequences and structures or primary sequences alone. In the latter case, the server computes the secondary structure prediction for the RNAs on-the-fly using RNAfold (free energy minimization). The user can also compare a set of input RNAs to one of five pre-compiled RNA datasets including lncRNAs and 3' UTRs. All types of comparison produce in output the pairwise alignments along with structural similarity and statistical significance measures for each resulting alignment. A graphical color-coded representation of the alignments allows the user to easily identify structural similarities between RNAs. Web-Beagle can be used for finding structurally related regions in two or more RNAs, for the identification of homologous regions or for functional annotation. Benchmark tests show that Web-Beagle has lower computational complexity, running time and better performances than other available methods. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
RNA Nanotechnology: Engineering, Assembly and Applications in Detection, Gene Delivery and Therapy

PubMed Central

Guo, Peixuan

2010-01-01

Biological macromolecules including DNA, RNA, and proteins, have intrinsic features that make them potential building blocks for the bottom-up fabrication of nanodevices. RNA is unique in nanoscale fabrication due to its amazing diversity of function and structure. RNA molecules can be designed and manipulated with a level of simplicity characteristic of DNA while possessing versatility in structure and function similar to that of proteins. RNA molecules typically contain a large variety of single stranded loops suitable for inter- and intra-molecular interaction. These loops can serve as mounting dovetails obviating the need for external linking dowels in fabrication and assembly. The self-assembly of nanoparticles from RNA involves cooperative interaction of individual RNA molecules that spontaneously assemble in a predefined manner to form a larger two- or three-dimensional structure. Within the realm of self-assembly there are two main categories, namely template and non-template. Template assembly involves interaction of RNA molecules under the influence of specific external sequence, forces, or spatial constraints such as RNA transcription, hybridization, replication, annealing, molding, or replicas. In contrast, non-template assembly involves formation of a larger structure by individual components without the influence of external forces. Examples of non-template assembly are ligation, chemical conjugation, covalent linkage, and loop/loop interaction of RNA, especially the formation of RNA multimeric complexes. The best characterized RNA multiplier and the first to be described in RNA nanotechnological application is the motor pRNA of bacteriophage phi29 which form dimers, trimers, and hexamers, via hand-in-hand interaction. phi29 pRNA can be redesigned to form a variety of structures and shapes including twins, tetramers, rods, triangles, and 3D arrays several microns in size via interaction of programmed helical regions and loops. 3D RNA array formation requires a defined nucleotide number for twisting and a palindromic sequence. Such arrays are unusually stable and resistant to a wide range of temperatures, salt concentrations, and pH. Both the therapeutic siRNA or ribozyme and a receptor-binding RNA aptamer or other ligands have been engineered into individual pRNAs. Individual chimeric RNA building blocks harboring siRNA or other therapeutic molecules have been fabricated subsequently into a trimer through hand-in-hand interaction of the engineered right and left interlocking RNA loops. The incubation of these particles containing the receptor-binding aptamer or other ligands results in the binding and co-entry of trivalent therapeutic particles into cells. Such particles were subsequently shown to modulate the apoptosis of cancer cells in both cell cultures and animal trials. The use of such antigen-free 20–40 nm particles holds promise for the repeated long-term treatment of chronic diseases. Other potentially useful RNA molecules that form multimers include HIV RNA that contain kissing loop to form dimers, tecto-RNA that forms a “jigsaw puzzle,” and the Drosophila bicoid mRNA that forms multimers via “hand-by-arm” interactions. Applications of RNA molecules involving replication, molding, embossing, and other related techniques, have recently been described that allow the utilization of a variety of materials to enhance diversity and resolution of nanomaterials. It should eventually be possible to adapt RNA to facilitate construction of ordered, patterned, or pre-programmed arrays or superstructures. Given the potential for 3D fabrication, the chance to produce reversible self-assembly, and the ability of self-repair, editing and replication, RNA self-assembly will play an increasingly significant role in integrated biological nanofabrication. A random 100-nucleotide RNA library may exist in 1.6 × 1060 varieties with multifarious structure to serve as a vital system for efficient fabrication, with a complexity and diversity far exceeding that of any current nanoscale system. This review covers the basic concepts of RNA structure and function, certain methods for the study of RNA structure, the approaches for engineering or fabricating RNA into nanoparticles or arrays, and special features of RNA molecules that form multimers. The most recent development in exploration of RNA nanoparticles for pathogen detection, drug/gene delivery, and therapeutic application is also introduced in this review. PMID:16430131
RNA Tertiary Interactions in a Riboswitch Stabilize the Structure of a Kink Turn

PubMed Central

Schroeder, Kersten T.; Daldrop, Peter; Lilley, David M.J.

2011-01-01

Summary The kink turn is a widespread RNA motif that introduces an acute kink into the axis of duplex RNA, typically comprising a bulge followed by a G⋅A and A⋅G pairs. The kinked conformation is stabilized by metal ions, or the binding of proteins including L7Ae. We now demonstrate a third mechanism for the stabilization of k-turn structure, involving tertiary interactions within a larger RNA structure. The SAM-I riboswitch contains an essential standard k-turn sequence that kinks a helix so that its terminal loop can make a long-range interaction. We find that some sequence variations in the k-turn within the riboswitch do not prevent SAM binding, despite preventing the folding of the k-turn in isolation. Furthermore, two crystal structures show that the sequence-variant k-turns are conventionally folded within the riboswitch. This study shows that the folded structure of the k-turn can be stabilized by tertiary interactions within a larger RNA structure. PMID:21893284
Pseudoscorpion mitochondria show rearranged genes and genome-wide reductions of RNA gene sizes and inferred structures, yet typical nucleotide composition bias

PubMed Central

2012-01-01

Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic analyses of all 13 mitochondrial protein-coding gene sequences consistently yield trees that place pseudoscorpions as sister to acariform mites. Conclusion The well-supported phylogenetic placement of pseudoscorpions as sister to Acariformes differs from some previous analyses based on morphology. However, these two lineages share multiple molecular evolutionary traits, including substantial mitochondrial genome rearrangements, extensive nucleotide substitution, and loss of helices in their inferred tRNA and rRNA structures. PMID:22409411
Guide-bound structures of an RNA-targeting A-cleaving CRISPR-Cas13a enzyme

PubMed Central

Knott, Gavin J.; East-Seletsky, Alexandra; Cofsky, Joshua C.; Holton, James M.; Charles, Emeric; O’Connell, Mitchell R.; Doudna, Jennifer A.

2018-01-01

CRISPR adaptive immune systems protect bacteria from infections by deploying CRISPR RNA (crRNA)-guided enzymes to recognize and cut foreign nucleic acids. Type VI-A CRISPR-Cas systems include the Cas13a enzyme, an RNA-activated ribonuclease (RNase) capable of crRNA processing and single-stranded RNA degradation upon target transcript binding. Here we present the 2.0 Å resolution crystal structure of a crRNA-bound L. bacterium Cas13a (LbaCas13a), representing a recently discovered Cas13a enzyme subtype. This structure and accompanying biochemical experiments define for the first time the Cas13a catalytic residues that are directly responsible for crRNA maturation. In addition, the orientation of the foreign-derived target RNA-specifying sequence in the protein interior explains the conformational gating of Cas13a nuclease activation. These results describe how Cas13a enzymes generate functional crRNAs and how catalytic activity is blocked prior to target RNA recognition, with implications for both bacterial immunity and diagnostic applications. PMID:28892041
Analysis of sequencing data for probing RNA secondary structures and protein-RNA binding in studying posttranscriptional regulations.

PubMed

Hu, Xihao; Wu, Yang; Lu, Zhi John; Yip, Kevin Y

2016-11-01

High-throughput sequencing has been used to study posttranscriptional regulations, where the identification of protein-RNA binding is a major and fast-developing sub-area, which is in turn benefited by the sequencing methods for whole-transcriptome probing of RNA secondary structures. In the study of RNA secondary structures using high-throughput sequencing, bases are modified or cleaved according to their structural features, which alter the resulting composition of sequencing reads. In the study of protein-RNA binding, methods have been proposed to immuno-precipitate (IP) protein-bound RNA transcripts in vitro or in vivo By sequencing these transcripts, the protein-RNA interactions and the binding locations can be identified. For both types of data, read counts are affected by a combination of confounding factors, including expression levels of transcripts, sequence biases, mapping errors and the probing or IP efficiency of the experimental protocols. Careful processing of the sequencing data and proper extraction of important features are fundamentally important to a successful analysis. Here we review and compare different experimental methods for probing RNA secondary structures and binding sites of RNA-binding proteins (RBPs), and the computational methods proposed for analyzing the corresponding sequencing data. We suggest how these two types of data should be integrated to study the structural properties of RBP binding sites as a systematic way to better understand posttranscriptional regulations. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Structural organizations of yeast RNase P and RNase MRP holoenzymes as revealed by UV-crosslinking studies of RNA-protein interactions.

PubMed

Khanova, Elena; Esakova, Olga; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

2012-04-01

Eukaryotic ribonuclease (RNase) P and RNase MRP are closely related ribonucleoprotein complexes involved in the metabolism of various RNA molecules including tRNA, rRNA, and some mRNAs. While evolutionarily related to bacterial RNase P, eukaryotic enzymes of the RNase P/MRP family are much more complex. Saccharomyces cerevisiae RNase P consists of a catalytic RNA component and nine essential proteins; yeast RNase MRP has an RNA component resembling that in RNase P and 10 essential proteins, most of which are shared with RNase P. The structural organizations of eukaryotic RNases P/MRP are not clear. Here we present the results of RNA-protein UV crosslinking studies performed on RNase P and RNase MRP holoenzymes isolated from yeast. The results indicate locations of specific protein-binding sites in the RNA components of RNase P and RNase MRP and shed light on the structural organizations of these large ribonucleoprotein complexes.
Sequence, Structure, and Context Preferences of Human RNA Binding Proteins.

PubMed

Dominguez, Daniel; Freese, Peter; Alexis, Maria S; Su, Amanda; Hochman, Myles; Palden, Tsultrim; Bazile, Cassandra; Lambert, Nicole J; Van Nostrand, Eric L; Pratt, Gabriel A; Yeo, Gene W; Graveley, Brenton R; Burge, Christopher B

2018-06-07

RNA binding proteins (RBPs) orchestrate the production, processing, and function of mRNAs. Here, we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of these proteins in vitro by deep sequencing of bound RNAs. These data enable construction of "RNA maps" of RBP activity without requiring crosslinking-based assays. We found an unexpectedly low diversity of RNA motifs, implying frequent convergence of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, however, we observed extensive preferences for contextual features distinct from short linear RNA motifs, including spaced "bipartite" motifs, biased flanking nucleotide composition, and bias away from or toward RNA structure. Our results emphasize the importance of contextual features in RNA recognition, which likely enable targeting of distinct subsets of transcripts by different RBPs that recognize the same linear motif. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Hepatitis Delta Antigen Requires a Flexible Quasi-Double-Stranded RNA Structure To Bind and Condense Hepatitis Delta Virus RNA in a Ribonucleoprotein Complex

PubMed Central

Griffin, Brittany L.; Chasovskikh, Sergey; Dritschilo, Anatoly

2014-01-01

ABSTRACT The circular genome and antigenome RNAs of hepatitis delta virus (HDV) form characteristic unbranched, quasi-double-stranded RNA secondary structures in which short double-stranded helical segments are interspersed with internal loops and bulges. The ribonucleoprotein complexes (RNPs) formed by these RNAs with the virus-encoded protein hepatitis delta antigen (HDAg) perform essential roles in the viral life cycle, including viral replication and virion formation. Little is understood about the formation and structure of these complexes and how they function in these key processes. Here, the specific RNA features required for HDAg binding and the topology of the complexes formed were investigated. Selective 2′OH acylation analyzed by primer extension (SHAPE) applied to free and HDAg-bound HDV RNAs indicated that the characteristic secondary structure of the RNA is preserved when bound to HDAg. Notably, the analysis indicated that predicted unpaired positions in the RNA remained dynamic in the RNP. Analysis of the in vitro binding activity of RNAs in which internal loops and bulges were mutated and of synthetically designed RNAs demonstrated that the distinctive secondary structure, not the primary RNA sequence, is the major determinant of HDAg RNA binding specificity. Atomic force microscopy analysis of RNPs formed in vitro revealed complexes in which the HDV RNA is substantially condensed by bending or wrapping. Our results support a model in which the internal loops and bulges in HDV RNA contribute flexibility to the quasi-double-stranded structure that allows RNA bending and condensing by HDAg. IMPORTANCE RNA-protein complexes (RNPs) formed by the hepatitis delta virus RNAs and protein, HDAg, perform critical roles in virus replication. Neither the structures of these RNPs nor the RNA features required to form them have been characterized. HDV RNA is unusual in that it forms an unbranched quasi-double-stranded structure in which short base-paired segments are interspersed with internal loops and bulges. We analyzed the role of the HDV RNA sequence and secondary structure in the formation of a minimal RNP and visualized the structure of this RNP using atomic force microscopy. Our results indicate that HDAg does not recognize the primary sequence of the RNA; rather, the principle contribution of unpaired bases in HDV RNA to HDAg binding is to allow flexibility in the unbranched quasi-double-stranded RNA structure. Visualization of RNPs by atomic force microscopy indicated that the RNA is significantly bent or condensed in the complex. PMID:24741096
Consistent global structures of complex RNA states through multidimensional chemical mapping

PubMed Central

Cheng, Clarence Yu; Chou, Fang-Chieh; Kladwang, Wipapat; Tian, Siqi; Cordero, Pablo; Das, Rhiju

2015-01-01

Accelerating discoveries of non-coding RNA (ncRNA) in myriad biological processes pose major challenges to structural and functional analysis. Despite progress in secondary structure modeling, high-throughput methods have generally failed to determine ncRNA tertiary structures, even at the 1-nm resolution that enables visualization of how helices and functional motifs are positioned in three dimensions. We report that integrating a new method called MOHCA-seq (Multiplexed •OH Cleavage Analysis with paired-end sequencing) with mutate-and-map secondary structure inference guides Rosetta 3D modeling to consistent 1-nm accuracy for intricately folded ncRNAs with lengths up to 188 nucleotides, including a blind RNA-puzzle challenge, the lariat-capping ribozyme. This multidimensional chemical mapping (MCM) pipeline resolves unexpected tertiary proximities for cyclic-di-GMP, glycine, and adenosylcobalamin riboswitch aptamers without their ligands and a loose structure for the recently discovered human HoxA9D internal ribosome entry site regulon. MCM offers a sequencing-based route to uncovering ncRNA 3D structure, applicable to functionally important but potentially heterogeneous states. DOI: http://dx.doi.org/10.7554/eLife.07600.001 PMID:26035425
RNA-DNA Triplex Formation by Long Noncoding RNAs.

PubMed

Li, Yue; Syed, Junetha; Sugiyama, Hiroshi

2016-11-17

Long noncoding RNAs (lncRNAs) play a pivotal role in the regulation of biological processes through various mechanisms that are not fully understood. Proposed mechanisms include regulation based on RNA-protein interactions, as well as RNA-RNA interactions and RNA-DNA interactions. Here, we focus on one possible mechanism that lncRNA might be using to impact biological function, the RNA-DNA triplex formation. We summarize currently available examples of lncRNA triplex formation and discuss the details surrounding orientation of triplex formation as one of the key properties guiding this process. We propose that symmetrical triplex-forming motifs, especially those in cis-acting lncRNAs, favor triplex formation. We also consider the effects of lncRNA structures, protein or ligand binding, and chromatin structures on the lncRNAs triplex formation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

PubMed

Li, Sanshu; Breaker, Ronald R

2017-10-13

With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Similarities and Differences between RNA and DNA Double-Helical Structures in Circular Dichroism Spectroscopy: A SAC-CI Study.

PubMed

Miyahara, Tomoo; Nakatsuji, Hiroshi; Sugiyama, Hiroshi

2016-11-17

The helical structures of DNA and RNA are investigated experimentally using circular dichroism (CD) spectroscopy. The signs and the shapes of the CD spectra are much different between the right- and left-handed structures as well as between DNA and RNA. The main difference lies in the sign at around 295 nm of the CD spectra: it is positive for the right-handed B-DNA and the left-handed Z-RNA but is negative for the left-handed Z-DNA and the right-handed A-RNA. We calculated the SAC-CI CD spectra of DNA and RNA using the tetramer models, which include both hydrogen-bonding and stacking interactions that are important in both DNA and RNA. The SAC-CI results reproduced the features at around 295 nm of the experimental CD spectra of each DNA and RNA, and elucidated that the strong stacking interaction between the two base pairs is the origin of the negative peaks at 295 nm of the CD spectra for both DNA and RNA. On the basis of these facts, we discuss the similarities and differences between RNA and DNA double-helical structures in the CD spectroscopy based on the ChiraSac methodology.
RNA Polymerase III promoter screen uncovers a novel noncoding RNA family conserved in Caenorhabditis and other clade V nematodes.

PubMed

Gruber, Andreas R

2014-07-10

RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.
Template-Based Modeling of Protein-RNA Interactions

PubMed Central

Zheng, Jinfang; Kundrotas, Petras J.; Vakser, Ilya A.

2016-01-01

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes. PMID:27662342
RNA structural constraints in the evolution of the influenza A virus genome NP segment

PubMed Central

Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL

2014-01-01

Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940
Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer

PubMed Central

Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

2010-01-01

The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq–RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq–RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 Å, while the type 2 Hfq–RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 Å. Diffraction data were collected to a resolution of 2.20 Å from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis. PMID:20445260

Picornaviral Polymerase Structure, Function, and Fidelity Modulation

PubMed Central

Peersen, Olve B.

2017-01-01

Like all positive strand RNA viruses, the picornaviruses replicate their genomes using a virally encoded RNA-dependent RNA polymerase enzyme known as 3Dpol. Over the past decade we have made tremendous advances in our understanding of 3Dpol structure and function, including the discovery of a novel mechanism for closing the active site that allows these viruses to easily fine tune replication fidelity and quasispecies distributions. This review summarizes current knowledge of picornaviral polymerase structure and how the enzyme interacts with RNA and other viral proteins to form stable and processive elongation complexes. The picornaviral RdRPs are among the smallest viral polymerases, but their fundamental molecular mechanism for catalysis appears to be generally applicable as a common feature of all positive strand RNA virus polymerases. PMID:28163093
Insights into the Structural Dynamics of Nucleocytoplasmic Transport of tRNA by Exportin-t

PubMed Central

Gupta, Asmita; Kailasam, Senthilkumar; Bansal, Manju

2016-01-01

Exportin-t (Xpot) transports mature 5′- and 3′-end processed tRNA from the nucleus to the cytoplasm by associating with a small G-protein Ran (RAs-related nuclear protein), in the nucleus. The release of tRNA in cytoplasm involves RanGTP hydrolysis. Despite the availability of crystal structures of nuclear and cytosolic forms of Xpot, the molecular details regarding the sequential events leading to tRNA release and subsequent conformational changes occurring in Xpot remain unknown. We have performed a combination of classical all-atom and accelerated molecular dynamics simulations on a set of complexes involving Xpot to study a range of features including conformational flexibility of free and cargo-bound Xpot and functionally critical contacts between Xpot and its cargo. The systems investigated include free Xpot and its different complexes, bound either to Ran (GTP/GDP) or tRNA or both. This approach provided a statistically reliable estimate of structural dynamics of Xpot after cargo release. The mechanistic basis for Xpot opening after cargo release has been explained in terms of dynamic structural hinges, about which neighboring region could be displaced to facilitate the nuclear to cytosolic state transition. Post-RanGTP hydrolysis, a cascade of events including local conformational change in RanGTP and loss of critical contacts at Xpot/tRNA interface suggest factors responsible for eventual release of tRNA. The level of flexibility in different Xpot complexes varied depending on the arrangement of individual HEAT repeats. Current study provides one of the most comprehensive and robust analysis carried out on this protein using molecular dynamics schemes. PMID:27028637
Regulation of Flavivirus RNA synthesis and replication

PubMed Central

Selisko, Barbara; Wang, Chunling; Harris, Eva; Canard, Bruno

2014-01-01

RNA synthesis and replication of the members of the Flavivirus genus (including dengue, West Nile and Japanese encephalitis viruses) is regulated by a wide variety of mechanisms and actors. These include the sequestration of the RNA-dependent RNA polymerase (RdRp) for functions other than RNA synthesis, regulatory interactions with other viral and host proteins within the replication complex (RC), and regulatory elements within the RNA genome itself. In this review, we discuss our current knowledge of the multiple levels at which Flavivirus RNA synthesis is controlled. We aim to bring together two active research fields: the structural and functional biology of individual proteins of the RC and the impressive wealth of knowledge acquired regarding the viral genomic RNA. PMID:25462437
Basis of altered RNA-binding specificity by PUF proteins revealed by crystal structures of yeast Puf4p

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, Matthew T.; Higgin, Joshua J.; Hall, Traci M.Tanaka

2008-06-06

Pumilio/FBF (PUF) family proteins are found in eukaryotic organisms and regulate gene expression post-transcriptionally by binding to sequences in the 3' untranslated region of target transcripts. PUF proteins contain an RNA binding domain that typically comprises eight {alpha}-helical repeats, each of which recognizes one RNA base. Some PUF proteins, including yeast Puf4p, have altered RNA binding specificity and use their eight repeats to bind to RNA sequences with nine or ten bases. Here we report the crystal structures of Puf4p alone and in complex with a 9-nucleotide (nt) target RNA sequence, revealing that Puf4p accommodates an 'extra' nucleotide by modestmore » adaptations allowing one base to be turned away from the RNA binding surface. Using structural information and sequence comparisons, we created a mutant Puf4p protein that preferentially binds to an 8-nt target RNA sequence over a 9-nt sequence and restores binding of each protein repeat to one RNA base.« less
Regulatory RNAs in Bacillus subtilis: a Gram-Positive Perspective on Bacterial RNA-Mediated Regulation of Gene Expression.

PubMed

Mars, Ruben A T; Nicolas, Pierre; Denham, Emma L; van Dijl, Jan Maarten

2016-12-01

Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5' untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5' ends of mRNA molecules. These can include 5' secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Regulatory RNAs in Bacillus subtilis: a Gram-Positive Perspective on Bacterial RNA-Mediated Regulation of Gene Expression

PubMed Central

Mars, Ruben A. T.; Nicolas, Pierre; Denham, Emma L.

2016-01-01

SUMMARY Bacteria can employ widely diverse RNA molecules to regulate their gene expression. Such molecules include trans-acting small regulatory RNAs, antisense RNAs, and a variety of transcriptional attenuation mechanisms in the 5′ untranslated region. Thus far, most regulatory RNA research has focused on Gram-negative bacteria, such as Escherichia coli and Salmonella. Hence, there is uncertainty about whether the resulting insights can be extrapolated directly to other bacteria, such as the Gram-positive soil bacterium Bacillus subtilis. A recent study identified 1,583 putative regulatory RNAs in B. subtilis, whose expression was assessed across 104 conditions. Here, we review the current understanding of RNA-based regulation in B. subtilis, and we categorize the newly identified putative regulatory RNAs on the basis of their conservation in other bacilli and the stability of their predicted secondary structures. Our present evaluation of the publicly available data indicates that RNA-mediated gene regulation in B. subtilis mostly involves elements at the 5′ ends of mRNA molecules. These can include 5′ secondary structure elements and metabolite-, tRNA-, or protein-binding sites. Importantly, sense-independent segments are identified as the most conserved and structured potential regulatory RNAs in B. subtilis. Altogether, the present survey provides many leads for the identification of new regulatory RNA functions in B. subtilis. PMID:27784798
RNA Graph Partitioning for the Discovery of RNA Modularity: A Novel Application of Graph Partition Algorithm to Biology

PubMed Central

Elmetwaly, Shereef; Schlick, Tamar

2014-01-01

Graph representations have been widely used to analyze and design various economic, social, military, political, and biological networks. In systems biology, networks of cells and organs are useful for understanding disease and medical treatments and, in structural biology, structures of molecules can be described, including RNA structures. In our RNA-As-Graphs (RAG) framework, we represent RNA structures as tree graphs by translating unpaired regions into vertices and helices into edges. Here we explore the modularity of RNA structures by applying graph partitioning known in graph theory to divide an RNA graph into subgraphs. To our knowledge, this is the first application of graph partitioning to biology, and the results suggest a systematic approach for modular design in general. The graph partitioning algorithms utilize mathematical properties of the Laplacian eigenvector (µ2) corresponding to the second eigenvalues (λ2) associated with the topology matrix defining the graph: λ2 describes the overall topology, and the sum of µ2′s components is zero. The three types of algorithms, termed median, sign, and gap cuts, divide a graph by determining nodes of cut by median, zero, and largest gap of µ2′s components, respectively. We apply these algorithms to 45 graphs corresponding to all solved RNA structures up through 11 vertices (∼220 nucleotides). While we observe that the median cut divides a graph into two similar-sized subgraphs, the sign and gap cuts partition a graph into two topologically-distinct subgraphs. We find that the gap cut produces the best biologically-relevant partitioning for RNA because it divides RNAs at less stable connections while maintaining junctions intact. The iterative gap cuts suggest basic modules and assembly protocols to design large RNA structures. Our graph substructuring thus suggests a systematic approach to explore the modularity of biological networks. In our applications to RNA structures, subgraphs also suggest design strategies for novel RNA motifs. PMID:25188578
A general strategy to solve the phase problem in RNA crystallography

PubMed Central

Keel, Amanda Y.; Rambo, Robert P.; Batey, Robert T.; Kieft, Jeffrey S.

2007-01-01

SUMMARY X-ray crystallography of biologically important RNA molecules has been hampered by technical challenges, including finding a heavy-atom derivative to obtain high-quality experimental phase information. Existing techniques have drawbacks, severely limiting the rate at which important new structures are solved. To address this need, we have developed a reliable means to localize heavy atoms specifically to virtually any RNA. By solving the crystal structures of thirteen variants of the G·U wobble pair cation binding motif we have identified an optimal version that when inserted into an RNA helix introduces a high-occupancy cation binding site suitable for phasing. This “directed soaking” strategy can be integrated fully into existing RNA and crystallography methods, potentially increasing the rate at which important structures are solved and facilitating routine solving of structures using Cu-Kα radiation. The success of this method has been proven in that it has already been used to solve several novel crystal structures. PMID:17637337
Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex.

PubMed

Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav

2002-12-01

The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.
Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex.

PubMed Central

Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav

2002-01-01

The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit. PMID:12515387
Molecular Phylogenetics and Systematics of the Bivalve Family Ostreidae Based on rRNA Sequence-Structure Models and Multilocus Species Tree

PubMed Central

Salvi, Daniele; Macali, Armando; Mariottini, Paolo

2014-01-01

The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663
Molecular phylogenetics and systematics of the bivalve family Ostreidae based on rRNA sequence-structure models and multilocus species tree.

PubMed

Salvi, Daniele; Macali, Armando; Mariottini, Paolo

2014-01-01

The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.
Distinct families of cis-acting RNA replication elements epsilon from hepatitis B viruses

PubMed Central

Chen, Augustine; Brown, Chris

2012-01-01

The hepadnavirus encapsidation signal, epsilon (ε), is an RNA structure located at the 5′ end of the viral pregenomic RNA. It is essential for viral replication and functions in polymerase protein binding and priming. This structure could also have potential regulatory roles in controlling the expression of viral replicative proteins. In addition to its structure, the primary sequence of this RNA element has crucial functional roles in the viral lifecycle. Although the ε elements in hepadnaviruses share common critical functions, there are some significant differences in mammalian and avian hepadnaviruses, which include both sequence and structural variations. Here we present several covariance models for ε elements from the Hepadnaviridae. The model building included experimentally determined data from previous studies using chemical probing and NMR analysis. These models have sufficient similarity to comprise a clan. The clan has in common a highly conserved overall structure consisting of a lower-stem, bulge, upper-stem and apical-loop. The models differ in functionally critical regions—notably the two types of avian ε elements have a tetra-loop (UGUU) including a non-canonical UU base pair, while the hepatitis B virus (HBV) epsilon has a tri-loop (UGU). The avian epsilon elements have a less stable dynamic structure in the upper stem. Comparisons between these models and all other Rfam models, and searches of genomes, showed these structures are specific to the Hepadnaviridae. Two family models and the clan are available from the Rfam database. PMID:22418844
RNase MRP Cleaves Pre-tRNASer-Met in the tRNA Maturation Pathway

PubMed Central

Adachi, Kousuke; Nobe, Yuko; Kobayashi, Junya; Hirota, Kouji; Oliveira, Douglas V.; Taoka, Masato; Isobe, Toshiaki

2014-01-01

Ribonuclease mitochondrial RNA processing (RNase MRP) is a multifunctional ribonucleoprotein (RNP) complex that is involved in the maturation of various types of RNA including ribosomal RNA. RNase MRP consists of a potential catalytic RNA and several protein components, all of which are required for cell viability. We show here that the temperature-sensitive mutant of rmp1, the gene for a unique protein component of RNase MRP, accumulates the dimeric tRNA precursor, pre-tRNASer-Met. To examine whether RNase MRP mediates tRNA maturation, we purified the RNase MRP holoenzyme from the fission yeast Schizosaccharomyces pombe and found that the enzyme directly and selectively cleaves pre-tRNASer-Met, suggesting that RNase MRP participates in the maturation of specific tRNA in vivo. In addition, mass spectrometry–based ribonucleoproteomic analysis demonstrated that this RNase MRP consists of one RNA molecule and 11 protein components, including a previously unknown component Rpl701. Notably, limited nucleolysis of RNase MRP generated an active catalytic core consisting of partial mrp1 RNA fragments, which constitute “Domain 1” in the secondary structure of RNase MRP, and 8 proteins. Thus, the present study provides new insight into the structure and function of RNase MRP. PMID:25401760
RNA self-assembly and RNA nanotechnology.

PubMed

Grabow, Wade W; Jaeger, Luc

2014-06-17

CONSPECTUS: Nanotechnology's central goal involves the direct control of matter at the molecular nanometer scale to build nanofactories, nanomachines, and other devices for potential applications including electronics, alternative fuels, and medicine. In this regard, the nascent use of nucleic acids as a material to coordinate the precise arrangements of specific molecules marked an important milestone in the relatively recent history of nanotechnology. While DNA served as the pioneer building material in nucleic acid nanotechnology, RNA continues to emerge as viable alternative material with its own distinct advantages for nanoconstruction. Several complementary assembly strategies have been used to build a diverse set of RNA nanostructures having unique structural attributes and the ability to self-assemble in a highly programmable and controlled manner. Of the different strategies, the architectonics approach uniquely endeavors to understand integrated structural RNA architectures through the arrangement of their characteristic structural building blocks. Viewed through this lens, it becomes apparent that nature routinely uses thermodynamically stable, recurrent modular motifs from natural RNA molecules to generate unique and more complex programmable structures. With the design principles found in natural structures, a number of synthetic RNAs have been constructed. The synthetic nanostructures constructed to date have provided, in addition to affording essential insights into RNA design, important platforms to characterize and validate the structural self-folding and assembly properties of RNA modules or building blocks. Furthermore, RNA nanoparticles have shown great promise for applications in nanomedicine and RNA-based therapeutics. Nevertheless, the synthetic RNA architectures achieved thus far consist largely of static, rigid particles that are still far from matching the structural and functional complexity of natural responsive structural elements such as the ribosome, large ribozymes, and riboswitches. Thus, the next step in synthetic RNA design will involve new ways to implement these same types of dynamic and responsive architectures into nanostructures functioning as real nanomachines in and outside the cell. RNA nanotechnology will likely garner broader utility and influence with a greater focus on the interplay between thermodynamic and kinetic influences on RNA self-assembly and using natural RNAs as guiding principles.
Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl; Waleń, Tomasz; University of Warsaw, Banacha 2, 02-097 Warsaw

2015-03-01

A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure ismore » RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.« less
Structure and Function of the N-Terminal Domain of the Vesicular Stomatitis Virus RNA Polymerase

PubMed Central

Qiu, Shihong; Ogino, Minako; Luo, Ming

2015-01-01

ABSTRACT Viruses have various mechanisms to duplicate their genomes and produce virus-specific mRNAs. Negative-strand RNA viruses encode their own polymerases to perform each of these processes. For the nonsegmented negative-strand RNA viruses, the polymerase is comprised of the large polymerase subunit (L) and the phosphoprotein (P). L proteins from members of the Rhabdoviridae, Paramyxoviridae, and Filoviridae share sequence and predicted secondary structure homology. Here, we present the structure of the N-terminal domain (conserved region I) of the L protein from a rhabdovirus, vesicular stomatitis virus, at 1.8-Å resolution. The strictly and strongly conserved residues in this domain cluster in a single area of the protein. Serial mutation of these residues shows that many of the amino acids are essential for viral transcription but not for mRNA capping. Three-dimensional alignments show that this domain shares structural homology with polymerases from other viral families, including segmented negative-strand RNA and double-stranded RNA (dsRNA) viruses. IMPORTANCE Negative-strand RNA viruses include a diverse set of viral families that infect animals and plants, causing serious illness and economic impact. The members of this group of viruses share a set of functionally conserved proteins that are essential to their replication cycle. Among this set of proteins is the viral polymerase, which performs a unique set of reactions to produce genome- and subgenome-length RNA transcripts. In this article, we study the polymerase of vesicular stomatitis virus, a member of the rhabdoviruses, which has served in the past as a model to study negative-strand RNA virus replication. We have identified a site in the N-terminal domain of the polymerase that is essential to viral transcription and that shares sequence homology with members of the paramyxoviruses and the filoviruses. Newly identified sites such as that described here could prove to be useful targets in the design of new therapeutics against negative-strand RNA viruses. PMID:26512087
Tree decomposition based fast search of RNA structures including pseudoknots in genomes.

PubMed

Song, Yinglei; Liu, Chunmei; Malmberg, Russell; Pan, Fangfang; Cai, Liming

2005-01-01

Searching genomes for RNA secondary structure with computational methods has become an important approach to the annotation of non-coding RNAs. However, due to the lack of efficient algorithms for accurate RNA structure-sequence alignment, computer programs capable of fast and effectively searching genomes for RNA secondary structures have not been available. In this paper, a novel RNA structure profiling model is introduced based on the notion of a conformational graph to specify the consensus structure of an RNA family. Tree decomposition yields a small tree width t for such conformation graphs (e.g., t = 2 for stem loops and only a slight increase for pseudo-knots). Within this modelling framework, the optimal alignment of a sequence to the structure model corresponds to finding a maximum valued isomorphic subgraph and consequently can be accomplished through dynamic programming on the tree decomposition of the conformational graph in time O(k(t)N(2)), where k is a small parameter; and N is the size of the projiled RNA structure. Experiments show that the application of the alignment algorithm to search in genomes yields the same search accuracy as methods based on a Covariance model with a significant reduction in computation time. In particular; very accurate searches of tmRNAs in bacteria genomes and of telomerase RNAs in yeast genomes can be accomplished in days, as opposed to months required by other methods. The tree decomposition based searching tool is free upon request and can be downloaded at our site h t t p ://w.uga.edu/RNA-informatics/software/index.php.
Structural RNAs of known and unknown function identified in malaria parasites by comparative genomics and RNA analysis

PubMed Central

Chakrabarti, Kausik; Pearson, Michael; Grate, Leslie; Sterne-Weiler, Timothy; Deans, Jonathan; Donohue, John Paul; Ares, Manuel

2007-01-01

As the genomes of more eukaryotic pathogens are sequenced, understanding how molecular differences between parasite and host might be exploited to provide new therapies has become a major focus. Central to cell function are RNA-containing complexes involved in gene expression, such as the ribosome, the spliceosome, snoRNAs, RNase P, and telomerase, among others. In this article we identify by comparative genomics and validate by RNA analysis numerous previously unknown structural RNAs encoded by the Plasmodium falciparum genome, including the telomerase RNA, U3, 31 snoRNAs, as well as previously predicted spliceosomal snRNAs, SRP RNA, MRP RNA, and RNAse P RNA. Furthermore, we identify six new RNA coding genes of unknown function. To investigate the relationships of the RNA coding genes to other genomic features in related parasites, we developed a genome browser for P. falciparum (http://areslab.ucsc.edu/cgi-bin/hgGateway). Additional experiments provide evidence supporting the prediction that snoRNAs guide methylation of a specific position on U4 snRNA, as well as predicting an snRNA promoter element particular to Plasmodium sp. These findings should allow detailed structural comparisons between the RNA components of the gene expression machinery of the parasite and its vertebrate hosts. PMID:17901154
Insights into RNA binding by the anticancer drug cisplatin from the crystal structure of cisplatin-modified ribosome

PubMed Central

Melnikov, Sergey V.; Söll, Dieter; Steitz, Thomas A.

2016-01-01

Abstract Cisplatin is a widely prescribed anticancer drug, which triggers cell death by covalent binding to a broad range of biological molecules. Among cisplatin targets, cellular RNAs remain the most poorly characterized molecules. Although cisplatin was shown to inactivate essential RNAs, including ribosomal, spliceosomal and telomeric RNAs, cisplatin binding sites in most RNA molecules are unknown, and therefore it remains challenging to study how modifications of RNA by cisplatin contributes to its toxicity. Here we report a 2.6Å-resolution X-ray structure of cisplatin-modified 70S ribosome, which describes cisplatin binding to the ribosome and provides the first nearly atomic model of cisplatin–RNA complex. We observe nine cisplatin molecules bound to the ribosome and reveal consensus structural features of the cisplatin-binding sites. Two of the cisplatin molecules modify conserved functional centers of the ribosome—the mRNA-channel and the GTPase center. In the mRNA-channel, cisplatin intercalates between the ribosome and the messenger RNA, suggesting that the observed inhibition of protein synthesis by cisplatin is caused by impaired mRNA-translocation. Our structure provides an insight into RNA targeting and inhibition by cisplatin, which can help predict cisplatin-binding sites in other cellular RNAs and design studies to elucidate a link between RNA modifications by cisplatin and cisplatin toxicity. PMID:27079977

Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

PubMed Central

Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

2010-01-01

RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462
Probing a 2-aminobenzimidazole library for binding to RNA internal loops via two-dimensional combinatorial screening.

PubMed

Velagapudi, Sai Pradeep; Pushechnikov, Alexei; Labuda, Lucas P; French, Jonathan M; Disney, Matthew D

2012-11-16

There are many potential RNA drug targets in bacterial, viral, and human transcriptomes. However, there are few small molecules that modulate RNA function. This is due, in part, to a lack of fundamental understanding about RNA-ligand interactions including the types of small molecules that bind to RNA structural elements and the RNA structural elements that bind to small molecules. In an effort to better understand RNA-ligand interactions, we diversified the 2-aminobenzimidazole core (2AB) and probed the resulting library for binding to a library of RNA internal loops. We chose the 2AB core for these studies because it is a privileged scaffold for binding RNA based on previous reports. These studies identified that N-methyl pyrrolidine, imidazole, and propylamine diversity elements at the R1 position increase binding to internal loops; variability at the R2 position is well tolerated. The preferred RNA loop space was also determined for five ligands using a statistical approach and identified trends that lead to selective recognition.
Structural basis for dsRNA recognition and interferon antagonism by Ebola VP35

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leung, Daisy W.; Prins, Kathleen C.; Borek, Dominika M.

2010-03-12

Ebola viral protein 35 (VP35), encoded by the highly pathogenic Ebola virus, facilitates host immune evasion by antagonizing antiviral signaling pathways, including those initiated by RIG-I-like receptors. Here we report the crystal structure of the Ebola VP35 interferon inhibitory domain (IID) bound to short double-stranded RNA (dsRNA), which together with in vivo results reveals how VP35-dsRNA interactions contribute to immune evasion. Conserved basic residues in VP35 IID recognize the dsRNA backbone, whereas the dsRNA blunt ends are 'end-capped' by a pocket of hydrophobic residues that mimic RIG-I-like receptor recognition of blunt-end dsRNA. Residues critical for RNA binding are also importantmore » for interferon inhibition in vivo but not for viral polymerase cofactor function of VP35. These results suggest that simultaneous recognition of dsRNA backbone and blunt ends provides a mechanism by which Ebola VP35 antagonizes host dsRNA sensors and immune responses.« less
The application of cluster analysis in the intercomparison of loop structures in RNA.

PubMed

Huang, Hung-Chung; Nagaswamy, Uma; Fox, George E

2005-04-01

We have developed a computational approach for the comparison and classification of RNA loop structures. Hairpin or interior loops identified in atomic resolution RNA structures were intercompared by conformational matching. The root-mean-square deviation (RMSD) values between all pairs of RNA fragments of interest, even if from different molecules, are calculated. Subsequently, cluster analysis is performed on the resulting matrix of RMSD distances using the unweighted pair group method with arithmetic mean (UPGMA). The cluster analysis objectively reveals groups of folds that resemble one another. To demonstrate the utility of the approach, a comprehensive analysis of all the terminal hairpin tetraloops that have been observed in 15 RNA structures that have been determined by X-ray crystallography was undertaken. The method found major clusters corresponding to the well-known GNRA and UNCG types. In addition, two tetraloops with the unusual primary sequence UMAC (M is A or C) were successfully assigned to the GNRA cluster. Larger loop structures were also examined and the clustering results confirmed the occurrence of variations of the GNRA and UNCG tetraloops in these loops and provided a systematic means for locating them. Nineteen examples of larger loops that closely resemble either the GNRA or UNCG tetraloop were found in the large ribosomal RNAs. When the clustering approach was extended to include all structures in the SCOR database, novel relationships were detected including one between the ANYA motif and a less common folding of the GAAA tetraloop sequence.
The application of cluster analysis in the intercomparison of loop structures in RNA

PubMed Central

HUANG, HUNG-CHUNG; NAGASWAMY, UMA; FOX, GEORGE E.

2005-01-01

We have developed a computational approach for the comparison and classification of RNA loop structures. Hairpin or interior loops identified in atomic resolution RNA structures were intercompared by conformational matching. The root-mean-square deviation (RMSD) values between all pairs of RNA fragments of interest, even if from different molecules, are calculated. Subsequently, cluster analysis is performed on the resulting matrix of RMSD distances using the unweighted pair group method with arithmetic mean (UPGMA). The cluster analysis objectively reveals groups of folds that resemble one another. To demonstrate the utility of the approach, a comprehensive analysis of all the terminal hairpin tetraloops that have been observed in 15 RNA structures that have been determined by X-ray crystallography was undertaken. The method found major clusters corresponding to the well-known GNRA and UNCG types. In addition, two tetraloops with the unusual primary sequence UMAC (M is A or C) were successfully assigned to the GNRA cluster. Larger loop structures were also examined and the clustering results confirmed the occurrence of variations of the GNRA and UNCG tetraloops in these loops and provided a systematic means for locating them. Nineteen examples of larger loops that closely resemble either the GNRA or UNCG tetraloop were found in the large ribosomal RNAs. When the clustering approach was extended to include all structures in the SCOR database, novel relationships were detected including one between the ANYA motif and a less common folding of the GAAA tetraloop sequence. PMID:15769871
DOE Office of Scientific and Technical Information (OSTI.GOV)

Januszyk, Kurt; Liu, Quansheng; Lima, Christopher D.

The eukaryotic RNA exosome is a highly conserved multi-subunit complex that catalyzes degradation and processing of coding and noncoding RNA. A noncatalytic nine-subunit exosome core interacts with Rrp44 and Rrp6, two subunits that possess processive and distributive 3'-to-5' exoribonuclease activity, respectively. While both Rrp6 and Rrp44 are responsible for RNA processing in budding yeast, Rrp6 may play a more prominent role in processing, as it has been demonstrated to be inhibited by stable RNA secondary structure in vitro and because the null allele in budding yeast leads to the buildup of specific structured RNA substrates. Human RRP6, otherwise known asmore » PM/SCL-100 or EXOSC10, shares sequence similarity to budding yeast Rrp6 and is proposed to catalyze 3'-to-5' exoribonuclease activity on a variety of nuclear transcripts including ribosomal RNA subunits, RNA that has been poly-adenylated by TRAMP, as well as other nuclear RNA transcripts destined for processing and/or destruction. To characterize human RRP6, we expressed the full-length enzyme as well as truncation mutants that retain catalytic activity, compared their activities to analogous constructs for Saccharomyces cerevisiae Rrp6, and determined the X-ray structure of a human construct containing the exoribonuclease and HRDC domains that retains catalytic activity. Structural data show that the human active site is more exposed when compared to the yeast structure, and biochemical data suggest that this feature may play a role in the ability of human RRP6 to productively engage and degrade structured RNA substrates more effectively than the analogous budding yeast enzyme.« less
Initiation of translation in bacteria by a structured eukaryotic IRES RNA.

PubMed

Colussi, Timothy M; Costantino, David A; Zhu, Jianyu; Donohue, John Paul; Korostelev, Andrei A; Jaafar, Zane A; Plank, Terra-Dawn M; Noller, Harry F; Kieft, Jeffrey S

2015-03-05

The central dogma of gene expression (DNA to RNA to protein) is universal, but in different domains of life there are fundamental mechanistic differences within this pathway. For example, the canonical molecular signals used to initiate protein synthesis in bacteria and eukaryotes are mutually exclusive. However, the core structures and conformational dynamics of ribosomes that are responsible for the translation steps that take place after initiation are ancient and conserved across the domains of life. We wanted to explore whether an undiscovered RNA-based signal might be able to use these conserved features, bypassing mechanisms specific to each domain of life, and initiate protein synthesis in both bacteria and eukaryotes. Although structured internal ribosome entry site (IRES) RNAs can manipulate ribosomes to initiate translation in eukaryotic cells, an analogous RNA structure-based mechanism has not been observed in bacteria. Here we report our discovery that a eukaryotic viral IRES can initiate translation in live bacteria. We solved the crystal structure of this IRES bound to a bacterial ribosome to 3.8 Å resolution, revealing that despite differences between bacterial and eukaryotic ribosomes this IRES binds directly to both and occupies the space normally used by transfer RNAs. Initiation in both bacteria and eukaryotes depends on the structure of the IRES RNA, but in bacteria this RNA uses a different mechanism that includes a form of ribosome repositioning after initial recruitment. This IRES RNA bridges billions of years of evolutionary divergence and provides an example of an RNA structure-based translation initiation signal capable of operating in two domains of life.
Selective 2′-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile, and accurate RNA structure analysis

PubMed Central

Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.

2016-01-01

SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499
Structure and mechanism of the T-box riboswitches

PubMed Central

Zhang, Jinwei

2015-01-01

In most Gram-positive bacteria, including many clinically devastating pathogens from genera such as Bacillus, Clostridium, Listeria and Staphylococcus, T-box riboswitches sense and regulate intracellular availability of amino acids through a multipartite mRNA-tRNA interaction. The T-box mRNA leaders respond to nutrient starvation by specifically binding cognate tRNAs and sensing whether the bound tRNA is aminoacylated, as a proxy for amino acid availability. Based on this readout, T-boxes direct a transcriptional or translational switch to control the expression of downstream genes involved in various aspects of amino acid metabolism: biosynthesis, transport, aminoacylation, transamidation, etc. Two decades after its discovery, the structural and mechanistic underpinnings of the T-box riboswitch were recently elucidated, producing a wealth of insights into how two structured RNAs can recognize each other with robust affinity and exquisite selectivity. The T-box paradigm exemplifies how natural non-coding RNAs can interact not just through sequence complementarity, but can add molecular specificity by precisely juxtaposing RNA structural motifs, exploiting inherently flexible elements and the biophysical properties of post-transcriptional modifications, ultimately achieving a high degree of shape complementarity through mutually induced fit. The T-box also provides a proof-of-principle that compact RNA domains can recognize minute chemical changes (such as tRNA aminoacylation) on another RNA. The unveiling of the structure and mechanism of the T-box system thus expands our appreciation of the range of capabilities and modes of action of structured non-coding RNAs, and hints at the existence of networks of non-coding RNAs that communicate through both, structural and sequence specificity. PMID:25959893
Deciphering RNA-Recognition Patterns of Intrinsically Disordered Proteins.

PubMed

Srivastava, Ambuj; Ahmad, Shandar; Gromiha, M Michael

2018-05-29

Intrinsically disordered regions (IDRs) and protein (IDPs) are highly flexible owing to their lack of well-defined structures. A subset of such proteins interacts with various substrates; including RNA; frequently adopting regular structures in the final complex. In this work; we have analysed a dataset of protein⁻RNA complexes undergoing disorder-to-order transition (DOT) upon binding. We found that DOT regions are generally small in size (less than 3 residues) for RNA binding proteins. Like structured proteins; positively charged residues are found to interact with RNA molecules; indicating the dominance of electrostatic and cation-π interactions. However, a comparison of binding frequency shows that interface hydrophobic and aromatic residues have more interactions in only DOT regions than in a protein. Further; DOT regions have significantly higher exposure to water than their structured counterparts. Interactions of DOT regions with RNA increase the sheet formation with minor changes in helix forming residues. We have computed the interaction energy for amino acids⁻nucleotide pairs; which showed the preference of His⁻G; Asn⁻U and Ser⁻U at for the interface of DOT regions. This study provides insights to understand protein⁻RNA interactions and the results could also be used for developing a tool for identifying DOT regions in RNA binding proteins.
Global Organization of a Positive-strand RNA Virus Genome

PubMed Central

Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew

2013-01-01

The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202
Latest development on RNA-based drugs and vaccines.

PubMed

Lundstrom, Kenneth

2018-06-01

Drugs and vaccines based on mRNA and RNA viruses show great potential and direct translation in the cytoplasm eliminates chromosomal integration. Limitations are associated with delivery and stability issues related to RNA degradation. Clinical trials on RNA-based drugs have been conducted in various disease areas. Likewise, RNA-based vaccines for viral infections and various cancers have been subjected to preclinical and clinical studies. RNA delivery and stability improvements include RNA structure modifications, targeting dendritic cells and employing self-amplifying RNA. Single-stranded RNA viruses possess self-amplifying RNA, which can provide extreme RNA replication in the cytoplasm to support RNA-based drug and vaccine development. Although oligonucleotide-based approaches have demonstrated potential, the focus here is on mRNA- and RNA virus-based methods.
Structural organizations of yeast RNase P and RNase MRP holoenzymes as revealed by UV-crosslinking studies of RNA–protein interactions

PubMed Central

Khanova, Elena; Esakova, Olga; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S.

2012-01-01

Eukaryotic ribonuclease (RNase) P and RNase MRP are closely related ribonucleoprotein complexes involved in the metabolism of various RNA molecules including tRNA, rRNA, and some mRNAs. While evolutionarily related to bacterial RNase P, eukaryotic enzymes of the RNase P/MRP family are much more complex. Saccharomyces cerevisiae RNase P consists of a catalytic RNA component and nine essential proteins; yeast RNase MRP has an RNA component resembling that in RNase P and 10 essential proteins, most of which are shared with RNase P. The structural organizations of eukaryotic RNases P/MRP are not clear. Here we present the results of RNA–protein UV crosslinking studies performed on RNase P and RNase MRP holoenzymes isolated from yeast. The results indicate locations of specific protein-binding sites in the RNA components of RNase P and RNase MRP and shed light on the structural organizations of these large ribonucleoprotein complexes. PMID:22332141
A Polyamide Inhibits Replication of Vesicular Stomatitis Virus by Targeting RNA in the Nucleocapsid

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gumpper, Ryan H.; Li, Weike; Castañeda, Carlos H.

Polyamides have been shown to bind double-stranded DNA by complementing the curvature of the minor groove and forming various hydrogen bonds with DNA. Several polyamide molecules have been found to have potent antiviral activities against papillomavirus, a double-stranded DNA virus. By analogy, we reason that polyamides may also interact with the structured RNA bound in the nucleocapsid of a negative-strand RNA virus. Vesicular stomatitis virus (VSV) was selected as a prototype virus to test this possibility since its genomic RNA encapsidated in the nucleocapsid forms a structure resembling one strand of an A-form RNA duplex. One polyamide molecule, UMSL1011, wasmore » found to inhibit infection of VSV. To confirm that the polyamide targeted the nucleocapsid, a nucleocapsid-like particle (NLP) was incubated with UMSL1011. The encapsidated RNA in the polyamide-treated NLP was protected from thermo-release and digestion by RNase A. UMSL1011 also inhibits viral RNA synthesis in the intracellular activity assay for the viral RNA-dependent RNA polymerase. The crystal structure revealed that UMSL1011 binds the structured RNA in the nucleocapsid. The conclusion of our studies is that the RNA in the nucleocapsid is a viable antiviral target of polyamides. Since the RNA structure in the nucleocapsid is similar in all negative-strand RNA viruses, polyamides may be optimized to target the specific RNA genome of a negative-strand RNA virus, such as respiratory syncytial virus and Ebola virus. IMPORTANCENegative-strand RNA viruses (NSVs) include several life-threatening pathogens, such as rabies virus, respiratory syncytial virus, and Ebola virus. There are no effective antiviral drugs against these viruses. Polyamides offer an exceptional opportunity because they may be optimized to target each NSV. Our studies on vesicular stomatitis virus, an NSV, demonstrated that a polyamide molecule could specifically target the viral RNA in the nucleocapsid and inhibit viral growth. The target specificity of the polyamide molecule was proved by its inhibition of thermo-release and RNA nuclease digestion of the RNA bound in a model nucleocapsid, and a crystal structure of the polyamide inside the nucleocapsid. This encouraging observation provided the proof-of-concept rationale for designing polyamides as antiviral drugs against NSVs.« less
A Polyamide Inhibits Replication of Vesicular Stomatitis Virus by Targeting RNA in the Nucleocapsid.

PubMed

Gumpper, Ryan H; Li, Weike; Castañeda, Carlos H; Scuderi, M José; Bashkin, James K; Luo, Ming

2018-04-15

Polyamides have been shown to bind double-stranded DNA by complementing the curvature of the minor groove and forming various hydrogen bonds with DNA. Several polyamide molecules have been found to have potent antiviral activities against papillomavirus, a double-stranded DNA virus. By analogy, we reason that polyamides may also interact with the structured RNA bound in the nucleocapsid of a negative-strand RNA virus. Vesicular stomatitis virus (VSV) was selected as a prototype virus to test this possibility since its genomic RNA encapsidated in the nucleocapsid forms a structure resembling one strand of an A-form RNA duplex. One polyamide molecule, UMSL1011, was found to inhibit infection of VSV. To confirm that the polyamide targeted the nucleocapsid, a nucleocapsid-like particle (NLP) was incubated with UMSL1011. The encapsidated RNA in the polyamide-treated NLP was protected from thermo-release and digestion by RNase A. UMSL1011 also inhibits viral RNA synthesis in the intracellular activity assay for the viral RNA-dependent RNA polymerase. The crystal structure revealed that UMSL1011 binds the structured RNA in the nucleocapsid. The conclusion of our studies is that the RNA in the nucleocapsid is a viable antiviral target of polyamides. Since the RNA structure in the nucleocapsid is similar in all negative-strand RNA viruses, polyamides may be optimized to target the specific RNA genome of a negative-strand RNA virus, such as respiratory syncytial virus and Ebola virus. IMPORTANCE Negative-strand RNA viruses (NSVs) include several life-threatening pathogens, such as rabies virus, respiratory syncytial virus, and Ebola virus. There are no effective antiviral drugs against these viruses. Polyamides offer an exceptional opportunity because they may be optimized to target each NSV. Our studies on vesicular stomatitis virus, an NSV, demonstrated that a polyamide molecule could specifically target the viral RNA in the nucleocapsid and inhibit viral growth. The target specificity of the polyamide molecule was proved by its inhibition of thermo-release and RNA nuclease digestion of the RNA bound in a model nucleocapsid, and a crystal structure of the polyamide inside the nucleocapsid. This encouraging observation provided the proof-of-concept rationale for designing polyamides as antiviral drugs against NSVs. Copyright © 2018 American Society for Microbiology.
Structural and Functional Analyses of the Severe Acute Respiratory Syndrome Coronavirus Endoribonuclease Nsp15

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bhardwaj, Kanchan; Palaninathan, Satheesh; Alcantara, Joanna Maria Ortiz

2008-03-31

The severe acute respiratory syndrome (SARS) coronavirus encodes several RNA-processing enzymes that are unusual for RNA viruses, including Nsp15 (nonstructural protein 15), a hexameric endoribonuclease that preferentially cleaves 3' of uridines. We solved the structure of a catalytically inactive mutant version of Nsp15, which was crystallized as a hexamer. The structure contains unreported flexibility in the active site of each subunit. Substitutions in the active site residues serine 293 and proline 343 allowed Nsp15 to cleave at cytidylate, whereas mutation of leucine 345 rendered Nsp15 able to cleave at purines as well as pyrimidines. Mutations that targeted the residues involvedmore » in subunit interactions generally resulted in the formation of catalytically inactive monomers. The RNA-binding residues were mapped by a method linking reversible cross-linking, RNA affinity purification, and peptide fingerprinting. Alanine substitution of several residues in the RNA-contacting portion of Nsp15 did not affect hexamer formation but decreased the affinity of RNA binding and reduced endonuclease activity. This suggests a model for Nsp15 hexamer interaction with RNA.« less
Innate immune restriction and antagonism of viral RNA lacking 2'-O methylation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hyde, Jennifer L.; Diamond, Michael S., E-mail: diamond@borcim.wustl.edu; Molecular Microbiology, Washington University School of Medicine, St Louis., MO 63110

N-7 and 2′-O methylation of host cell mRNA occurs in the nucleus and results in the generation of cap structures (cap 0, m{sup 7}GpppN; cap 1, m{sup 7}GpppNm) that control gene expression by modulating nuclear export, splicing, turnover, and protein synthesis. Remarkably, RNA cap modification also contributes to mammalian cell host defense as viral RNA lacking 2′-O methylation is sensed and inhibited by IFIT1, an interferon (IFN) stimulated gene (ISG). Accordingly, pathogenic viruses that replicate in the cytoplasm have evolved mechanisms to circumvent IFIT1 restriction and facilitate infection of mammalian cells. These include: (a) generating cap 1 structures on theirmore » RNA through cap-snatching or virally-encoded 2′-O methyltransferases, (b) using cap-independent means of translation, or (c) using RNA secondary structural motifs to antagonize IFIT1 binding. This review will discuss new insights as to how specific modifications at the 5′-end of viral RNA modulate host pathogen recognition responses to promote infection and disease.« less
Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components.

PubMed

Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

2015-09-01

Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components

PubMed Central

Fagerlund, Robert D.; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S.

2015-01-01

Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA–RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. PMID:26135751
Complete mitochondrial genome of the giant African snail, Achatina fulica (Mollusca: Achatinidae): a novel location of putative control regions (CR) in the mitogenome within Pulmonate species.

PubMed

He, Zhang-Ping; Dai, Xia-Bin; Zhang, Shuai; Zhi, Ting-Ting; Lun, Zhao-Rong; Wu, Zhong-Dao; Yang, Ting-Bao

2016-01-01

The whole sequence (15,057 bp) of the mitochondrial DNA (mtDNA) of the terrestrial snail Achatina fulica (order Stylommatophora) was determined. The mitogenome, as the typical metazoan mtDNA, contains 13 protein-coding genes (PCG), 2 ribosomal RNA genes (rRNA) and 22 transfer RNA genes (tRNA). The tRNA genes include two trnS without standard secondary structure. Interestingly, among the known mitogenomes of Pulmonata species, we firstly characterized an unassigned lengthy sequence (551 bp) between the cox1 and the trnV which may be the CR for the sake of its AT bases usage bias (65.70%) and potential hairpin structure.

Structure of human IFIT1 with capped RNA reveals adaptable mRNA binding and mechanisms for sensing N1 and N2 ribose 2′-O methylations

PubMed Central

Laudenbach, Beatrice Theres; Martínez-Montero, Saúl; Cencic, Regina; Habjan, Matthias; Pichlmair, Andreas; Damha, Masad J.; Pelletier, Jerry; Nagar, Bhushan

2017-01-01

IFIT1 (IFN-induced protein with tetratricopeptide repeats-1) is an effector of the host innate immune antiviral response that prevents propagation of virus infection by selectively inhibiting translation of viral mRNA. It relies on its ability to compete with the translation initiation factor eIF4F to specifically recognize foreign capped mRNAs, while remaining inactive against host mRNAs marked by ribose 2′-O methylation at the first cap-proximal nucleotide (N1). We report here several crystal structures of RNA-bound human IFIT1, including a 1.6-Å complex with capped RNA. IFIT1 forms a water-filled, positively charged RNA-binding tunnel with a separate hydrophobic extension that unexpectedly engages the cap in multiple conformations (syn and anti) giving rise to a relatively plastic and nonspecific mode of binding, in stark contrast to eIF4E. Cap-proximal nucleotides encircled by the tunnel provide affinity to compete with eIF4F while allowing IFIT1 to select against N1 methylated mRNA. Gel-shift binding assays confirm that N1 methylation interferes with IFIT1 binding, but in an RNA-dependent manner, whereas translation assays reveal that N1 methylation alone is not sufficient to prevent mRNA recognition at high IFIT1 concentrations. Structural and functional analysis show that 2′-O methylation at N2, another abundant mRNA modification, is also detrimental for RNA binding, thus revealing a potentially synergistic role for it in self- versus nonself-mRNA discernment. Finally, structure-guided mutational analysis confirms the importance of RNA binding for IFIT1 restriction of a human coronavirus mutant lacking viral N1 methylation. Our structural and biochemical analysis sheds new light on the molecular basis for IFIT1 translational inhibition of capped viral RNA. PMID:28251928
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE Office of Scientific and Technical Information (OSTI.GOV)

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization

DOE PAGES

Qiu, Chen; McCann, Kathleen L.; Wine, Robert N.; ...

2014-12-15

Pumilio/feminization of XX and XO animals (fem)-3 mRNA-binding factor (PUF) proteins bind sequence specifically to mRNA targets using a single-stranded RNA-binding domain comprising eight Pumilio (PUM) repeats. PUM repeats have now been identified in proteins that function in pre-rRNA processing, including human Puf-A and yeast Puf6. This is a role not previously ascribed to PUF proteins. In this paper we present crystal structures of human Puf-A that reveal a class of nucleic acid-binding proteins with 11 PUM repeats arranged in an “L”-like shape. In contrast to classical PUF proteins, Puf-A forms sequence-independent interactions with DNA or RNA, mediated by conservedmore » basic residues. We demonstrate that equivalent basic residues in yeast Puf6 are important for RNA binding, pre-rRNA processing, and mRNA localization. Finally, PUM repeats can be assembled into alternative folds that bind to structured nucleic acids in addition to forming canonical eight-repeat crescent-shaped RNA-binding domains found in classical PUF proteins.« less
A Grammatical Approach to RNA-RNA Interaction Prediction

NASA Astrophysics Data System (ADS)

Kato, Yuki; Akutsu, Tatsuya; Seki, Hiroyuki

2007-11-01

Much attention has been paid to two interacting RNA molecules involved in post-transcriptional control of gene expression. Although there have been a few studies on RNA-RNA interaction prediction based on dynamic programming algorithm, no grammar-based approach has been proposed. The purpose of this paper is to provide a new modeling for RNA-RNA interaction based on multiple context-free grammar (MCFG). We present a polynomial time parsing algorithm for finding the most likely derivation tree for the stochastic version of MCFG, which is applicable to RNA joint secondary structure prediction including kissing hairpin loops. Also, elementary tests on RNA-RNA interaction prediction have shown that the proposed method is comparable to Alkan et al.'s method.
Structural basis of DNA folding and recognition in an AMP-DNA aptamer complex: distinct architectures but common recognition motifs for DNA and RNA aptamers complexed to AMP.

PubMed

Lin, C H; Patel, D J

1997-11-01

Structural studies by nuclear magnetic resonance (NMR) of RNA and DNA aptamer complexes identified through in vitro selection and amplification have provided a wealth of information on RNA and DNA tertiary structure and molecular recognition in solution. The RNA and DNA aptamers that target ATP (and AMP) with micromolar affinity exhibit distinct binding site sequences and secondary structures. We report below on the tertiary structure of the AMP-DNA aptamer complex in solution and compare it with the previously reported tertiary structure of the AMP-RNA aptamer complex in solution. The solution structure of the AMP-DNA aptamer complex shows, surprisingly, that two AMP molecules are intercalated at adjacent sites within a rectangular widened minor groove. Complex formation involves adaptive binding where the asymmetric internal bubble of the free DNA aptamer zippers up through formation of a continuous six-base mismatch segment which includes a pair of adjacent three-base platforms. The AMP molecules pair through their Watson-Crick edges with the minor groove edges of guanine residues. These recognition G.A mismatches are flanked by sheared G.A and reversed Hoogsteen G.G mismatch pairs. The AMP-DNA aptamer and AMP-RNA aptamer complexes have distinct tertiary structures and binding stoichiometries. Nevertheless, both complexes have similar structural features and recognition alignments in their binding pockets. Specifically, AMP targets both DNA and RNA aptamers by intercalating between purine bases and through identical G.A mismatch formation. The recognition G.A mismatch stacks with a reversed Hoogsteen G.G mismatch in one direction and with an adenine base in the other direction in both complexes. It is striking that DNA and RNA aptamers selected independently from libraries of 10(14) molecules in each case utilize identical mismatch alignments for molecular recognition with micromolar affinity within binding-site pockets containing common structural elements.
Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1

PubMed Central

Wang, Iren; Hennig, Janosch; Jagtap, Pravin Kumar Ankush; Sonntag, Miriam; Valcárcel, Juan; Sattler, Michael

2014-01-01

Alternative pre-messenger ribonucleic acid (pre-mRNA) splicing is an essential process in eukaryotic gene regulation. The T-cell intracellular antigen-1 (TIA-1) is an apoptosis-promoting factor that modulates alternative splicing of transcripts, including the pre-mRNA encoding the membrane receptor Fas. TIA-1 is a multi-domain ribonucleic acid (RNA) binding protein that recognizes poly-uridine tract RNA sequences to facilitate 5′ splice site recognition by the U1 small nuclear ribonucleoprotein (snRNP). Here, we characterize the RNA interaction and conformational dynamics of TIA-1 by nuclear magnetic resonance (NMR), isothermal titration calorimetry (ITC) and small angle X-ray scattering (SAXS). Our NMR-derived solution structure of TIA-1 RRM2–RRM3 (RRM2,3) reveals that RRM2 adopts a canonical RNA recognition motif (RRM) fold, while RRM3 is preceded by an non-canonical helix α0. NMR and SAXS data show that all three RRMs are largely independent structural modules in the absence of RNA, while RNA binding induces a compact arrangement. RRM2,3 binds to pyrimidine-rich FAS pre-mRNA or poly-uridine (U9) RNA with nanomolar affinities. RRM1 has little intrinsic RNA binding affinity and does not strongly contribute to RNA binding in the context of RRM1,2,3. Our data unravel the role of binding avidity and the contributions of the TIA-1 RRMs for recognition of pyrimidine-rich RNAs. PMID:24682828
Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.

PubMed

Disney, Matthew D; Winkelsas, Audrey M; Velagapudi, Sai Pradeep; Southern, Mark; Fallahi, Mohammad; Childs-Disney, Jessica L

2016-06-17

The development of small molecules that target RNA is challenging yet, if successful, could advance the development of chemical probes to study RNA function or precision therapeutics to treat RNA-mediated disease. Previously, we described Inforna, an approach that can mine motifs (secondary structures) within target RNAs, which is deduced from the RNA sequence, and compare them to a database of known RNA motif-small molecule binding partners. Output generated by Inforna includes the motif found in both the database and the desired RNA target, lead small molecules for that target, and other related meta-data. Lead small molecules can then be tested for binding and affecting cellular (dys)function. Herein, we describe Inforna 2.0, which incorporates all known RNA motif-small molecule binding partners reported in the scientific literature, a chemical similarity searching feature, and an improved user interface and is freely available via an online web server. By incorporation of interactions identified by other laboratories, the database has been doubled, containing 1936 RNA motif-small molecule interactions, including 244 unique small molecules and 1331 motifs. Interestingly, chemotype analysis of the compounds that bind RNA in the database reveals features in small molecule chemotypes that are privileged for binding. Further, this updated database expanded the number of cellular RNAs to which lead compounds can be identified.
New Era of Studying RNA Secondary Structure and Its Influence on Gene Regulation in Plants.

PubMed

Yang, Xiaofei; Yang, Minglei; Deng, Hongjing; Ding, Yiliang

2018-01-01

The dynamic structure of RNA plays a central role in post-transcriptional regulation of gene expression such as RNA maturation, degradation, and translation. With the rise of next-generation sequencing, the study of RNA structure has been transformed from in vitro low-throughput RNA structure probing methods to in vivo high-throughput RNA structure profiling. The development of these methods enables incremental studies on the function of RNA structure to be performed, revealing new insights of novel regulatory mechanisms of RNA structure in plants. Genome-wide scale RNA structure profiling allows us to investigate general RNA structural features over 10s of 1000s of mRNAs and to compare RNA structuromes between plant species. Here, we provide a comprehensive and up-to-date overview of: (i) RNA structure probing methods; (ii) the biological functions of RNA structure; (iii) genome-wide RNA structural features corresponding to their regulatory mechanisms; and (iv) RNA structurome evolution in plants.
Triplet repeat RNA structure and its role as pathogenic agent and therapeutic target

PubMed Central

Krzyzosiak, Wlodzimierz J.; Sobczak, Krzysztof; Wojciechowska, Marzena; Fiszer, Agnieszka; Mykowska, Agnieszka; Kozlowski, Piotr

2012-01-01

This review presents detailed information about the structure of triplet repeat RNA and addresses the simple sequence repeats of normal and expanded lengths in the context of the physiological and pathogenic roles played in human cells. First, we discuss the occurrence and frequency of various trinucleotide repeats in transcripts and classify them according to the propensity to form RNA structures of different architectures and stabilities. We show that repeats capable of forming hairpin structures are overrepresented in exons, which implies that they may have important functions. We further describe long triplet repeat RNA as a pathogenic agent by presenting human neurological diseases caused by triplet repeat expansions in which mutant RNA gains a toxic function. Prominent examples of these diseases include myotonic dystrophy type 1 and fragile X-associated tremor ataxia syndrome, which are triggered by mutant CUG and CGG repeats, respectively. In addition, we discuss RNA-mediated pathogenesis in polyglutamine disorders such as Huntington's disease and spinocerebellar ataxia type 3, in which expanded CAG repeats may act as an auxiliary toxic agent. Finally, triplet repeat RNA is presented as a therapeutic target. We describe various concepts and approaches aimed at the selective inhibition of mutant transcript activity in experimental therapies developed for repeat-associated diseases. PMID:21908410
Probing a 2-Aminobenzimidazole Library for Binding to RNA Internal Loops via Two-Dimensional Combinatorial Screening

PubMed Central

Velegapudi, Sai Pradeep; Pushechnikov, Alexei; Labuda, Lucas P.; French, Jonathan M.; Disney, Matthew D.

2012-01-01

There are many potential RNA drug targets in bacterial, viral, and the human transcriptomes. However, there are few small molecules that modulate RNA function. This is due, in part, to a lack of fundamental understanding about RNA-ligand interactions including the types of small molecules that bind to RNA structural elements and the RNA structural elements that bind to small molecules. In an effort to better understand RNA-ligand interactions, we diversified the 2-aminobenzimidazole core (2AB) and probed the resulting library for binding to a library of RNA internal loops. We chose the 2AB core for these studies because it is a privileged scaffold for binding RNA based on previous reports. These studies identified that N-methyl pyrrolidine, imidazole, and propylamine diversity elements at the R1 position increase binding to internal loops; variability at the R2 position is well tolerated. The preferred RNA loop space was also determined for five ligands using a statistical approach and identified trends that lead to selective recognition. PMID:22958065
Nuclear RNA Exosome at 3.1 Å Reveals Substrate Specificities, RNA Paths, and Allosteric Inhibition of Rrp44/Dis3.

PubMed

Zinder, John C; Wasmuth, Elizabeth V; Lima, Christopher D

2016-11-17

The eukaryotic RNA exosome is an essential and conserved 3'-to-5' exoribonuclease complex that degrades or processes nearly every class of cellular RNA. The nuclear RNA exosome includes a 9-subunit non-catalytic core that binds Rrp44 (Dis3) and Rrp6 subunits to modulate their processive and distributive 3'-to-5' exoribonuclease activities, respectively. Here we utilize an engineered RNA with two 3' ends to obtain a crystal structure of an 11-subunit nuclear exosome bound to RNA at 3.1 Å. The structure reveals an extended RNA path to Rrp6 that penetrates into the non-catalytic core; contacts between the non-catalytic core and Rrp44, which inhibit exoribonuclease activity; and features of the Rrp44 exoribonuclease site that support its ability to degrade 3' phosphate RNA substrates. Using reconstituted exosome complexes, we show that 3' phosphate RNA is not a substrate for Rrp6 but is readily degraded by Rrp44 in the nuclear exosome. Copyright © 2016 Elsevier Inc. All rights reserved.
When your cap matters: structural insights into self vs non-self recognition of 5' RNA by immunomodulatory host proteins

DOE Office of Scientific and Technical Information (OSTI.GOV)

Leung, Daisy W.; Amarasinghe, Gaya K.

Cytosolic recognition of viral RNA is important for host innate immune responses. Differential recognition of self vs non-self RNA is a considerable challenge as the inability to differentiate may trigger aberrant immune responses. Recent work identified the composition of the RNA 5', including the 5' cap and its methylation state, as an important determinant of recognition by the host. Recent studies have advanced our understanding of the modified 5' RNA recognition and viral antagonism of RNA receptors. Here, we will discuss RIG-I and IFIT proteins as examples of host proteins that detect dsRNA and ssRNA, respectively.
The organization and contribution of helicases to RNA splicing.

PubMed

De, Inessa; Schmitzová, Jana; Pena, Vladimir

2016-01-01

Splicing is an essential step of gene expression. It occurs in two consecutive chemical reactions catalyzed by a large protein-RNA complex named the spliceosome. Assembled on the pre-mRNA substrate from five small nuclear proteins, the spliceosome acts as a protein-controlled ribozyme to catalyze the two reactions and finally dissociates into its components, which are re-used for a new round of splicing. Upon following this cyclic pathway, the spliceosome undergoes numerous intermediate stages that differ in composition as well as in their internal RNA-RNA and RNA-protein contacts. The driving forces and control mechanisms of these remodeling processes are provided by specific molecular motors called RNA helicases. While eight spliceosomal helicases are present in all organisms, higher eukaryotes contain five additional ones potentially required to drive a more intricate splicing pathway and link it to an RNA metabolism of increasing complexity. Spliceosomal helicases exhibit a notable structural diversity in their accessory domains and overall architecture, in accordance with the diversity of their task-specific functions. This review summarizes structure-function knowledge about all spliceosomal helicases, including the latter five, which traditionally are treated separately from the conserved ones. The implications of the structural characteristics of helicases for their functions, as well as for their structural communication within the multi-subunits environment of the spliceosome, are pointed out. © 2016 Wiley Periodicals, Inc.
Tudor staphylococcal nuclease is a structure-specific ribonuclease that degrades RNA at unstructured regions during microRNA decay.

PubMed

Li, Chia-Lung; Yang, Wei-Zen; Shi, Zhonghao; Yuan, Hanna S

2018-05-01

Tudor staphylococcal nuclease (TSN) is an evolutionarily conserved ribonuclease in eukaryotes that is composed of five staphylococcal nuclease-like domains (SN1-SN5) and a Tudor domain. TSN degrades hyper-edited double-stranded RNA, including primary miRNA precursors containing multiple I•U and U•I pairs, and mature miRNA during miRNA decay. However, how TSN binds and degrades its RNA substrates remains unclear. Here, we show that the C. elegans TSN (cTSN) is a monomeric Ca 2+ -dependent ribonuclease, cleaving RNA chains at the 5'-side of the phosphodiester linkage to produce degraded fragments with 5'-hydroxyl and 3'-phosphate ends. cTSN degrades single-stranded RNA and double-stranded RNA containing mismatched base pairs, but is not restricted to those containing multiple I•U and U•I pairs. cTSN has at least two catalytic active sites located in the SN1 and SN3 domains, since mutations of the putative Ca 2+ -binding residues in these two domains strongly impaired its ribonuclease activity. We further show by small-angle X-ray scattering that rice osTSN has a flexible two-lobed structure with open to closed conformations, indicating that TSN may change its conformation upon RNA binding. We conclude that TSN is a structure-specific ribonuclease targeting not only single-stranded RNA, but also unstructured regions of double-stranded RNA. This study provides the molecular basis for how TSN cooperates with RNA editing to eliminate duplex RNA in cell defense, and how TSN selects and degrades RNA during microRNA decay. © 2018 Li et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Structure of the second RRM domain of Nrd1, a fission yeast MAPK target RNA binding protein, and implication for its RNA recognition and regulation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kobayashi, Ayaho; Kanaba, Teppei; Satoh, Ryosuke

Highlights: •Solution structure of the second RRM of Nrd1 was determined. •RNA binding site of the second RRM was estimated. •Regulatory mechanism of RNA binding by phosphorylation is discussed. -- Abstract: Negative regulator of differentiation 1 (Nrd1) is known as a negative regulator of sexual differentiation in fission yeast. Recently, it has been revealed that Nrd1 also regulates cytokinesis, in which physical separation of the cell is achieved by a contractile ring comprising many proteins including actin and myosin. Cdc4, a myosin II light chain, is known to be required for cytokinesis. Nrd1 binds and stabilizes Cdc4 mRNA, and therebymore » suppressing the cytokinesis defects of the cdc4 mutants. Interestingly, Pmk1 MAPK phosphorylates Nrd1, resulting in markedly reduced RNA binding activity. Furthermore, Nrd1 localizes to stress granules in response to various stresses, and Pmk1 phosphorylation enhances the localization. Nrd1 consists of four RRM domains, although the mechanism by which Pmk1 regulates the RNA binding activity of Nrd1 is unknown. In an effort to delineate the relationship between Nrd1 structure and function, we prepared each RNA binding domain of Nrd1 and examined RNA binding to chemically synthesized oligo RNA using NMR. The structure of the second RRM domain of Nrd1 was determined and the RNA binding site on the second RRM domain was mapped by NMR. A plausible mechanism pertaining to the regulation of RNA binding activity by phosphorylation is also discussed.« less
microRNAs Databases: Developmental Methodologies, Structural and Functional Annotations.

PubMed

Singh, Nagendra Kumar

2017-09-01

microRNA (miRNA) is an endogenous and evolutionary conserved non-coding RNA, involved in post-transcriptional process as gene repressor and mRNA cleavage through RNA-induced silencing complex (RISC) formation. In RISC, miRNA binds in complementary base pair with targeted mRNA along with Argonaut proteins complex, causes gene repression or endonucleolytic cleavage of mRNAs and results in many diseases and syndromes. After the discovery of miRNA lin-4 and let-7, subsequently large numbers of miRNAs were discovered by low-throughput and high-throughput experimental techniques along with computational process in various biological and metabolic processes. The miRNAs are important non-coding RNA for understanding the complex biological phenomena of organism because it controls the gene regulation. This paper reviews miRNA databases with structural and functional annotations developed by various researchers. These databases contain structural and functional information of animal, plant and virus miRNAs including miRNAs-associated diseases, stress resistance in plant, miRNAs take part in various biological processes, effect of miRNAs interaction on drugs and environment, effect of variance on miRNAs, miRNAs gene expression analysis, sequence of miRNAs, structure of miRNAs. This review focuses on the developmental methodology of miRNA databases such as computational tools and methods used for extraction of miRNAs annotation from different resources or through experiment. This study also discusses the efficiency of user interface design of every database along with current entry and annotations of miRNA (pathways, gene ontology, disease ontology, etc.). Here, an integrated schematic diagram of construction process for databases is also drawn along with tabular and graphical comparison of various types of entries in different databases. Aim of this paper is to present the importance of miRNAs-related resources at a single place.
Deformability in the cleavage site of primary microRNA is not sensed by the double-stranded RNA binding domains in the microprocessor component DGCR8.

PubMed

Quarles, Kaycee A; Chadalavada, Durga; Showalter, Scott A

2015-06-01

The prevalence of double-stranded RNA (dsRNA) in eukaryotic cells has only recently been appreciated. Of interest here, RNA silencing begins with dsRNA substrates that are bound by the dsRNA-binding domains (dsRBDs) of their processing proteins. Specifically, processing of microRNA (miRNA) in the nucleus minimally requires the enzyme Drosha and its dsRBD-containing cofactor protein, DGCR8. The smallest recombinant construct of DGCR8 that is sufficient for in vitro dsRNA binding, referred to as DGCR8-Core, consists of its two dsRBDs and a C-terminal tail. As dsRBDs rarely recognize the nucleotide sequence of dsRNA, it is reasonable to hypothesize that DGCR8 function is dependent on the recognition of specific structural features in the miRNA precursor. Previously, we demonstrated that noncanonical structural elements that promote RNA flexibility within the stem of miRNA precursors are necessary for efficient in vitro cleavage by reconstituted Microprocessor complexes. Here, we combine gel shift assays with in vitro processing assays to demonstrate that neither the N-terminal dsRBD of DGCR8 in isolation nor the DGCR8-Core construct is sensitive to the presence of noncanonical structural elements within the stem of miRNA precursors, or to single-stranded segments flanking the stem. Extending DGCR8-Core to include an N-terminal heme-binding region does not change our conclusions. Thus, our data suggest that although the DGCR8-Core region is necessary for dsRNA binding and recruitment to the Microprocessor, it is not sufficient to establish the previously observed connection between RNA flexibility and processing efficiency. © 2015 Wiley Periodicals, Inc.
[Analysis of the primary and secondary structure of the mitochondrial serine transfer RNA in seven species of Lutzomyia].

PubMed

Vivero, Rafael José; Contreras-Gutiérrez, Maria Angélica; Bejarano, Eduar Elías

2007-09-01

Lutzomyia sand flies are involved in the transmission of the parasite Leishmania spp. in America. The taxonomy of these vectors is traditionally based on morphological features of the adult stage, particularly the paired structures of the head and genitalia. Although these characters are useful to distinguish most species of Lutzomyia, morphological identification may be complicated by the similarities within subgenera and species group. To evaluate the utility of mitochondrial serine transfer RNA tRNA Ser for taxonomic identification of Lutzomyia. Seven sand fly species, each representing one of the 27 taxonomic subdivisions in genus Lutzomyia, were analyzed including L. trinidadensis (Oswaldoi group), L. (Psychodopygus) panamensis, L.(Micropygomyia) cayennensis cayennensis, L. dubitans (Migonei group), L. (Lutzomyia) gomezi, L. rangeliana (ungrouped) and L. evansi (Verrucarum group). The mitochondrial tRNA Ser gene, flanked by the cytochrome b and NAD dehydrogenase subunit one genes, was extracted, amplified and sequenced from each specimen. Secondary structure of the tRNA Ser was predicted by comparisons with previously described homologous structures from other dipteran species. The tRNA Ser gene ranged in size from 66 base pairs in L. gomezi to 69 base pairs in L. trinidadensis. Fourteen polymorphic sites, including four insertion-deletion events, were observed in the aligned 70 nucleotide positions. The majority of the substitutions were located in the dihydrouridine, ribothymidine-pseudouridine-cytosine and variable loops, as well as in the basal extreme of the anticodon arm. Changes of primary sequence of the tRNASer provided useful molecular characters for taxonomic identification of the sand fly species under consideration.
Structural basis for specific recognition of multiple mRNA targets by a PUF regulatory protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Yeming; Opperman, Laura; Wickens, Marvin

2011-11-02

Caenorhabditis elegans fem-3 binding factor (FBF) is a founding member of the PUMILIO/FBF (PUF) family of mRNA regulatory proteins. It regulates multiple mRNAs critical for stem cell maintenance and germline development. Here, we report crystal structures of FBF in complex with 6 different 9-nt RNA sequences, including elements from 4 natural mRNAs. These structures reveal that FBF binds to conserved bases at positions 1-3 and 7-8. The key specificity determinant of FBF vs. other PUF proteins lies in positions 4-6. In FBF/RNA complexes, these bases stack directly with one another and turn away from the RNA-binding surface. A short regionmore » of FBF is sufficient to impart its unique specificity and lies directly opposite the flipped bases. We suggest that this region imposes a flattened curvature on the protein; hence, the requirement for the additional nucleotide. The principles of FBF/RNA recognition suggest a general mechanism by which PUF proteins recognize distinct families of RNAs yet exploit very nearly identical atomic contacts in doing so.« less
Structural basis for specific recognition of multiple mRNA targets by a PUF regulatory protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wang, Yeming; Opperman, Laura; Wickens, Marvin

2010-08-19

Caenorhabditis elegans fem-3 binding factor (FBF) is a founding member of the PUMILIO/FBF (PUF) family of mRNA regulatory proteins. It regulates multiple mRNAs critical for stem cell maintenance and germline development. Here, we report crystal structures of FBF in complex with 6 different 9-nt RNA sequences, including elements from 4 natural mRNAs. These structures reveal that FBF binds to conserved bases at positions 1-3 and 7-8. The key specificity determinant of FBF vs. other PUF proteins lies in positions 4-6. In FBF/RNA complexes, these bases stack directly with one another and turn away from the RNA-binding surface. A short regionmore » of FBF is sufficient to impart its unique specificity and lies directly opposite the flipped bases. We suggest that this region imposes a flattened curvature on the protein; hence, the requirement for the additional nucleotide. The principles of FBF/RNA recognition suggest a general mechanism by which PUF proteins recognize distinct families of RNAs yet exploit very nearly identical atomic contacts in doing so.« less

3′ Cap-Independent Translation Enhancers of Plant Viruses

PubMed Central

Simon, Anne E.; Miller, W. Allen

2014-01-01

In the absence of a 5′ cap, plant positive-strand RNA viruses have evolved a number of different elements in their 3′ untranslated region (UTR) to attract initiation factors and/or ribosomes to their templates. These 3′ cap-independent translational enhancers (3′ CITEs) take different forms, such as I-shaped, Y-shaped, T-shaped, or pseudoknotted structures, or radiate multiple helices from a central hub. Common features of most 3′ CITEs include the ability to bind a component of the translation initiation factor eIF4F complex and to engage in an RNA-RNA kissing-loop interaction with a hairpin loop located at the 5′ end of the RNA. The two T-shaped structures can bind to ribosomes and ribosomal subunits, with one structure also able to engage in a simultaneous long-distance RNA-RNA interaction. Several of these 3′ CITEs are interchangeable and there is evidence that natural recombination allows exchange of modular CITE units, which may overcome genetic resistance or extend the virus’s host range. PMID:23682606
High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simulations

PubMed Central

Chen, Alan A.; García, Angel E.

2013-01-01

We report the de novo folding of three hyperstable RNA tetraloops to 1–3 Å rmsd from their experimentally determined structures using molecular dynamics simulations initialized in the unfolded state. RNA tetraloops with loop sequences UUCG, GCAA, or CUUG are hyperstable because of the formation of noncanonical loop-stabilizing interactions, and they are all faithfully reproduced to angstrom-level accuracy in replica exchange molecular dynamics simulations, including explicit solvent and ion molecules. This accuracy is accomplished using unique RNA parameters, in which biases that favor rigid, highly stacked conformations are corrected to accurately capture the inherent flexibility of ssRNA loops, accurate base stacking energetics, and purine syn-anti interconversions. In a departure from traditional quantum chemistrycentric approaches to force field optimization, our parameters are calibrated directly from thermodynamic and kinetic measurements of intra- and internucleotide structural transitions. The ability to recapitulate the signature noncanonical interactions of the three most abundant hyperstable stem loop motifs represents a significant milestone to the accurate prediction of RNA tertiary structure using unbiased all-atom molecular dynamics simulations. PMID:24043821
Non-Structural Proteins of Arthropod-Borne Bunyaviruses: Roles and Functions

PubMed Central

Eifan, Saleh; Schnettler, Esther; Dietrich, Isabelle; Kohl, Alain; Blomström, Anne-Lie

2013-01-01

Viruses within the Bunyaviridae family are tri-segmented, negative-stranded RNA viruses. The family includes several emerging and re-emerging viruses of humans, animals and plants, such as Rift Valley fever virus, Crimean-Congo hemorrhagic fever virus, La Crosse virus, Schmallenberg virus and tomato spotted wilt virus. Many bunyaviruses are arthropod-borne, so-called arboviruses. Depending on the genus, bunyaviruses encode, in addition to the RNA-dependent RNA polymerase and the different structural proteins, one or several non-structural proteins. These non-structural proteins are not always essential for virus growth and replication but can play an important role in viral pathogenesis through their interaction with the host innate immune system. In this review, we will summarize current knowledge and understanding of insect-borne bunyavirus non-structural protein function(s) in vertebrate, plant and arthropod. PMID:24100888
TRFolder-W: a web server for telomerase RNA structure prediction in yeast genomes.

PubMed

Zhang, Dong; Xue, Xingran; Malmberg, Russell L; Cai, Liming

2012-10-15

TRFolder-W is a web server capable of predicting core structures of telomerase RNA (TR) in yeast genomes. TRFolder is a command-line Python toolkit for TR-specific structure prediction. We developed a web-version built on the django web framework, leveraging the work done previously, to include enhancements to increase flexibility of usage. To date, there are five core sub-structures commonly found in TR of fungal species, which are the template region, downstream pseudoknot, boundary element, core-closing stem and triple helix. The aim of TRFolder-W is to use the five core structures as fundamental units to predict potential TR genes for yeast, and to provide a user-friendly interface. Moreover, the application of TRFolder-W can be extended to predict the characteristic structure on species other than fungal species. The web server TRFolder-W is available at http://rna-informatics.uga.edu/?f=software&p=TRFolder-w.
Comparison of small molecules and oligonucleotides that target a toxic, non-coding RNA.

PubMed

Costales, Matthew G; Rzuczek, Suzanne G; Disney, Matthew D

2016-06-01

Potential RNA targets for chemical probes and therapeutic modalities are pervasive in the transcriptome. Oligonucleotide-based therapeutics are commonly used to target RNA sequence. Small molecules are emerging as a modality to target RNA structures selectively, but their development is still in its infancy. In this work, we compare the activity of oligonucleotides and several classes of small molecules that target the non-coding r(CCUG) repeat expansion (r(CCUG)(exp)) that causes myotonic dystrophy type 2 (DM2), an incurable disease that is the second-most common cause of adult onset muscular dystrophy. Small molecule types investigated include monomers, dimers, and multivalent compounds synthesized on-site by using RNA-templated click chemistry. Oligonucleotides investigated include phosphorothioates that cleave their target and vivo-morpholinos that modulate target RNA activity via binding. We show that compounds assembled on-site that recognize structure have the highest potencies amongst small molecules and are similar in potency to a vivo-morpholino modified oligonucleotide that targets sequence. These studies are likely to impact the design of therapeutic modalities targeting other repeats expansions that cause fragile X syndrome and amyotrophic lateral sclerosis, for example. Copyright © 2016. Published by Elsevier Ltd.
Methylated nucleosides in tRNA and tRNA methyltransferases

PubMed Central

Hori, Hiroyuki

2014-01-01

To date, more than 90 modified nucleosides have been found in tRNA and the biosynthetic pathways of the majority of tRNA modifications include a methylation step(s). Recent studies of the biosynthetic pathways have demonstrated that the availability of methyl group donors for the methylation in tRNA is important for correct and efficient protein synthesis. In this review, I focus on the methylated nucleosides and tRNA methyltransferases. The primary functions of tRNA methylations are linked to the different steps of protein synthesis, such as the stabilization of tRNA structure, reinforcement of the codon-anticodon interaction, regulation of wobble base pairing, and prevention of frameshift errors. However, beyond these basic functions, recent studies have demonstrated that tRNA methylations are also involved in the RNA quality control system and regulation of tRNA localization in the cell. In a thermophilic eubacterium, tRNA modifications and the modification enzymes form a network that responses to temperature changes. Furthermore, several modifications are involved in genetic diseases, infections, and the immune response. Moreover, structural, biochemical, and bioinformatics studies of tRNA methyltransferases have been clarifying the details of tRNA methyltransferases and have enabled these enzymes to be classified. In the final section, the evolution of modification enzymes is discussed. PMID:24904644
Recent advances in high-performance fluorescent and bioluminescent RNA imaging probes.

PubMed

Xia, Yuqiong; Zhang, Ruili; Wang, Zhongliang; Tian, Jie; Chen, Xiaoyuan

2017-05-22

RNA plays an important role in life processes. Imaging of messenger RNAs (mRNAs) and micro-RNAs (miRNAs) not only allows us to learn the formation and transcription of mRNAs and the biogenesis of miRNAs involved in various life processes, but also helps in detecting cancer. High-performance RNA imaging probes greatly expand our view of life processes and enhance the cancer detection accuracy. In this review, we summarize the state-of-the-art high-performance RNA imaging probes, including exogenous probes that can image RNA sequences with special modification and endogeneous probes that can directly image endogenous RNAs without special treatment. For each probe, we review its structure and imaging principle in detail. Finally, we summarize the application of mRNA and miRNA imaging probes in studying life processes as well as in detecting cancer. By correlating the structures and principles of various probes with their practical uses, we compare different RNA imaging probes and offer guidance for better utilization of the current imaging probes and the future design of higher-performance RNA imaging probes.
Fast prediction of RNA-RNA interaction using heuristic algorithm.

PubMed

Montaseri, Soheila

2015-01-01

Interaction between two RNA molecules plays a crucial role in many medical and biological processes such as gene expression regulation. In this process, an RNA molecule prohibits the translation of another RNA molecule by establishing stable interactions with it. Some algorithms have been formed to predict the structure of the RNA-RNA interaction. High computational time is a common challenge in most of the presented algorithms. In this context, a heuristic method is introduced to accurately predict the interaction between two RNAs based on minimum free energy (MFE). This algorithm uses a few dot matrices for finding the secondary structure of each RNA and binding sites between two RNAs. Furthermore, a parallel version of this method is presented. We describe the algorithm's concurrency and parallelism for a multicore chip. The proposed algorithm has been performed on some datasets including CopA-CopT, R1inv-R2inv, Tar-Tar*, DIS-DIS, and IncRNA54-RepZ in Escherichia coli bacteria. The method has high validity and efficiency, and it is run in low computational time in comparison to other approaches.
Targeting RNA in mammalian systems with small molecules.

PubMed

Donlic, Anita; Hargrove, Amanda E

2018-05-03

The recognition of RNA functions beyond canonical protein synthesis has challenged the central dogma of molecular biology. Indeed, RNA is now known to directly regulate many important cellular processes, including transcription, splicing, translation, and epigenetic modifications. The misregulation of these processes in disease has led to an appreciation of RNA as a therapeutic target. This potential was first recognized in bacteria and viruses, but discoveries of new RNA classes following the sequencing of the human genome have invigorated exploration of its disease-related functions in mammals. As stable structure formation is evolving as a hallmark of mammalian RNAs, the prospect of utilizing small molecules to specifically probe the function of RNA structural domains and their interactions is gaining increased recognition. To date, researchers have discovered bioactive small molecules that modulate phenotypes by binding to expanded repeats, microRNAs, G-quadruplex structures, and RNA splice sites in neurological disorders, cancers, and other diseases. The lessons learned from achieving these successes both call for additional studies and encourage exploration of the plethora of mammalian RNAs whose precise mechanisms of action remain to be elucidated. Efforts toward understanding fundamental principles of small molecule-RNA recognition combined with advances in methodology development should pave the way toward targeting emerging RNA classes such as long noncoding RNAs. Together, these endeavors can unlock the full potential of small molecule-based probing of RNA-regulated processes and enable us to discover new biology and underexplored avenues for therapeutic intervention in human disease. This article is categorized under: RNA Methods > RNA Analyses In Vitro and In Silico RNA Interactions with Proteins and Other Molecules > Small Molecule-RNA Interactions RNA in Disease and Development > RNA in Disease. © 2018 Wiley Periodicals, Inc.
Post-transcriptional control by bacteriophage T4: mRNA decay and inhibition of translation initiation

PubMed Central

2010-01-01

Over 50 years of biological research with bacteriophage T4 includes notable discoveries in post-transcriptional control, including the genetic code, mRNA, and tRNA; the very foundations of molecular biology. In this review we compile the past 10 - 15 year literature on RNA-protein interactions with T4 and some of its related phages, with particular focus on advances in mRNA decay and processing, and on translational repression. Binding of T4 proteins RegB, RegA, gp32 and gp43 to their cognate target RNAs has been characterized. For several of these, further study is needed for an atomic-level perspective, where resolved structures of RNA-protein complexes are awaiting investigation. Other features of post-transcriptional control are also summarized. These include: RNA structure at translation initiation regions that either inhibit or promote translation initiation; programmed translational bypassing, where T4 orchestrates ribosome bypass of a 50 nucleotide mRNA sequence; phage exclusion systems that involve T4-mediated activation of a latent endoribonuclease (PrrC) and cofactor-assisted activation of EF-Tu proteolysis (Gol-Lit); and potentially important findings on ADP-ribosylation (by Alt and Mod enzymes) of ribosome-associated proteins that might broadly impact protein synthesis in the infected cell. Many of these problems can continue to be addressed with T4, whereas the growing database of T4-related phage genome sequences provides new resources and potentially new phage-host systems to extend the work into a broader biological, evolutionary context. PMID:21129205
G-quadruplex in animal development: Contribution to gene expression and genomic heterogeneity.

PubMed

Armas, Pablo; Calcaterra, Nora Beatriz

2018-05-18

During animal development, gene expression is orchestrated by specific and highly evolutionarily conserved mechanisms that take place accurately, both at spatial and temporal levels. The last decades have provided compelling evidence showing that chromatin state plays essential roles in orchestrating most of the stages of development. The DNA molecule can adopt alternative structures different from the helical duplex architecture. G-rich DNA sequences can fold as intrastrand quadruple helix structures called G-quadruplexes or G4-DNA. G4 can also be formed in RNA molecules, such as mRNA, lncRNA and pre-miRNA. Emerging evidences suggest that G4s have crucial roles in a variety of biological processes, including transcription, recombination, replication, translation and chromosome stability. In this review, we have collected recent information gathered by various laboratories showing the important role of G4 DNA and RNA structures in several steps of animal development. Copyright © 2018 Elsevier B.V. All rights reserved.
The uncoupling of catalysis and translocation in the viral RNA-dependent RNA polymerase

PubMed Central

Shu, Bo; Gong, Peng

2017-01-01

ABSTRACT The nucleotide addition cycle of nucleic acid polymerases includes 2 major events: the pre-chemistry active site closure leading to the addition of one nucleotide to the product chain; the post-chemistry translocation step moving the polymerase active site one position downstream on its template. In viral RNA-dependent RNA polymerases (RdRPs), structural and biochemical evidences suggest that these 2 events are not tightly coupled, unlike the situation observed in A-family polymerases such as the bacteriophage T7 RNA polymerase. Recently, an RdRP translocation intermediate crystal structure of enterovirus 71 shed light on how translocation may be controlled by elements within RdRP catalytic motifs, and a series of poliovirus apo RdRP crystal structures explicitly suggest that a motif B loop may assist the movement of the template strand in late stages of transcription. Implications of RdRP catalysis-translocation uncoupling and the remaining challenges to further elucidate RdRP translocation mechanism are also discussed. PMID:28277928
Coordination of genomic structure and transcription by the main bacterial nucleoid-associated protein HU

PubMed Central

Berger, Michael; Farcas, Anca; Geertz, Marcel; Zhelyazkova, Petya; Brix, Klaudia; Travers, Andrew; Muskhelishvili, Georgi

2010-01-01

The histone-like protein HU is a highly abundant DNA architectural protein that is involved in compacting the DNA of the bacterial nucleoid and in regulating the main DNA transactions, including gene transcription. However, the coordination of the genomic structure and function by HU is poorly understood. Here, we address this question by comparing transcript patterns and spatial distributions of RNA polymerase in Escherichia coli wild-type and hupA/B mutant cells. We demonstrate that, in mutant cells, upregulated genes are preferentially clustered in a large chromosomal domain comprising the ribosomal RNA operons organized on both sides of OriC. Furthermore, we show that, in parallel to this transcription asymmetry, mutant cells are also impaired in forming the transcription foci—spatially confined aggregations of RNA polymerase molecules transcribing strong ribosomal RNA operons. Our data thus implicate HU in coordinating the global genomic structure and function by regulating the spatial distribution of RNA polymerase in the nucleoid. PMID:20010798
Modulation of RNA function by aminoglycoside antibiotics.

PubMed

Schroeder, R; Waldsich, C; Wank, H

2000-01-04

One of the most important families of antibiotics are the aminoglycosides, including drugs such as neomycin B, paromomycin, gentamicin and streptomycin. With the discovery of the catalytic potential of RNA, these antibiotics became very popular due to their RNA-binding capacity. They serve for the analysis of RNA function as well as for the study of RNA as a potential therapeutic target. Improvements in RNA structure determination recently provided first insights into the decoding site of the ribosome at high resolution and how aminoglycosides might induce misreading of the genetic code. In addition to inhibiting prokaryotic translation, aminoglycosides inhibit several catalytic RNAs such as self-splicing group I introns, RNase P and small ribozymes in vitro. Furthermore, these antibiotics interfere with human immunodeficiency virus (HIV) replication by disrupting essential RNA-protein contacts. Most exciting is the potential of many RNA-binding antibiotics to stimulate RNA activities, conceiving small-molecule partners for the hypothesis of an ancient RNA world. SELEX (systematic evolution of ligands by exponential enrichment) has been used in this evolutionary game leading to small synthetic RNAs, whose NMR structures gave valuable information on how aminoglycosides interact with RNA, which could possibly be used in applied science.
High-Throughput Genetic Identification of Functionally Important Regions of the Yeast DEAD-Box Protein Mss116p

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mohr, Georg; Del Campo, Mark; Turner, Kathryn G.

The Saccharomyces cerevisiae DEAD-box protein Mss116p is a general RNA chaperone that functions in splicing mitochondrial group I and group II introns. Recent X-ray crystal structures of Mss116p in complex with ATP analogs and single-stranded RNA show that the helicase core induces a bend in the bound RNA, as in other DEAD-box proteins, while a C-terminal extension (CTE) induces a second bend, resulting in RNA crimping. Here, we illuminate these structures by using high-throughput genetic selections, unigenic evolution, and analyses of in vivo splicing activity to comprehensively identify functionally important regions and permissible amino acid substitutions throughout Mss116p. The functionallymore » important regions include those containing conserved sequence motifs involved in ATP and RNA binding or interdomain interactions, as well as previously unidentified regions, including surface loops that may function in protein-protein interactions. The genetic selections recapitulate major features of the conserved helicase motifs seen in other DEAD-box proteins but also show surprising variations, including multiple novel variants of motif III (SAT). Patterns of amino acid substitutions indicate that the RNA bend induced by the helicase core depends on ionic and hydrogen-bonding interactions with the bound RNA; identify a subset of critically interacting residues; and indicate that the bend induced by the CTE results primarily from a steric block. Finally, we identified two conserved regions - one the previously noted post II region in the helicase core and the other in the CTE - that may help displace or sequester the opposite RNA strand during RNA unwinding.« less
PRince: a web server for structural and physicochemical analysis of protein-RNA interface.

PubMed

Barik, Amita; Mishra, Abhishek; Bahadur, Ranjit Prasad

2012-07-01

We have developed a web server, PRince, which analyzes the structural features and physicochemical properties of the protein-RNA interface. Users need to submit a PDB file containing the atomic coordinates of both the protein and the RNA molecules in complex form (in '.pdb' format). They should also mention the chain identifiers of interacting protein and RNA molecules. The size of the protein-RNA interface is estimated by measuring the solvent accessible surface area buried in contact. For a given protein-RNA complex, PRince calculates structural, physicochemical and hydration properties of the interacting surfaces. All these parameters generated by the server are presented in a tabular format. The interacting surfaces can also be visualized with software plug-in like Jmol. In addition, the output files containing the list of the atomic coordinates of the interacting protein, RNA and interface water molecules can be downloaded. The parameters generated by PRince are novel, and users can correlate them with the experimentally determined biophysical and biochemical parameters for better understanding the specificity of the protein-RNA recognition process. This server will be continuously upgraded to include more parameters. PRince is publicly accessible and free for use. Available at http://www.facweb.iitkgp.ernet.in/~rbahadur/prince/home.html.
Analysis of secondary structural elements in human microRNA hairpin precursors.

PubMed

Liu, Biao; Childs-Disney, Jessica L; Znosko, Brent M; Wang, Dan; Fallahi, Mohammad; Gallo, Steven M; Disney, Matthew D

2016-03-01

MicroRNAs (miRNAs) regulate gene expression by targeting complementary mRNAs for destruction or translational repression. Aberrant expression of miRNAs has been associated with various diseases including cancer, thus making them interesting therapeutic targets. The composite of secondary structural elements that comprise miRNAs could aid the design of small molecules that modulate their function. We analyzed the secondary structural elements, or motifs, present in all human miRNA hairpin precursors and compared them to highly expressed human RNAs with known structures and other RNAs from various organisms. Amongst human miRNAs, there are 3808 are unique motifs, many residing in processing sites. Further, we identified motifs in miRNAs that are not present in other highly expressed human RNAs, desirable targets for small molecules. MiRNA motifs were incorporated into a searchable database that is freely available. We also analyzed the most frequently occurring bulges and internal loops for each RNA class and found that the smallest loops possible prevail. However, the distribution of loops and the preferred closing base pairs were unique to each class. Collectively, we have completed a broad survey of motifs found in human miRNA precursors, highly expressed human RNAs, and RNAs from other organisms. Interestingly, unique motifs were identified in human miRNA processing sites, binding to which could inhibit miRNA maturation and hence function.
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications

PubMed Central

Chen, Zhi-Teng; Du, Yu-Zhou

2017-01-01

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer (AGN), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae. PMID:28475163
First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2017-05-05

The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.
Electrostatic interactions guide the active site face of a structure-specific ribonuclease to its RNA substrate.

PubMed

Plantinga, Matthew J; Korennykh, Alexei V; Piccirilli, Joseph A; Correll, Carl C

2008-08-26

Restrictocin, a member of the alpha-sarcin family of site-specific endoribonucleases, uses electrostatic interactions to bind to the ribosome and to RNA oligonucleotides, including the minimal specific substrate, the sarcin/ricin loop (SRL) of 23S-28S rRNA. Restrictocin binds to the SRL by forming a ground-state E:S complex that is stabilized predominantly by Coulomb interactions and depends on neither the sequence nor structure of the RNA, suggesting a nonspecific complex. The 22 cationic residues of restrictocin are dispersed throughout this protein surface, complicating a priori identification of a Coulomb interacting surface. Structural studies have identified an enzyme-substrate interface, which is expected to overlap with the electrostatic E:S interface. Here, we identified restrictocin residues that contribute to binding in the E:S complex by determining the salt dependence [partial differential log(k 2/ K 1/2)/ partial differential log[KCl

Post-transcriptional inducible gene regulation by natural antisense RNA.

PubMed

Nishizawa, Mikio; Ikeya, Yukinobu; Okumura, Tadayoshi; Kimura, Tominori

2015-01-01

Accumulating data indicate the existence of natural antisense transcripts (asRNAs), frequently transcribed from eukaryotic genes and do not encode proteins in many cases. However, their importance has been overlooked due to their heterogeneity, low expression level, and unknown function. Genes induced in responses to various stimuli are transcriptionally regulated by the activation of a gene promoter and post-transcriptionally regulated by controlling mRNA stability and translatability. A low-copy-number asRNA may post-transcriptionally regulate gene expression with cis-controlling elements on the mRNA. The asRNA itself may act as regulatory RNA in concert with trans-acting factors, including various RNA-binding proteins that bind to cis-controlling elements, microRNAs, and drugs. A novel mechanism that regulates mRNA stability includes the interaction of asRNA with mRNA by hybridization to loops in secondary structures. Furthermore, recent studies have shown that the functional network of mRNAs, asRNAs, and microRNAs finely tunes the levels of mRNA expression. The post-transcriptional mechanisms via these RNA-RNA interactions may play pivotal roles to regulate inducible gene expression and present the possibility of the involvement of asRNAs in various diseases.
Chemical and structural characterization of a model Post-Termination Complex (PoTC) for the ribosome recycling reaction: Evidence for the release of the mRNA by RRF and EF-G

PubMed Central

Iwakura, Nobuhiro; Yokoyama, Takeshi; Quaglia, Fabio; Mitsuoka, Kaoru; Mio, Kazuhiro; Shigematsu, Hideki; Shirouzu, Mikako; Kaji, Akira; Kaji, Hideko

2017-01-01

A model Post-Termination Complex (PoTC) used for the discovery of Ribosome Recycling Factor (RRF) was purified and characterized by cryo-electron microscopic analysis and biochemical methods. We established that the model PoTC has mostly one tRNA, at the P/E or P/P position, together with one mRNA. The structural studies were supported by the biochemical measurement of bound tRNA and mRNA. Using this substrate, we establish that the release of tRNA, release of mRNA and splitting of ribosomal subunits occur during the recycling reaction. Order of these events is tRNA release first followed by mRNA release and splitting almost simultaneously. Moreover, we demonstrate that IF3 is not involved in any of the recycling reactions but simply prevents the re-association of split ribosomal subunits. Our finding demonstrates that the important function of RRF includes the release of mRNA, which is often missed by the use of a short ORF with the Shine-Dalgarno sequence near the termination site. PMID:28542628
RNA-SSPT: RNA Secondary Structure Prediction Tools.

PubMed

Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

2013-01-01

The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.
RNA-SSPT: RNA Secondary Structure Prediction Tools

PubMed Central

Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

2013-01-01

The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes. PMID:24250115
Folding and unfolding single RNA molecules under tension

PubMed Central

Woodside, Michael T; García-García, Cuauhtémoc; Block, Steven M

2010-01-01

Single-molecule force spectroscopy constitutes a powerful method for probing RNA folding: it allows the kinetic, energetic, and structural properties of intermediate and transition states to be determined quantitatively, yielding new insights into folding pathways and energy landscapes. Recent advances in experimental and theoretical methods, including fluctuation theorems, kinetic theories, novel force clamps, and ultrastable instruments, have opened new avenues for study. These tools have been used to probe folding in simple model systems, for example, RNA and DNA hairpins. Knowledge gained from such systems is helping to build our understanding of more complex RNA structures composed of multiple elements, as well as how nucleic acids interact with proteins involved in key cellular activities, such as transcription and translation. PMID:18786653
A mutually exclusive stem–loop arrangement in roX2 RNA is essential for X-chromosome regulation in Drosophila

PubMed Central

Ilik, Ibrahim Avsar; Maticzka, Daniel; Georgiev, Plamen; Gutierrez, Noel Marie; Backofen, Rolf; Akhtar, Asifa

2017-01-01

The X chromosome provides an ideal model system to study the contribution of RNA–protein interactions in epigenetic regulation. In male flies, roX long noncoding RNAs (lncRNAs) harbor several redundant domains to interact with the ubiquitin ligase male-specific lethal 2 (MSL2) and the RNA helicase Maleless (MLE) for X-chromosomal regulation. However, how these interactions provide the mechanics of spreading remains unknown. By using the uvCLAP (UV cross-linking and affinity purification) methodology, which provides unprecedented information about RNA secondary structures in vivo, we identified the minimal functional unit of roX2 RNA. By using wild-type and various MLE mutant derivatives, including a catalytically inactive MLE derivative, MLEGET, we show that the minimal roX RNA contains two mutually exclusive stem–loops that exist in a peculiar structural arrangement: When one stem–loop is unwound by MLE, an alternate structure can form, likely trapping MLE in this perpetually structured region. We show that this functional unit is necessary for dosage compensation, as mutations that disrupt this formation lead to male lethality. Thus, we propose that roX2 lncRNA contains an MLE-dependent affinity switch to enable reversible interactions of the MSL complex to allow dosage compensation of the X chromosome. PMID:29066499
Fab Chaperone-Assisted RNA Crystallography (Fab CARC).

PubMed

Sherman, Eileen; Archer, Jennifer; Ye, Jing-Dong

2016-01-01

Recent discovery of structured RNAs such as ribozymes and riboswitches shows that there is still much to learn about the structure and function of RNAs. Knowledge learned can be employed in both biochemical research and clinical applications. X-ray crystallography gives unparalleled atomic-level structural detail from which functional inferences can be deduced. However, the difficulty in obtaining high-quality crystals and their phasing information make it a very challenging task. RNA crystallography is particularly arduous due to several factors such as RNA's paucity of surface chemical diversity, lability, repetitive anionic backbone, and flexibility, all of which are counterproductive to crystal packing. Here we describe Fab chaperone assisted RNA crystallography (CARC), a systematic technique to increase RNA crystallography success by facilitating crystal packing as well as expediting phase determination through molecular replacement of conserved Fab domains. Major steps described in this chapter include selection of a synthetic Fab library displayed on M13 phage against a structured RNA crystallization target, ELISA for initial choice of binding Fabs, Fab expression followed by protein A affinity then cation exchange chromatography purification, final choice of Fab by binding specificity and affinity as determined by a dot blot assay, and lastly gel filtration purification of a large quantity of chosen Fabs for crystallization.
Structural analysis of the human U3 ribonucleoprotein particle reveal a conserved sequence available for base pairing with pre-rRNA.

PubMed Central

Parker, K A; Steitz, J A

1987-01-01

The human U3 ribonucleoprotein (RNP) has been analyzed to determine its protein constituents, sites of protein-RNA interaction, and RNA secondary structure. By using anti-U3 RNP antibodies and extracts prepared from HeLa cells labeled in vivo, the RNP was found to contain four nonphosphorylated proteins of 36, 30, 13, and 12.5 kilodaltons and two phosphorylated proteins of 74 and 59 kilodaltons. U3 nucleotides 72-90, 106-121, 154-166, and 190-217 must contain sites that interact with proteins since these regions are immunoprecipitated after treatment of the RNP with RNase A or T1. The secondary structure was probed with specific nucleases and by chemical modification with single-strand-specific reagents that block subsequent reverse transcription. Regions that are single stranded (and therefore potentially able to interact with a substrate RNA) include an evolutionarily conserved sequence at nucleotides 104-112 and nonconserved sequences at nucleotides 65-74, 80-84, and 88-93. Nucleotides 159-168 do not appear to be highly accessible, thus making it unlikely that this U3 sequence base pairs with sequences near the 5.8S rRNA-internal transcribed spacer II junction, as previously proposed. Alternative functions of the U3 RNP are discussed, including the possibility that U3 may participate in a processing event near the 3' end of 28S rRNA. Images PMID:2959855
Single-molecule FRET-Rosetta reveals RNA structural rearrangements during human telomerase catalysis

PubMed Central

Parks, Joseph W.; Kappel, Kalli; Das, Rhiju; Stone, Michael D.

2017-01-01

Maintenance of telomeres by telomerase permits continuous proliferation of rapidly dividing cells, including the majority of human cancers. Despite its direct biomedical significance, the architecture of the human telomerase complex remains unknown. Generating homogeneous telomerase samples has presented a significant barrier to developing improved structural models. Here we pair single-molecule Förster resonance energy transfer (smFRET) measurements with Rosetta modeling to map the conformations of the essential telomerase RNA core domain within the active ribonucleoprotein. FRET-guided modeling places the essential pseudoknot fold distal to the active site on a protein surface comprising the C-terminal element, a domain that shares structural homology with canonical polymerase thumb domains. An independently solved medium-resolution structure of Tetrahymena telomerase provides a blind test of our modeling methodology and sheds light on the structural homology of this domain across diverse organisms. Our smFRET-Rosetta models reveal nanometer-scale rearrangements within the RNA core domain during catalysis. Taken together, our FRET data and pseudoatomic molecular models permit us to propose a possible mechanism for how RNA core domain rearrangement is coupled to template hybrid elongation. PMID:28096444
Structural basis of RNA recognition and activation by innate immune receptor RIG-I

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jiang, Fuguo; Ramanathan, Anand; Miller, Matthew T.

Retinoic-acid-inducible gene-I (RIG-I; also known as DDX58) is a cytoplasmic pathogen recognition receptor that recognizes pathogen-associated molecular pattern (PAMP) motifs to differentiate between viral and cellular RNAs. RIG-I is activated by blunt-ended double-stranded (ds)RNA with or without a 5'-triphosphate (ppp), by single-stranded RNA marked by a 5'-ppp and by polyuridine sequences. Upon binding to such PAMP motifs, RIG-I initiates a signalling cascade that induces innate immune defences and inflammatory cytokines to establish an antiviral state. The RIG-I pathway is highly regulated and aberrant signalling leads to apoptosis, altered cell differentiation, inflammation, autoimmune diseases and cancer. The helicase and repressor domainsmore » (RD) of RIG-I recognize dsRNA and 5'-ppp RNA to activate the two amino-terminal caspase recruitment domains (CARDs) for signalling. Here, to understand the synergy between the helicase and the RD for RNA binding, and the contribution of ATP hydrolysis to RIG-I activation, we determined the structure of human RIG-I helicase-RD in complex with dsRNA and an ATP analogue. The helicase-RD organizes into a ring around dsRNA, capping one end, while contacting both strands using previously uncharacterized motifs to recognize dsRNA. Small-angle X-ray scattering, limited proteolysis and differential scanning fluorimetry indicate that RIG-I is in an extended and flexible conformation that compacts upon binding RNA. These results provide a detailed view of the role of helicase in dsRNA recognition, the synergy between the RD and the helicase for RNA binding and the organization of full-length RIG-I bound to dsRNA, and provide evidence of a conformational change upon RNA binding. The RIG-I helicase-RD structure is consistent with dsRNA translocation without unwinding and cooperative binding to RNA. The structure yields unprecedented insight into innate immunity and has a broader impact on other areas of biology, including RNA interference and DNA repair, which utilize homologous helicase domains within DICER and FANCM.« less
A novel RNA binding surface of the TAM domain of TIP5/BAZ2A mediates epigenetic regulation of rRNA genes.

PubMed

Anosova, Irina; Melnik, Svitlana; Tripsianes, Konstantinos; Kateb, Fatiha; Grummt, Ingrid; Sattler, Michael

2015-05-26

The chromatin remodeling complex NoRC, comprising the subunits SNF2h and TIP5/BAZ2A, mediates heterochromatin formation at major clusters of repetitive elements, including rRNA genes, centromeres and telomeres. Association with chromatin requires the interaction of the TAM (TIP5/ARBP/MBD) domain of TIP5 with noncoding RNA, which targets NoRC to specific genomic loci. Here, we show that the NMR structure of the TAM domain of TIP5 resembles the fold of the MBD domain, found in methyl-CpG binding proteins. However, the TAM domain exhibits an extended MBD fold with unique C-terminal extensions that constitute a novel surface for RNA binding. Mutation of critical amino acids within this surface abolishes RNA binding in vitro and in vivo. Our results explain the distinct binding specificities of TAM and MBD domains to RNA and methylated DNA, respectively, and reveal structural features for the interaction of NoRC with non-coding RNA. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Systematic discovery of Xist RNA binding proteins

PubMed Central

Chu, Ci; Zhang, Qiangfeng Cliff; da Rocha, Simão Teixeira; Flynn, Ryan A.; Bharadwaj, Maheetha; Calabrese, J. Mauro; Magnuson, Terry; Heard, Edith; Chang, Howard Y.

2015-01-01

Summary Noncoding RNAs (ncRNAs) function with associated proteins to effect complex structural and regulatory outcomes. To reveal the composition and dynamics of specific noncoding RNA- protein complexes (RNPs) in vivo, we developed comprehensive identification of RNA-binding proteins by mass spectrometry (ChIRP-MS). ChIRP-MS analysis of four ncRNAs captures key protein interactors, including a U1-specific link to the 3′ RNA processing machinery. Xist, an essential lncRNA for X-chromosome inactivation (XCI), interacts with 81 proteins from chromatin modification, nuclear matrix, and RNA remodeling pathways. The Xist RNA-protein particle assembles in two steps coupled with the transition from pluripotency to differentiation. Specific interactors include HnrnpK that participates in Xist-mediated gene silencing and histone modifications, but not Xist localization and Drosophila Split ends homolog Spen that interacts via the A-repeat domain of Xist and is required for gene silencing. Thus, Xist lncRNA engages with proteins in a modular and developmentally controlled manner to coordinate chromatin spreading and silencing. PMID:25843628
Single-stranded DNA and RNA origami.

PubMed

Han, Dongran; Qi, Xiaodong; Myhrvold, Cameron; Wang, Bei; Dai, Mingjie; Jiang, Shuoxing; Bates, Maxwell; Liu, Yan; An, Byoungkwon; Zhang, Fei; Yan, Hao; Yin, Peng

2017-12-15

Self-folding of an information-carrying polymer into a defined structure is foundational to biology and offers attractive potential as a synthetic strategy. Although multicomponent self-assembly has produced complex synthetic nanostructures, unimolecular folding has seen limited progress. We describe a framework to design and synthesize a single DNA or RNA strand to self-fold into a complex yet unknotted structure that approximates an arbitrary user-prescribed shape. We experimentally construct diverse multikilobase single-stranded structures, including a ~10,000-nucleotide (nt) DNA structure and a ~6000-nt RNA structure. We demonstrate facile replication of the strand in vitro and in living cells. The work here thus establishes unimolecular folding as a general strategy for constructing complex and replicable nucleic acid nanostructures, and expands the design space and material scalability for bottom-up nanotechnology. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Assemble: an interactive graphical tool to analyze and build RNA architectures at the 2D and 3D levels.

PubMed

Jossinet, Fabrice; Ludwig, Thomas E; Westhof, Eric

2010-08-15

Assemble is an intuitive graphical interface to analyze, manipulate and build complex 3D RNA architectures. It provides several advanced and unique features within the framework of a semi-automated modeling process that can be performed by homology and ab initio with or without electron density maps. Those include the interactive editing of a secondary structure and a searchable, embedded library of annotated tertiary structures. Assemble helps users with performing recurrent and otherwise tedious tasks in structural RNA research. Assemble is released under an open-source license (MIT license) and is freely available at http://bioinformatics.org/assemble. It is implemented in the Java language and runs on MacOSX, Linux and Windows operating systems.
Optimized guide RNA structure for genome editing via Cas9

PubMed Central

Xu, Jianyong; Lian, Wei; Jia, Yuning; Li, Lingyun; Huang, Zhong

2017-01-01

The genome editing tool Cas9-gRNA (guide RNA) has been successfully applied in different cell types and organisms with high efficiency. However, more efforts need to be made to enhance both efficiency and specificity. In the current study, we optimized the guide RNA structure of Streptococcus pyogenes CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR-associated) system to improve its genome editing efficiency. Comparing with the original functional structure of guide RNA, which is composed of crRNA and tracrRNA, the widely used chimeric gRNA has shorter crRNA and tracrRNA sequence. The deleted RNA sequence could form extra loop structure, which might enhance the stability of the guide RNA structure and subsequently the genome editing efficiency. Thus the genome editing efficiency of different forms of guide RNA was tested. And we found that the chimeric structure of gRNA with original full length of crRNA and tracrRNA showed higher genome editing efficiency than the conventional chimeric structure or other types of gRNA we tested. Therefore our data here uncovered the new type of gRNA structure with higher genome editing efficiency. PMID:29212218
Single-molecule FRET reveals a corkscrew RNA structure for the polymerase-bound influenza virus promoter.

PubMed

Tomescu, Alexandra I; Robb, Nicole C; Hengrung, Narin; Fodor, Ervin; Kapanidis, Achillefs N

2014-08-12

The influenza virus is a major human and animal pathogen responsible for seasonal epidemics and occasional pandemics. The genome of the influenza A virus comprises eight segments of single-stranded, negative-sense RNA with highly conserved 5' and 3' termini. These termini interact to form a double-stranded promoter structure that is recognized and bound by the viral RNA-dependent RNA polymerase (RNAP); however, no 3D structural information for the influenza polymerase-bound promoter exists. Functional studies have led to the proposal of several 2D models for the secondary structure of the bound promoter, including a corkscrew model in which the 5' and 3' termini form short hairpins. We have taken advantage of an insect-cell system to prepare large amounts of active recombinant influenza virus RNAP, and used this to develop a highly sensitive single-molecule FRET assay to measure distances between fluorescent dyes located on the promoter and map its structure both with and without the polymerase bound. These advances enabled the direct analysis of the influenza promoter structure in complex with the viral RNAP, and provided 3D structural information that is in agreement with the corkscrew model for the influenza virus promoter RNA. Our data provide insights into the mechanisms of promoter binding by the influenza RNAP and have implications for the understanding of the regulatory mechanisms involved in the transcription of viral genes and replication of the viral RNA genome. In addition, the simplicity of this system should translate readily to the study of any virus polymerase-promoter interaction.
Single-molecule FRET reveals a corkscrew RNA structure for the polymerase-bound influenza virus promoter

PubMed Central

Tomescu, Alexandra I.; Robb, Nicole C.; Hengrung, Narin; Fodor, Ervin; Kapanidis, Achillefs N.

2014-01-01

The influenza virus is a major human and animal pathogen responsible for seasonal epidemics and occasional pandemics. The genome of the influenza A virus comprises eight segments of single-stranded, negative-sense RNA with highly conserved 5′ and 3′ termini. These termini interact to form a double-stranded promoter structure that is recognized and bound by the viral RNA-dependent RNA polymerase (RNAP); however, no 3D structural information for the influenza polymerase-bound promoter exists. Functional studies have led to the proposal of several 2D models for the secondary structure of the bound promoter, including a corkscrew model in which the 5′ and 3′ termini form short hairpins. We have taken advantage of an insect-cell system to prepare large amounts of active recombinant influenza virus RNAP, and used this to develop a highly sensitive single-molecule FRET assay to measure distances between fluorescent dyes located on the promoter and map its structure both with and without the polymerase bound. These advances enabled the direct analysis of the influenza promoter structure in complex with the viral RNAP, and provided 3D structural information that is in agreement with the corkscrew model for the influenza virus promoter RNA. Our data provide insights into the mechanisms of promoter binding by the influenza RNAP and have implications for the understanding of the regulatory mechanisms involved in the transcription of viral genes and replication of the viral RNA genome. In addition, the simplicity of this system should translate readily to the study of any virus polymerase–promoter interaction. PMID:25071209
Oxidative damage of 18S and 5S ribosomal RNA in digestive gland of mussels exposed to trace metals.

PubMed

Kournoutou, Georgia G; Giannopoulou, Panagiota C; Sazakli, Eleni; Leotsinidis, Michel; Kalpaxis, Dimitrios L

2017-11-01

Numerous studies have shown the ability of trace metals to accumulate in marine organisms and cause oxidative stress that leads to perturbations in many important intracellular processes, including protein synthesis. This study is mainly focused on the exploration of structural changes, like base modifications, scissions, and conformational changes, caused in 18S and 5S ribosomal RNA (rRNA) isolated from the mussel Mytilus galloprovincialis exposed to 40μg/L Cu, 30μg/L Hg, or 100μg/L Cd, for 5 or 15days. 18S rRNA and 5S rRNA are components of the small and large ribosomal subunit, respectively, found in complex with ribosomal proteins, translation factors and other auxiliary components (metal ions, toxins etc). 18S rRNA plays crucial roles in all stages of protein synthesis, while 5S rRNA serves as a master signal transducer between several functional regions of 28S rRNA. Therefore, structural changes in these ribosomal constituents could affect the basic functions of ribosomes and hence the normal metabolism of cells. Especially, 18S rRNA along with ribosomal proteins forms the decoding centre that ensures the correct codon-anticodon pairing. As exemplified by ELISA, primer extension analysis and DMS footprinting analysis, each metal caused oxidative damage to rRNA, depending on the nature of metal ion and the duration of exposure. Interestingly, exposure of mussels to Cu or Hg caused structural alterations in 5S rRNA, localized in paired regions and within loops A, B, C, and E, leading to a continuous progressive loss of the 5S RNA structural integrity. In contrast, structural impairments of 5S rRNA in mussels exposed to Cd were accumulating for the initial 5days, and then progressively decreased to almost the normal level by day 15, probably due to the parallel elevation of metallothionein content that depletes the pools of free Cd. Regions of interest in 18S rRNA, such as the decoding centre, sites implicated in the binding of tRNAs (A- and P-sites) or translation factors, and areas related to translation fidelity, were found to undergo significant metal-induced conformational alterations, leading either to loosening of their structure or to more compact folding. These modifications were associated with parallel alterations in the translation process at multiple levels, a fact suggesting that structural perturbations in ribosomes, caused by metals, pose significant hurdles in translational efficiency and fidelity. Copyright © 2017 Elsevier B.V. All rights reserved.
Structural basis of malaria parasite lysyl-tRNA synthetase inhibition by cladosporin.

PubMed

Khan, Sameena; Sharma, Arvind; Belrhali, Hassan; Yogavel, Manickam; Sharma, Amit

2014-06-01

Malaria parasites inevitably develop drug resistance to anti-malarials over time. Hence the immediacy for discovering new chemical scaffolds to include in combination malaria drug therapy. The desirable attributes of new chemotherapeutic agents currently include activity against both liver and blood stage malaria parasites. One such recently discovered compound called cladosporin abrogates parasite growth via inhibition of Plasmodium falciparum lysyl-tRNA synthetase (PfKRS), an enzyme central to protein translation. Here, we present crystal structure of ternary PfKRS-lysine-cladosporin (PfKRS-K-C) complex that reveals cladosporin's remarkable ability to mimic the natural substrate adenosine and thereby colonize PfKRS active site. The isocoumarin fragment of cladosporin sandwiches between critical adenine-recognizing residues while its pyran ring fits snugly in the ribose-recognizing cavity. PfKRS-K-C structure highlights ample space within PfKRS active site for further chemical derivatization of cladosporin. Such derivatives may be useful against additional human pathogens that retain high conservation in cladosporin chelating residues within their lysyl-tRNA synthetase.
Conserved and divergent features of the structure and function of La and La-related proteins (LARPs)

PubMed Central

Bayfield, Mark A.; Yang, Ruiqing; Maraia, Richard J.

2010-01-01

Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3’OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3’OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3’OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNA assembly by hLARP7/PIP7S). Analyses of other LARP family members (i.e., hLARP4, hLARP6) suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs. PMID:20138158

Structural elements and organization of the ancestral translational machinery

NASA Technical Reports Server (NTRS)

Rein, R.; Srinivasan, S.; Mcdonald, J.; Raghunathan, G.; Shibata, M.

1987-01-01

The molecular mechanisms of the primitive translational apparatus are discussed in the framework of present-day protein biosynthesis. The structural necessities of an early adaptor and the multipoint recognition properties of such an adaptor are investigated on the basis of structure/function relationships found in a contemporary system and a molecular model of the contemporary transpeptidation complex. A model of the tRNA(Tyr)-tyrosyl tRNA synthetase complex including the positioning of the disordered region is proposed; the model is used to illustrate the required recognition properties of the ancestor aminoacyl synthetase.
Shaping tRNA

ERIC Educational Resources Information Center

Priano, Christine

2013-01-01

This model-building activity provides a quick, visual, hands-on tool that allows students to examine more carefully the cloverleaf structure of a typical tRNA molecule. When used as a supplement to lessons that involve gene expression, this exercise reinforces several concepts in molecular genetics, including nucleotide base-pairing rules, the…
The determinants of alternative RNA splicing in human cells.

PubMed

Ramanouskaya, Tatsiana V; Grinev, Vasily V

2017-12-01

Alternative splicing represents an important level of the regulation of gene function in eukaryotic organisms. It plays a critical role in virtually every biological process within an organism, including regulation of cell division and cell death, differentiation of tissues in the embryo and the adult organism, as well as in cellular response to diverse environmental factors. In turn, studies of the last decade have shown that alternative splicing itself is controlled by different mechanisms. Unfortunately, there is no clear understanding of how these diverse mechanisms, or determinants, regulate and constrain the set of alternative RNA species produced from any particular gene in every cell of the human body. Here, we provide a consolidated overview of alternative splicing determinants including RNA-protein interactions, epigenetic regulation via chromatin remodeling, coupling of transcription-to-alternative splicing, effect of secondary structures in pre-RNA, and function of the RNA quality control systems. We also extensively and critically discuss some mechanistic insights on coordinated inclusion/exclusion of exons during the formation of mature RNA molecules. We conclude that the final structure of RNA is pre-determined by a complex interplay between cis- and trans-acting factors. Altogether, currently available empirical data significantly expand our understanding of the functioning of the alternative splicing machinery of cells in normal and pathological conditions. On the other hand, there are still many blind spots that require further deep investigations.
Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions

PubMed Central

Lünse, Christina E.; Corbino, Keith A.; Ames, Tyler D.; Nelson, James W.; Roth, Adam; Perkins, Kevin R.; Sherlock, Madeline E.

2017-01-01

Abstract The discovery of structured non-coding RNAs (ncRNAs) in bacteria can reveal new facets of biology and biochemistry. Comparative genomics analyses executed by powerful computer algorithms have successfully been used to uncover many novel bacterial ncRNA classes in recent years. However, this general search strategy favors the discovery of more common ncRNA classes, whereas progressively rarer classes are correspondingly more difficult to identify. In the current study, we confront this problem by devising several methods to select subsets of intergenic regions that can concentrate these rare RNA classes, thereby increasing the probability that comparative sequence analysis approaches will reveal their existence. By implementing these methods, we discovered 224 novel ncRNA classes, which include ROOL RNA, an RNA class averaging 581 nt and present in multiple phyla, several highly conserved and widespread ncRNA classes with properties that suggest sophisticated biochemical functions and a multitude of putative cis-regulatory RNA classes involved in a variety of biological processes. We expect that further research on these newly found RNA classes will reveal additional aspects of novel biology, and allow for greater insights into the biochemistry performed by ncRNAs. PMID:28977401
The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

PubMed Central

De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

1992-01-01

The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081
A semi-supervised learning approach for RNA secondary structure prediction.

PubMed

Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

2015-08-01

RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.
Mitochondrial genome of the African lion Panthera leo leo.

PubMed

Ma, Yue-ping; Wang, Shuo

2015-01-01

In this study, the complete mitochondrial genome sequence of the African lion P. leo leo was reported. The total length of the mitogenome was 17,054 bp. It contained the typical mitochondrial structure, including 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region; 21 of the tRNA genes folded into typical cloverleaf secondary structure except for tRNASe. The overall composition of the mitogenome was A (32.0%), G (14.5%), C (26.5%) and T (27.0%). The new sequence will provide molecular genetic information for conservation genetics study of this important large carnivore.
Simulated rRNA/DNA Ratios Show Potential To Misclassify Active Populations as Dormant

DOE Office of Scientific and Technical Information (OSTI.GOV)

Steven, Blaire; Hesse, Cedar; Soghigian, John

The use of rRNA/DNA ratios derived from surveys of rRNA sequences in RNA and DNA extracts is an appealing but poorly validated approach to infer the activity status of environmental microbes. To improve the interpretation of rRNA/DNA ratios, we performed simulations to investigate the effects of community structure, rRNA amplification, and sampling depth on the accuracy of rRNA/DNA ratios in classifying bacterial populations as “active” or “dormant.” Community structure was an insignificant factor. In contrast, the extent of rRNA amplification that occurs as cells transition from dormant to growing had a significant effect (P < 0.0001) on classification accuracy, withmore » misclassification errors ranging from 16 to 28%, depending on the rRNA amplification model. The error rate increased to 47% when communities included a mixture of rRNA amplification models, but most of the inflated error was false negatives (i.e., active populations misclassified as dormant). Sampling depth also affected error rates (P < 0.001). Inadequate sampling depth produced various artifacts that are characteristic of rRNA/DNA ratios generated from real communities. These data show important constraints on the use of rRNA/DNA ratios to infer activity status. Whereas classification of populations as active based on rRNA/DNA ratios appears generally valid, classification of populations as dormant is potentially far less accurate.« less
Simulated rRNA/DNA Ratios Show Potential To Misclassify Active Populations as Dormant

DOE PAGES

Steven, Blaire; Hesse, Cedar; Soghigian, John; ...

2017-03-31

The use of rRNA/DNA ratios derived from surveys of rRNA sequences in RNA and DNA extracts is an appealing but poorly validated approach to infer the activity status of environmental microbes. To improve the interpretation of rRNA/DNA ratios, we performed simulations to investigate the effects of community structure, rRNA amplification, and sampling depth on the accuracy of rRNA/DNA ratios in classifying bacterial populations as “active” or “dormant.” Community structure was an insignificant factor. In contrast, the extent of rRNA amplification that occurs as cells transition from dormant to growing had a significant effect (P < 0.0001) on classification accuracy, withmore » misclassification errors ranging from 16 to 28%, depending on the rRNA amplification model. The error rate increased to 47% when communities included a mixture of rRNA amplification models, but most of the inflated error was false negatives (i.e., active populations misclassified as dormant). Sampling depth also affected error rates (P < 0.001). Inadequate sampling depth produced various artifacts that are characteristic of rRNA/DNA ratios generated from real communities. These data show important constraints on the use of rRNA/DNA ratios to infer activity status. Whereas classification of populations as active based on rRNA/DNA ratios appears generally valid, classification of populations as dormant is potentially far less accurate.« less
Modelling the structure of a ceRNA-theoretical, bipartite microRNA-mRNA interaction network regulating intestinal epithelial cellular pathways using R programming.

PubMed

Robinson, J M; Henderson, W A

2018-01-12

We report a method using functional-molecular databases and network modelling to identify hypothetical mRNA-miRNA interaction networks regulating intestinal epithelial barrier function. The model forms a data-analysis component of our cell culture experiments, which produce RNA expression data from Nanostring Technologies nCounter ® system. The epithelial tight-junction (TJ) and actin cytoskeleton interact as molecular components of the intestinal epithelial barrier. Upstream regulation of TJ-cytoskeleton interaction is effected by the Rac/Rock/Rho signaling pathway and other associated pathways which may be activated or suppressed by extracellular signaling from growth factors, hormones, and immune receptors. Pathway activations affect epithelial homeostasis, contributing to degradation of the epithelial barrier associated with osmotic dysregulation, inflammation, and tumor development. The complexity underlying miRNA-mRNA interaction networks represents a roadblock for prediction and validation of competing-endogenous RNA network function. We developed a network model to identify hypothetical co-regulatory motifs in a miRNA-mRNA interaction network related to epithelial function. A mRNA-miRNA interaction list was generated using KEGG and miRWalk2.0 databases. R-code was developed to quantify and visualize inherent network structures. We identified a sub-network with a high number of shared, targeting miRNAs, of genes associated with cellular proliferation and cancer, including c-MYC and Cyclin D.
RNA Thermodynamic Structural Entropy

PubMed Central

Garcia-Martin, Juan Antonio; Clote, Peter

2015-01-01

Conformational entropy for atomic-level, three dimensional biomolecules is known experimentally to play an important role in protein-ligand discrimination, yet reliable computation of entropy remains a difficult problem. Here we describe the first two accurate and efficient algorithms to compute the conformational entropy for RNA secondary structures, with respect to the Turner energy model, where free energy parameters are determined from UV absorption experiments. An algorithm to compute the derivational entropy for RNA secondary structures had previously been introduced, using stochastic context free grammars (SCFGs). However, the numerical value of derivational entropy depends heavily on the chosen context free grammar and on the training set used to estimate rule probabilities. Using data from the Rfam database, we determine that both of our thermodynamic methods, which agree in numerical value, are substantially faster than the SCFG method. Thermodynamic structural entropy is much smaller than derivational entropy, and the correlation between length-normalized thermodynamic entropy and derivational entropy is moderately weak to poor. In applications, we plot the structural entropy as a function of temperature for known thermoswitches, such as the repression of heat shock gene expression (ROSE) element, we determine that the correlation between hammerhead ribozyme cleavage activity and total free energy is improved by including an additional free energy term arising from conformational entropy, and we plot the structural entropy of windows of the HIV-1 genome. Our software RNAentropy can compute structural entropy for any user-specified temperature, and supports both the Turner’99 and Turner’04 energy parameters. It follows that RNAentropy is state-of-the-art software to compute RNA secondary structure conformational entropy. Source code is available at https://github.com/clotelab/RNAentropy/; a full web server is available at http://bioinformatics.bc.edu/clotelab/RNAentropy, including source code and ancillary programs. PMID:26555444
RNA Thermodynamic Structural Entropy.

PubMed

Garcia-Martin, Juan Antonio; Clote, Peter

2015-01-01

Conformational entropy for atomic-level, three dimensional biomolecules is known experimentally to play an important role in protein-ligand discrimination, yet reliable computation of entropy remains a difficult problem. Here we describe the first two accurate and efficient algorithms to compute the conformational entropy for RNA secondary structures, with respect to the Turner energy model, where free energy parameters are determined from UV absorption experiments. An algorithm to compute the derivational entropy for RNA secondary structures had previously been introduced, using stochastic context free grammars (SCFGs). However, the numerical value of derivational entropy depends heavily on the chosen context free grammar and on the training set used to estimate rule probabilities. Using data from the Rfam database, we determine that both of our thermodynamic methods, which agree in numerical value, are substantially faster than the SCFG method. Thermodynamic structural entropy is much smaller than derivational entropy, and the correlation between length-normalized thermodynamic entropy and derivational entropy is moderately weak to poor. In applications, we plot the structural entropy as a function of temperature for known thermoswitches, such as the repression of heat shock gene expression (ROSE) element, we determine that the correlation between hammerhead ribozyme cleavage activity and total free energy is improved by including an additional free energy term arising from conformational entropy, and we plot the structural entropy of windows of the HIV-1 genome. Our software RNAentropy can compute structural entropy for any user-specified temperature, and supports both the Turner'99 and Turner'04 energy parameters. It follows that RNAentropy is state-of-the-art software to compute RNA secondary structure conformational entropy. Source code is available at https://github.com/clotelab/RNAentropy/; a full web server is available at http://bioinformatics.bc.edu/clotelab/RNAentropy, including source code and ancillary programs.
Computational RNomics of Drosophilids

PubMed Central

Rose, Dominic; Hackermüller, Jörg; Washietl, Stefan; Reiche, Kristin; Hertel, Jana; Findeiß, Sven; Stadler, Peter F; Prohaska, Sonja J

2007-01-01

Background Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure. The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz. Results We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the Drosophila genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai et al., EMBO J. 26: 79–89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities. Conclusion The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl et al., Nat. Biotech. 23: 1383–1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals. PMID:17996037
Comparative structural analysis of human DEAD-box RNA helicases.

PubMed

Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

2010-09-30

DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members.
Comparative Structural Analysis of Human DEAD-Box RNA Helicases

PubMed Central

Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

2010-01-01

DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members. PMID:20941364
Ricin - inhibitor design. Annual report, 15 April 1994-14 April 1995

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schramm, V.L.

1995-05-14

Substrates for ricin A-chain include short RNA stem-loop structures which have been synthesized with radioactive labels for ease of catalytic assay and for kinetic isotope effects. Ricin A-chain from several sources is incapable of completing multiple catalytic cycles using these substrates. A family of ricin substrate analogue molecules have been synthesized and tested which are specific for transition states with oxycarbonium character or for enzymatic mechanisms involving protonation of the adenine leaving group. Formycin analogues were incorporated into RNA oligomeric structures and tested for binding to ricin A-chain or as inhibitors of the ricin-inactivation of in vitro translation using rabbitmore » reticulocyte lysates. Ribo-oxycarbonium ion analogues containing iminoribitol analogues of ribose were synthetically incorporated into RNA oligomeric structures. Neither formycin nor ribo-oxycarbonium analogues, either singly or in RNA oligomers caused significant inhibition of ricin A-chain when assayed in reticulocyte lysate translation assays. The results indicate a novel transition state mechanism for ricin A-chain, or a requirement for additional features of 28s rRNA to bind transition state analogues.« less
SU-E-T-338: Ultrastable PRNA 3WJ Nanoparticles as Potential I-125 and C-131 Carriers for Targeted Radiation Therapy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Luo, W; Li, H; Guo, P

2014-06-01

Purpose: To study the feasibility of using the pRNA 3WJ nanoparticles to carry I-125 or Cs-131 to target and treat cancer. As the first step, we investigated the stabilities of pRNA 3WJ nanoparticles that are essential for cancer targeting and treatment in this study. Methods: The thermodynamic stability of assembled RNA 3WJ nanoparticles was studied using the TGGE system. The nanoparticles were irradiated with I-125 or Cs-131 radioactive sources that were immersed in the RNA nanoparticle/DNA structure sample liquid contained in a small vial. The irradiation of the RNA samples was performed for different time periods and doses. The purposemore » was to distinguish the effects of radiation on DNA and RNA structures. Unradiated samples were used as control. Results: RNA nanoparticles were formed by mixing three pieces of oligos, 3WJa, 3WJb, and 3WJc at 1:1:1 molar ratio. Figure 4 demonstrates that 2′-F modified 3WJ nanoparticles remained stable at temperatures as high as 66.8 ± 2°C, and exhibited melting temperatures of 71 ± 2°C. The radiation stability test was performed with I- 125 and Cs-131 irradiation. Several DNA structures including plasmids were included as control. The first test introduced I-125 and a low dose of 1 Gy to both RNA and DNA samples, but no change was observed. When the dose was increased to 30 Gy, DNA was damaged while RNA remained unchanged. Three tests were also conducted with Cs-131 with 7 Gy, 21 Gy, 30 Gy, and 89 Gy, and the results were similar to those with I-125. Conclusion: pRNA 3WJ nanoparticles are able to form efficiently by onepot self-assembly. They remained stable at high temperatures and high therapeutic doses over a long time. These unique features suggest that RNA 3WJ nanoparticles have the potential to be used for targeted radiation therapy for cancer treatment.« less
Type and Level of RMRP Functional Impairment Predicts Phenotype in the Cartilage Hair Hypoplasia–Anauxetic Dysplasia Spectrum

PubMed Central

Thiel, Christian T. ; Mortier, Geert ; Kaitila, Ilkka ; Reis, André ; Rauch, Anita

2007-01-01

Mutations in the RMRP gene lead to a wide spectrum of autosomal recessive skeletal dysplasias, ranging from the milder phenotypes metaphyseal dysplasia without hypotrichosis and cartilage hair hypoplasia (CHH) to the severe anauxetic dysplasia (AD). This clinical spectrum includes different degrees of short stature, hair hypoplasia, defective erythrogenesis, and immunodeficiency. The RMRP gene encodes the untranslated RNA component of the mitochondrial RNA–processing ribonuclease, RNase MRP. We recently demonstrated that mutations may affect both messenger RNA (mRNA) and ribosomal RNA (rRNA) cleavage and thus cell-cycle regulation and protein synthesis. To investigate the genotype-phenotype correlation, we analyzed the position and the functional effect of 13 mutations in patients with variable features of the CHH-AD spectrum. Those at the end of the spectrum include a novel patient with anauxetic dysplasia who was compound heterozygous for the null mutation g.254_263delCTCAGCGCGG and the mutation g.195C→T, which was previously described in patients with milder phenotypes. Mapping of nucleotide conservation to the two-dimensional structure of the RMRP gene revealed that disease-causing mutations either affect evolutionarily conserved nucleotides or are likely to alter secondary structure through mispairing in stem regions. In vitro testing of RNase MRP multiprotein-specific mRNA and rRNA cleavage of different mutations revealed a strong correlation between the decrease in rRNA cleavage in ribosomal assembly and the degree of bone dysplasia, whereas reduced mRNA cleavage, and thus cell-cycle impairment, predicts the presence of hair hypoplasia, immunodeficiency, and hematological abnormalities and thus increased cancer risk. PMID:17701897
Secondary Structure Predictions for Long RNA Sequences Based on Inversion Excursions and MapReduce.

PubMed

Yehdego, Daniel T; Zhang, Boyu; Kodimala, Vikram K R; Johnson, Kyle L; Taufer, Michela; Leung, Ming-Ying

2013-05-01

Secondary structures of ribonucleic acid (RNA) molecules play important roles in many biological processes including gene expression and regulation. Experimental observations and computing limitations suggest that we can approach the secondary structure prediction problem for long RNA sequences by segmenting them into shorter chunks, predicting the secondary structures of each chunk individually using existing prediction programs, and then assembling the results to give the structure of the original sequence. The selection of cutting points is a crucial component of the segmenting step. Noting that stem-loops and pseudoknots always contain an inversion, i.e., a stretch of nucleotides followed closely by its inverse complementary sequence, we developed two cutting methods for segmenting long RNA sequences based on inversion excursions: the centered and optimized method. Each step of searching for inversions, chunking, and predictions can be performed in parallel. In this paper we use a MapReduce framework, i.e., Hadoop, to extensively explore meaningful inversion stem lengths and gap sizes for the segmentation and identify correlations between chunking methods and prediction accuracy. We show that for a set of long RNA sequences in the RFAM database, whose secondary structures are known to contain pseudoknots, our approach predicts secondary structures more accurately than methods that do not segment the sequence, when the latter predictions are possible computationally. We also show that, as sequences exceed certain lengths, some programs cannot computationally predict pseudoknots while our chunking methods can. Overall, our predicted structures still retain the accuracy level of the original prediction programs when compared with known experimental secondary structure.
An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data.

PubMed

Wang, Yejun; MacKenzie, Keith D; White, Aaron P

2015-05-07

As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for comparative analyses with other Salmonella serotypes.

Multisubunit DNA-Dependent RNA Polymerases from Vaccinia Virus and Other Nucleocytoplasmic Large-DNA Viruses: Impressions from the Age of Structure.

PubMed

Mirzakhanyan, Yeva; Gershon, Paul D

2017-09-01

The past 17 years have been marked by a revolution in our understanding of cellular multisubunit DNA-dependent RNA polymerases (MSDDRPs) at the structural level. A parallel development over the past 15 years has been the emerging story of the giant viruses, which encode MSDDRPs. Here we link the two in an attempt to understand the specialization of multisubunit RNA polymerases in the domain of life encompassing the large nucleocytoplasmic DNA viruses (NCLDV), a superclade that includes the giant viruses and the biochemically well-characterized poxvirus vaccinia virus. The first half of this review surveys the recently determined structural biology of cellular RNA polymerases for a microbiology readership. The second half discusses a reannotation of MSDDRP subunits from NCLDV families and the apparent specialization of these enzymes by virus family and by subunit with regard to subunit or domain loss, subunit dissociability, endogenous control of polymerase arrest, and the elimination/customization of regulatory interactions that would confer higher-order cellular control. Some themes are apparent in linking subunit function to structure in the viral world: as with cellular RNA polymerases I and III and unlike cellular RNA polymerase II, the viral enzymes seem to opt for speed and processivity and seem to have eliminated domains associated with higher-order regulation. The adoption/loss of viral RNA polymerase proofreading functions may have played a part in matching intrinsic mutability to genome size. Copyright © 2017 American Society for Microbiology.
Molecular basis for the interaction between Integrator subunits IntS9 and IntS11 and its functional importance.

PubMed

Wu, Yixuan; Albrecht, Todd R; Baillat, David; Wagner, Eric J; Tong, Liang

2017-04-25

The metazoan Integrator complex (INT) has important functions in the 3'-end processing of noncoding RNAs, including the uridine-rich small nuclear RNA (UsnRNA) and enhancer RNA (eRNA), and in the transcription of coding genes by RNA polymerase II. The INT contains at least 14 subunits, but its molecular mechanism of action is poorly understood, because currently there is little structural information about its subunits. The endonuclease activity of INT is mediated by its subunit 11 (IntS11), which belongs to the metallo-β-lactamase superfamily and is a paralog of CPSF-73, the endonuclease for pre-mRNA 3'-end processing. IntS11 forms a stable complex with Integrator complex subunit 9 (IntS9) through their C-terminal domains (CTDs). Here, we report the crystal structure of the IntS9-IntS11 CTD complex at 2.1-Å resolution and detailed, structure-based biochemical and functional studies. The complex is composed of a continuous nine-stranded β-sheet with four strands from IntS9 and five from IntS11. Highly conserved residues are located in the extensive interface between the two CTDs. Yeast two-hybrid assays and coimmunoprecipitation experiments confirm the structural observations on the complex. Functional studies demonstrate that the IntS9-IntS11 interaction is crucial for the role of INT in snRNA 3'-end processing.
Ab initio RNA folding by discrete molecular dynamics: From structure prediction to folding mechanisms

PubMed Central

Ding, Feng; Sharma, Shantanu; Chalasani, Poornima; Demidov, Vadim V.; Broude, Natalia E.; Dokholyan, Nikolay V.

2008-01-01

RNA molecules with novel functions have revived interest in the accurate prediction of RNA three-dimensional (3D) structure and folding dynamics. However, existing methods are inefficient in automated 3D structure prediction. Here, we report a robust computational approach for rapid folding of RNA molecules. We develop a simplified RNA model for discrete molecular dynamics (DMD) simulations, incorporating base-pairing and base-stacking interactions. We demonstrate correct folding of 150 structurally diverse RNA sequences. The majority of DMD-predicted 3D structures have <4 Å deviations from experimental structures. The secondary structures corresponding to the predicted 3D structures consist of 94% native base-pair interactions. Folding thermodynamics and kinetics of tRNAPhe, pseudoknots, and mRNA fragments in DMD simulations are in agreement with previous experimental findings. Folding of RNA molecules features transient, non-native conformations, suggesting non-hierarchical RNA folding. Our method allows rapid conformational sampling of RNA folding, with computational time increasing linearly with RNA length. We envision this approach as a promising tool for RNA structural and functional analyses. PMID:18456842
Conserved and divergent features of the structure and function of La and La-related proteins (LARPs).

PubMed

Bayfield, Mark A; Yang, Ruiqing; Maraia, Richard J

2010-01-01

Genuine La proteins contain two RNA binding motifs, a La motif (LAM) followed by a RNA recognition motif (RRM), arranged in a unique way to bind RNA. These proteins interact with an extensive variety of cellular RNAs and exhibit activities in two broad categories: i) to promote the metabolism of nascent pol III transcripts, including precursor-tRNAs, by binding to their common, UUU-3'OH containing ends, and ii) to modulate the translation of certain mRNAs involving an unknown binding mechanism. Characterization of several La-RNA crystal structures as well as biochemical studies reveal insight into their unique two-motif domain architecture and how the LAM recognizes UUU-3'OH while the RRM binds other parts of a pre-tRNA. Recent studies of members of distinct families of conserved La-related proteins (LARPs) indicate that some of these harbor activity related to genuine La proteins, suggesting that their UUU-3'OH binding mode has been appropriated for the assembly and regulation of a specific snRNP (e.g., 7SK snRNP assembly by hLARP7/PIP7S). Analyses of other LARP family members suggest more diverged RNA binding modes and specialization for cytoplasmic mRNA-related functions. Thus it appears that while genuine La proteins exhibit broad general involvement in both snRNA-related and mRNA-related functions, different LARP families may have evolved specialized activities in either snRNA or mRNA-related functions. In this review, we summarize recent progress that has led to greater understanding of the structure and function of La proteins and their roles in tRNA processing and RNP assembly dynamics, as well as progress on the different LARPs.
Structure and reconstitution of yeast Mpp6-nuclear exosome complexes reveals that Mpp6 stimulates RNA decay and recruits the Mtr4 helicase.

PubMed

Wasmuth, Elizabeth V; Zinder, John C; Zattas, Dimitrios; Das, Mom; Lima, Christopher D

2017-07-25

Nuclear RNA exosomes catalyze a range of RNA processing and decay activities that are coordinated in part by cofactors, including Mpp6, Rrp47, and the Mtr4 RNA helicase. Mpp6 interacts with the nine-subunit exosome core, while Rrp47 stabilizes the exoribonuclease Rrp6 and recruits Mtr4, but it is less clear if these cofactors work together. Using biochemistry with Saccharomyces cerevisiae proteins, we show that Rrp47 and Mpp6 stimulate exosome-mediated RNA decay, albeit with unique dependencies on elements within the nuclear exosome. Mpp6-exosomes can recruit Mtr4, while Mpp6 and Rrp47 each contribute to Mtr4-dependent RNA decay, with maximal Mtr4-dependent decay observed with both cofactors. The 3.3 Å structure of a twelve-subunit nuclear Mpp6 exosome bound to RNA shows the central region of Mpp6 bound to the exosome core, positioning its Mtr4 recruitment domain next to Rrp6 and the exosome central channel. Genetic analysis reveals interactions that are largely consistent with our model.
Viral replication. Structural basis for RNA replication by the hepatitis C virus polymerase.

PubMed

Appleby, Todd C; Perry, Jason K; Murakami, Eisuke; Barauskas, Ona; Feng, Joy; Cho, Aesop; Fox, David; Wetmore, Diana R; McGrath, Mary E; Ray, Adrian S; Sofia, Michael J; Swaminathan, S; Edwards, Thomas E

2015-02-13

Nucleotide analog inhibitors have shown clinical success in the treatment of hepatitis C virus (HCV) infection, despite an incomplete mechanistic understanding of NS5B, the viral RNA-dependent RNA polymerase. Here we study the details of HCV RNA replication by determining crystal structures of stalled polymerase ternary complexes with enzymes, RNA templates, RNA primers, incoming nucleotides, and catalytic metal ions during both primed initiation and elongation of RNA synthesis. Our analysis revealed that highly conserved active-site residues in NS5B position the primer for in-line attack on the incoming nucleotide. A β loop and a C-terminal membrane-anchoring linker occlude the active-site cavity in the apo state, retract in the primed initiation assembly to enforce replication of the HCV genome from the 3' terminus, and vacate the active-site cavity during elongation. We investigated the incorporation of nucleotide analog inhibitors, including the clinically active metabolite formed by sofosbuvir, to elucidate key molecular interactions in the active site. Copyright © 2015, American Association for the Advancement of Science.
The True Story and Advantages of RNA Phage Capsids as Nanotools.

PubMed

Pumpens, Paul; Renhofa, Regina; Dishlers, Andris; Kozlovska, Tatjana; Ose, Velta; Pushko, Peter; Tars, Kaspars; Grens, Elmars; Bachmann, Martin F

2016-01-01

RNA phages are often used as prototypes for modern recombinant virus-like particle (VLP) technologies. Icosahedral RNA phage VLPs can be formed from coat proteins (CPs) and are efficiently produced in bacteria and yeast. Both genetic fusion and chemical coupling have been successfully used for the production of numerous chimeras based on RNA phage VLPs. In this review, we describe advances in RNA phage VLP technology along with the history of the Leviviridae family, including its taxonomical organization, genomic structure, and important role in the development of molecular biology. Comparative 3D structures of different RNA phage VLPs are used to explain the level of VLP tolerance to foreign elements displayed on VLP surfaces. We also summarize data that demonstrate the ability of CPs to tolerate different organic (peptides, oligonucleotides, and carbohydrates) and inorganic (metal ions) compounds either chemically coupled or noncovalently added to the outer and/or inner surfaces of VLPs. Finally, we present lists of nanotechnological RNA phage VLP applications, such as experimental vaccines constructed by genetic fusion and chemical coupling methodologies, nanocontainers for targeted drug delivery, and bioimaging tools. © 2016 S. Karger AG, Basel.
Structural imprints in vivo decode RNA regulatory mechanisms

PubMed Central

Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

2015-01-01

Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993
RNA polymerase pausing and nascent RNA structure formation are linked through clamp domain movement

PubMed Central

Hein, Pyae P.; Kolb, Kellie E.; Windgassen, Tricia; Bellecourt, Michael J.; Darst, Seth A.; Mooney, Rachel A.; Landick, Robert

2014-01-01

The rates of RNA synthesis and nascent RNA folding into biologically active structures are linked via pausing by RNA polymerase (RNAP). Structures that form within the RNA exit channel can increase pausing by interacting with bacterial RNAP or decrease pausing by preventing backtracking. Conversely, pausing is required for proper folding of some RNAs. Opening of the RNAP clamp domain is proposed to mediate some effects of nascent RNA structures. However, the connections among RNA structure formation, clamp movement, and catalytic activity remain uncertain. We assayed exit-channel structure formation in Escherichia coli RNAP together with disulfide crosslinks that favor closed or open clamp conformations and found that clamp position directly influences RNA structure formation and catalytic activity. We report that exit-channel RNA structures slow pause escape by favoring clamp opening and through interactions with the flap that slow translocation. PMID:25108353
Improved Model for Predicting the Free Energy Contribution of Dinucleotide Bulges to RNA Duplex Stability.

PubMed

Tomcho, Jeremy C; Tillman, Magdalena R; Znosko, Brent M

2015-09-01

Predicting the secondary structure of RNA is an intermediate in predicting RNA three-dimensional structure. Commonly, determining RNA secondary structure from sequence uses free energy minimization and nearest neighbor parameters. Current algorithms utilize a sequence-independent model to predict free energy contributions of dinucleotide bulges. To determine if a sequence-dependent model would be more accurate, short RNA duplexes containing dinucleotide bulges with different sequences and nearest neighbor combinations were optically melted to derive thermodynamic parameters. These data suggested energy contributions of dinucleotide bulges were sequence-dependent, and a sequence-dependent model was derived. This model assigns free energy penalties based on the identity of nucleotides in the bulge (3.06 kcal/mol for two purines, 2.93 kcal/mol for two pyrimidines, 2.71 kcal/mol for 5'-purine-pyrimidine-3', and 2.41 kcal/mol for 5'-pyrimidine-purine-3'). The predictive model also includes a 0.45 kcal/mol penalty for an A-U pair adjacent to the bulge and a -0.28 kcal/mol bonus for a G-U pair adjacent to the bulge. The new sequence-dependent model results in predicted values within, on average, 0.17 kcal/mol of experimental values, a significant improvement over the sequence-independent model. This model and new experimental values can be incorporated into algorithms that predict RNA stability and secondary structure from sequence.
High-resolution NMR structures of the domains of Saccharomyces cerevisiae Tho1

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jacobsen, Julian O. B.; Allen, Mark D.; Freund, Stefan M. V.

2016-05-23

In this study, high-resolution structures of both the N-terminal DNA-binding SAP domain and the C-terminal RNA-binding domain of S. cerevisiae Tho1 have been determined. THO is a multi-protein complex involved in the formation of messenger ribonuclear particles (mRNPs) by coupling transcription with mRNA processing and export. THO is thought to be formed from five subunits, Tho2p, Hpr1p, Tex1p, Mft1p and Thp2p, and recent work has determined a low-resolution structure of the complex [Poulsen et al. (2014 ▸), PLoS One, 9, e103470]. A number of additional proteins are thought to be involved in the formation of mRNP in yeast, including Tho1,more » which has been shown to bind RNA in vitro and is recruited to actively transcribed chromatin in vivo in a THO-complex and RNA-dependent manner. Tho1 is known to contain a SAP domain at the N-terminus, but the ability to suppress the expression defects of the hpr1Δ mutant of THO was shown to reside in the RNA-binding C-terminal region. In this study, high-resolution structures of both the N-terminal DNA-binding SAP domain and C-terminal RNA-binding domain have been determined.« less
Solenopsis invicta virus 3: mapping of structural proteins, ribosomal frameshifting, and similarities to Acyrthosiphon pisum virus and Kelp fly virus.

PubMed

Valles, Steven M; Bell, Susanne; Firth, Andrew E

2014-01-01

Solenopsis invicta virus 3 (SINV-3) is a positive-sense single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta. We show that the second open reading frame (ORF) of the dicistronic genome is expressed via a frameshifting mechanism and that the sequences encoding the structural proteins map to both ORF2 and the 3' end of ORF1, downstream of the sequence that encodes the RNA-dependent RNA polymerase. The genome organization and structural protein expression strategy resemble those of Acyrthosiphon pisum virus (APV), an aphid virus. The capsid protein that is encoded by the 3' end of ORF1 in SINV-3 and APV is predicted to have a jelly-roll fold similar to the capsid proteins of picornaviruses and caliciviruses. The capsid-extension protein that is produced by frameshifting, includes the jelly-roll fold domain encoded by ORF1 as its N-terminus, while the C-terminus encoded by the 5' half of ORF2 has no clear homology with other viral structural proteins. A third protein, encoded by the 3' half of ORF2, is associated with purified virions at sub-stoichiometric ratios. Although the structural proteins can be translated from the genomic RNA, we show that SINV-3 also produces a subgenomic RNA encoding the structural proteins. Circumstantial evidence suggests that APV may also produce such a subgenomic RNA. Both SINV-3 and APV are unclassified picorna-like viruses distantly related to members of the order Picornavirales and the family Caliciviridae. Within this grouping, features of the genome organization and capsid domain structure of SINV-3 and APV appear more similar to caliciviruses, perhaps suggesting the basis for a "Calicivirales" order.
RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

PubMed

Moss, Walter N

2018-03-01

With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics

PubMed Central

Reeder, Jens; Giegerich, Robert

2004-01-01

Background The general problem of RNA secondary structure prediction under the widely used thermodynamic model is known to be NP-complete when the structures considered include arbitrary pseudoknots. For restricted classes of pseudoknots, several polynomial time algorithms have been designed, where the O(n6)time and O(n4) space algorithm by Rivas and Eddy is currently the best available program. Results We introduce the class of canonical simple recursive pseudoknots and present an algorithm that requires O(n4) time and O(n2) space to predict the energetically optimal structure of an RNA sequence, possible containing such pseudoknots. Evaluation against a large collection of known pseudoknotted structures shows the adequacy of the canonization approach and our algorithm. Conclusions RNA pseudoknots of medium size can now be predicted reliably as well as efficiently by the new algorithm. PMID:15294028
The crystal structure of Zika virus NS5 reveals conserved drug targets.

PubMed

Duan, Wenqian; Song, Hao; Wang, Haiyuan; Chai, Yan; Su, Chao; Qi, Jianxun; Shi, Yi; Gao, George F

2017-04-03

Zika virus (ZIKV) has emerged as major health concern, as ZIKV infection has been shown to be associated with microcephaly, severe neurological disease and possibly male sterility. As the largest protein component within the ZIKV replication complex, NS5 plays key roles in the life cycle and survival of the virus through its N-terminal methyltransferase (MTase) and C-terminal RNA-dependent RNA polymerase (RdRp) domains. Here, we present the crystal structures of ZIKV NS5 MTase in complex with an RNA cap analogue ( m7 GpppA) and the free NS5 RdRp. We have identified the conserved features of ZIKV NS5 MTase and RdRp structures that could lead to development of current antiviral inhibitors being used against flaviviruses, including dengue virus and West Nile virus, to treat ZIKV infection. These results should inform and accelerate the structure-based design of antiviral compounds against ZIKV. © 2017 The Authors.
The Structure of the RNA-Dependent RNA Polymerase of a Permutotetravirus Suggests a Link between Primer-Dependent and Primer-Independent Polymerases

PubMed Central

Ferrero, Diego S.; Buxaderas, Mònica; Rodríguez, José F.; Verdaguer, Núria

2015-01-01

Thosea asigna virus (TaV), an insect virus belonging to the Permutatetraviridae family, has a positive-sense single-stranded RNA (ssRNA) genome with two overlapping open reading frames, encoding for the replicase and capsid proteins. The particular TaV replicase includes a structurally unique RNA-dependent RNA polymerase (RdRP) with a sequence permutation in the palm sub-domain, where the active site is anchored. This non-canonical arrangement of the RdRP palm is also found in double-stranded RNA viruses of the Birnaviridae family. Both virus families also share a conserved VPg sequence motif at the polymerase N-terminus which in birnaviruses appears to be used to covalently link a fraction of the replicase molecules to the 5’-end of the genomic segments. Birnavirus VPgs are presumed to be used as primers for replication initiation. Here we have solved the crystal structure of the TaV RdRP, the first non-canonical RdRP of a ssRNA virus, in its apo- form and bound to different substrates. The enzyme arranges as a stable dimer maintained by mutual interactions between the active site cleft of one molecule and the flexible N-terminal tail of the symmetrically related RdRP. The latter, partially mimicking the RNA template backbone, is involved in regulating the polymerization activity. As expected from previous sequence-based bioinformatics predictions, the overall architecture of the TaV enzyme shows important resemblances with birnavirus polymerases. In addition, structural comparisons and biochemical analyses reveal unexpected similarities between the TaV RdRP and those of Flaviviruses. In particular, a long loop protruding from the thumb domain towards the central enzyme cavity appears to act as a platform for de novo initiation of RNA replication. Our findings strongly suggest an unexpected evolutionary relationship between the RdRPs encoded by these distant ssRNA virus groups. PMID:26625123
Structural computational modeling of RNA aptamers.

PubMed

Xu, Xiaojun; Dickey, David D; Chen, Shi-Jie; Giangrande, Paloma H

2016-07-01

RNA aptamers represent an emerging class of biologics that can be easily adapted for personalized and precision medicine. Several therapeutic aptamers with desirable binding and functional properties have been developed and evaluated in preclinical studies over the past 25years. However, for the majority of these aptamers, their clinical potential has yet to be realized. A significant hurdle to the clinical adoption of this novel class of biologicals is the limited information on their secondary and tertiary structure. Knowledge of the RNA's structure would greatly facilitate and expedite the post-selection optimization steps required for translation, including truncation (to reduce costs of manufacturing), chemical modification (to enhance stability and improve safety) and chemical conjugation (to improve drug properties for combinatorial therapy). Here we describe a structural computational modeling methodology that when coupled to a standard functional assay, can be used to determine key sequence and structural motifs of an RNA aptamer. We applied this methodology to enable the truncation of an aptamer to prostate specific membrane antigen (PSMA) with great potential for targeted therapy that had failed previous truncation attempts. This methodology can be easily applied to optimize other aptamers with therapeutic potential. Copyright © 2016. Published by Elsevier Inc.
Structure of a bacterial RNA polymerase holoenzyme open promoter complex

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bae, Brian; Feklistov, Andrey; Lass-Napiorkowska, Agnieszka

2015-09-08

Initiation of transcription is a primary means for controlling gene expression. In bacteria, the RNA polymerase (RNAP) holoenzyme binds and unwinds promoter DNA, forming the transcription bubble of the open promoter complex (RPo). We have determined crystal structures, refined to 4.14 Å-resolution, of RPo containing Thermus aquaticus RNAP holoenzyme and promoter DNA that includes the full transcription bubble. The structures, combined with biochemical analyses, reveal key features supporting the formation and maintenance of the double-strand/single-strand DNA junction at the upstream edge of the -10 element where bubble formation initiates. The results also reveal RNAP interactions with duplex DNA just upstreammore » of the -10 element and potential protein/DNA interactions that direct the DNA template strand into the RNAP active site. Addition of an RNA primer to yield a 4 base-pair post-translocated RNA:DNA hybrid mimics an initially transcribing complex at the point where steric clash initiates abortive initiation and σA dissociation.« less
Structure of a bacterial RNA polymerase holoenzyme open promoter complex

DOE PAGES

Bae, Brian; Feklistov, Andrey; Lass-Napiorkowska, Agnieszka; ...

2015-09-08

Initiation of transcription is a primary means for controlling gene expression. In bacteria, the RNA polymerase (RNAP) holoenzyme binds and unwinds promoter DNA, forming the transcription bubble of the open promoter complex (RPo). We have determined crystal structures, refined to 4.14 Å-resolution, of RPo containing Thermus aquaticus RNAP holoenzyme and promoter DNA that includes the full transcription bubble. The structures, combined with biochemical analyses, reveal key features supporting the formation and maintenance of the double-strand/single-strand DNA junction at the upstream edge of the -10 element where bubble formation initiates. The results also reveal RNAP interactions with duplex DNA just upstreammore » of the -10 element and potential protein/DNA interactions that direct the DNA template strand into the RNAP active site. Additionally a RNA primer to yield a 4 base-pair post-translocated RNA:DNA hybrid mimics an initially transcribing complex at the point where steric clash initiates abortive initiation and σ A dissociation.« less
Computational study of RNA folding kinetics and thermodynamics

NASA Astrophysics Data System (ADS)

Morgan, Steven Robert

RNA in its many forms is involved in the processes of protein manufacture, gene splicing, catalysis and gene regulation. It is also the store of genetic information in some viruses. The function of the RNA is determined by its structure, and it is the purpose of this thesis to investigate kinetic and thermodynamic properties of RNA secondary structures in order to obtain a better understanding of their formation and function. Our main tenet is that kinetic formation of RNA structure is necessary to explain features found in natural RNA structures, as well as aspects of the biological function of RNA. Firstly we show that examination of the energies of fragments of RNA secondary structure provides evidence for kinetic formation of structure. Local regions of RNA of length less than about 100 nucleotides adopt a conformation with energy near or equal to the minimum possible for those regions, whilst the energies of larger domains are much further from the their respective minima. This is consistent with the patterns that would be expected if RNA structure is folded Idneticatic during transcription. A Monte-Carlo algorithm is then used to model the kinetic folding of RNA during transcriptional growth. The algorithm is capable of finding the correct structure of a natural RNA for which the minimum free energy approach is unsuccessful. In the viral phage MS2 Idneticatic formed RNA structure plays an important role in the regulation of gene expression. The folding algorithm can accurately model this by IdneticaUy controlling access to the gene initiation region. The algorithm is also successfully used to model the control of replication in the ColEl plasmid. Taking a different approach, we then use a simplified model of RNA secondary structure to investigate the size of energy barriers between degenerate minimum energy structures. This model has much in common with physical systems such as spin glasses, and in fact shows similar behaviour to these systems in that energy barriers between structures grow quickly with the length of the RNA sequence. These barriers will serve to trap RNA in non-optimal structures. Together these studies demonstrate the necessity of studying RNA secondary structure from a kinetic point of view, and provide clear directions in which further work may be taken. Kinetic models of RNA secondary structure should continue to prove useful in modelling the structure and function of RNA.

Analyses of the radiation of birnaviruses from diverse host phyla and of their evolutionary affinities with other double-stranded RNA and positive strand RNA viruses using robust structure-based multiple sequence alignments and advanced phylogenetic methods

PubMed Central

2013-01-01

Background Birnaviruses form a distinct family of double-stranded RNA viruses infecting animals as different as vertebrates, mollusks, insects and rotifers. With such a wide host range, they constitute a good model for studying the adaptation to the host. Additionally, several lines of evidence link birnaviruses to positive strand RNA viruses and suggest that phylogenetic analyses may provide clues about transition. Results We characterized the genome of a birnavirus from the rotifer Branchionus plicalitis. We used X-ray structures of RNA-dependent RNA polymerases and capsid proteins to obtain multiple structure alignments that allowed us to obtain reliable multiple sequence alignments and we employed “advanced” phylogenetic methods to study the evolutionary relationships between some positive strand and double-stranded RNA viruses. We showed that the rotifer birnavirus genome exhibited an organization remarkably similar to other birnaviruses. As this host was phylogenetically very distant from the other known species targeted by birnaviruses, we revisited the evolutionary pathways within the Birnaviridae family using phylogenetic reconstruction methods. We also applied a number of phylogenetic approaches based on structurally conserved domains/regions of the capsid and RNA-dependent RNA polymerase proteins to study the evolutionary relationships between birnaviruses, other double-stranded RNA viruses and positive strand RNA viruses. Conclusions We show that there is a good correlation between the phylogeny of the birnaviruses and that of their hosts at the phylum level using the RNA-dependent RNA polymerase (genomic segment B) on the one hand and a concatenation of the capsid protein, protease and ribonucleoprotein (genomic segment A) on the other hand. This correlation tends to vanish within phyla. The use of advanced phylogenetic methods and robust structure-based multiple sequence alignments allowed us to obtain a more accurate picture (in terms of probability of the tree topologies) of the evolutionary affinities between double-stranded RNA and positive strand RNA viruses. In particular, we were able to show that there exists a good statistical support for the claims that dsRNA viruses are not monophyletic and that viruses with permuted RdRps belong to a common evolution lineage as previously proposed by other groups. We also propose a tree topology with a good statistical support describing the evolutionary relationships between the Picornaviridae, Caliciviridae, Flaviviridae families and a group including the Alphatetraviridae, Nodaviridae, Permutotretraviridae, Birnaviridae, and Cystoviridae families. PMID:23865988
Quantifying the relationship between sequence and three-dimensional structure conservation in RNA

PubMed Central

2010-01-01

Background In recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA. Results Here we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection. Discussion The computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction. PMID:20550657
rCAD: A Novel Database Schema for the Comparative Analysis of RNA.

PubMed

Ozer, Stuart; Doshi, Kishore J; Xu, Weijia; Gutell, Robin R

2011-12-31

Beyond its direct involvement in protein synthesis with mRNA, tRNA, and rRNA, RNA is now being appreciated for its significance in the overall metabolism and regulation of the cell. Comparative analysis has been very effective in the identification and characterization of RNA molecules, including the accurate prediction of their secondary structure. We are developing an integrative scalable data management and analysis system, the RNA Comparative Analysis Database (rCAD), implemented with SQL Server to support RNA comparative analysis. The platformagnostic database schema of rCAD captures the essential relationships between the different dimensions of information for RNA comparative analysis datasets. The rCAD implementation enables a variety of comparative analysis manipulations with multiple integrated data dimensions for advanced RNA comparative analysis workflows. In this paper, we describe details of the rCAD schema design and illustrate its usefulness with two usage scenarios.
rCAD: A Novel Database Schema for the Comparative Analysis of RNA

PubMed Central

Ozer, Stuart; Doshi, Kishore J.; Xu, Weijia; Gutell, Robin R.

2013-01-01

Beyond its direct involvement in protein synthesis with mRNA, tRNA, and rRNA, RNA is now being appreciated for its significance in the overall metabolism and regulation of the cell. Comparative analysis has been very effective in the identification and characterization of RNA molecules, including the accurate prediction of their secondary structure. We are developing an integrative scalable data management and analysis system, the RNA Comparative Analysis Database (rCAD), implemented with SQL Server to support RNA comparative analysis. The platformagnostic database schema of rCAD captures the essential relationships between the different dimensions of information for RNA comparative analysis datasets. The rCAD implementation enables a variety of comparative analysis manipulations with multiple integrated data dimensions for advanced RNA comparative analysis workflows. In this paper, we describe details of the rCAD schema design and illustrate its usefulness with two usage scenarios. PMID:24772454
An efficient algorithm for planar drawing of RNA structures with pseudoknots of any type.

PubMed

Byun, Yanga; Han, Kyungsook

2016-06-01

An RNA pseudoknot is a tertiary structural element in which bases of a loop pair with complementary bases are outside the loop. A drawing of RNA secondary structures is a tree, but a drawing of RNA pseudoknots is a graph that has an inner cycle within a pseudoknot and possibly outer cycles formed between the pseudoknot and other structural elements. Visualizing a large-scale RNA structure with pseudoknots as a planar drawing is challenging because a planar drawing of an RNA structure requires both pseudoknots and an entire structure enclosing the pseudoknots to be embedded into a plane without overlapping or crossing. This paper presents an efficient heuristic algorithm for visualizing a pseudoknotted RNA structure as a planar drawing. The algorithm consists of several parts for finding crossing stems and page mapping the stems, for the layout of stem-loops and pseudoknots, and for overlap detection between structural elements and resolving it. Unlike previous algorithms, our algorithm generates a planar drawing for a large RNA structure with pseudoknots of any type and provides a bracket view of the structure. It generates a compact and aesthetic structure graph for a large pseudoknotted RNA structure in O([Formula: see text]) time, where n is the number of stems of the RNA structure.
Solution nuclear magnetic resonance analyses of the anticodon arms of proteinogenic and nonproteinogenic tRNA(Gly).

PubMed

Chang, Andrew T; Nikonowicz, Edward P

2012-05-01

Although the fate of most tRNA molecules in the cell is aminoacylation and delivery to the ribosome, some tRNAs are destined to fulfill other functional roles. In addition to their central role in translation, tRNA molecules participate in processes such as regulation of gene expression, bacterial cell wall biosynthesis, viral replication, antibiotic biosynthesis, and suppression of alternative splicing. In bacteria, glycyl-tRNA molecules with anticodon sequences GCC and UCC exhibit multiple extratranslational functions, including transcriptional regulation and cell wall biosynthesis. We have determined the high-resolution structures of three glycyl-tRNA anticodon arms with anticodon sequences GCC and UCC. Two of the tRNA molecules are proteinogenic (tRNA(Gly,GCC) and tRNA(Gly,UCC)), and the third is nonproteinogenic (np-tRNA(Gly,UCC)) and participates in cell wall biosynthesis. The UV-monitored thermal melting curves show that the anticodon arm of tRNA(Gly,UCC) with a loop-closing C-A(+) base pair melts at a temperature 10 °C lower than those of tRNA(Gly,GCC) and np-tRNA(Gly,UCC). U-A and C-G pairs close the loops of the latter two molecules and enhance stem stability. Mg(2+) stabilizes the tRNA(Gly,UCC) anticodon arm and reduces the T(m) differential. The structures of the three tRNA(Gly) anticodon arms exhibit small differences among one another, but none of them form the classical U-turn motif. The anticodon loop of tRNA(Gly,GCC) becomes more dynamic and disordered in the presence of multivalent cations, whereas metal ion coordination in the anticodon loops of tRNA(Gly,UCC) and np-tRNA(Gly,UCC) establishes conformational homogeneity. The conformational similarity of the molecules is greater than their functional differences might suggest. Because aminoacylation of full-length tRNA molecules is accomplished by one tRNA synthetase, the similar structural context of the loop may facilitate efficient recognition of each of the anticodon sequences.
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove

PubMed Central

Filbin, Megan E.; Kieft, Jeffrey S.

2011-01-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem–loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem–loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis. PMID:21606179
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove.

PubMed

Filbin, Megan E; Kieft, Jeffrey S

2011-07-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem-loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem-loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis.
Solution structure of a GAAA tetraloop receptor RNA.

PubMed Central

Butcher, S E; Dieckmann, T; Feigon, J

1997-01-01

The GAAA tetraloop receptor is an 11-nucleotide RNA sequence that participates in the tertiary folding of a variety of large catalytic RNAs by providing a specific binding site for GAAA tetraloops. Here we report the solution structure of the isolated tetraloop receptor as solved by multidimensional, heteronuclear magnetic resonance spectroscopy. The internal loop of the tetraloop receptor has three adenosines stacked in a cross-strand or zipper-like fashion. This arrangement produces a high degree of base stacking within the asymmetric internal loop without extrahelical bases or kinking the helix. Additional interactions within the internal loop include a U. U mismatch pair and a G.U wobble pair. A comparison with the crystal structure of the receptor RNA bound to its tetraloop shows that a conformational change has to occur upon tetraloop binding, which is in good agreement with previous biochemical data. A model for an alternative binding site within the receptor is proposed based on the NMR structure, phylogenetic data and previous crystallographic structures of tetraloop interactions. PMID:9405377
Enterovirus 71 induces anti-viral stress granule-like structures in RD cells.

PubMed

Zhu, Yuanmei; Wang, Bei; Huang, He; Zhao, Zhendong

2016-08-05

Stress granules (SGs) are dynamic cytoplasmic granules formed in response to a variety of stresses, including viral infection. Several viruses can modulate the formation of SG with different effects, but the relationship between SG formation and EV71 infection is poorly understood. In this study, we report that EV71 inhibits canonical SGs formation in infected cells and induces the formation of novel RNA granules that were distinguished from canonical SGs in composition and morphology, which we termed 'SG like structures'. Our results also demonstrated that EV71 triggered formation of SG-like structures is dependent on PKR and eIF2α phosphorylation and requires ongoing cellular mRNA synthesis. Finally, we found that SG-like structures are antiviral RNA granules that promote cellular apoptosis and suppress EV71 propagation. Taken together, our findings explain the formation mechanism of SG-like structures induced by EV71 and shed light on virus-host interaction and molecular mechanism underlying EV71 pathogenesis. Copyright © 2016. Published by Elsevier Inc.
Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

PubMed

Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

2008-10-01

One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.
Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

PubMed Central

Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

2013-01-01

One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281
Chloroplast- or Mitochondria-Targeted DEAD-Box RNA Helicases Play Essential Roles in Organellar RNA Metabolism and Abiotic Stress Responses

PubMed Central

Nawaz, Ghazala; Kang, Hunseung

2017-01-01

The yields and productivity of crops are greatly diminished by various abiotic stresses, including drought, cold, heat, and high salinity. Chloroplasts and mitochondria are cellular organelles that can sense diverse environmental stimuli and alter gene expression to cope with adverse environmental stresses. Organellar gene expression is mainly regulated at posttranscriptional levels, including RNA processing, intron splicing, RNA editing, RNA turnover, and translational control, during which a variety of nucleus-encoded RNA-binding proteins (RBPs) are targeted to chloroplasts or mitochondria where they play essential roles in organellar RNA metabolism. DEAD-box RNA helicases (RHs) are enzymes that can alter RNA structures and affect RNA metabolism in all living organisms. Although a number of DEAD-box RHs have been found to play important roles in RNA metabolism in the nucleus and cytoplasm, our understanding on the roles of DEAD-box RHs in the regulation of RNA metabolism in chloroplasts and mitochondria is only at the beginning. Considering that organellar RNA metabolism and gene expression are tightly regulated by anterograde signaling from the nucleus, it is imperative to determine the functions of nucleus-encoded organellar RBPs. In this review, we summarize the emerging roles of nucleus-encoded chloroplast- or mitochondria-targeted DEAD-box RHs in organellar RNA metabolism and plant response to diverse abiotic stresses. PMID:28596782
Comparative study of topological indices of macro/supramolecular RNA complex networks.

PubMed

Agüero-Chapín, Guillermín; Antunes, Agostinho; Ubeira, Florencio M; Chou, Kuo-Chen; González-Díaz, Humberto

2008-11-01

RNA function annotation is often based on alignment to a previously studied template. In contrast to the study of proteins, there are not many alignment-free methods to predict RNA functions if alignment fails. The use of topological indices (TIs) of RNA complex networks (CNs) to find quantitative structure-activity relationships (QSAR) may be an alternative to incorporate secondary structure or sequence-to-sequence similarity. Here, we introduce new QSAR-like techniques using RNA macromolecular CNs (mmCNs), where nodes are nucleotides, or RNA supramolecular CNs (smCNs), where nodes are RNA sequences. We studied a data set of 198 sequences including 18S-rRNAs (important phylogenetic molecular biomarkers). We constructed three types of RNA mmCNs: sequence-linear (SL), Cartesian-lattice (CL), and sequence-folding CNs (SF-CNs) and two smCNs: sequence-sequence disagreement CN (SSD) and sequence-sequence similarity (SSS-smCN). We reported the first comparative QSAR study with all these CIs and CNs, which includes: (i) spectral moments ( ( i )micro d ( w)) of SL-mmCNs (accuracy = 75.3%), (ii) electrostatic CIs (xi d ) of CL-mmCNs (>90%), (iii) thermodynamic parameters (Delta G, Delta H, Delta S, and T m) of SF-mmCNs (64.7%), (iv) disagreement-distribution moments ( M k ) of the SSD-smCN (79.3%), and (v) node centralities of the SSD-smCN (78.0%). Furthermore, we reported the experimental isolation of a new RNA sequence from Psidum guajava leaf tissue and its QSAR and BLAST prediction to illustrate the practical use of these methods. We also investigated the use of these CNs to explore rRNA diversity on bacteria, plants, and parasites from the Dactylogyrus genus. The HPL-mmCNs model was the best of all found. All the CNs and TIs, except SF-mmCNs, were introduced here by the first time for the QSAR study of RNA, which allowed a comparative study for RNA classification.
Arabidopsis DRB4, AGO1, AGO7, and RDR6 participate in a DCL4-initiated antiviral RNA silencing pathway negatively regulated by DCL1.

PubMed

Qu, Feng; Ye, Xiaohong; Morris, T Jack

2008-09-23

Plant RNA silencing machinery enlists four primary classes of proteins to achieve sequence-specific regulation of gene expression and mount an antiviral defense. These include Dicer-like ribonucleases (DCLs), Argonaute proteins (AGOs), dsRNA-binding proteins (DRBs), and RNA-dependent RNA polymerases (RDRs). Although at least four distinct endogenous RNA silencing pathways have been thoroughly characterized, a detailed understanding of the antiviral RNA silencing pathway is just emerging. In this report, we have examined the role of four DCLs, two AGOs, one DRB, and one RDR in controlling viral RNA accumulation in infected Arabidopsis plants by using a mutant virus lacking its silencing suppressor. Our results show that all four DCLs contribute to antiviral RNA silencing. We confirm previous reports implicating both DCL4 and DCL2 in this process and establish a minor role for DCL3. Surprisingly, we found that DCL1 represses antiviral RNA silencing through negatively regulating the expression of DCL4 and DCL3. We also implicate DRB4 in antiviral RNA silencing. Finally, we show that both AGO1 and AGO7 function to ensure efficient clearance of viral RNAs and establish that AGO1 is capable of targeting viral RNAs with more compact structures, whereas AGO7 and RDR6 favor less structured RNA targets. Our results resolve several key steps in the antiviral RNA silencing pathway and provide a basis for further in-depth analysis.
Base pairing and structural insights into the 5-formylcytosine in RNA duplex

PubMed Central

Wang, Rui; Luo, Zhipu; He, Kaizhang; Delaney, Michael O.; Chen, Doris; Sheng, Jia

2016-01-01

Abstract 5-Formylcytidine (f5C), a previously discovered natural nucleotide in the mitochondrial tRNA of many species including human, has been recently detected as the oxidative product of 5-methylcytidine (m5C) through 5-hydroxymethylcytidine (hm5C) in total RNA of mammalian cells. The discovery indicated that these cytosine derivatives in RNA might also play important epigenetic roles similar as in DNA, which has been intensively investigated in the past few years. In this paper, we studied the base pairing specificity of f5C in different RNA duplex contexts. We found that the 5-formyl group could increase duplex thermal stability and enhance base pairing specificity. We present three high-resolution crystal structures of an octamer RNA duplex [5′-GUA(f5C)GUAC-3′]2 that have been solved under three crystallization conditions with different buffers and pH values. Our results showed that the 5-formyl group is located in the same plane as the cytosine base and forms an intra-residue hydrogen bond with the amino group in the N4 position. In addition, this modification increases the base stacking between the f5C and the neighboring bases while not causing significant global and local structure perturbations. This work provides insights into the effects of 5-formylcytosine on RNA duplex. PMID:27079978
Structure-seq2: sensitive and accurate genome-wide profiling of RNA structure in vivo

PubMed Central

Ritchey, Laura E.; Su, Zhao; Tang, Yin; Tack, David C.

2017-01-01

Abstract RNA serves many functions in biology such as splicing, temperature sensing, and innate immunity. These functions are often determined by the structure of RNA. There is thus a pressing need to understand RNA structure and how it changes during diverse biological processes both in vivo and genome-wide. Here, we present Structure-seq2, which provides nucleotide-resolution RNA structural information in vivo and genome-wide. This optimized version of our original Structure-seq method increases sensitivity by at least 4-fold and improves data quality by minimizing formation of a deleterious by-product, reducing ligation bias, and improving read coverage. We also present a variation of Structure-seq2 in which a biotinylated nucleotide is incorporated during reverse transcription, which greatly facilitates the protocol by eliminating two PAGE purification steps. We benchmark Structure-seq2 on both mRNA and rRNA structure in rice (Oryza sativa). We demonstrate that Structure-seq2 can lead to new biological insights. Our Structure-seq2 datasets uncover hidden breaks in chloroplast rRNA and identify a previously unreported N1-methyladenosine (m1A) in a nuclear-encoded Oryza sativa rRNA. Overall, Structure-seq2 is a rapid, sensitive, and unbiased method to probe RNA in vivo and genome-wide that facilitates new insights into RNA biology. PMID:28637286
Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

PubMed

Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

2010-01-01

The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by <25% in sequence identity fold into very similar structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.
Dawn of the in vivo RNA structurome and interactome.

PubMed

Kwok, Chun Kit

2016-10-15

RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases. © 2016 The Author(s); published by Portland Press Limited on behalf of the Biochemical Society.
Nanoscale platforms for messenger RNA delivery.

PubMed

Li, Bin; Zhang, Xinfu; Dong, Yizhou

2018-05-04

Messenger RNA (mRNA) has become a promising class of drugs for diverse therapeutic applications in the past few years. A series of clinical trials are ongoing or will be initiated in the near future for the treatment of a variety of diseases. Currently, mRNA-based therapeutics mainly focuses on ex vivo transfection and local administration in clinical studies. Efficient and safe delivery of therapeutically relevant mRNAs remains one of the major challenges for their broad applications in humans. Thus, effective delivery systems are urgently needed to overcome this limitation. In recent years, numerous nanoscale biomaterials have been constructed for mRNA delivery in order to protect mRNA from extracellular degradation and facilitate endosomal escape after cellular uptake. Nanoscale platforms have expanded the feasibility of mRNA-based therapeutics, and enabled its potential applications to protein replacement therapy, cancer immunotherapy, therapeutic vaccines, regenerative medicine, and genome editing. This review focuses on recent advances, challenges, and future directions in nanoscale platforms designed for mRNA delivery, including lipid and lipid-derived nanoparticles, polymer-based nanoparticles, protein derivatives mRNA complexes, and other types of nanomaterials. This article is categorized under: Nanotechnology Approaches to Biology > Nanoscale Systems in Biology Biology-Inspired Nanomaterials > Lipid-Based Structures Biology-Inspired Nanomaterials > Nucleic Acid-Based Structures. © 2018 Wiley Periodicals, Inc.

Posttranscriptional modification of tRNA in thermophilic archaea (Archaebacteria).

PubMed Central

Edmonds, C G; Crain, P F; Gupta, R; Hashizume, T; Hocart, C H; Kowalak, J A; Pomerantz, S C; Stetter, K O; McCloskey, J A

1991-01-01

Nucleoside modification has been studied in unfractionated tRNA from 11 thermophilic archaea (archaebacteria), including phylogenetically diverse representatives of thermophilic methanogens and sulfur-metabolizing hyperthermophiles which grow optimally in the temperature range of 56 (Thermoplasma acidophilum) to 105 degrees C (Pyrodictium occultum), and for comparison from the most thermophilic bacterium (eubacterium) known, Thermotoga maritima (80 degrees C). Nine nucleosides are found to be unique to the archaea, six of which are structurally novel in being modified both in the base and by methylation in ribose and occur primarily in tRNA from the extreme thermophiles in the Crenarchaeota of the archaeal phylogenetic tree. 2-Thiothymine occurs in tRNA from Thermococcus sp., and constitutes the only known occurrence of the thymine moiety in archaeal RNA, in contrast to its near-ubiquitous presence in tRNA from bacteria and eukarya. A total of 33 modified nucleosides are rigorously characterized in archaeal tRNA in the present study, demonstrating that the structural range of posttranscriptional modifications in archaeal tRNA is more extensive than previously known. From a phylogenetic standpoint, certain tRNA modifications occur in the archaea which are otherwise unique to either the bacterial or eukaryal domain, although the overall patterns of modification are more typical of eukaryotes than bacteria. PMID:1708763
Diverse activities of viral cis-acting RNA regulatory elements revealed using multicolor, long-term, single-cell imaging

PubMed Central

Pocock, Ginger M.; Zimdars, Laraine L.; Yuan, Ming; Eliceiri, Kevin W.; Ahlquist, Paul; Sherer, Nathan M.

2017-01-01

Cis-acting RNA structural elements govern crucial aspects of viral gene expression. How these structures and other posttranscriptional signals affect RNA trafficking and translation in the context of single cells is poorly understood. Herein we describe a multicolor, long-term (>24 h) imaging strategy for measuring integrated aspects of viral RNA regulatory control in individual cells. We apply this strategy to demonstrate differential mRNA trafficking behaviors governed by RNA elements derived from three retroviruses (HIV-1, murine leukemia virus, and Mason-Pfizer monkey virus), two hepadnaviruses (hepatitis B virus and woodchuck hepatitis virus), and an intron-retaining transcript encoded by the cellular NXF1 gene. Striking behaviors include “burst” RNA nuclear export dynamics regulated by HIV-1’s Rev response element and the viral Rev protein; transient aggregations of RNAs into discrete foci at or near the nuclear membrane triggered by multiple elements; and a novel, pulsiform RNA export activity regulated by the hepadnaviral posttranscriptional regulatory element. We incorporate single-cell tracking and a data-mining algorithm into our approach to obtain RNA element–specific, high-resolution gene expression signatures. Together these imaging assays constitute a tractable, systems-based platform for studying otherwise difficult to access spatiotemporal features of viral and cellular gene regulation. PMID:27903772
nRC: non-coding RNA Classifier based on structural features.

PubMed

Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

2017-01-01

Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.
Evolutionary Origin and Conserved Structural Building Blocks of Riboswitches and Ribosomal RNAs: Riboswitches as Probable Target Sites for Aminoglycosides Interaction.

PubMed

Mehdizadeh Aghdam, Elnaz; Barzegar, Abolfazl; Hejazi, Mohammad Saeid

2014-01-01

Riboswitches, as noncoding RNA sequences, control gene expression through direct ligand binding. Sporadic reports on the structural relation of riboswitches with ribosomal RNAs (rRNA), raises an interest in possible similarity between riboswitches and rRNAs evolutionary origins. Since aminoglycoside antibiotics affect microbial cells through binding to functional sites of the bacterial rRNA, finding any conformational and functional relation between riboswitches/rRNAs is utmost important in both of medicinal and basic research. Analysis of the riboswitches structures were carried out using bioinformatics and computational tools. The possible functional similarity of riboswitches with rRNAs was evaluated based on the affinity of paromomycin antibiotic (targeting "A site" of 16S rRNA) to riboswitches via docking method. There was high structural similarity between riboswitches and rRNAs, but not any particular sequence based similarity between them was found. The building blocks including "hairpin loop containing UUU", "peptidyl transferase center conserved hairpin A loop"," helix 45" and "S2 (G8) hairpin" as high identical rRNA motifs were detected in all kinds of riboswitches. Surprisingly, binding energies of paromomycin with different riboswitches are considerably better than the binding energy of paromomycin with "16S rRNA A site". Therefore the high affinity of paromomycin to bind riboswitches in comparison with rRNA "A site" suggests a new insight about riboswitches as possible targets for aminoglycoside antibiotics. These findings are considered as a possible supporting evidence for evolutionary origin of riboswitches/rRNAs and also their role in the exertion of antibiotics effects to design new drugs based on the concomitant effects via rRNA/riboswitches.
Prostate Cell Specific Regulation of Androgen Receptor Phosphorylation in Vivo

DTIC Science & Technology

2009-11-01

includes both Rpb5, a subunit shared by RNA polymerase (Pol) I, II , and III, and the corepressor, Unconventional prefoldin Rpb5-Interactor (URI/C19orf2...complex that contains RNA polymerase II subunit 5, a subunit shared by all three RNA polymerases; unconventional prefoldin RPB5-in- teractor (URI), which...sequence of ART-27 is conserved throughout evolution from worms to humans and its predicted protein structure is homologous to the prefoldin -a family of
RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

NASA Astrophysics Data System (ADS)

Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.
The ancient history of the structure of ribonuclease P and the early origins of Archaea

PubMed Central

2010-01-01

Background Ribonuclease P is an ancient endonuclease that cleaves precursor tRNA and generally consists of a catalytic RNA subunit (RPR) and one or more proteins (RPPs). It represents an important macromolecular complex and model system that is universally distributed in life. Its putative origins have inspired fundamental hypotheses, including the proposal of an ancient RNA world. Results To study the evolution of this complex, we constructed rooted phylogenetic trees of RPR molecules and substructures and estimated RPP age using a cladistic method that embeds structure directly into phylogenetic analysis. The general approach was used previously to study the evolution of tRNA, SINE RNA and 5S rRNA, the origins of metabolism, and the evolution and complexity of the protein world, and revealed here remarkable evolutionary patterns. Trees of molecules uncovered the tripartite nature of life and the early origin of archaeal RPRs. Trees of substructures showed molecules originated in stem P12 and were accessorized with a catalytic P1-P4 core structure before the first substructure was lost in Archaea. This core currently interacts with RPPs and ancient segments of the tRNA molecule. Finally, a census of protein domain structure in hundreds of genomes established RPPs appeared after the rise of metabolic enzymes at the onset of the protein world. Conclusions The study provides a detailed account of the history and early diversification of a fundamental ribonucleoprotein and offers further evidence in support of the existence of a tripartite organismal world that originated by the segregation of archaeal lineages from an ancient community of primordial organisms. PMID:20334683
RNA Cap Methyltransferase Activity Assay

PubMed Central

Trotman, Jackson B.; Schoenberg, Daniel R.

2018-01-01

Methyltransferases that methylate the guanine-N7 position of the mRNA 5′ cap structure are ubiquitous among eukaryotes and commonly encoded by viruses. Here we provide a detailed protocol for the biochemical analysis of RNA cap methyltransferase activity of biological samples. This assay involves incubation of cap-methyltransferase-containing samples with a [32P]G-capped RNA substrate and S-adenosylmethionine (SAM) to produce RNAs with N7-methylated caps. The extent of cap methylation is then determined by P1 nuclease digestion, thin-layer chromatography (TLC), and phosphorimaging. The protocol described here includes additional steps for generating the [32P]G-capped RNA substrate and for preparing nuclear and cytoplasmic extracts from mammalian cells. This assay is also applicable to analyzing the cap methyltransferase activity of other biological samples, including recombinant protein preparations and fractions from analytical separations and immunoprecipitation/pulldown experiments. PMID:29644259
A Sponge-like Structure Involved in the Association and Transport of Maternal Products during Drosophila Oogenesis

PubMed Central

Wilsch-Bräuninger, Michaela; Schwarz, Heinz; Nüsslein-Volhard, Christiane

1997-01-01

Localization of maternally provided RNAs during oogenesis is required for formation of the antero–posterior axis of the Drosophila embryo. Here we describe a subcellular structure in nurse cells and oocytes which may function as an intracellular compartment for assembly and transport of maternal products involved in RNA localization. This structure, which we have termed “sponge body,” consists of ER-like cisternae, embedded in an amorphous electron-dense mass. It lacks a surrounding membrane and is frequently associated with mitochondria. The sponge bodies are not identical to the Golgi complexes. We suggest that the sponge bodies are homologous to the mitochondrial cloud in Xenopus oocytes, a granulo-fibrillar structure that contains RNAs involved in patterning of the embryo. Exuperantia protein, the earliest factor known to be required for the localization of bicoid mRNA to the anterior pole of the Drosophila oocyte, is highly enriched in the sponge bodies but not an essential structural component of these. RNA staining indicates that sponge bodies contain RNA. However, neither the intensity of this staining nor the accumulation of Exuperantia in the sponge bodies is dependent on the amount of bicoid mRNA present in the ovaries. Sponge bodies surround nuage, a possible polar granule precursor. Microtubules and microfilaments are not present in sponge bodies, although transport of the sponge bodies through the cells is implied by their presence in cytoplasmic bridges. We propose that the sponge bodies are structures that, by assembly and transport of included molecules or associated structures, are involved in localization of mRNAs in Drosophila oocytes. PMID:9348297
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.

PubMed

Walia, Rasna R; Caragea, Cornelia; Lewis, Benjamin A; Towfic, Fadi; Terribilini, Michael; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

2012-05-10

RNA molecules play diverse functional and structural roles in cells. They function as messengers for transferring genetic information from DNA to proteins, as the primary genetic material in many viruses, as catalysts (ribozymes) important for protein synthesis and RNA processing, and as essential and ubiquitous regulators of gene expression in living organisms. Many of these functions depend on precisely orchestrated interactions between RNA molecules and specific proteins in cells. Understanding the molecular mechanisms by which proteins recognize and bind RNA is essential for comprehending the functional implications of these interactions, but the recognition 'code' that mediates interactions between proteins and RNA is not yet understood. Success in deciphering this code would dramatically impact the development of new therapeutic strategies for intervening in devastating diseases such as AIDS and cancer. Because of the high cost of experimental determination of protein-RNA interfaces, there is an increasing reliance on statistical machine learning methods for training predictors of RNA-binding residues in proteins. However, because of differences in the choice of datasets, performance measures, and data representations used, it has been difficult to obtain an accurate assessment of the current state of the art in protein-RNA interface prediction. We provide a review of published approaches for predicting RNA-binding residues in proteins and a systematic comparison and critical assessment of protein-RNA interface residue predictors trained using these approaches on three carefully curated non-redundant datasets. We directly compare two widely used machine learning algorithms (Naïve Bayes (NB) and Support Vector Machine (SVM)) using three different data representations in which features are encoded using either sequence- or structure-based windows. Our results show that (i) Sequence-based classifiers that use a position-specific scoring matrix (PSSM)-based representation (PSSMSeq) outperform those that use an amino acid identity based representation (IDSeq) or a smoothed PSSM (SmoPSSMSeq); (ii) Structure-based classifiers that use smoothed PSSM representation (SmoPSSMStr) outperform those that use PSSM (PSSMStr) as well as sequence identity based representation (IDStr). PSSMSeq classifiers, when tested on an independent test set of 44 proteins, achieve performance that is comparable to that of three state-of-the-art structure-based predictors (including those that exploit geometric features) in terms of Matthews Correlation Coefficient (MCC), although the structure-based methods achieve substantially higher Specificity (albeit at the expense of Sensitivity) compared to sequence-based methods. We also find that the expected performance of the classifiers on a residue level can be markedly different from that on a protein level. Our experiments show that the classifiers trained on three different non-redundant protein-RNA interface datasets achieve comparable cross-validation performance. However, we find that the results are significantly affected by differences in the distance threshold used to define interface residues. Our results demonstrate that protein-RNA interface residue predictors that use a PSSM-based encoding of sequence windows outperform classifiers that use other encodings of sequence windows. While structure-based methods that exploit geometric features can yield significant increases in the Specificity of protein-RNA interface residue predictions, such increases are offset by decreases in Sensitivity. These results underscore the importance of comparing alternative methods using rigorous statistical procedures, multiple performance measures, and datasets that are constructed based on several alternative definitions of interface residues and redundancy cutoffs as well as including evaluations on independent test sets into the comparisons.
Uniqueness, Advantages, Challenges, Solutions, and Perspectives in Therapeutics Applying RNA Nanotechnology

PubMed Central

Haque, Farzin; Hallahan, Brent; Reif, Randall; Li, Hui

2012-01-01

The field of RNA nanotechnology is rapidly emerging. RNA can be manipulated with the simplicity characteristic of DNA to produce nanoparticles with a diversity of quaternary structures by self-assembly. Additionally RNA is tremendously versatile in its function and some RNA molecules display catalytic activities much like proteins. Thus, RNA has the advantage of both worlds. However, the instability of RNA has made many scientists flinch away from RNA nanotechnology. Other concerns that have deterred the progress of RNA therapeutics include the induction of interferons, stimulation of cytokines, and activation of other immune systems, as well as short pharmacokinetic profiles in vivo. This review will provide some solutions and perspectives on the chemical and thermodynamic stability, in vivo half-life and biodistribution, yield and production cost, in vivo toxicity and side effect, specific delivery and targeting, as well as endosomal trapping and escape. PMID:22913595
Uniqueness, advantages, challenges, solutions, and perspectives in therapeutics applying RNA nanotechnology.

PubMed

Guo, Peixuan; Haque, Farzin; Hallahan, Brent; Reif, Randall; Li, Hui

2012-08-01

The field of RNA nanotechnology is rapidly emerging. RNA can be manipulated with the simplicity characteristic of DNA to produce nanoparticles with a diversity of quaternary structures by self-assembly. Additionally RNA is tremendously versatile in its function and some RNA molecules display catalytic activities much like proteins. Thus, RNA has the advantage of both worlds. However, the instability of RNA has made many scientists flinch away from RNA nanotechnology. Other concerns that have deterred the progress of RNA therapeutics include the induction of interferons, stimulation of cytokines, and activation of other immune systems, as well as short pharmacokinetic profiles in vivo. This review will provide some solutions and perspectives on the chemical and thermodynamic stability, in vivo half-life and biodistribution, yield and production cost, in vivo toxicity and side effect, specific delivery and targeting, as well as endosomal trapping and escape.
Functional Information Stored in the Conserved Structural RNA Domains of Flavivirus Genomes

PubMed Central

Fernández-Sanlés, Alba; Ríos-Marco, Pablo; Romero-López, Cristina; Berzal-Herranz, Alfredo

2017-01-01

The genus Flavivirus comprises a large number of small, positive-sense single-stranded, RNA viruses able to replicate in the cytoplasm of certain arthropod and/or vertebrate host cells. The genus, which has some 70 member species, includes a number of emerging and re-emerging pathogens responsible for outbreaks of human disease around the world, such as the West Nile, dengue, Zika, yellow fever, Japanese encephalitis, St. Louis encephalitis, and tick-borne encephalitis viruses. Like other RNA viruses, flaviviruses have a compact RNA genome that efficiently stores all the information required for the completion of the infectious cycle. The efficiency of this storage system is attributable to supracoding elements, i.e., discrete, structural units with essential functions. This information storage system overlaps and complements the protein coding sequence and is highly conserved across the genus. It therefore offers interesting potential targets for novel therapeutic strategies. This review summarizes our knowledge of the features of flavivirus genome functional RNA domains. It also provides a brief overview of the main achievements reported in the design of antiviral nucleic acid-based drugs targeting functional genomic RNA elements. PMID:28421048
Synthesis of aspartyl-tRNA(Asp) in Escherichia coli--a snapshot of the second step.

PubMed Central

Eiler, S; Dock-Bregeon, A; Moulinier, L; Thierry, J C; Moras, D

1999-01-01

The 2.4 A crystal structure of the Escherichia coli aspartyl-tRNA synthetase (AspRS)-tRNA(Asp)-aspartyl-adenylate complex shows the two substrates poised for the transfer of the aspartic acid moiety from the adenylate to the 3'-hydroxyl of the terminal adenosine of the tRNA. A general molecular mechanism is proposed for the second step of the aspartylation reaction that accounts for the observed conformational changes, notably in the active site pocket. The stabilization of the transition state is mediated essentially by two amino acids: the class II invariant arginine of motif 2 and the eubacterial-specific Gln231, which in eukaryotes and archaea is replaced by a structurally non-homologous serine. Two archetypal RNA-protein modes of interactions are observed: the anticodon stem-loop, including the wobble base Q, binds to the N-terminal beta-barrel domain through direct protein-RNA interactions, while the binding of the acceptor stem involves both direct and water-mediated hydrogen bonds in an original recognition scheme. PMID:10562565
Shared Sulfur Mobilization Routes for tRNA Thiolation and Molybdenum Cofactor Biosynthesis in Prokaryotes and Eukaryotes

PubMed Central

Leimkühler, Silke; Bühning, Martin; Beilschmidt, Lena

2017-01-01

Modifications of transfer RNA (tRNA) have been shown to play critical roles in the biogenesis, metabolism, structural stability and function of RNA molecules, and the specific modifications of nucleobases with sulfur atoms in tRNA are present in pro- and eukaryotes. Here, especially the thiomodifications xm5s2U at the wobble position 34 in tRNAs for Lys, Gln and Glu, were suggested to have an important role during the translation process by ensuring accurate deciphering of the genetic code and by stabilization of the tRNA structure. The trafficking and delivery of sulfur nucleosides is a complex process carried out by sulfur relay systems involving numerous proteins, which not only deliver sulfur to the specific tRNAs but also to other sulfur-containing molecules including iron–sulfur clusters, thiamin, biotin, lipoic acid and molybdopterin (MPT). Among the biosynthesis of these sulfur-containing molecules, the biosynthesis of the molybdenum cofactor (Moco) and the synthesis of thio-modified tRNAs in particular show a surprising link by sharing protein components for sulfur mobilization in pro- and eukaryotes. PMID:28098827
Towards Long-Range RNA Structure Prediction in Eukaryotic Genes.

PubMed

Pervouchine, Dmitri D

2018-06-15

The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.
Alignment of RNA molecules: Binding energy and statistical properties of random sequences

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valba, O. V., E-mail: valbaolga@gmail.com; Nechaev, S. K., E-mail: sergei.nechaev@gmail.com; Tamm, M. V., E-mail: thumm.m@gmail.com

2012-02-15

A new statistical approach to the problem of pairwise alignment of RNA sequences is proposed. The problem is analyzed for a pair of interacting polymers forming an RNA-like hierarchical cloverleaf structures. An alignment is characterized by the numbers of matches, mismatches, and gaps. A weight function is assigned to each alignment; this function is interpreted as a free energy taking into account both direct monomer-monomer interactions and a combinatorial contribution due to formation of various cloverleaf secondary structures. The binding free energy is determined for a pair of RNA molecules. Statistical properties are discussed, including fluctuations of the binding energymore » between a pair of RNA molecules and loop length distribution in a complex. Based on an analysis of the free energy per nucleotide pair complexes of random RNAs as a function of the number of nucleotide types c, a hypothesis is put forward about the exclusivity of the alphabet c = 4 used by nature.« less
Fluctuations between multiple EF-G-induced chimeric tRNA states during translocation on the ribosome

NASA Astrophysics Data System (ADS)

Adio, Sarah; Senyushkina, Tamara; Peske, Frank; Fischer, Niels; Wintermeyer, Wolfgang; Rodnina, Marina V.

2015-06-01

The coupled translocation of transfer RNA and messenger RNA through the ribosome entails large-scale structural rearrangements, including step-wise movements of the tRNAs. Recent structural work has visualized intermediates of translocation induced by elongation factor G (EF-G) with tRNAs trapped in chimeric states with respect to 30S and 50S ribosomal subunits. The functional role of the chimeric states is not known. Here we follow the formation of translocation intermediates by single-molecule fluorescence resonance energy transfer. Using EF-G mutants, a non-hydrolysable GTP analogue, and fusidic acid, we interfere with either translocation or EF-G release from the ribosome and identify several rapidly interconverting chimeric tRNA states on the reaction pathway. EF-G engagement prevents backward transitions early in translocation and increases the fraction of ribosomes that rapidly fluctuate between hybrid, chimeric and posttranslocation states. Thus, the engagement of EF-G alters the energetics of translocation towards a flat energy landscape, thereby promoting forward tRNA movement.
Evaluation of sequence alignments and oligonucleotide probes with respect to three-dimensional structure of ribosomal RNA using ARB software package

PubMed Central

Kumar, Yadhu; Westram, Ralf; Kipfer, Peter; Meier, Harald; Ludwig, Wolfgang

2006-01-01

Background Availability of high-resolution RNA crystal structures for the 30S and 50S ribosomal subunits and the subsequent validation of comparative secondary structure models have prompted the biologists to use three-dimensional structure of ribosomal RNA (rRNA) for evaluating sequence alignments of rRNA genes. Furthermore, the secondary and tertiary structural features of rRNA are highly useful and successfully employed in designing rRNA targeted oligonucleotide probes intended for in situ hybridization experiments. RNA3D, a program to combine sequence alignment information with three-dimensional structure of rRNA was developed. Integration into ARB software package, which is used extensively by the scientific community for phylogenetic analysis and molecular probe designing, has substantially extended the functionality of ARB software suite with 3D environment. Results Three-dimensional structure of rRNA is visualized in OpenGL 3D environment with the abilities to change the display and overlay information onto the molecule, dynamically. Phylogenetic information derived from the multiple sequence alignments can be overlaid onto the molecule structure in a real time. Superimposition of both statistical and non-statistical sequence associated information onto the rRNA 3D structure can be done using customizable color scheme, which is also applied to a textual sequence alignment for reference. Oligonucleotide probes designed by ARB probe design tools can be mapped onto the 3D structure along with the probe accessibility models for evaluation with respect to secondary and tertiary structural conformations of rRNA. Conclusion Visualization of three-dimensional structure of rRNA in an intuitive display provides the biologists with the greater possibilities to carry out structure based phylogenetic analysis. Coupled with secondary structure models of rRNA, RNA3D program aids in validating the sequence alignments of rRNA genes and evaluating probe target sites. Superimposition of the information derived from the multiple sequence alignment onto the molecule dynamically allows the researchers to observe any sequence inherited characteristics (phylogenetic information) in real-time environment. The extended ARB software package is made freely available for the scientific community via . PMID:16672074
Annealing to sequences within the primer binding site loop promotes an HIV-1 RNA conformation favoring RNA dimerization and packaging

PubMed Central

Seif, Elias; Niu, Meijuan; Kleiman, Lawrence

2013-01-01

The 5′ untranslated region (5′ UTR) of HIV-1 genomic RNA (gRNA) includes structural elements that regulate reverse transcription, transcription, translation, tRNALys3 annealing to the gRNA, and gRNA dimerization and packaging into viruses. It has been reported that gRNA dimerization and packaging are regulated by changes in the conformation of the 5′-UTR RNA. In this study, we show that annealing of tRNALys3 or a DNA oligomer complementary to sequences within the primer binding site (PBS) loop of the 5′ UTR enhances its dimerization in vitro. Structural analysis of the 5′-UTR RNA using selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) shows that the annealing promotes a conformational change of the 5′ UTR that has been previously reported to favor gRNA dimerization and packaging into virus. The model predicted by SHAPE analysis is supported by antisense experiments designed to test which annealed sequences will promote or inhibit gRNA dimerization. Based on reports showing that the gRNA dimerization favors its incorporation into viruses, we tested the ability of a mutant gRNA unable to anneal to tRNALys3 to be incorporated into virions. We found a ∼60% decrease in mutant gRNA packaging compared with wild-type gRNA. Together, these data further support a model for viral assembly in which the initial annealing of tRNALys3 to gRNA is cytoplasmic, which in turn aids in the promotion of gRNA dimerization and its incorporation into virions. PMID:23960173

3D RNA and functional interactions from evolutionary couplings

PubMed Central

Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

2016-01-01

Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444
Rclick: a web server for comparison of RNA 3D structures.

PubMed

Nguyen, Minh N; Verma, Chandra

2015-03-15

RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Mutations Abrogating VP35 Interaction with Double-Stranded RNA Render Ebola Virus Avirulent in Guinea Pigs

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prins, Kathleen C.; Delpeut, Sebastien; Leung, Daisy W.

2010-10-11

Ebola virus (EBOV) protein VP35 is a double-stranded RNA (dsRNA) binding inhibitor of host interferon (IFN)-{alpha}/{beta} responses that also functions as a viral polymerase cofactor. Recent structural studies identified key features, including a central basic patch, required for VP35 dsRNA binding activity. To address the functional significance of these VP35 structural features for EBOV replication and pathogenesis, two point mutations, K319A/R322A, that abrogate VP35 dsRNA binding activity and severely impair its suppression of IFN-{alpha}/{beta} production were identified. Solution nuclear magnetic resonance (NMR) spectroscopy and X-ray crystallography reveal minimal structural perturbations in the K319A/R322A VP35 double mutant and suggest that lossmore » of basic charge leads to altered function. Recombinant EBOVs encoding the mutant VP35 exhibit, relative to wild-type VP35 viruses, minimal growth attenuation in IFN-defective Vero cells but severe impairment in IFN-competent cells. In guinea pigs, the VP35 mutant virus revealed a complete loss of virulence. Strikingly, the VP35 mutant virus effectively immunized animals against subsequent wild-type EBOV challenge. These in vivo studies, using recombinant EBOV viruses, combined with the accompanying biochemical and structural analyses directly correlate VP35 dsRNA binding and IFN inhibition functions with viral pathogenesis. Moreover, these studies provide a framework for the development of antivirals targeting this critical EBOV virulence factor.« less
Localization of mRNA in vertebrate axonal compartments by in situ hybridization.

PubMed

Sotelo-Silveira, José Roberto; Calliari, Aldo; Kun, Alejandra; Elizondo, Victoria; Canclini, Lucía; Sotelo, José Roberto

2011-01-01

The conclusive demonstration of RNA in vertebrate axons by in situ hybridization (ISH) has been elusive. We review the most important reasons for difficulties, including low concentration of axonal RNAs, localization in specific cortical domains, and the need to isolate axons. We demonstrate the importance of axon micro-dissection to obtain a whole mount perspective of mRNA distribution in the axonal territory. We describe a protocol to perform fluorescent ISH in isolated axons and guidelines for the preservation of structural and molecular integrity of cortical RNA-containing domains (e.g., Periaxoplasmic Ribosomal Plaques, or PARPs) in isolated axoplasm.
RNA structures as mediators of neurological diseases and as drug targets

PubMed Central

Bernat, Viachaslau; Disney, Matthew D.

2015-01-01

RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. PMID:26139368
The conservation and function of RNA secondary structure in plants

PubMed Central

Vandivier, Lee E.; Anderson, Stephen J.; Foley, Shawn W.; Gregory, Brian D.

2016-01-01

RNA transcripts fold into secondary structures via intricate patterns of base pairing. These secondary structures impart catalytic, ligand binding, and scaffolding functions to a wide array of RNAs, forming a critical node of biological regulation. Among their many functions, RNA structural elements modulate epigenetic marks, alter mRNA stability and translation, regulate alternative splicing, transduce signals, and scaffold large macromolecular complexes. Thus, the study of RNA secondary structure is critical to understanding the function and regulation of RNA transcripts. Here, we review the origins, form, and function of RNA secondary structure, focusing on plants. We then provide an overview of methods for probing secondary structure, from physical methods such as X-ray crystallography and nuclear magnetic resonance imaging (NMR) to chemical and nuclease probing methods. Marriage with high-throughput sequencing has enabled these latter methods to scale across whole transcriptomes, yielding tremendous new insights into the form and function of RNA secondary structure. PMID:26865341
Fine-grained parallelism accelerating for RNA secondary structure prediction with pseudoknots based on FPGA.

PubMed

Xia, Fei; Jin, Guoqing

2014-06-01

PKNOTS is a most famous benchmark program and has been widely used to predict RNA secondary structure including pseudoknots. It adopts the standard four-dimensional (4D) dynamic programming (DP) method and is the basis of many variants and improved algorithms. Unfortunately, the O(N(6)) computing requirements and complicated data dependency greatly limits the usefulness of PKNOTS package with the explosion in gene database size. In this paper, we present a fine-grained parallel PKNOTS package and prototype system for accelerating RNA folding application based on FPGA chip. We adopted a series of storage optimization strategies to resolve the "Memory Wall" problem. We aggressively exploit parallel computing strategies to improve computational efficiency. We also propose several methods that collectively reduce the storage requirements for FPGA on-chip memory. To the best of our knowledge, our design is the first FPGA implementation for accelerating 4D DP problem for RNA folding application including pseudoknots. The experimental results show a factor of more than 50x average speedup over the PKNOTS-1.08 software running on a PC platform with Intel Core2 Q9400 Quad CPU for input RNA sequences. However, the power consumption of our FPGA accelerator is only about 50% of the general-purpose micro-processors.
Noncanoncial signal recognition particle RNAs in a major eukaryotic phylum revealed by purification of SRP from the human pathogen Cryptococcus neoformans

PubMed Central

Dumesic, Phillip A.; Rosenblad, Magnus A.; Samuelsson, Tore; Nguyen, Tiffany; Moresco, James J.; Yates, John R.; Madhani, Hiten D.

2015-01-01

Despite conservation of the signal recognition particle (SRP) from bacteria to man, computational approaches have failed to identify SRP components from genomes of many lower eukaryotes, raising the possibility that they have been lost or altered in those lineages. We report purification and analysis of SRP in the human pathogen Cryptococcus neoformans, providing the first description of SRP in basidiomycetous yeast. The C. neoformans SRP RNA displays a predicted structure in which the universally conserved helix 8 contains an unprecedented stem-loop insertion. Guided by this sequence, we computationally identified 152 SRP RNAs throughout the phylum Basidiomycota. This analysis revealed additional helix 8 alterations including single and double stem-loop insertions as well as loop diminutions affecting RNA structural elements that are otherwise conserved from bacteria to man. Strikingly, these SRP RNA features in Basidiomycota are accompanied by phylum-specific alterations in the RNA-binding domain of Srp54, the SRP protein subunit that directly interacts with helix 8. Our findings reveal unexpected fungal SRP diversity and suggest coevolution of the two most conserved SRP features—SRP RNA helix 8 and Srp54—in basidiomycetes. Because members of this phylum include important human and plant pathogens, these noncanonical features provide new targets for antifungal compound development. PMID:26275773
R-chie: a web server and R package for visualizing RNA secondary structures

PubMed Central

Lai, Daniel; Proctor, Jeff R.; Zhu, Jing Yun A.; Meyer, Irmtraud M.

2012-01-01

Visually examining RNA structures can greatly aid in understanding their potential functional roles and in evaluating the performance of structure prediction algorithms. As many functional roles of RNA structures can already be studied given the secondary structure of the RNA, various methods have been devised for visualizing RNA secondary structures. Most of these methods depict a given RNA secondary structure as a planar graph consisting of base-paired stems interconnected by roundish loops. In this article, we present an alternative method of depicting RNA secondary structure as arc diagrams. This is well suited for structures that are difficult or impossible to represent as planar stem-loop diagrams. Arc diagrams can intuitively display pseudo-knotted structures, as well as transient and alternative structural features. In addition, they facilitate the comparison of known and predicted RNA secondary structures. An added benefit is that structure information can be displayed in conjunction with a corresponding multiple sequence alignments, thereby highlighting structure and primary sequence conservation and variation. We have implemented the visualization algorithm as a web server R-chie as well as a corresponding R package called R4RNA, which allows users to run the software locally and across a range of common operating systems. PMID:22434875
Structure and reconstitution of yeast Mpp6-nuclear exosome complexes reveals that Mpp6 stimulates RNA decay and recruits the Mtr4 helicase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wasmuth, Elizabeth V.; Zinder, John C.; Zattas, Dimitrios

Nuclear RNA exosomes catalyze a range of RNA processing and decay activities that are coordinated in part by cofactors, including Mpp6, Rrp47, and the Mtr4 RNA helicase. Mpp6 interacts with the nine-subunit exosome core, while Rrp47 stabilizes the exoribonuclease Rrp6 and recruits Mtr4, but it is less clear if these cofactors work together. Using biochemistry with Saccharomyces cerevisiae proteins, we show that Rrp47 and Mpp6 stimulate exosome-mediated RNA decay, albeit with unique dependencies on elements within the nuclear exosome. Mpp6-exosomes can recruit Mtr4, while Mpp6 and Rrp47 each contribute to Mtr4-dependent RNA decay, with maximal Mtr4-dependent decay observed with bothmore » cofactors. The 3.3 Å structure of a twelve-subunit nuclear Mpp6 exosome bound to RNA shows the central region of Mpp6 bound to the exosome core, positioning its Mtr4 recruitment domain next to Rrp6 and the exosome central channel. Genetic analysis reveals interactions that are largely consistent with our model.« less
The complete mitochondrial genome of the gall-forming fly, Fergusonina taylori Nelson and Yeates (Diptera: Fergusoninidae).

PubMed

Nelson, Leigh A; Cameron, Stephen L; Yeates, David K

2011-10-01

The monogeneric family Fergusoninidae consists of gall-forming flies that, together with Fergusobia (Tylenchida: Neotylenchidae) nematodes, form the only known mutualistic association between insects and nematodes. In this study, the entire 16,000 bp mitochondrial genome of Fergusonina taylori Nelson and Yeates was sequenced. The circular genome contains one encoding region including 27 genes and one non-coding A+T-rich region. The arrangement of the protein-coding, ribosomal RNA (rRNA) and transfer RNA (tRNA) genes was the same as that found in the ancestral insect. Nucleotide composition is highly A+T biased. All of the protein initiation codons are ATN, except for nad1 which begins with TTT. All 22 tRNA anticodons of F. taylori match those observed in Drosophila yakuba, and all form the typical cloverleaf structure except for tRNA-Ser((AGN)) which lacks a dihydrouridine (DHU) arm. Secondary structural features of the rRNA genes of Fergusonina are similar to those proposed for other insects, with minor modifications. The mitochondrial genome of Fergusonina presented here may prove valuable for resolving the sister group to the Fergusoninidae, and expands the available mtDNA data sources for acalyptrates overall.
RAG-3D: A search tool for RNA 3D substructures

DOE PAGES

Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; ...

2015-08-24

In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less
RAG-3D: a search tool for RNA 3D substructures

PubMed Central

Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar

2015-01-01

To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547
RAG-3D: A search tool for RNA 3D substructures

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef

In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less
ModeRNA server: an online tool for modeling RNA 3D structures.

PubMed

Rother, Magdalena; Milanowska, Kaja; Puton, Tomasz; Jeleniewicz, Jaroslaw; Rother, Kristian; Bujnicki, Janusz M

2011-09-01

The diverse functional roles of non-coding RNA molecules are determined by their underlying structure. ModeRNA server is an online tool for RNA 3D structure modeling by the comparative approach, based on a template RNA structure and a user-defined target-template sequence alignment. It offers an option to search for potential templates, given the target sequence. The server also provides tools for analyzing, editing and formatting of RNA structure files. It facilitates the use of the ModeRNA software and offers new options in comparison to the standalone program. ModeRNA server was implemented using the Python language and the Django web framework. It is freely available at http://iimcb.genesilico.pl/modernaserver. iamb@genesilico.pl.
Structure, recognition and adaptive binding in RNA aptamer complexes.

PubMed

Patel, D J; Suri, A K; Jiang, F; Jiang, L; Fan, P; Kumar, R A; Nonin, S

1997-10-10

Novel features of RNA structure, recognition and discrimination have been recently elucidated through the solution structural characterization of RNA aptamers that bind cofactors, aminoglycoside antibiotics, amino acids and peptides with high affinity and specificity. This review presents the solution structures of RNA aptamer complexes with adenosine monophosphate, flavin mononucleotide, arginine/citrulline and tobramycin together with an example of hydrogen exchange measurements of the base-pair kinetics for the AMP-RNA aptamer complex. A comparative analysis of the structures of these RNA aptamer complexes yields the principles, patterns and diversity associated with RNA architecture, molecular recognition and adaptive binding associated with complex formation.
Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

PubMed Central

Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

2012-01-01

While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738
RNA 3D Structure Modeling by Combination of Template-Based Method ModeRNA, Template-Free Folding with SimRNA, and Refinement with QRNAS.

PubMed

Piatkowski, Pawel; Kasprzak, Joanna M; Kumar, Deepak; Magnus, Marcin; Chojnowski, Grzegorz; Bujnicki, Janusz M

2016-01-01

RNA encompasses an essential part of all known forms of life. The functions of many RNA molecules are dependent on their ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that either utilize information derived from known structures of other RNA molecules (by way of template-based modeling) or attempt to simulate the physical process of RNA structure formation (by way of template-free modeling). All computational methods suffer from various limitations that make theoretical models less reliable than high-resolution experimentally determined structures. This chapter provides a protocol for computational modeling of RNA 3D structure that overcomes major limitations by combining two complementary approaches: template-based modeling that is capable of predicting global architectures based on similarity to other molecules but often fails to predict local unique features, and template-free modeling that can predict the local folding, but is limited to modeling the structure of relatively small molecules. Here, we combine the use of a template-based method ModeRNA with a template-free method SimRNA. ModeRNA requires a sequence alignment of the target RNA sequence to be modeled with a template of the known structure; it generates a model that predicts the structure of a conserved core and provides a starting point for modeling of variable regions. SimRNA can be used to fold small RNAs (<80 nt) without any additional structural information, and to refold parts of models for larger RNAs that have a correctly modeled core. ModeRNA can be either downloaded, compiled and run locally or run through a web interface at http://genesilico.pl/modernaserver/ . SimRNA is currently available to download for local use as a precompiled software package at http://genesilico.pl/software/stand-alone/simrna and as a web server at http://genesilico.pl/SimRNAweb . For model optimization we use QRNAS, available at http://genesilico.pl/qrnas .
Dnmt2 mediates intergenerational transmission of paternally acquired metabolic disorders through sperm small non-coding RNAs.

PubMed

Zhang, Yunfang; Zhang, Xudong; Shi, Junchao; Tuorto, Francesca; Li, Xin; Liu, Yusheng; Liebers, Reinhard; Zhang, Liwen; Qu, Yongcun; Qian, Jingjing; Pahima, Maya; Liu, Ying; Yan, Menghong; Cao, Zhonghong; Lei, Xiaohua; Cao, Yujing; Peng, Hongying; Liu, Shichao; Wang, Yue; Zheng, Huili; Woolsey, Rebekah; Quilici, David; Zhai, Qiwei; Li, Lei; Zhou, Tong; Yan, Wei; Lyko, Frank; Zhang, Ying; Zhou, Qi; Duan, Enkui; Chen, Qi

2018-05-01

The discovery of RNAs (for example, messenger RNAs, non-coding RNAs) in sperm has opened the possibility that sperm may function by delivering additional paternal information aside from solely providing the DNA 1 . Increasing evidence now suggests that sperm small non-coding RNAs (sncRNAs) can mediate intergenerational transmission of paternally acquired phenotypes, including mental stress 2,3 and metabolic disorders 4-6 . How sperm sncRNAs encode paternal information remains unclear, but the mechanism may involve RNA modifications. Here we show that deletion of a mouse tRNA methyltransferase, DNMT2, abolished sperm sncRNA-mediated transmission of high-fat-diet-induced metabolic disorders to offspring. Dnmt2 deletion prevented the elevation of RNA modifications (m 5 C, m 2 G) in sperm 30-40 nt RNA fractions that are induced by a high-fat diet. Also, Dnmt2 deletion altered the sperm small RNA expression profile, including levels of tRNA-derived small RNAs and rRNA-derived small RNAs, which might be essential in composing a sperm RNA 'coding signature' that is needed for paternal epigenetic memory. Finally, we show that Dnmt2-mediated m 5 C contributes to the secondary structure and biological properties of sncRNAs, implicating sperm RNA modifications as an additional layer of paternal hereditary information.
RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

PubMed

Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

2018-01-01

RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

Methylation guide RNA evolution in archaea: structure, function and genomic organization of 110 C/D box sRNA families across six Pyrobaculum species.

PubMed

Lui, Lauren M; Uzilov, Andrew V; Bernick, David L; Corredor, Andrea; Lowe, Todd M; Dennis, Patrick P

2018-05-16

Archaeal homologs of eukaryotic C/D box small nucleolar RNAs (C/D box sRNAs) guide precise 2'-O-methyl modification of ribosomal and transfer RNAs. Although C/D box sRNA genes constitute one of the largest RNA gene families in archaeal thermophiles, most genomes have incomplete sRNA gene annotation because reliable, fully automated detection methods are not available. We expanded and curated a comprehensive gene set across six species of the crenarchaeal genus Pyrobaculum, particularly rich in C/D box sRNA genes. Using high-throughput small RNA sequencing, specialized computational searches and comparative genomics, we analyzed 526 Pyrobaculum C/D box sRNAs, organizing them into 110 families based on synteny and conservation of guide sequences which determine methylation targets. We examined gene duplications and rearrangements, including one family that has expanded in a pattern similar to retrotransposed repetitive elements in eukaryotes. New training data and inclusion of kink-turn secondary structural features enabled creation of an improved search model. Our analyses provide the most comprehensive, dynamic view of C/D box sRNA evolutionary history within a genus, in terms of modification function, feature plasticity, and gene mobility.
RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure.

PubMed

Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

2008-03-31

With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules.
RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure

PubMed Central

Liu, Qi; Yang, Yu; Chen, Chun; Bu, Jiajun; Zhang, Yin; Ye, Xiuzi

2008-01-01

Background With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. Results RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1) present a robust and effective way for RNA structural data compression; (2) design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers) compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. Conclusion A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool for academic users. Extensive tests have shown that RNACompress is a universally efficient algorithm for the compression of RNA sequences with their secondary structures. RNACompress also serves as a good measurement of the informational complexity of RNA secondary structure, which can be used to study the functional activities of RNA molecules. PMID:18373878
Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

PubMed

Barik, Sailen

2017-12-01

A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Computational Assessment of Potassium and Magnesium Ion Binding to a Buried Pocket in GTPase-Associating Center RNA

PubMed Central

2016-01-01

An experimentally well-studied model of RNA tertiary structures is a 58mer rRNA fragment, known as GTPase-associating center (GAC) RNA, in which a highly negative pocket walled by phosphate oxygen atoms is stabilized by a chelated cation. Although such deep pockets with more than one direct phosphate to ion chelation site normally include magnesium, as shown in one GAC crystal structure, another GAC crystal structure and solution experiments suggest potassium at this site. Both crystal structures also depict two magnesium ions directly bound to the phosphate groups comprising this controversial pocket. Here, we used classical molecular dynamics simulations as well as umbrella sampling to investigate the possibility of binding of potassium versus magnesium inside the pocket and to better characterize the chelation of one of the binding magnesium ions outside the pocket. The results support the preference of the pocket to accommodate potassium rather than magnesium and suggest that one of the closely binding magnesium ions can only bind at high magnesium concentrations, such as might be present during crystallization. This work illustrates the complementary utility of molecular modeling approaches with atomic-level detail in resolving discrepancies between conflicting experimental results. PMID:27983843
Computational Assessment of Potassium and Magnesium Ion Binding to a Buried Pocket in GTPase-Associating Center RNA.

PubMed

Hayatshahi, Hamed S; Roe, Daniel R; Galindo-Murillo, Rodrigo; Hall, Kathleen B; Cheatham, Thomas E

2017-01-26

An experimentally well-studied model of RNA tertiary structures is a 58mer rRNA fragment, known as GTPase-associating center (GAC) RNA, in which a highly negative pocket walled by phosphate oxygen atoms is stabilized by a chelated cation. Although such deep pockets with more than one direct phosphate to ion chelation site normally include magnesium, as shown in one GAC crystal structure, another GAC crystal structure and solution experiments suggest potassium at this site. Both crystal structures also depict two magnesium ions directly bound to the phosphate groups comprising this controversial pocket. Here, we used classical molecular dynamics simulations as well as umbrella sampling to investigate the possibility of binding of potassium versus magnesium inside the pocket and to better characterize the chelation of one of the binding magnesium ions outside the pocket. The results support the preference of the pocket to accommodate potassium rather than magnesium and suggest that one of the closely binding magnesium ions can only bind at high magnesium concentrations, such as might be present during crystallization. This work illustrates the complementary utility of molecular modeling approaches with atomic-level detail in resolving discrepancies between conflicting experimental results.
Multiperspective smFRET reveals rate-determining late intermediates of ribosomal translocation.

PubMed

Wasserman, Michael R; Alejo, Jose L; Altman, Roger B; Blanchard, Scott C

2016-04-01

Directional translocation of the ribosome through the mRNA open reading frame is a critical determinant of translational fidelity. This process entails a complex interplay of large-scale conformational changes within the actively translating particle, which together coordinate the movement of tRNA and mRNA substrates with respect to the large and small ribosomal subunits. Using pre-steady state, single-molecule fluorescence resonance energy transfer imaging, we tracked the nature and timing of these conformational events within the Escherichia coli ribosome from five structural perspectives. Our investigations revealed direct evidence of structurally and kinetically distinct late intermediates during substrate movement, whose resolution determines the rate of translocation. These steps involve intramolecular events within the EF-G-GDP-bound ribosome, including exaggerated, reversible fluctuations of the small-subunit head domain, which ultimately facilitate peptidyl-tRNA's movement into its final post-translocation position.
Modular architecture of eukaryotic RNase P and RNase MRP revealed by electron microscopy.

PubMed

Hipp, Katharina; Galani, Kyriaki; Batisse, Claire; Prinz, Simone; Böttcher, Bettina

2012-04-01

Ribonuclease P (RNase P) and RNase MRP are closely related ribonucleoprotein enzymes, which process RNA substrates including tRNA precursors for RNase P and 5.8 S rRNA precursors, as well as some mRNAs, for RNase MRP. The structures of RNase P and RNase MRP have not yet been solved, so it is unclear how the proteins contribute to the structure of the complexes and how substrate specificity is determined. Using electron microscopy and image processing we show that eukaryotic RNase P and RNase MRP have a modular architecture, where proteins stabilize the RNA fold and contribute to cavities, channels and chambers between the modules. Such features are located at strategic positions for substrate recognition by shape and coordination of the cleaved-off sequence. These are also the sites of greatest difference between RNase P and RNase MRP, highlighting the importance of the adaptation of this region to the different substrates.
Stem-Loop V of Varkud Satellite RNA Exhibits Characteristics of the Mg2+ Bound Structure in the Presence of Monovalent Ions

PubMed Central

2015-01-01

The Varkud Satellite RNA contains a self-cleaving ribozyme that has been shown to function independently of its surroundings. This 160 nucleotide ribozyme adopts a catalytically active tertiary structure that includes a kissing hairpin complex formed by stem-loop I and stem-loop V (SLV). The five-nucleotide 5′-rUGACU loop of the isolated SLV has been shown to adopt a Mg2+-dependent U-turn structure by solution NMR. This U-turn hairpin is examined here by molecular dynamics simulations in the presence of monovalent and divalent ions. Simulations confirm on an all-atom level the hypotheses for the role of the Mg2+ ions in stabilizing the loop, as well as the role of the solvent exposed U700 base. Additionally, these simulations suggest the Mg2+-free stem-loop adopts a wide range of structures, including energetically favorable structures similar to the Mg2+-bound loop structure. We propose this structure is a “gatekeeper” or precursor to Mg2+ binding when those ions are present. PMID:26328924
Accurate Classification of RNA Structures Using Topological Fingerprints

PubMed Central

Li, Kejie; Gribskov, Michael

2016-01-01

While RNAs are well known to possess complex structures, functionally similar RNAs often have little sequence similarity. While the exact size and spacing of base-paired regions vary, functionally similar RNAs have pronounced similarity in the arrangement, or topology, of base-paired stems. Furthermore, predicted RNA structures often lack pseudoknots (a crucial aspect of biological activity), and are only partially correct, or incomplete. A topological approach addresses all of these difficulties. In this work we describe each RNA structure as a graph that can be converted to a topological spectrum (RNA fingerprint). The set of subgraphs in an RNA structure, its RNA fingerprint, can be compared with the fingerprints of other RNA structures to identify and correctly classify functionally related RNAs. Topologically similar RNAs can be identified even when a large fraction, up to 30%, of the stems are omitted, indicating that highly accurate structures are not necessary. We investigate the performance of the RNA fingerprint approach on a set of eight highly curated RNA families, with diverse sizes and functions, containing pseudoknots, and with little sequence similarity–an especially difficult test set. In spite of the difficult test set, the RNA fingerprint approach is very successful (ROC AUC > 0.95). Due to the inclusion of pseudoknots, the RNA fingerprint approach both covers a wider range of possible structures than methods based only on secondary structure, and its tolerance for incomplete structures suggests that it can be applied even to predicted structures. Source code is freely available at https://github.rcac.purdue.edu/mgribsko/XIOS_RNA_fingerprint. PMID:27755571
A Boost for the Emerging Field of RNA Nanotechnology

PubMed Central

2011-01-01

This Nano Focus article highlights recent advances in RNA nanotechnology as presented at the First International Conference of RNA Nanotechnology and Therapeutics, which took place in Cleveland, OH, USA (October 23–25, 2010) (http://www.eng.uc.edu/nanomedicine/RNA2010/), chaired by Peixuan Guo and co-chaired by David Rueda and Scott Tenenbaum. The conference was the first of its kind to bring together more than 30 invited speakers in the frontier of RNA nanotechnology from France, Sweden, South Korea, China, and throughout the United States to discuss RNA nanotechnology and its applications. It provided a platform for researchers from academia, government, and the pharmaceutical industry to share existing knowledge, vision, technology, and challenges in the field and promoted collaborations among researchers interested in advancing this emerging scientific discipline. The meeting covered a range of topics, including biophysical and single-molecule approaches for characterization of RNA nanostructures; structure studies on RNA nanoparticles by chemical or biochemical approaches, computation, prediction, and modeling of RNA nanoparticle structures; methods for the assembly of RNA nanoparticles; chemistry for RNA synthesis, conjugation, and labeling; and application of RNA nanoparticles in therapeutics. A special invited talk on the well-established principles of DNA nanotechnology was arranged to provide models for RNA nanotechnology. An Administrator from National Institutes of Health (NIH) National Cancer Institute (NCI) Alliance for Nanotechnology in Cancer discussed the current nanocancer research directions and future funding opportunities at NCI. As indicated by the feedback received from the invited speakers and the meeting participants, this meeting was extremely successful, exciting, and informative, covering many groundbreaking findings, pioneering ideas, and novel discoveries. PMID:21604810
ModeRNA: a tool for comparative modeling of RNA 3D structure

PubMed Central

Rother, Magdalena; Rother, Kristian; Puton, Tomasz; Bujnicki, Janusz M.

2011-01-01

RNA is a large group of functionally important biomacromolecules. In striking analogy to proteins, the function of RNA depends on its structure and dynamics, which in turn is encoded in the linear sequence. However, while there are numerous methods for computational prediction of protein three-dimensional (3D) structure from sequence, with comparative modeling being the most reliable approach, there are very few such methods for RNA. Here, we present ModeRNA, a software tool for comparative modeling of RNA 3D structures. As an input, ModeRNA requires a 3D structure of a template RNA molecule, and a sequence alignment between the target to be modeled and the template. It must be emphasized that a good alignment is required for successful modeling, and for large and complex RNA molecules the development of a good alignment usually requires manual adjustments of the input data based on previous expertise of the respective RNA family. ModeRNA can model post-transcriptional modifications, a functionally important feature analogous to post-translational modifications in proteins. ModeRNA can also model DNA structures or use them as templates. It is equipped with many functions for merging fragments of different nucleic acid structures into a single model and analyzing their geometry. Windows and UNIX implementations of ModeRNA with comprehensive documentation and a tutorial are freely available. PMID:21300639
RNA Characterization by Solid-State NMR Spectroscopy.

PubMed

Yang, Yufei; Wang, Shenlin

2018-06-21

The structures of RNAs, which play critical roles in various biological processes, provide important clues and insights into the biological functions of these molecules. However, RNA structure determination remains a challenging topic. In recent years, magic-angle-spinning solid-state NMR (MAS SSNMR) has emerged as an alternative technique for structural and dynamic characterization of RNA. MAS SSNMR has been successfully applied to provide atomic-level structural information about several RNA molecules and RNA-protein complexes. In this Minireview, we give an overview of recent progress in the field of MAS SSNMR based RNA structural characterization, and introduce sample preparation strategies and SSNMR spectroscopic techniques that have been incorporated to identify RNA structural elements. We also highlight a few impressive examples of RNAs that have been investigated extensively by SSNMR. Finally, we briefly discuss future technical trends in the use of MAS SSNMR to facilitate RNA structure determination. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Structural features of microRNA (miRNA) precursors and their relevance to miRNA biogenesis and small interfering RNA/short hairpin RNA design.

PubMed

Krol, Jacek; Sobczak, Krzysztof; Wilczynska, Urszula; Drath, Maria; Jasinska, Anna; Kaczynska, Danuta; Krzyzosiak, Wlodzimierz J

2004-10-01

We have established the structures of 10 human microRNA (miRNA) precursors using biochemical methods. Eight of these structures turned out to be different from those that were computer-predicted. The differences localized in the terminal loop region and at the opposite side of the precursor hairpin stem. We have analyzed the features of these structures from the perspectives of miRNA biogenesis and active strand selection. We demonstrated the different thermodynamic stability profiles for pre-miRNA hairpins harboring miRNAs at their 5'- and 3'-sides and discussed their functional implications. Our results showed that miRNA prediction based on predicted precursor structures may give ambiguous results, and the success rate is significantly higher for the experimentally determined structures. On the other hand, the differences between the predicted and experimentally determined structures did not affect the stability of termini produced through "conceptual dicing." This result confirms the value of thermodynamic analysis based on mfold as a predictor of strand section by RNAi-induced silencing complex (RISC).
Stem-Loop RNA Hairpins in Giant Viruses: Invading rRNA-Like Repeats and a Template Free RNA

PubMed Central

Seligmann, Hervé; Raoult, Didier

2018-01-01

We examine the hypothesis that de novo template-free RNAs still form spontaneously, as they did at the origins of life, invade modern genomes, contribute new genetic material. Previously, analyses of RNA secondary structures suggested that some RNAs resembling ancestral (t)RNAs formed recently de novo, other parasitic sequences cluster with rRNAs. Here positive control analyses of additional RNA secondary structures confirm ancestral and de novo statuses of RNA grouped according to secondary structure. Viroids with branched stems resemble de novo RNAs, rod-shaped viroids resemble rRNA secondary structures, independently of GC contents. 5′ UTR leading regions of West Nile and Dengue flavivirid viruses resemble de novo and rRNA structures, respectively. An RNA homologous with Megavirus, Dengue and West Nile genomes, copperhead snake microsatellites and levant cotton repeats, not templated by Mimivirus' genome, persists throughout Mimivirus' infection. Its secondary structure clusters with candidate de novo RNAs. The saltatory phyletic distribution and secondary structure of Mimivirus' peculiar RNA suggest occasional template-free polymerization of this sequence, rather than noncanonical transcriptions (swinger polymerization, posttranscriptional editing). PMID:29449833
Neuronal RNA granules: a link between RNA localization and stimulation-dependent translation

NASA Technical Reports Server (NTRS)

Krichevsky, A. M.; Kosik, K. S.

2001-01-01

RNA granules are a macromolecular structure observed in neurons, where they serve as motile units that translocate mRNAs. Isolated RNA granules are highly enriched in Staufen protein and ultrastructurally contain densely packed clusters of ribosomes. With depolarization, many mRNAs, including those involved in plasticity, rapidly shift from the RNA granule fraction to polysomes. Depolarization reorganizes granules and induces a less compact organization of their ribosomes. RNA granules are not translationally competent, as indicated by the failure to incorporate radioactive amino acids and the absence of eIF4E, 4G, and tRNAs. We concluded that RNA granules are a local storage compartment for mRNAs under translational arrest but are poised for release to actively translated pools. Local release of mRNAs and ribosomes from granules may serve as a macromolecular mechanism linking RNA localization to translation and synaptic plasticity.
Free-energy landscape of a hyperstable RNA tetraloop.

PubMed

Miner, Jacob C; Chen, Alan A; García, Angel E

2016-06-14

We report the characterization of the energy landscape and the folding/unfolding thermodynamics of a hyperstable RNA tetraloop obtained through high-performance molecular dynamics simulations at microsecond timescales. Sampling of the configurational landscape is conducted using temperature replica exchange molecular dynamics over three isochores at high, ambient, and negative pressures to determine the thermodynamic stability and the free-energy landscape of the tetraloop. The simulations reveal reversible folding/unfolding transitions of the tetraloop into the canonical A-RNA conformation and the presence of two alternative configurations, including a left-handed Z-RNA conformation and a compact purine Triplet. Increasing hydrostatic pressure shows a stabilizing effect on the A-RNA conformation and a destabilization of the left-handed Z-RNA. Our results provide a comprehensive description of the folded free-energy landscape of a hyperstable RNA tetraloop and highlight the significant advances of all-atom molecular dynamics in describing the unbiased folding of a simple RNA secondary structure motif.
PGL germ granule assembly protein is a base-specific, single-stranded RNase

PubMed Central

Aoki, Scott T.; Kershner, Aaron M.; Bingman, Craig A.; Wickens, Marvin; Kimble, Judith

2016-01-01

Cellular RNA-protein (RNP) granules are ubiquitous and have fundamental roles in biology and RNA metabolism, but the molecular basis of their structure, assembly, and function is poorly understood. Using nematode “P-granules” as a paradigm, we focus on the PGL granule scaffold protein to gain molecular insights into RNP granule structure and assembly. We first identify a PGL dimerization domain (DD) and determine its crystal structure. PGL-1 DD has a novel 13 α-helix fold that creates a positively charged channel as a homodimer. We investigate its capacity to bind RNA and discover unexpectedly that PGL-1 DD is a guanosine-specific, single-stranded endonuclease. Discovery of the PGL homodimer, together with previous results, suggests a model in which the PGL DD dimer forms a fundamental building block for P-granule assembly. Discovery of the PGL RNase activity expands the role of RNP granule assembly proteins to include enzymatic activity in addition to their job as structural scaffolds. PMID:26787882
Fluorescence probing of T box antiterminator RNA: Insights into riboswitch discernment of the tRNA discriminator base

PubMed Central

Means, John A.; Simson, Crystal M.; Zhou, Shu; Rachford, Aaron A.; Rack, Jeffrey J.; Hines, Jennifer V.

2009-01-01

The T box transcription antitermination riboswitch is one of the main regulatory mechanisms utilized by Gram-positive bacteria to regulate genes that are involved in amino acid metabolism. The details of the antitermination event, including the role that Mg2+ plays, in this riboswitch have not been completely elucidated. In these studies, details of the antitermination event were investigated utilizing 2-aminopurine to monitor structural changes of a model antiterminator RNA when it was bound to model tRNA. Based on the results of these fluorescence studies, the model tRNA binds the model antiterminator RNA via an induced fit. This binding is enhanced by the presence of Mg2+, facilitating the complete base pairing of the model tRNA acceptor end with the complementary bases in the model antiterminator bulge. PMID:19755116
5SRNAdb: an information resource for 5S ribosomal RNAs.

PubMed

Szymanski, Maciej; Zielezinski, Andrzej; Barciszewski, Jan; Erdmann, Volker A; Karlowski, Wojciech M

2016-01-04

Ribosomal 5S RNA (5S rRNA) is the ubiquitous RNA component found in the large subunit of ribosomes in all known organisms. Due to its small size, abundance and evolutionary conservation 5S rRNA for many years now is used as a model molecule in studies on RNA structure, RNA-protein interactions and molecular phylogeny. 5SRNAdb (http://combio.pl/5srnadb/) is the first database that provides a high quality reference set of ribosomal 5S RNAs (5S rRNA) across three domains of life. Here, we give an overview of new developments in the database and associated web tools since 2002, including updates to database content, curation processes and user web interfaces. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Structure of a group II intron in complex with its reverse transcriptase.

PubMed

Qu, Guosheng; Kaushal, Prem Singh; Wang, Jia; Shigematsu, Hideki; Piazza, Carol Lyn; Agrawal, Rajendra Kumar; Belfort, Marlene; Wang, Hong-Wei

2016-06-01

Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.
Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome.

PubMed

Dykeman, Eric C; Stockley, Peter G; Twarock, Reidun

2013-09-09

The current paradigm for assembly of single-stranded RNA viruses is based on a mechanism involving non-sequence-specific packaging of genomic RNA driven by electrostatic interactions. Recent experiments, however, provide compelling evidence for sequence specificity in this process both in vitro and in vivo. The existence of multiple RNA packaging signals (PSs) within viral genomes has been proposed, which facilitates assembly by binding coat proteins in such a way that they promote the protein-protein contacts needed to build the capsid. The binding energy from these interactions enables the confinement or compaction of the genomic RNAs. Identifying the nature of such PSs is crucial for a full understanding of assembly, which is an as yet untapped potential drug target for this important class of pathogens. Here, for two related bacterial viruses, we determine the sequences and locations of their PSs using Hamiltonian paths, a concept from graph theory, in combination with bioinformatics and structural studies. Their PSs have a common secondary structure motif but distinct consensus sequences and positions within the respective genomes. Despite these differences, the distributions of PSs in both viruses imply defined conformations for the packaged RNA genomes in contact with the protein shell in the capsid, consistent with a recent asymmetric structure determination of the MS2 virion. The PS distributions identified moreover imply a preferred, evolutionarily conserved assembly pathway with respect to the RNA sequence with potentially profound implications for other single-stranded RNA viruses known to have RNA PSs, including many animal and human pathogens. Copyright © 2013 Elsevier Ltd. All rights reserved.
Tracking the Fragile X Mental Retardation Protein in a Highly Ordered Neuronal RiboNucleoParticles Population: A Link between Stalled Polyribosomes and RNA Granules.

PubMed

El Fatimy, Rachid; Davidovic, Laetitia; Tremblay, Sandra; Jaglin, Xavier; Dury, Alain; Robert, Claude; De Koninck, Paul; Khandjian, Edouard W

2016-07-01

Local translation at the synapse plays key roles in neuron development and activity-dependent synaptic plasticity. mRNAs are translocated from the neuronal soma to the distant synapses as compacted ribonucleoparticles referred to as RNA granules. These contain many RNA-binding proteins, including the Fragile X Mental Retardation Protein (FMRP), the absence of which results in Fragile X Syndrome, the most common inherited form of intellectual disability and the leading genetic cause of autism. Using FMRP as a tracer, we purified a specific population of RNA granules from mouse brain homogenates. Protein composition analyses revealed a strong relationship between polyribosomes and RNA granules. However, the latter have distinct architectural and structural properties, since they are detected as close compact structures as observed by electron microscopy, and converging evidence point to the possibility that these structures emerge from stalled polyribosomes. Time-lapse video microscopy indicated that single granules merge to form cargoes that are transported from the soma to distal locations. Transcriptomic analyses showed that a subset of mRNAs involved in cytoskeleton remodelling and neural development is selectively enriched in RNA granules. One third of the putative mRNA targets described for FMRP appear to be transported in granules and FMRP is more abundant in granules than in polyribosomes. This observation supports a primary role for FMRP in granules biology. Our findings open new avenues for the study of RNA granule dysfunctions in animal models of nervous system disorders, such as Fragile X syndrome.
Solution structure and thermodynamics of 2',5' RNA intercalation.

PubMed

Horowitz, Eric D; Lilavivat, Seth; Holladay, Benjamin W; Germann, Markus W; Hud, Nicholas V

2009-04-29

As a means to explore the influence of the nucleic acid backbone on the intercalative binding of ligands to DNA and RNA, we have determined the solution structure of a proflavine-bound 2',5'-linked octamer duplex with the sequence GCCGCGGC. This structure represents the first NMR structure of an intercalated RNA duplex, of either backbone structural isomer. By comparison with X-ray crystal structures, we have identified similarities and differences between intercalated 3',5' and 2',5'-linked RNA duplexes. First, the two forms of RNA have different sugar pucker geometries at the intercalated nucleotide steps, yet have the same interphosphate distances. Second, as in intercalated 3',5' RNA, the phosphate backbone angle zeta at the 2',5' RNA intercalation site prefers to be in the trans conformation, whereas unintercalated 2',5' and 3',5' RNA prefer the -gauche conformation. These observations provide new insights regarding the transitions required for intercalation of a phosphodiester-ribose backbone and suggest a possible contribution of the backbone to the origin of the nearest-neighbor exclusion principle. Thermodynamic studies presented for intercalation of both structural RNA isomers also reveal a surprising sensitivity of intercalator binding enthalpy and entropy to the details of RNA backbone structure.
Computational biology of RNA interactions.

PubMed

Dieterich, Christoph; Stadler, Peter F

2013-01-01

The biodiversity of the RNA world has been underestimated for decades. RNA molecules are key building blocks, sensors, and regulators of modern cells. The biological function of RNA molecules cannot be separated from their ability to bind to and interact with a wide space of chemical species, including small molecules, nucleic acids, and proteins. Computational chemists, physicists, and biologists have developed a rich tool set for modeling and predicting RNA interactions. These interactions are to some extent determined by the binding conformation of the RNA molecule. RNA binding conformations are approximated with often acceptable accuracy by sequence and secondary structure motifs. Secondary structure ensembles of a given RNA molecule can be efficiently computed in many relevant situations by employing a standard energy model for base pair interactions and dynamic programming techniques. The case of bi-molecular RNA-RNA interactions can be seen as an extension of this approach. However, unbiased transcriptome-wide scans for local RNA-RNA interactions are computationally challenging yet become efficient if the binding motif/mode is known and other external information can be used to confine the search space. Computational methods are less developed for proteins and small molecules, which bind to RNA with very high specificity. Binding descriptors of proteins are usually determined by in vitro high-throughput assays (e.g., microarrays or sequencing). Intriguingly, recent experimental advances, which are mostly based on light-induced cross-linking of binding partners, render in vivo binding patterns accessible yet require new computational methods for careful data interpretation. The grand challenge is to model the in vivo situation where a complex interplay of RNA binders competes for the same target RNA molecule. Evidently, bioinformaticians are just catching up with the impressive pace of these developments. Copyright © 2012 John Wiley & Sons, Ltd.
The phylogeny of archaebacteria, including novel anaerobic thermoacidophiles in the light of RNA polymerase structure

NASA Astrophysics Data System (ADS)

Zillig, Wolfram; Schnabel, Ralf; Tu, Jenn; Stetter, Karl Otto

1982-05-01

DNA-dependent RNA polymerases of archaebacteria are distinct from those of eubacteria both in structure and in function. They show similarities to those of the eukaryotic cytoplasm. Extremely thermophilic anaerobic sulfur-respiring archaebacteria isolated from solfataric waters represent four different families, the Thermoproteaceae, the “stiff filaments”, the Desulfurococcaceae and the Thermococcaceae, of a novel order, Thermoproteales. Together with the Sulfolobales, they form the second branch of the urkingdom of the archaebacteria besides that of the methanogens and extreme halophiles. Thermoplasma appears isolated.
A computational proposal for designing structured RNA pools for in vitro selection of RNAs.

PubMed

Kim, Namhee; Gan, Hin Hark; Schlick, Tamar

2007-04-01

Although in vitro selection technology is a versatile experimental tool for discovering novel synthetic RNA molecules, finding complex RNA molecules is difficult because most RNAs identified from random sequence pools are simple motifs, consistent with recent computational analysis of such sequence pools. Thus, enriching in vitro selection pools with complex structures could increase the probability of discovering novel RNAs. Here we develop an approach for engineering sequence pools that links RNA sequence space regions with corresponding structural distributions via a "mixing matrix" approach combined with a graph theory analysis. We define five classes of mixing matrices motivated by covariance mutations in RNA; these constructs define nucleotide transition rates and are applied to chosen starting sequences to yield specific nonrandom pools. We examine the coverage of sequence space as a function of the mixing matrix and starting sequence via clustering analysis. We show that, in contrast to random sequences, which are associated only with a local region of sequence space, our designed pools, including a structured pool for GTP aptamers, can target specific motifs. It follows that experimental synthesis of designed pools can benefit from using optimized starting sequences, mixing matrices, and pool fractions associated with each of our constructed pools as a guide. Automation of our approach could provide practical tools for pool design applications for in vitro selection of RNAs and related problems.
Removal of Covalent Heterogeneity Reveals Simple Folding Behavior for P4-P6 RNA*

PubMed Central

Greenfeld, Max; Solomatin, Sergey V.; Herschlag, Daniel

2011-01-01

RNA folding landscapes have been described alternately as simple and as complex. The limited diversity of RNA residues and the ability of RNA to form stable secondary structures prior to adoption of a tertiary structure would appear to simplify folding relative to proteins. Nevertheless, there is considerable evidence for long-lived misfolded RNA states, and these observations have suggested rugged energy landscapes. Recently, single molecule fluorescence resonance energy transfer (smFRET) studies have exposed heterogeneity in many RNAs, consistent with deeply furrowed rugged landscapes. We turned to an RNA of intermediate complexity, the P4-P6 domain from the Tetrahymena group I intron, to address basic questions in RNA folding. P4-P6 exhibited long-lived heterogeneity in smFRET experiments, but the inability to observe exchange in the behavior of individual molecules led us to probe whether there was a non-conformational origin to this heterogeneity. We determined that routine protocols in RNA preparation and purification, including UV shadowing and heat annealing, cause covalent modifications that alter folding behavior. By taking measures to avoid these treatments and by purifying away damaged P4-P6 molecules, we obtained a population of P4-P6 that gave near-uniform behavior in single molecule studies. Thus, the folding landscape of P4-P6 lacks multiple deep furrows that would trap different P4-P6 molecules in different conformations and contrasts with the molecular heterogeneity that has been seen in many smFRET studies of structured RNAs. The simplicity of P4-P6 allowed us to reliably determine the thermodynamic and kinetic effects of metal ions on folding and to now begin to build more detailed models for RNA folding behavior. PMID:21478155
RNA Structures as Mediators of Neurological Diseases and as Drug Targets.

PubMed

Bernat, Viachaslau; Disney, Matthew D

2015-07-01

RNAs adopt diverse folded structures that are essential for function and thus play critical roles in cellular biology. A striking example of this is the ribosome, a complex, three-dimensionally folded macromolecular machine that orchestrates protein synthesis. Advances in RNA biochemistry, structural and molecular biology, and bioinformatics have revealed other non-coding RNAs whose functions are dictated by their structure. It is not surprising that aberrantly folded RNA structures contribute to disease. In this Review, we provide a brief introduction into RNA structural biology and then describe how RNA structures function in cells and cause or contribute to neurological disease. Finally, we highlight successful applications of rational design principles to provide chemical probes and lead compounds targeting structured RNAs. Based on several examples of well-characterized RNA-driven neurological disorders, we demonstrate how designed small molecules can facilitate the study of RNA dysfunction, elucidating previously unknown roles for RNA in disease, and provide lead therapeutics. Copyright © 2015 Elsevier Inc. All rights reserved.
Trans-acting RNAs as molecular probes for monitoring time-dependent structural change of an RNA complex adapting two structures.

PubMed

Maeda, Yuri; Furuta, Hiroyuki; Ikawa, Yoshiya

2011-03-01

As dynamic structural changes are pivotal for the functions of some classes of RNA molecule, it is important to develop methods to monitor structural changes in RNA in a time-dependent manner without chemical modification. Based on previous reports that trans-acting RNAs can be used as probes for analysis and control of 3D structures of target RNAs, we applied this method to monitor time-dependent structural changes in RNA. We designed and performed a proof-of-principle study using a simple model RNA complex that adopts two different structures as a target. The time-dependent structural changes in the target RNA were successfully monitored using two trans-acting RNAs, which stably form a ternary complex with the bimolecular target RNA and act as a catalyst to join two RNA fragments of the target complex, respectively. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9)

PubMed Central

Fu, Qinqin; Yuan, Y. Adam

2013-01-01

Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA. PMID:23361462
Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9).

PubMed

Fu, Qinqin; Yuan, Y Adam

2013-03-01

Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA.
OST-HTH: a novel predicted RNA-binding domain

PubMed Central

2010-01-01

Background The mechanism by which the arthropod Oskar and vertebrate TDRD5/TDRD7 proteins nucleate or organize structurally related ribonucleoprotein (RNP) complexes, the polar granule and nuage, is poorly understood. Using sequence profile searches we identify a novel domain in these proteins that is widely conserved across eukaryotes and bacteria. Results Using contextual information from domain architectures, sequence-structure superpositions and available functional information we predict that this domain is likely to adopt the winged helix-turn-helix fold and bind RNA with a potential specificity for dsRNA. We show that in eukaryotes this domain is often combined in the same polypeptide with protein-protein- or lipid- interaction domains that might play a role in anchoring these proteins to specific cytoskeletal structures. Conclusions Thus, proteins with this domain might have a key role in the recognition and localization of dsRNA, including miRNAs, rasiRNAs and piRNAs hybridized to their targets. In other cases, this domain is fused to ubiquitin-binding, E3 ligase and ubiquitin-like domains indicating a previously under-appreciated role for ubiquitination in regulating the assembly and stability of nuage-like RNP complexes. Both bacteria and eukaryotes encode a conserved family of proteins that combines this predicted RNA-binding domain with a previously uncharacterized domain (DUF88). We present evidence that it is an RNAse belonging to the superfamily that includes the 5'->3' nucleases, PIN and NYN domains and might be recruited to degrade certain RNAs. Reviewers This article was reviewed by Sandor Pongor and Arcady Mushegian. PMID:20302647
Structural control of caspase-generated glutamyl-tRNA synthetase by appended noncatalytic WHEP domains.

PubMed

Halawani, Dalia; Gogonea, Valentin; DiDonato, Joseph A; Pipich, Vitaliy; Yao, Peng; China, Arnab; Topbas, Celalettin; Vasu, Kommireddy; Arif, Abul; Hazen, Stanley L; Fox, Paul L

2018-06-08

Aminoacyl-tRNA synthetases are ubiquitous, evolutionarily conserved enzymes catalyzing the conjugation of amino acids onto cognate tRNAs. During eukaryotic evolution, tRNA synthetases have been the targets of persistent structural modifications. These modifications can be additive, as in the evolutionary acquisition of noncatalytic domains, or subtractive, as in the generation of truncated variants through regulated mechanisms such as proteolytic processing, alternative splicing, or coding region polyadenylation. A unique variant is the human glutamyl-prolyl-tRNA synthetase (EPRS) consisting of two fused synthetases joined by a linker containing three copies of the WHEP domain (termed by its presence in tryptophanyl-, histidyl-, and glutamyl-prolyl-tRNA synthetases). Here, we identify site-selective proteolysis as a mechanism that severs the linkage between the EPRS synthetases in vitro and in vivo Caspase action targeted Asp-929 in the third WHEP domain, thereby separating the two synthetases. Using a neoepitope antibody directed against the newly exposed C terminus, we demonstrate EPRS cleavage at Asp-929 in vitro and in vivo Biochemical and biophysical characterizations of the N-terminally generated EPRS proteoform containing the glutamyl-tRNA synthetase and most of the linker, including two WHEP domains, combined with structural analysis by small-angle neutron scattering, revealed a role for the WHEP domains in modulating conformations of the catalytic core and GSH- S -transferase-C-terminal-like (GST-C) domain. WHEP-driven conformational rearrangement altered GST-C domain interactions and conferred distinct oligomeric states in solution. Collectively, our results reveal long-range conformational changes imposed by the WHEP domains and illustrate how noncatalytic domains can modulate the global structure of tRNA synthetases in complex eukaryotic systems. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Tracking the Dynamic Folding and Unfolding of RNA G-Quadruplexes in Live Cells.

PubMed

Chen, Xiu-Cai; Chen, Shuo-Bin; Dai, Jing; Yuan, Jia-Hao; Ou, Tian-Miao; Huang, Zhi-Shu; Tan, Jia-Heng

2018-04-16

Because of the absence of methods for tracking RNA G-quadruplex dynamics, especially the folding and unfolding of this attractive structure in live cells, understanding of the biological roles of RNA G-quadruplexes is so far limited. Herein, we report a new red-emitting fluorescent probe, QUMA-1, for the selective, continuous, and real-time visualization of RNA G-quadruplexes in live cells. The applications of QUMA-1 in several previously intractable applications, including live-cell imaging of the dynamic folding, unfolding, and movement of RNA G-quadruplexes and the visualization of the unwinding of RNA G-quadruplexes by RNA helicase have been demonstrated. Notably, our real-time results revealed the complexity of the dynamics of RNA G-quadruplexes in live cells. We anticipate that the further application of QUMA-1 in combination with appropriate biological and imaging methods to explore the dynamics of RNA G-quadruplexes will uncover more information about the biological roles of RNA G-quadruplexes. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures

PubMed Central

Miao, Zhichao; Adamiak, Ryszard W.; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cheng, Clarence; Chojnowski, Grzegorz; Chou, Fang-Chieh; Cordero, Pablo; Cruz, José Almeida; Ferré-D'Amaré, Adrian R.; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Dunin-Horkawicz, Stanislaw; Kladwang, Wipapat; Krokhotin, Andrey; Lach, Grzegorz; Magnus, Marcin; Major, François; Mann, Thomas H.; Masquida, Benoît; Matelska, Dorota; Meyer, Mélanie; Peselis, Alla; Popenda, Mariusz; Purzycka, Katarzyna J.; Serganov, Alexander; Stasiewicz, Juliusz; Szachniuk, Marta; Tandon, Arpit; Tian, Siqi; Wang, Jian; Xiao, Yi; Xu, Xiaojun; Zhang, Jinwei; Zhao, Peinan; Zok, Tomasz; Westhof, Eric

2015-01-01

This paper is a report of a second round of RNA-Puzzles, a collective and blind experiment in three-dimensional (3D) RNA structure prediction. Three puzzles, Puzzles 5, 6, and 10, represented sequences of three large RNA structures with limited or no homology with previously solved RNA molecules. A lariat-capping ribozyme, as well as riboswitches complexed to adenosylcobalamin and tRNA, were predicted by seven groups using RNAComposer, ModeRNA/SimRNA, Vfold, Rosetta, DMD, MC-Fold, 3dRNA, and AMBER refinement. Some groups derived models using data from state-of-the-art chemical-mapping methods (SHAPE, DMS, CMCT, and mutate-and-map). The comparisons between the predictions and the three subsequently released crystallographic structures, solved at diffraction resolutions of 2.5–3.2 Å, were carried out automatically using various sets of quality indicators. The comparisons clearly demonstrate the state of present-day de novo prediction abilities as well as the limitations of these state-of-the-art methods. All of the best prediction models have similar topologies to the native structures, which suggests that computational methods for RNA structure prediction can already provide useful structural information for biological problems. However, the prediction accuracy for non-Watson–Crick interactions, key to proper folding of RNAs, is low and some predicted models had high Clash Scores. These two difficulties point to some of the continuing bottlenecks in RNA structure prediction. All submitted models are available for download at http://ahsoka.u-strasbg.fr/rnapuzzles/. PMID:25883046
A widespread class of reverse transcriptase-related cellular genes.

PubMed

Gladyshev, Eugene A; Arkhipova, Irina R

2011-12-20

Reverse transcriptases (RTs) polymerize DNA on RNA templates. They fall into several structurally related but distinct classes and form an assemblage of RT-like enzymes that, in addition to RTs, also includes certain viral RNA-dependent RNA polymerases (RdRP) synthesizing RNA on RNA templates. It is generally believed that most RT-like enzymes originate from retrotransposons or viruses and have no specific function in the host cell, with telomerases being the only notable exception. Here we report on the discovery and properties of a unique class of RT-related cellular genes collectively named rvt. We present evidence that rvts are not components of retrotransposons or viruses, but single-copy genes with a characteristic domain structure that may contain introns in evolutionarily conserved positions, occur in syntenic regions, and evolve under purifying selection. These genes can be found in all major taxonomic groups including protists, fungi, animals, plants, and even bacteria, although they exhibit patchy phylogenetic distribution in each kingdom. We also show that the RVT protein purified from one of its natural hosts, Neurospora crassa, exists in a multimeric form and has the ability to polymerize NTPs as well as dNTPs in vitro, with a strong preference for NTPs, using Mn(2+) as a cofactor. The existence of a previously unknown class of single-copy RT-related genes calls for reevaluation of the current views on evolution and functional roles of RNA-dependent polymerases in living cells.
The paramyxovirus polymerase complex as a target for next-generation anti-paramyxovirus therapeutics

PubMed Central

Cox, Robert; Plemper, Richard K.

2015-01-01

The paramyxovirus family includes major human and animal pathogens, including measles virus, mumps virus, and human respiratory syncytial virus (RSV), as well as the emerging zoonotic Hendra and Nipah viruses. In the U.S., RSV is the leading cause of infant hospitalizations due to viral infectious disease. Despite their clinical significance, effective drugs for the improved management of paramyxovirus disease are lacking. The development of novel anti-paramyxovirus therapeutics is therefore urgently needed. Paramyxoviruses contain RNA genomes of negative polarity, necessitating a virus-encoded RNA-dependent RNA polymerase (RdRp) complex for replication and transcription. Since an equivalent enzymatic activity is absent in host cells, the RdRp complex represents an attractive druggable target, although structure-guided drug development campaigns are hampered by the lack of high-resolution RdRp crystal structures. Here, we review the current structural and functional insight into the paramyxovirus polymerase complex in conjunction with an evaluation of the mechanism of activity and developmental status of available experimental RdRp inhibitors. Our assessment spotlights the importance of the RdRp complex as a premier target for therapeutic intervention and examines how high-resolution insight into the organization of the complex will pave the path toward the structure-guided design and optimization of much-needed next-generation paramyxovirus RdRp blockers. PMID:26029193
The paramyxovirus polymerase complex as a target for next-generation anti-paramyxovirus therapeutics.

PubMed

Cox, Robert; Plemper, Richard K

2015-01-01

The paramyxovirus family includes major human and animal pathogens, including measles virus, mumps virus, and human respiratory syncytial virus (RSV), as well as the emerging zoonotic Hendra and Nipah viruses. In the U.S., RSV is the leading cause of infant hospitalizations due to viral infectious disease. Despite their clinical significance, effective drugs for the improved management of paramyxovirus disease are lacking. The development of novel anti-paramyxovirus therapeutics is therefore urgently needed. Paramyxoviruses contain RNA genomes of negative polarity, necessitating a virus-encoded RNA-dependent RNA polymerase (RdRp) complex for replication and transcription. Since an equivalent enzymatic activity is absent in host cells, the RdRp complex represents an attractive druggable target, although structure-guided drug development campaigns are hampered by the lack of high-resolution RdRp crystal structures. Here, we review the current structural and functional insight into the paramyxovirus polymerase complex in conjunction with an evaluation of the mechanism of activity and developmental status of available experimental RdRp inhibitors. Our assessment spotlights the importance of the RdRp complex as a premier target for therapeutic intervention and examines how high-resolution insight into the organization of the complex will pave the path toward the structure-guided design and optimization of much-needed next-generation paramyxovirus RdRp blockers.
Structure of RNA polymerase complex and genome within a dsRNA virus provides insights into the mechanisms of transcription and assembly.

PubMed

Wang, Xurong; Zhang, Fuxian; Su, Rui; Li, Xiaowu; Chen, Wenyuan; Chen, Qingxiu; Yang, Tao; Wang, Jiawei; Liu, Hongrong; Fang, Qin; Cheng, Lingpeng

2018-06-25

Most double-stranded RNA (dsRNA) viruses transcribe RNA plus strands within a common innermost capsid shell. This process requires coordinated efforts by RNA-dependent RNA polymerase (RdRp) together with other capsid proteins and genomic RNA. Here we report the near-atomic resolution structure of the RdRp protein VP2 in complex with its cofactor protein VP4 and genomic RNA within an aquareovirus capsid using 200-kV cryoelectron microscopy and symmetry-mismatch reconstruction. The structure of these capsid proteins enabled us to observe the elaborate nonicosahedral structure within the double-layered icosahedral capsid. Our structure shows that the RdRp complex is anchored at the inner surface of the capsid shell and interacts with genomic dsRNA and four of the five asymmetrically arranged N termini of the capsid shell proteins under the fivefold axis, implying roles for these N termini in virus assembly. The binding site of the RNA end at VP2 is different from the RNA cap binding site identified in the crystal structure of orthoreovirus RdRp λ3, although the structures of VP2 and λ3 are almost identical. A loop, which was thought to separate the RNA template and transcript, interacts with an apical domain of the capsid shell protein, suggesting a mechanism for regulating RdRp replication and transcription. A conserved nucleoside triphosphate binding site was localized in our RdRp cofactor protein VP4 structure, and interactions between the VP4 and the genomic RNA were identified.

Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

PubMed

Leppek, Kathrin; Das, Rhiju; Barna, Maria

2018-03-01

RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.
Archaeal RNA polymerase and transcription regulation

PubMed Central

Jun, Sung-Hoon; Reichlen, Matthew J.; Tajiri, Momoko; Murakami, Katsuhiko S.

2010-01-01

To elucidate the mechanism of transcription by cellular RNA polymerases (RNAPs), high resolution X-ray crystal structures together with structure-guided biochemical, biophysical and genetics studies are essential. The recently-solved X-ray crystal structures of archaeal RNA polymerase (RNAP) allow a structural comparison of the transcription machinery among all three domains of life. The archaea were once thought of closely related to bacteria, but they are now considered to be more closely related to the eukaryote at the molecular level than bacteria. According to these structures, the archaeal transcription apparatus, which includes RNAP and general transcription factors, is similar to the eukaryotic transcription machinery. Yet, the transcription regulators, activators and repressors, encoded by archaeal genomes are closely related to bacterial factors. Therefore, archaeal transcription appears to possess an intriguing hybrid of eukaryotic-type transcription apparatus and bacterial-like regulatory mechanisms. Elucidating the transcription mechanism in archaea, which possesses a combination of bacterial and eukaryotic transcription mechanisms that are commonly regarded as separate and mutually exclusive, can provide data that will bring basic transcription mechanisms across all three domains of life. PMID:21250781
On topological RNA interaction structures.

PubMed

Qin, Jing; Reidys, Christian M

2013-07-01

Recently a folding algorithm of topological RNA pseudoknot structures was presented in Reidys et al. (2011). This algorithm folds single-stranded γ-structures, that is, RNA structures composed by distinct motifs of bounded topological genus. In this article, we set the theoretical foundations for the folding of the two backbone analogues of γ structures: the RNA γ-interaction structures. These are RNA-RNA interaction structures that are constructed by a finite number of building blocks over two backbones having genus at most γ. Combinatorial properties of γ-interaction structures are of practical interest since they have direct implications for the folding of topological interaction structures. We compute the generating function of γ-interaction structures and show that it is algebraic, which implies that the numbers of interaction structures can be computed recursively. We obtain simple asymptotic formulas for 0- and 1-interaction structures. The simplest class of interaction structures are the 0-interaction structures, which represent the two backbone analogues of secondary structures.
An evolutionary conserved pattern of 18S rRNA sequence complementarity to mRNA 5′ UTRs and its implications for eukaryotic gene translation regulation

PubMed Central

Pánek, Josef; Kolář, Michal; Vohradský, Jiří; Shivaya Valášek, Leoš

2013-01-01

There are several key mechanisms regulating eukaryotic gene expression at the level of protein synthesis. Interestingly, the least explored mechanisms of translational control are those that involve the translating ribosome per se, mediated for example via predicted interactions between the ribosomal RNAs (rRNAs) and mRNAs. Here, we took advantage of robustly growing large-scale data sets of mRNA sequences for numerous organisms, solved ribosomal structures and computational power to computationally explore the mRNA–rRNA complementarity that is statistically significant across the species. Our predictions reveal highly specific sequence complementarity of 18S rRNA sequences with mRNA 5′ untranslated regions (UTRs) forming a well-defined 3D pattern on the rRNA sequence of the 40S subunit. Broader evolutionary conservation of this pattern may imply that 5′ UTRs of eukaryotic mRNAs, which have already emerged from the mRNA-binding channel, may contact several complementary spots on 18S rRNA situated near the exit of the mRNA binding channel and on the middle-to-lower body of the solvent-exposed 40S ribosome including its left foot. We discuss physiological significance of this structurally conserved pattern and, in the context of previously published experimental results, propose that it modulates scanning of the 40S subunit through 5′ UTRs of mRNAs. PMID:23804757
2-D Structure of the A Region of Xist RNA and Its Implication for PRC2 Association

PubMed Central

Maenner, Sylvain; Blaud, Magali; Fouillen, Laetitia; Savoye, Anne; Marchand, Virginie; Dubois, Agnès; Sanglier-Cianférani, Sarah; Van Dorsselaer, Alain; Clerc, Philippe; Avner, Philip; Visvikis, Athanase; Branlant, Christiane

2010-01-01

In placental mammals, inactivation of one of the X chromosomes in female cells ensures sex chromosome dosage compensation. The 17 kb non-coding Xist RNA is crucial to this process and accumulates on the future inactive X chromosome. The most conserved Xist RNA region, the A region, contains eight or nine repeats separated by U-rich spacers. It is implicated in the recruitment of late inactivated X genes to the silencing compartment and likely in the recruitment of complex PRC2. Little is known about the structure of the A region and more generally about Xist RNA structure. Knowledge of its structure is restricted to an NMR study of a single A repeat element. Our study is the first experimental analysis of the structure of the entire A region in solution. By the use of chemical and enzymatic probes and FRET experiments, using oligonucleotides carrying fluorescent dyes, we resolved problems linked to sequence redundancies and established a 2-D structure for the A region that contains two long stem-loop structures each including four repeats. Interactions formed between repeats and between repeats and spacers stabilize these structures. Conservation of the spacer terminal sequences allows formation of such structures in all sequenced Xist RNAs. By combination of RNP affinity chromatography, immunoprecipitation assays, mass spectrometry, and Western blot analysis, we demonstrate that the A region can associate with components of the PRC2 complex in mouse ES cell nuclear extracts. Whilst a single four-repeat motif is able to associate with components of this complex, recruitment of Suz12 is clearly more efficient when the entire A region is present. Our data with their emphasis on the importance of inter-repeat pairing change fundamentally our conception of the 2-D structure of the A region of Xist RNA and support its possible implication in recruitment of the PRC2 complex. PMID:20052282
Designing and Testing Functional RNA Nanoparticles | Center for Cancer Research

Cancer.gov

Recent advances in nanotechnology have generated excitement that nanomaterials may provide novel approaches for the diagnosis and treatment of deadly diseases, such as cancer. However, the use of synthetic materials to generate nanoparticles can present challenges with endotoxin content, sterility, or biocompatibility. Employing biological materials may overcome these issues with RNA being particularly attractive given the clinical applications of RNA interference and the abundance of functional RNAs, including aptamers and ribozymes. RNA can form stable three-dimensional nanoparticle structures that can be decorated with other nucleic acids, small molecules, or proteins, potentially increasing local concentrations of therapeutic agents and acting synergistically when combined.
Deciphering the role of the Gag-Pol ribosomal frameshift signal in HIV-1 RNA genome packaging.

PubMed

Nikolaitchik, Olga A; Hu, Wei-Shau

2014-04-01

A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5' untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5' end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts.
Deciphering the Role of the Gag-Pol Ribosomal Frameshift Signal in HIV-1 RNA Genome Packaging

PubMed Central

Nikolaitchik, Olga A.

2014-01-01

ABSTRACT A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5′ untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. IMPORTANCE To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5′ end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts. PMID:24453371
Exploration of RNA structure spaces

NASA Technical Reports Server (NTRS)

Fox, G. E.

1991-01-01

In order to understand the structure of real structure spaces, we are studying the 5S rRNA structure space experimentally. A plasmid containing a synthetic 5S rRNA gene, two rRNA promoters, and transcription terminators has been assembled. Assays are conducted to determine if the foreign 5S rRNA is expressed, and to see whether or not it is incorporated into ribosomes. Evolutionary competition is used to determine the relative fitness of strains containing the foreign 5S rRNA and a control 5S rRNA. By using site directed mutagenesis, a number of mutants can be made in order to study the boundaries of the structure space and how sharply defined they are. By making similar studies in the vicinity of structure space, it will be possible to determine how homogeneous the 5S rRNA structure space is. Useable experimental protocols have been developed, and a number of mutants have already been studied. Initial results suggest an explanation of why single stranded regions of the RNA are less subject to mutation than double stranded regions.
INFO-RNA--a fast approach to inverse RNA folding.

PubMed

Busch, Anke; Backofen, Rolf

2006-08-01

The structure of RNA molecules is often crucial for their function. Therefore, secondary structure prediction has gained much interest. Here, we consider the inverse RNA folding problem, which means designing RNA sequences that fold into a given structure. We introduce a new algorithm for the inverse folding problem (INFO-RNA) that consists of two parts; a dynamic programming method for good initial sequences and a following improved stochastic local search that uses an effective neighbor selection method. During the initialization, we design a sequence that among all sequences adopts the given structure with the lowest possible energy. For the selection of neighbors during the search, we use a kind of look-ahead of one selection step applying an additional energy-based criterion. Afterwards, the pre-ordered neighbors are tested using the actual optimization criterion of minimizing the structure distance between the target structure and the mfe structure of the considered neighbor. We compared our algorithm to RNAinverse and RNA-SSD for artificial and biological test sets. Using INFO-RNA, we performed better than RNAinverse and in most cases, we gained better results than RNA-SSD, the probably best inverse RNA folding tool on the market. www.bioinf.uni-freiburg.de?Subpages/software.html.
NoFold: RNA structure clustering without folding or alignment.

PubMed

Middleton, Sarah A; Kim, Junhyong

2014-11-01

Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Automated and fast building of three-dimensional RNA structures.

PubMed

Zhao, Yunjie; Huang, Yangyu; Gong, Zhou; Wang, Yanjie; Man, Jianfen; Xiao, Yi

2012-01-01

Building tertiary structures of non-coding RNA is required to understand their functions and design new molecules. Current algorithms of RNA tertiary structure prediction give satisfactory accuracy only for small size and simple topology and many of them need manual manipulation. Here, we present an automated and fast program, 3dRNA, for RNA tertiary structure prediction with reasonable accuracy for RNAs of larger size and complex topology.
Fine-grained parallel RNAalifold algorithm for RNA secondary structure prediction on FPGA

PubMed Central

Xia, Fei; Dou, Yong; Zhou, Xingming; Yang, Xuejun; Xu, Jiaqing; Zhang, Yang

2009-01-01

Background In the field of RNA secondary structure prediction, the RNAalifold algorithm is one of the most popular methods using free energy minimization. However, general-purpose computers including parallel computers or multi-core computers exhibit parallel efficiency of no more than 50%. Field Programmable Gate-Array (FPGA) chips provide a new approach to accelerate RNAalifold by exploiting fine-grained custom design. Results RNAalifold shows complicated data dependences, in which the dependence distance is variable, and the dependence direction is also across two dimensions. We propose a systolic array structure including one master Processing Element (PE) and multiple slave PEs for fine grain hardware implementation on FPGA. We exploit data reuse schemes to reduce the need to load energy matrices from external memory. We also propose several methods to reduce energy table parameter size by 80%. Conclusion To our knowledge, our implementation with 16 PEs is the only FPGA accelerator implementing the complete RNAalifold algorithm. The experimental results show a factor of 12.2 speedup over the RNAalifold (ViennaPackage – 1.6.5) software for a group of aligned RNA sequences with 2981-residue running on a Personal Computer (PC) platform with Pentium 4 2.6 GHz CPU. PMID:19208138
RNA-Puzzles: A CASP-like evaluation of RNA three-dimensional structure prediction

PubMed Central

Cruz, José Almeida; Blanchet, Marc-Frédérick; Boniecki, Michal; Bujnicki, Janusz M.; Chen, Shi-Jie; Cao, Song; Das, Rhiju; Ding, Feng; Dokholyan, Nikolay V.; Flores, Samuel Coulbourn; Huang, Lili; Lavender, Christopher A.; Lisi, Véronique; Major, François; Mikolajczak, Katarzyna; Patel, Dinshaw J.; Philips, Anna; Puton, Tomasz; Santalucia, John; Sijenyi, Fredrick; Hermann, Thomas; Rother, Kristian; Rother, Magdalena; Serganov, Alexander; Skorupski, Marcin; Soltysinski, Tomasz; Sripakdeevong, Parin; Tuszynska, Irina; Weeks, Kevin M.; Waldsich, Christina; Wildauer, Michael; Leontis, Neocles B.; Westhof, Eric

2012-01-01

We report the results of a first, collective, blind experiment in RNA three-dimensional (3D) structure prediction, encompassing three prediction puzzles. The goals are to assess the leading edge of RNA structure prediction techniques; compare existing methods and tools; and evaluate their relative strengths, weaknesses, and limitations in terms of sequence length and structural complexity. The results should give potential users insight into the suitability of available methods for different applications and facilitate efforts in the RNA structure prediction community in ongoing efforts to improve prediction tools. We also report the creation of an automated evaluation pipeline to facilitate the analysis of future RNA structure prediction exercises. PMID:22361291
SimRNAweb: a web server for RNA 3D structure modeling with optional restraints.

PubMed

Magnus, Marcin; Boniecki, Michał J; Dawson, Wayne; Bujnicki, Janusz M

2016-07-08

RNA function in many biological processes depends on the formation of three-dimensional (3D) structures. However, RNA structure is difficult to determine experimentally, which has prompted the development of predictive computational methods. Here, we introduce a user-friendly online interface for modeling RNA 3D structures using SimRNA, a method that uses a coarse-grained representation of RNA molecules, utilizes the Monte Carlo method to sample the conformational space, and relies on a statistical potential to describe the interactions in the folding process. SimRNAweb makes SimRNA accessible to users who do not normally use high performance computational facilities or are unfamiliar with using the command line tools. The simplest input consists of an RNA sequence to fold RNA de novo. Alternatively, a user can provide a 3D structure in the PDB format, for instance a preliminary model built with some other technique, to jump-start the modeling close to the expected final outcome. The user can optionally provide secondary structure and distance restraints, and can freeze a part of the starting 3D structure. SimRNAweb can be used to model single RNA sequences and RNA-RNA complexes (up to 52 chains). The webserver is available at http://genesilico.pl/SimRNAweb. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
On structural transitions, thermodynamic equilibrium, and the phase diagram of DNA and RNA duplexes under torque and tension.

PubMed

Wereszczynski, Jeff; Andricioaei, Ioan

2006-10-31

A precise understanding of the flexibility of double stranded nucleic acids and the nature of their deformed conformations induced by external forces is important for a wide range of biological processes including transcriptional regulation, supercoil and catenane removal, and site-specific recombination. We present, at atomic resolution, a simulation of the dynamics involved in the transitions from B-DNA and A-RNA to Pauling (P) forms and to denatured states driven by application of external torque and tension. We then calculate the free energy profile along a B- to P-transition coordinate and from it, compute a reversible pathway, i.e., an isotherm of tension and torque pairs required to maintain P-DNA in equilibrium. The reversible isotherm maps correctly onto a phase diagram derived from single molecule experiments, and yields values of elongation, twist, and twist-stretch coupling in agreement with measured values. We also show that configurational entropy compensates significantly for the large electrostatic energy increase due to closer-packed P backbones. A similar set of simulations applied to RNA are used to predict a novel structure, P-RNA, with its associated free energy, equilibrium tension, torque and structural parameters, and to assign the location, on the phase-diagram, of a putative force-torque-dependent RNA "triple point."
The cell's nucleolus: an emerging target for chemotherapeutic intervention.

PubMed

Pickard, Amanda J; Bierbach, Ulrich

2013-09-01

The transient nucleolus plays a central role in the up-regulated synthesis of ribosomal RNA (rRNA) to sustain ribosome biogenesis, a hallmark of aberrant cell growth. This function, in conjunction with its unique pathohistological features in malignant cells and its ability to mediate apoptosis, renders this sub-nuclear structure a potential target for chemotherapeutic agents. In this Minireview, structurally and functionally diverse small molecules are discussed that have been reported to either interact with the nucleolus directly or perturb its function indirectly by acting on its dynamic components. These molecules include all major classes of nucleic-acid-targeted agents, antimetabolites, kinase inhibitors, anti-inflammatory drugs, natural product antibiotics, oligopeptides, as well as nanoparticles. Together, these molecules are invaluable probes of structure and function of the nucleolus. They also provide a unique opportunity to develop novel strategies for more selective and therefore better-tolerated chemotherapeutic intervention. In this regard, inhibition of RNA polymerase-I-mediated rRNA synthesis appears to be a promising mechanism for killing cancer cells. The recent development of molecules targeted at G-quadruplex-forming rRNA gene sequences, which are currently undergoing clinical trials, seems to attest to the success of this approach. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
RNA connectivity requirements between conserved elements in the core of the yeast telomerase RNP

PubMed Central

Mefford, Melissa A; Rafiq, Qundeel; Zappulla, David C

2013-01-01

Telomerase is a specialized chromosome end-replicating enzyme required for genome duplication in many eukaryotes. An RNA and reverse transcriptase protein subunit comprise its enzymatic core. Telomerase is evolving rapidly, particularly its RNA component. Nevertheless, nearly all telomerase RNAs, including those of H. sapiens and S. cerevisiae, share four conserved structural elements: a core-enclosing helix (CEH), template-boundary element, template, and pseudoknot, in this order along the RNA. It is not clear how these elements coordinate telomerase activity. We find that although rearranging the order of the four conserved elements in the yeast telomerase RNA subunit, TLC1, disrupts activity, the RNA ends can be moved between the template and pseudoknot in vitro and in vivo. However, the ends disrupt activity when inserted between the other structured elements, defining an Area of Required Connectivity (ARC). Within the ARC, we find that only the junction nucleotides between the pseudoknot and CEH are essential. Integrating all of our findings provides a basic map of functional connections in the core of the yeast telomerase RNP and a framework to understand conserved element coordination in telomerase mechanism. PMID:24129512
Using in-cell SHAPE-Seq and simulations to probe structure-function design principles of RNA transcriptional regulators.

PubMed

Takahashi, Melissa K; Watters, Kyle E; Gasper, Paul M; Abbott, Timothy R; Carlson, Paul D; Chen, Alan A; Lucks, Julius B

2016-06-01

Antisense RNA-mediated transcriptional regulators are powerful tools for controlling gene expression and creating synthetic gene networks. RNA transcriptional repressors derived from natural mechanisms called attenuators are particularly versatile, though their mechanistic complexity has made them difficult to engineer. Here we identify a new structure-function design principle for attenuators that enables the forward engineering of new RNA transcriptional repressors. Using in-cell SHAPE-Seq to characterize the structures of attenuator variants within Escherichia coli, we show that attenuator hairpins that facilitate interaction with antisense RNAs require interior loops for proper function. Molecular dynamics simulations of these attenuator variants suggest these interior loops impart structural flexibility. We further observe hairpin flexibility in the cellular structures of natural RNA mechanisms that use antisense RNA interactions to repress translation, confirming earlier results from in vitro studies. Finally, we design new transcriptional attenuators in silico using an interior loop as a structural requirement and show that they function as desired in vivo. This work establishes interior loops as an important structural element for designing synthetic RNA gene regulators. We anticipate that the coupling of experimental measurement of cellular RNA structure and function with computational modeling will enable rapid discovery of structure-function design principles for a diverse array of natural and synthetic RNA regulators. © 2016 Takahashi et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
A new way to see RNA

PubMed Central

Keating, Kevin S.; Humphris, Elisabeth L.; Pyle, Anna Marie

2015-01-01

Unlike proteins, the RNA backbone has numerous degrees of freedom (eight, if one counts the sugar pucker), making RNA modeling, structure building and prediction a multidimensional problem of exceptionally high complexity. And yet RNA tertiary structures are not infinite in their structural morphology; rather, they are built from a limited set of discrete units. In order to reduce the dimensionality of the RNA backbone in a physically reasonable way, a shorthand notation was created that reduced the RNA backbone torsion angles to two (η and θ, analogous to ϕ and ψ in proteins). When these torsion angles are calculated for nucleotides in a crystallographic database and plotted against one another, one obtains a plot analogous to a Ramachandran plot (the η/θ plot), with highly populated and unpopulated regions. Nucleotides that occupy proximal positions on the plot have identical structures and are found in the same units of tertiary structure. In this review, we describe the statistical validation of the η/θ formalism and the exploration of features within the η/θ plot. We also describe the application of the η/θ formalism in RNA motif discovery, structural comparison, RNA structure building and tertiary structure prediction. More than a tool, however, the η/θ formalism has provided new insights into RNA structure itself, revealing its fundamental components and the factors underlying RNA architectural form. PMID:21729350

SONAR Discovers RNA-Binding Proteins from Analysis of Large-Scale Protein-Protein Interactomes.

PubMed

Brannan, Kristopher W; Jin, Wenhao; Huelga, Stephanie C; Banks, Charles A S; Gilmore, Joshua M; Florens, Laurence; Washburn, Michael P; Van Nostrand, Eric L; Pratt, Gabriel A; Schwinn, Marie K; Daniels, Danette L; Yeo, Gene W

2016-10-20

RNA metabolism is controlled by an expanding, yet incomplete, catalog of RNA-binding proteins (RBPs), many of which lack characterized RNA binding domains. Approaches to expand the RBP repertoire to discover non-canonical RBPs are currently needed. Here, HaloTag fusion pull down of 12 nuclear and cytoplasmic RBPs followed by quantitative mass spectrometry (MS) demonstrates that proteins interacting with multiple RBPs in an RNA-dependent manner are enriched for RBPs. This motivated SONAR, a computational approach that predicts RNA binding activity by analyzing large-scale affinity precipitation-MS protein-protein interactomes. Without relying on sequence or structure information, SONAR identifies 1,923 human, 489 fly, and 745 yeast RBPs, including over 100 human candidate RBPs that contain zinc finger domains. Enhanced CLIP confirms RNA binding activity and identifies transcriptome-wide RNA binding sites for SONAR-predicted RBPs, revealing unexpected RNA binding activity for disease-relevant proteins and DNA binding proteins. Copyright © 2016 Elsevier Inc. All rights reserved.
Global RNA association with the transcriptionally active chromosome of chloroplasts.

PubMed

Lehniger, Marie-Kristin; Finster, Sabrina; Melonek, Joanna; Oetke, Svenja; Krupinska, Karin; Schmitz-Linneweber, Christian

2017-10-01

Processed chloroplast RNAs are co-enriched with preparations of the chloroplast transcriptionally active chromosome. Chloroplast genomes are organized as a polyploid DNA-protein structure called the nucleoid. Transcriptionally active chloroplast DNA together with tightly bound protein factors can be purified by gel filtration as a functional entity called the transcriptionally active chromosome (TAC). Previous proteomics analyses of nucleoids and of TACs demonstrated a considerable overlap in protein composition including RNA binding proteins. Therefore the RNA content of TAC preparations from Nicotiana tabacum was determined using whole genome tiling arrays. A large number of chloroplast RNAs was found to be associated with the TAC. The pattern of RNAs attached to the TAC consists of RNAs produced by different chloroplast RNA polymerases and differs from the pattern of RNA found in input controls. An analysis of RNA splicing and RNA editing of selected RNA species demonstrated that TAC-associated RNAs are processed to a similar extent as the RNA in input controls. Thus, TAC fractions contain a specific subset of the processed chloroplast transcriptome.
Identification of nucleolus-associated chromatin domains reveals the role of the nucleolus in the 3D organisation of the A. thaliana genome

PubMed Central

Pontvianne, Frédéric; Carpentier, Marie-Christine; Durut, Nathalie; Pavlištová, Veronika; Jaške, Karin; Schořová, Šárka; Parrinello, Hugues; Rohmer, Marine; Pikaard, Craig S; Fojtová, Miloslava; Fajkus, Jiří; Saez-Vasquez, Julio

2017-01-01

The nucleolus is the site of ribosomal RNA (rRNA) gene transcription, rRNA processing and ribosome biogenesis. However, the nucleolus also plays additional roles in the cell. We isolated nucleoli by Fluorescence Activated Cell Sorting (FACS) and identified Nucleolus-Associated Chromatin Domains (NADs) by deep sequencing, comparing wild-type plants and null mutants for the nucleolar protein, NUCLEOLIN 1 (NUC1). NADs are primarily genomic regions with heterochromatic signatures and include transposable elements (TEs), sub-telomeric regions and mostly inactive protein-coding genes. However, NADs also include active ribosomal RNA genes, and the entire short arm of chromosome 4 adjacent to them. In nuc1 null mutants, which alter rRNA gene expression and overall nucleolar structure, NADs are altered, telomere association with the nucleolus is decreased and telomeres become shorter. Collectively, our studies reveal roles for NUC1 and the nucleolus in the spatial organization of chromosomes as well as telomere maintenance. PMID:27477271
Four RNA families with functional transient structures

PubMed Central

Zhu, Jing Yun A; Meyer, Irmtraud M

2015-01-01

Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5′ flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5′ UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here. PMID:25751035
Four RNA families with functional transient structures.

PubMed

Zhu, Jing Yun A; Meyer, Irmtraud M

2015-01-01

Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5' flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5' UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here.
FRASS: the web-server for RNA structural comparison

PubMed Central

2010-01-01

Background The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. Results The FRASS web-server represents a RNA chain with its Gauss integrals and allows one to compare structures of RNA chains and to find similar entries in a database derived from the Protein Data Bank. We observed that FRASS scores correlate well with the ARTS and LaJolla similarity scores. Moreover, the-web server can also reproduce satisfactorily the DARTS classification of RNA 3D structures and the classification of the SCOR functions that was obtained by the SARA method. Conclusions The FRASS web-server can be easily used to detect relationships among RNA molecules and to scan efficiently the rapidly enlarging structural databases. PMID:20553602
Functional 5′ UTR mRNA structures in eukaryotic translation regulation and how to find them

PubMed Central

Leppek, Kathrin; Das, Rhiju; Barna, Maria

2017-01-01

RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms. PMID:29165424
RNA therapeutics: RNAi and antisense mechanisms and clinical applications.

PubMed

Chery, Jessica

2016-07-01

RNA therapeutics refers to the use of oligonucleotides to target primarily ribonucleic acids (RNA) for therapeutic efforts or in research studies to elucidate functions of genes. Oligonucleotides are distinct from other pharmacological modalities, such as small molecules and antibodies that target mainly proteins, due to their mechanisms of action and chemical properties. Nucleic acids come in two forms: deoxyribonucleic acids (DNA) and ribonucleic acids (RNA). Although DNA is more stable, RNA offers more structural variety ranging from messenger RNA (mRNA) that codes for protein to non-coding RNAs, microRNA (miRNA), transfer RNA (tRNA), short interfering RNAs (siRNAs), ribosomal RNA (rRNA), and long-noncoding RNAs (lncRNAs). As our understanding of the wide variety of RNAs deepens, researchers have sought to target RNA since >80% of the genome is estimated to be transcribed. These transcripts include non-coding RNAs such as miRNAs and siRNAs that function in gene regulation by playing key roles in the transfer of genetic information from DNA to protein, the final product of the central dogma in biology 1 . Currently there are two main approaches used to target RNA: double stranded RNA-mediated interference (RNAi) and antisense oligonucleotides (ASO). Both approaches are currently in clinical trials for targeting of RNAs involved in various diseases, such as cancer and neurodegeneration. In fact, ASOs targeting spinal muscular atrophy and amyotrophic lateral sclerosis have shown positive results in clinical trials 2 . Advantages of ASOs include higher affinity due to the development of chemical modifications that increase affinity, selectivity while decreasing toxicity due to off-target effects. This review will highlight the major therapeutic approaches of RNA medicine currently being applied with a focus on RNAi and ASOs.
Poly(A) RNA a new component of Cajal bodies.

PubMed

Kołowerzo, Agnieszka; Smoliński, Dariusz Jan; Bednarska, Elzbieta

2009-07-01

In European larch microsporocytes, spherical structures 0.5 to 6 microm in diameter are present in which poly(A) RNA accumulates. There were one to several bodies per cell and they were often present in the vicinity of the nucleolus. No nascent transcripts were observed within them. Splicing factors of the SR family, including protein SC35, which participates in bringing the 3' and 5' sites closer in the splicing reaction, were also not observed. The absence of the above-mentioned elements within bodies containing poly(A) RNA disqualifies them as sites of synthesis and preliminary stages of primary transcript maturation. However, they contained abundant elements of the splicing machinery commonly occurring in Cajal bodies, i.e., Sm proteins or small nuclear RNA (snRNA). The molecular composition as well as the characteristic ultrastructure of bodies containing poly(A) RNA proves that these were Cajal bodies. This is the first report of such poly(A) RNA localization.
13Check_RNA: A tool to evaluate 13C chemical shifts assignments of RNA.

PubMed

Icazatti, A A; Martin, O A; Villegas, M; Szleifer, I; Vila, J A

2018-06-19

Chemical shifts (CS) are an important source of structural information of macromolecules such as RNA. In addition to the scarce availability of CS for RNA, the observed values are prone to errors due to a wrong re-calibration or miss assignments. Different groups have dedicated their efforts to correct CS systematic errors on RNA. Despite this, there are not automated and freely available algorithms for correct assignments of RNA 13C CS before their deposition to the BMRB or re-reference already deposited CS with systematic errors. Based on an existent method we have implemented an open source python module to correct 13C CS (from here on 13Cexp) systematic errors of RNAs and then return the results in 3 formats including the nmrstar one. This software is available on GitHub at https://github.com/BIOS-IMASL/13Check_RNA under a MIT license. Supplementary data are available at Bioinformatics online.
K-Partite RNA Secondary Structures

NASA Astrophysics Data System (ADS)

Jiang, Minghui; Tejada, Pedro J.; Lasisi, Ramoni O.; Cheng, Shanhong; Fechser, D. Scott

RNA secondary structure prediction is a fundamental problem in structural bioinformatics. The prediction problem is difficult because RNA secondary structures may contain pseudoknots formed by crossing base pairs. We introduce k-partite secondary structures as a simple classification of RNA secondary structures with pseudoknots. An RNA secondary structure is k-partite if it is the union of k pseudoknot-free sub-structures. Most known RNA secondary structures are either bipartite or tripartite. We show that there exists a constant number k such that any secondary structure can be modified into a k-partite secondary structure with approximately the same free energy. This offers a partial explanation of the prevalence of k-partite secondary structures with small k. We give a complete characterization of the computational complexities of recognizing k-partite secondary structures for all k ≥ 2, and show that this recognition problem is essentially the same as the k-colorability problem on circle graphs. We present two simple heuristics, iterated peeling and first-fit packing, for finding k-partite RNA secondary structures. For maximizing the number of base pair stackings, our iterated peeling heuristic achieves a constant approximation ratio of at most k for 2 ≤ k ≤ 5, and at most frac6{1-(1-6/k)^k} le frac6{1-e^{-6}} < 6.01491 for k ≥ 6. Experiment on sequences from PseudoBase shows that our first-fit packing heuristic outperforms the leading method HotKnots in predicting RNA secondary structures with pseudoknots. Source code, data set, and experimental results are available at http://www.cs.usu.edu/ mjiang/rna/kpartite/.
Computer-Aided Design of RNA Origami Structures.

PubMed

Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

2017-01-01

RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.
ARMOUR - A Rice miRNA: mRNA Interaction Resource.

PubMed

Sanan-Mishra, Neeti; Tripathi, Anita; Goswami, Kavita; Shukla, Rohit N; Vasudevan, Madavan; Goswami, Hitesh

2018-01-01

ARMOUR was developed as A Rice miRNA:mRNA interaction resource. This informative and interactive database includes the experimentally validated expression profiles of miRNAs under different developmental and abiotic stress conditions across seven Indian rice cultivars. This comprehensive database covers 689 known and 1664 predicted novel miRNAs and their expression profiles in more than 38 different tissues or conditions along with their predicted/known target transcripts. The understanding of miRNA:mRNA interactome in regulation of functional cellular machinery is supported by the sequence information of the mature and hairpin structures. ARMOUR provides flexibility to users in querying the database using multiple ways like known gene identifiers, gene ontology identifiers, KEGG identifiers and also allows on the fly fold change analysis and sequence search query with inbuilt BLAST algorithm. ARMOUR database provides a cohesive platform for novel and mature miRNAs and their expression in different experimental conditions and allows searching for their interacting mRNA targets, GO annotation and their involvement in various biological pathways. The ARMOUR database includes a provision for adding more experimental data from users, with an aim to develop it as a platform for sharing and comparing experimental data contributed by research groups working on rice.
Fabrication of pRNA nanoparticles to deliver therapeutic RNAs and bioactive compounds into tumor cells

PubMed Central

Shu, Yi; Shu, Dan; Haque, Farzin; Guo, Peixuan

2013-01-01

RNA nanotechnology is a term that refers to the design, fabrication, and utilization of nanoparticles mainly composed of ribonucleic acids via bottom-up self-assembly. The packaging RNA (pRNA) of the bacteriophage phi29 DNA packaging motor has been developed into a nano-delivery platform. This protocol describes the synthesis, assembly, and functionalization of pRNA nanoparticles based on three ‘toolkits’ derived from pRNA structural features: interlocking loops for hand-in-hand interactions, palindrome sequences for foot-to-foot interactions, and an RNA three-way junction for branch-extension. siRNAs, ribozymes, aptamers, chemical ligands, fluorophores, and other functionalities can also be fused to the pRNA prior to the assembly of the nanoparticles, so as to ensure the production of homogeneous nanoparticles and the retention of appropriate folding and function of the incorporated modules. The resulting self-assembled multivalent pRNA nanoparticles are thermodynamically and chemically stable, and they remain intact at ultra-low concentrations. Gene silencing effects are progressively enhanced with increasing number of siRNA in each pRNA nanoparticle. Systemic injection of the pRNA nanoparticles into xenograft-bearing mice has revealed strong binding to tumors without accumulation in vital organs or tissues. The pRNA-based nano-delivery scaffold paves a new way towards nanotechnological application of pRNA-based nanoparticles for disease detection and treatment. The time required for completing one round of this protocol is 3–4 weeks, including in vitro functional assays, or 2–3 months including in vivo studies. PMID:23928498
Small molecules targeting viral RNA.

PubMed

Hermann, Thomas

2016-11-01

Highly conserved noncoding RNA (ncRNA) elements in viral genomes and transcripts offer new opportunities to expand the repertoire of drug targets for the development of antiinfective therapy. Ligands binding to ncRNA architectures are able to affect interactions, structural stability or conformational changes and thereby block processes essential for viral replication. Proof of concept for targeting functional RNA by small molecule inhibitors has been demonstrated for multiple viruses with RNA genomes. Strategies to identify antiviral compounds as inhibitors of ncRNA are increasingly emphasizing consideration of drug-like properties of candidate molecules emerging from screening and ligand design. Recent efforts of antiviral lead discovery for RNA targets have provided drug-like small molecules that inhibit viral replication and include inhibitors of human immunodeficiency virus (HIV), hepatitis C virus (HCV), severe respiratory syndrome coronavirus (SARS CoV), and influenza A virus. While target selectivity remains a challenge for the discovery of useful RNA-binding compounds, a better understanding is emerging of properties that define RNA targets amenable for inhibition by small molecule ligands. Insight from successful approaches of targeting viral ncRNA in HIV, HCV, SARS CoV, and influenza A will provide a basis for the future exploration of RNA targets for therapeutic intervention in other viral pathogens which create urgent, unmet medical needs. Viruses for which targeting ncRNA components in the genome or transcripts may be promising include insect-borne flaviviruses (Dengue, Zika, and West Nile) and filoviruses (Ebola and Marburg). WIREs RNA 2016, 7:726-743. doi: 10.1002/wrna.1373 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.
Automated 3D structure composition for large RNAs

PubMed Central

Popenda, Mariusz; Szachniuk, Marta; Antczak, Maciej; Purzycka, Katarzyna J.; Lukasiak, Piotr; Bartol, Natalia; Blazewicz, Jacek; Adamiak, Ryszard W.

2012-01-01

Understanding the numerous functions that RNAs play in living cells depends critically on knowledge of their three-dimensional structure. Due to the difficulties in experimentally assessing structures of large RNAs, there is currently great demand for new high-resolution structure prediction methods. We present the novel method for the fully automated prediction of RNA 3D structures from a user-defined secondary structure. The concept is founded on the machine translation system. The translation engine operates on the RNA FRABASE database tailored to the dictionary relating the RNA secondary structure and tertiary structure elements. The translation algorithm is very fast. Initial 3D structure is composed in a range of seconds on a single processor. The method assures the prediction of large RNA 3D structures of high quality. Our approach needs neither structural templates nor RNA sequence alignment, required for comparative methods. This enables the building of unresolved yet native and artificial RNA structures. The method is implemented in a publicly available, user-friendly server RNAComposer. It works in an interactive mode and a batch mode. The batch mode is designed for large-scale modelling and accepts atomic distance restraints. Presently, the server is set to build RNA structures of up to 500 residues. PMID:22539264
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data

PubMed Central

Dotu, Ivan; Adamson, Scott I.; Coleman, Benjamin; Fournier, Cyril; Ricart-Altimiras, Emma; Eyras, Eduardo

2018-01-01

RNA-protein binding is critical to gene regulation, controlling fundamental processes including splicing, translation, localization and stability, and aberrant RNA-protein interactions are known to play a role in a wide variety of diseases. However, molecular understanding of RNA-protein interactions remains limited; in particular, identification of RNA motifs that bind proteins has long been challenging, especially when such motifs depend on both sequence and structure. Moreover, although RNA binding proteins (RBPs) often contain more than one binding domain, algorithms capable of identifying more than one binding motif simultaneously have not been developed. In this paper we present a novel pipeline to determine binding peaks in crosslinking immunoprecipitation (CLIP) data, to discover multiple possible RNA sequence/structure motifs among them, and to experimentally validate such motifs. At the core is a new semi-automatic algorithm SARNAclust, the first unsupervised method to identify and deconvolve multiple sequence/structure motifs simultaneously. SARNAclust computes similarity between sequence/structure objects using a graph kernel, providing the ability to isolate the impact of specific features through the bulge graph formalism. Application of SARNAclust to synthetic data shows its capability of clustering 5 motifs at once with a V-measure value of over 0.95, while GraphClust achieves only a V-measure of 0.083 and RNAcontext cannot detect any of the motifs. When applied to existing eCLIP sets, SARNAclust finds known motifs for SLBP and HNRNPC and novel motifs for several other RBPs such as AGGF1, AKAP8L and ILF3. We demonstrate an experimental validation protocol, a targeted Bind-n-Seq-like high-throughput sequencing approach that relies on RNA inverse folding for oligo pool design, that can validate the components within the SLBP motif. Finally, we use this protocol to experimentally interrogate the SARNAclust motif predictions for protein ILF3. Our results support a newly identified partially double-stranded UUUUUGAGA motif similar to that known for the splicing factor HNRNPC. PMID:29596423
An Accurate Scalable Template-based Alignment Algorithm

PubMed Central

Gardner, David P.; Xu, Weijia; Miranker, Daniel P.; Ozer, Stuart; Cannone, Jamie J.; Gutell, Robin R.

2013-01-01

The rapid determination of nucleic acid sequences is increasing the number of sequences that are available. Inherent in a template or seed alignment is the culmination of structural and functional constraints that are selecting those mutations that are viable during the evolution of the RNA. While we might not understand these structural and functional, template-based alignment programs utilize the patterns of sequence conservation to encapsulate the characteristics of viable RNA sequences that are aligned properly. We have developed a program that utilizes the different dimensions of information in rCAD, a large RNA informatics resource, to establish a profile for each position in an alignment. The most significant include sequence identity and column composition in different phylogenetic taxa. We have compared our methods with a maximum of eight alternative alignment methods on different sets of 16S and 23S rRNA sequences with sequence percent identities ranging from 50% to 100%. The results showed that CRWAlign outperformed the other alignment methods in both speed and accuracy. A web-based alignment server is available at http://www.rna.ccbb.utexas.edu/SAE/2F/CRWAlign. PMID:24772376
High-Density Proximity Mapping Reveals the Subcellular Organization of mRNA-Associated Granules and Bodies.

PubMed

Youn, Ji-Young; Dunham, Wade H; Hong, Seo Jung; Knight, James D R; Bashkurov, Mikhail; Chen, Ginny I; Bagci, Halil; Rathod, Bhavisha; MacLeod, Graham; Eng, Simon W M; Angers, Stéphane; Morris, Quaid; Fabian, Marc; Côté, Jean-François; Gingras, Anne-Claude

2018-02-01

mRNA processing, transport, translation, and ultimately degradation involve a series of dedicated protein complexes that often assemble into large membraneless structures such as stress granules (SGs) and processing bodies (PBs). Here, systematic in vivo proximity-dependent biotinylation (BioID) analysis of 119 human proteins associated with different aspects of mRNA biology uncovers 7424 unique proximity interactions with 1,792 proteins. Classical bait-prey analysis reveals connections of hundreds of proteins to distinct mRNA-associated processes or complexes, including the splicing and transcriptional elongation machineries (protein phosphatase 4) and the CCR4-NOT deadenylase complex (CEP85, RNF219, and KIAA0355). Analysis of correlated patterns between endogenous preys uncovers the spatial organization of RNA regulatory structures and enables the definition of 144 core components of SGs and PBs. We report preexisting contacts between most core SG proteins under normal growth conditions and demonstrate that several core SG proteins (UBAP2L, CSDE1, and PRRC2C) are critical for the formation of microscopically visible SGs. Copyright © 2017 Elsevier Inc. All rights reserved.
High resolution atomic force microscopy of double-stranded RNA.

PubMed

Ares, Pablo; Fuentes-Perez, Maria Eugenia; Herrero-Galán, Elías; Valpuesta, José M; Gil, Adriana; Gomez-Herrero, Julio; Moreno-Herrero, Fernando

2016-06-09

Double-stranded (ds) RNA mediates the suppression of specific gene expression, it is the genetic material of a number of viruses, and a key activator of the innate immune response against viral infections. The ever increasing list of roles played by dsRNA in the cell and its potential biotechnological applications over the last decade has raised an interest for the characterization of its mechanical properties and structure, and that includes approaches using Atomic Force Microscopy (AFM) and other single-molecule techniques. Recent reports have resolved the structure of dsDNA with AFM at unprecedented resolution. However, an equivalent study with dsRNA is still lacking. Here, we have visualized the double helix of dsRNA under near-physiological conditions and at sufficient resolution to resolve the A-form sub-helical pitch periodicity. We have employed different high-sensitive force-detection methods and obtained images with similar spatial resolution. Therefore, we show here that the limiting factors for high-resolution AFM imaging of soft materials in liquid medium are, rather than the imaging mode, the force between the tip and the sample and the sharpness of the tip apex.

Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

PubMed

Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

2008-07-01

The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. (c) 2008 Wiley Periodicals, Inc.
Impact of target mRNA structure on siRNA silencing efficiency: a large-scale study

PubMed Central

Gredell, Joseph A.; Berger, Angela K.; Walton, S. Patrick

2009-01-01

The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5’- and 3’-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5’-end or 3’-end were silenced, on average, ~10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate siRNAs. PMID:18306428
Superposition of two tRNA{sup Ser} acceptor stem crystal structures: Comparison of structure, ligands and hydration

DOE Office of Scientific and Technical Information (OSTI.GOV)

Eichert, Andre; Fuerste, Jens P.; Ulrich, Alexander

2010-05-07

We solved the X-ray structures of two Escherichia coli tRNA{sup Ser} acceptor stem microhelices. As both tRNAs are aminoacylated by the same seryl-tRNA-synthetase, we performed a comparative structure analysis of both duplexes to investigate the helical conformation, the hydration patterns and magnesium binding sites. It is well accepted, that the hydration of RNA plays an important role in RNA-protein interactions and that the extensive solvent content of the minor groove has a special function in RNA. The detailed comparison of both tRNA{sup Ser} microhelices provides insights into the structural arrangement of the isoacceptor tRNA aminoacyl stems with respect to themore » surrounding water molecules and may eventually help us to understand their biological function at atomic resolution.« less
Structure, Biology, and Therapeutic Application of Toxin-Antitoxin Systems in Pathogenic Bacteria.

PubMed

Lee, Ki-Young; Lee, Bong-Jin

2016-10-22

Bacterial toxin-antitoxin (TA) systems have received increasing attention for their diverse identities, structures, and functional implications in cell cycle arrest and survival against environmental stresses such as nutrient deficiency, antibiotic treatments, and immune system attacks. In this review, we describe the biological functions and the auto-regulatory mechanisms of six different types of TA systems, among which the type II TA system has been most extensively studied. The functions of type II toxins include mRNA/tRNA cleavage, gyrase/ribosome poison, and protein phosphorylation, which can be neutralized by their cognate antitoxins. We mainly explore the similar but divergent structures of type II TA proteins from 12 important pathogenic bacteria, including various aspects of protein-protein interactions. Accumulating knowledge about the structure-function correlation of TA systems from pathogenic bacteria has facilitated a novel strategy to develop antibiotic drugs that target specific pathogens. These molecules could increase the intrinsic activity of the toxin by artificially interfering with the intermolecular network of the TA systems.
Thioflavin T as an efficient fluorescence sensor for selective recognition of RNA G-quadruplexes

NASA Astrophysics Data System (ADS)

Xu, Shujuan; Li, Qian; Xiang, Junfeng; Yang, Qianfan; Sun, Hongxia; Guan, Aijiao; Wang, Lixia; Liu, Yan; Yu, Lijia; Shi, Yunhua; Chen, Hongbo; Tang, Yalin

2016-04-01

RNA G-quadruplexes (G4s) play important roles in translational regulation, mRNA processing events and gene expression. Therefore, a fluorescent probe that is capable of efficiently recognizing RNA G-quadruplex structures among other RNA forms is highly desirable. In this study, a water-soluble fluorogenic dye (i.e., Thioflavin T (ThT)) was employed to recognize RNA G-quadruplex structures using UV-Vis absorption spectra, fluorescence spectra and emission lifetime experiments. By stacking on the G-tetrad, the ThT probe exhibited highly specific recognition of RNA G-quadruplex structures with striking fluorescence enhancement compared with other RNA forms. The specific binding demonstrates that ThT is an efficient fluorescence sensor that can distinguish G4 and non-G4 RNA structures.
Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

PubMed

Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

2016-03-14

RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
On the importance of cotranscriptional RNA structure formation

PubMed Central

Lai, Daniel; Proctor, Jeff R.; Meyer, Irmtraud M.

2013-01-01

The expression of genes, both coding and noncoding, can be significantly influenced by RNA structural features of their corresponding transcripts. There is by now mounting experimental and some theoretical evidence that structure formation in vivo starts during transcription and that this cotranscriptional folding determines the functional RNA structural features that are being formed. Several decades of research in bioinformatics have resulted in a wide range of computational methods for predicting RNA secondary structures. Almost all state-of-the-art methods in terms of prediction accuracy, however, completely ignore the process of structure formation and focus exclusively on the final RNA structure. This review hopes to bridge this gap. We summarize the existing evidence for cotranscriptional folding and then review the different, currently used strategies for RNA secondary-structure prediction. Finally, we propose a range of ideas on how state-of-the-art methods could be potentially improved by explicitly capturing the process of cotranscriptional structure formation. PMID:24131802
Structural Requirement in Clostridium perfringens Collagenase mRNA 5′ Leader Sequence for Translational Induction through Small RNA-mRNA Base Pairing

PubMed Central

Nomura, Nobuhiko; Nakamura, Kouji

2013-01-01

The Gram-positive anaerobic bacterium Clostridium perfringens is pathogenic to humans and animals, and the production of its toxins is strictly regulated during the exponential phase. We recently found that the 5′ leader sequence of the colA transcript encoding collagenase, which is a major toxin of this organism, is processed and stabilized in the presence of the small RNA VR-RNA. The primary colA 5′-untranslated region (5′UTR) forms a long stem-loop structure containing an internal bulge and masks its own ribosomal binding site. Here we found that VR-RNA directly regulates colA expression through base pairing with colA mRNA in vivo. However, when the internal bulge structure was closed by point mutations in colA mRNA, translation ceased despite the presence of VR-RNA. In addition, a mutation disrupting the colA stem-loop structure induced mRNA processing and ColA-FLAG translational activation in the absence of VR-RNA, indicating that the stem-loop and internal bulge structure of the colA 5′ leader sequence is important for regulation by VR-RNA. On the other hand, processing was required for maximal ColA expression but was not essential for VR-RNA-dependent colA regulation. Finally, colA processing and translational activation were induced at a high temperature without VR-RNA. These results suggest that inhibition of the colA 5′ leader structure through base pairing is the primary role of VR-RNA in colA regulation and that the colA 5′ leader structure is a possible thermosensor. PMID:23585542
A computational method for predicting regulation of human microRNAs on the influenza virus genome

PubMed Central

2013-01-01

Background While it has been suggested that host microRNAs (miRNAs) may downregulate viral gene expression as an antiviral defense mechanism, such a mechanism has not been explored in the influenza virus for human flu studies. As it is difficult to conduct related experiments on humans, computational studies can provide some insight. Although many computational tools have been designed for miRNA target prediction, there is a need for cross-species prediction, especially for predicting viral targets of human miRNAs. However, finding putative human miRNAs targeting influenza virus genome is still challenging. Results We developed machine-learning features and conducted comprehensive data training for predicting interactions between H1N1 genome segments and host miRNA. We defined our seed region as the first ten nucleotides from the 5' end of the miRNA to the 3' end of the miRNA and integrated various features including the number of consecutive matching bases in the seed region of 10 bases, a triplet feature in seed regions, thermodynamic energy, penalty of bulges and wobbles at binding sites, and the secondary structure of viral RNA for the prediction. Conclusions Compared to general predictive models, our model fully takes into account the conservation patterns and features of viral RNA secondary structures, and greatly improves the prediction accuracy. Our model identified some key miRNAs including hsa-miR-489, hsa-miR-325, hsa-miR-876-3p and hsa-miR-2117, which target HA, PB2, MP and NS of H1N1, respectively. Our study provided an interesting hypothesis concerning the miRNA-based antiviral defense mechanism against influenza virus in human, i.e., the binding between human miRNA and viral RNAs may not result in gene silencing but rather may block the viral RNA replication. PMID:24565017
RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.

PubMed

Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

2008-02-01

RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication.
RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae

PubMed Central

Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

2008-01-01

RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning—possibly mediated by intrinsically disordered protein segments—is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication. PMID:18033802
Recovery of Infectious Pariacoto Virus from cDNA Clones and Identification of Susceptible Cell Lines

PubMed Central

Johnson, Karyn N.; Ball, L. Andrew

2001-01-01

Pariacoto virus (PaV) is a nodavirus that was recently isolated in Peru from the Southern armyworm, Spodoptera eridania. Virus particles are non enveloped and about 30 nm in diameter and have T=3 icosahedral symmetry. The 3.0-Å crystal structure shows that about 35% of the genomic RNA is icosahedrally ordered, with the RNA forming a dodecahedral cage of 25-nucleotide (nt) duplexes that underlie the inner surface of the capsid. The PaV genome comprises two single-stranded, positive-sense RNAs: RNA1 (3,011 nt), which encodes the 108-kDa catalytic subunit of the RNA-dependent RNA polymerase, and RNA2 (1,311 nt), which encodes the 43-kDa capsid protein precursor α. In order to apply molecular genetics to the structure and assembly of PaV, we identified susceptible cell lines and developed a reverse genetic system for this virus. Cell lines that were susceptible to infection by PaV included those from Spodoptera exigua, Helicoverpa zea and Aedes albopictus, whereas cells from Drosophila melanogaster and Spodoptera frugiperda were refractory to infection. To recover virus from molecular clones, full-length cDNAs of PaV RNAs 1 and 2 were cotranscribed by T7 RNA polymerase in baby hamster kidney cells that expressed T7 RNA polymerase. Lysates of these cells were infectious both for cultured cells from Helicoverpa zea (corn earworm) and for larvae of Galleria mellonella (greater wax moth). The combination of infectious cDNA clones, cell culture infectivity, and the ability to produce milligram amounts of virus allows the application of DNA-based genetic methods to the study of PaV structure and assembly. PMID:11711613
Initiation, extension, and termination of RNA synthesis by a paramyxovirus polymerase.

PubMed

Jordan, Paul C; Liu, Cheng; Raynaud, Pauline; Lo, Michael K; Spiropoulou, Christina F; Symons, Julian A; Beigelman, Leo; Deval, Jerome

2018-02-01

Paramyxoviruses represent a family of RNA viruses causing significant human diseases. These include measles virus, the most infectious virus ever reported, in addition to parainfluenza virus, and other emerging viruses. Paramyxoviruses likely share common replication machinery but their mechanisms of RNA biosynthesis activities and details of their complex polymerase structures are unknown. Mechanistic and functional details of a paramyxovirus polymerase would have sweeping implications for understanding RNA virus replication and for the development of new antiviral medicines. To study paramyxovirus polymerase structure and function, we expressed an active recombinant Nipah virus (NiV) polymerase complex assembled from the multifunctional NiV L protein bound to its phosphoprotein cofactor. NiV is an emerging highly pathogenic virus that causes severe encephalitis and has been declared a global public health concern due to its high mortality rate. Using negative-stain electron microscopy, we demonstrated NiV polymerase forms ring-like particles resembling related RNA polymerases. We identified conserved sequence elements driving recognition of the 3'-terminal genomic promoter by NiV polymerase, and leading to initiation of RNA synthesis, primer extension, and transition to elongation mode. Polyadenylation resulting from NiV polymerase stuttering provides a mechanistic basis for transcription termination. It also suggests a divergent adaptation in promoter recognition between pneumo- and paramyxoviruses. The lack of available antiviral therapy for NiV prompted us to identify the triphosphate forms of R1479 and GS-5734, two clinically relevant nucleotide analogs, as substrates and inhibitors of NiV polymerase activity by delayed chain termination. Overall, these findings provide low-resolution structural details and the mechanism of an RNA polymerase from a previously uncharacterized virus family. This work illustrates important functional differences yet remarkable similarities between the polymerases of nonsegmented negative-strand RNA viruses.
Recovery of infectious pariacoto virus from cDNA clones and identification of susceptible cell lines.

PubMed

Johnson, K N; Ball, L A

2001-12-01

Pariacoto virus (PaV) is a nodavirus that was recently isolated in Peru from the Southern armyworm, Spodoptera eridania. Virus particles are non enveloped and about 30 nm in diameter and have T=3 icosahedral symmetry. The 3.0-A crystal structure shows that about 35% of the genomic RNA is icosahedrally ordered, with the RNA forming a dodecahedral cage of 25-nucleotide (nt) duplexes that underlie the inner surface of the capsid. The PaV genome comprises two single-stranded, positive-sense RNAs: RNA1 (3,011 nt), which encodes the 108-kDa catalytic subunit of the RNA-dependent RNA polymerase, and RNA2 (1,311 nt), which encodes the 43-kDa capsid protein precursor alpha. In order to apply molecular genetics to the structure and assembly of PaV, we identified susceptible cell lines and developed a reverse genetic system for this virus. Cell lines that were susceptible to infection by PaV included those from Spodoptera exigua, Helicoverpa zea and Aedes albopictus, whereas cells from Drosophila melanogaster and Spodoptera frugiperda were refractory to infection. To recover virus from molecular clones, full-length cDNAs of PaV RNAs 1 and 2 were cotranscribed by T7 RNA polymerase in baby hamster kidney cells that expressed T7 RNA polymerase. Lysates of these cells were infectious both for cultured cells from Helicoverpa zea (corn earworm) and for larvae of Galleria mellonella (greater wax moth). The combination of infectious cDNA clones, cell culture infectivity, and the ability to produce milligram amounts of virus allows the application of DNA-based genetic methods to the study of PaV structure and assembly.
A Method to Predict the Structure and Stability of RNA/RNA Complexes.

PubMed

Xu, Xiaojun; Chen, Shi-Jie

2016-01-01

RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.
Synthesizing topological structures containing RNA

NASA Astrophysics Data System (ADS)

Liu, Di; Shao, Yaming; Chen, Gang; Tse-Dinh, Yuk-Ching; Piccirilli, Joseph A.; Weizmann, Yossi

2017-03-01

Though knotting and entanglement have been observed in DNA and proteins, their existence in RNA remains an enigma. Synthetic RNA topological structures are significant for understanding the physical and biological properties pertaining to RNA topology, and these properties in turn could facilitate identifying naturally occurring topologically nontrivial RNA molecules. Here we show that topological structures containing single-stranded RNA (ssRNA) free of strong base pairing interactions can be created either by configuring RNA-DNA hybrid four-way junctions or by template-directed synthesis with a single-stranded DNA (ssDNA) topological structure. By using a constructed ssRNA knot as a highly sensitive topological probe, we find that Escherichia coli DNA topoisomerase I has low RNA topoisomerase activity and that the R173A point mutation abolishes the unknotting activity for ssRNA, but not for ssDNA. Furthermore, we discover the topological inhibition of reverse transcription (RT) and obtain different RT-PCR patterns for an ssRNA knot and circle of the same sequence.
The DEAD-Box Protein CYT-19 Uses Arginine Residues in Its C-Tail To Tether RNA Substrates.

PubMed

Busa, Veronica F; Rector, Maxwell J; Russell, Rick

2017-07-18

DEAD-box proteins are nonprocessive RNA helicases that play diverse roles in cellular processes. The Neurospora crassa DEAD-box protein CYT-19 promotes mitochondrial group I intron splicing and functions as a general RNA chaperone. CYT-19 includes a disordered, arginine-rich "C-tail" that binds RNA, positioning the helicase core to capture and unwind nearby RNA helices. Here we probed the C-tail further by varying the number and positions of arginines within it. We found that removing sets of as few as four of the 11 arginines reduced RNA unwinding activity (k cat /K M ) to a degree equivalent to that seen upon removal of the C-tail, suggesting that a minimum or "threshold" number of arginines is required. In addition, a mutant with 16 arginines displayed RNA unwinding activity greater than that of wild-type CYT-19. The C-tail modifications impacted unwinding only of RNA helices within constructs that included an adjacent helix or structured RNA element that would allow C-tail binding, indicating that the helicase core remained active in the mutants. In addition, changes in RNA unwinding efficiency of the mutants were mirrored by changes in functional RNA affinity, as determined from the RNA concentration dependence of ATPase activity, suggesting that the C-tail functions primarily to increase RNA affinity. Interestingly, the salt concentration dependence of RNA unwinding activity is unaffected by C-tail composition, suggesting that the C-tail uses primarily hydrogen bonding, not electrostatic interactions, to bind double-stranded RNA. Our results provide insights into how an unstructured C-tail contributes to DEAD-box protein activity and suggest parallels with other families of RNA- and DNA-binding proteins.
RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

ERIC Educational Resources Information Center

Ellington, Roni; Wachira, James; Nkwanta, Asamoah

2010-01-01

The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses…
Peptides Used in the Delivery of Small Noncoding RNA

PubMed Central

2015-01-01

RNA interference (RNAi) is an endogenous process in which small noncoding RNAs, including small interfering RNAs (siRNAs) and microRNAs (miRNAs), post-transcriptionally regulate gene expressions. In general, siRNA and miRNA/miRNA mimics are similar in nature and activity except their origin and specificity. Although both siRNAs and miRNAs have been extensively studied as novel therapeutics for a wide range of diseases, the large molecular weight, anionic surface charges, instability in blood circulation, and intracellular trafficking to the RISC after cellular uptake have hindered the translation of these RNAs from bench to clinic. As a result, a great variety of delivery systems have been investigated for safe and effective delivery of small noncoding RNAs. Among these systems, peptides, especially cationic peptides, have emerged as a promising type of carrier due to their inherent ability to condense negatively charged RNAs, ease of synthesis, controllable size, and tunable structure. In this review, we will focus on three major types of cationic peptides, including poly(l-lysine) (PLL), protamine, and cell penetrating peptides (CPP), as well as peptide targeting ligands that have been extensively used in RNA delivery. The delivery strategies, applications, and limitations of these cationic peptides in siRNA/miRNA delivery will be discussed. PMID:25157701
A new version of the RDP (Ribosomal Database Project)

NASA Technical Reports Server (NTRS)

Maidak, B. L.; Cole, J. R.; Parker, C. T. Jr; Garrity, G. M.; Larsen, N.; Li, B.; Lilburn, T. G.; McCaughey, M. J.; Olsen, G. J.; Overbeek, R.;

1999-01-01

The Ribosomal Database Project (RDP-II), previously described by Maidak et al. [ Nucleic Acids Res. (1997), 25, 109-111], is now hosted by the Center for Microbial Ecology at Michigan State University. RDP-II is a curated database that offers ribosomal RNA (rRNA) nucleotide sequence data in aligned and unaligned forms, analysis services, and associated computer programs. During the past two years, data alignments have been updated and now include >9700 small subunit rRNA sequences. The recent development of an ObjectStore database will provide more rapid updating of data, better data accuracy and increased user access. RDP-II includes phylogenetically ordered alignments of rRNA sequences, derived phylogenetic trees, rRNA secondary structure diagrams, and various software programs for handling, analyzing and displaying alignments and trees. The data are available via anonymous ftp (ftp.cme.msu. edu) and WWW (http://www.cme.msu.edu/RDP). The WWW server provides ribosomal probe checking, approximate phylogenetic placement of user-submitted sequences, screening for possible chimeric rRNA sequences, automated alignment, and a suggested placement of an unknown sequence on an existing phylogenetic tree. Additional utilities also exist at RDP-II, including distance matrix, T-RFLP, and a Java-based viewer of the phylogenetic trees that can be used to create subtrees.

Exosomes as nanocarriers for siRNA delivery: paradigms and challenges.

PubMed

Shahabipour, Fahimeh; Banach, Maciej; Sahebkar, Amirhossein

2016-12-01

Exosomes are nano-sized vesicles that facilitate intercellular communications through carrying genetic materials and functional biomolecules. Owing to their unique size and structure, exosomes have emerged as a useful tool to overcome the limitations of siRNA delivery. The use of exosomes as siRNA delivery vehicles lacks certain disadvantages of the existing foreign delivery systems such as viruses, polycationic polymers and liposomes, and introduces several advantages including inherent capacity to pass through biological barriers and escape from phagocytosis by the reticuloendothelial system, as well as being biocompatible, non-toxic, and immunologically inert. Different strategies have been employed to harness exosome-based delivery systems, including surface modification with targeting ligands, and using exosome-display technology, virus-modified exosomes, and exosome-mimetic vesicles. The present review provides a capsule summary of the recent advances and current challenges in the field of exosome-mediated siRNA delivery.
Selective Degradation of Host RNA Polymerase II Transcripts by Influenza A Virus PA-X Host Shutoff Protein

PubMed Central

Larkins-Ford, Jonah; McCormick, Craig; Gaglia, Marta M.

2016-01-01

Influenza A viruses (IAVs) inhibit host gene expression by a process known as host shutoff. Host shutoff limits host innate immune responses and may also redirect the translation apparatus to the production of viral proteins. Multiple IAV proteins regulate host shutoff, including PA-X, a ribonuclease that remains incompletely characterized. We report that PA-X selectively targets host RNA polymerase II (Pol II) transcribed mRNAs, while sparing products of Pol I and Pol III. Interestingly, we show that PA-X can also target Pol II-transcribed RNAs in the nucleus, including non-coding RNAs that are not destined to be translated, and reporter transcripts with RNA hairpin structures that block ribosome loading. Transcript degradation likely occurs in the nucleus, as PA-X is enriched in the nucleus and its nuclear localization correlates with reduction in target RNA levels. Complete degradation of host mRNAs following PA-X-mediated endonucleolytic cleavage is dependent on the host 5’->3’-exonuclease Xrn1. IAV mRNAs are structurally similar to host mRNAs, but are synthesized and modified at the 3’ end by the action of the viral RNA-dependent RNA polymerase complex. Infection of cells with wild-type IAV or a recombinant PA-X-deficient virus revealed that IAV mRNAs resist PA-X-mediated degradation during infection. At the same time, loss of PA-X resulted in changes in the synthesis of select viral mRNAs and a decrease in viral protein accumulation. Collectively, these results significantly advance our understanding of IAV host shutoff, and suggest that the PA-X causes selective degradation of host mRNAs by discriminating some aspect of Pol II-dependent RNA biogenesis in the nucleus. PMID:26849127
A folded viral noncoding RNA blocks host cell exoribonucleases through a conformationally dynamic RNA structure.

PubMed

Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S

2018-06-19

Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.
R2R--software to speed the depiction of aesthetic consensus RNA secondary structures.

PubMed

Weinberg, Zasha; Breaker, Ronald R

2011-01-04

With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.
Functions of the 3′ and 5′ genome RNA regions of members of the genus Flavivirus

PubMed Central

Brinton, Margo A.; Basu, Mausumi

2015-01-01

The positive sense genomes of members of the genus Flavivirus in the family Flaviviridae are ~11 kb nts in length and have a 5′ type I cap but no 3′ poly A. The 5′ and 3′ terminal regions contain short conserved sequences that are proposed to be repeated remnants of an ancient sequence. However, the functions of most of these conserved sequences have not yet been determined. The terminal regions of the genome also contain multiple conserved RNA structures. Functional data for many of these structures has been obtained. Three sets of complementary 3′ and 5′ terminal region sequences, some of which are located in conserved RNA structures, interact to form a panhandle structure that is required for initiation of minus strand RNA synthesis with the 5′ terminal structure functioning as the promoter. How the switch from the terminal RNA structure base pairing to the long distance RNA-RNA interaction is triggered and regulated is not well understood but evidence suggests involvement of a cell protein binding to three sites on the 3′ terminal RNA structures and a cis-acting metastable 3′ RNA element in the 3′ terminal structure. Cell proteins may also be involved in facilitating exponential replication of nascent genomic RNA within replication vesicles at later times of infection cycle. Other conserved RNA structures and/or sequences in the 5′ and 3′ terminal regions have been proposed to regulate genome translation. Additional functions of the 5′ and 3′ terminal sequences have also been reported. PMID:25683510
Direct Duplex Detection: An Emerging Tool in the RNA Structure Analysis Toolbox.

PubMed

Weidmann, Chase A; Mustoe, Anthony M; Weeks, Kevin M

2016-09-01

While a variety of powerful tools exists for analyzing RNA structure, identifying long-range and intermolecular base-pairing interactions has remained challenging. Recently, three groups introduced a high-throughput strategy that uses psoralen-mediated crosslinking to directly identify RNA-RNA duplexes in cells. Initial application of these methods highlights the preponderance of long-range structures within and between RNA molecules and their widespread structural dynamics. Copyright © 2016 Elsevier Ltd. All rights reserved.
Structures of riboswitch RNA reaction states by mix-and-inject XFEL serial crystallography

NASA Astrophysics Data System (ADS)

Stagno, J. R.; Liu, Y.; Bhandari, Y. R.; Conrad, C. E.; Panja, S.; Swain, M.; Fan, L.; Nelson, G.; Li, C.; Wendel, D. R.; White, T. A.; Coe, J. D.; Wiedorn, M. O.; Knoska, J.; Oberthuer, D.; Tuckey, R. A.; Yu, P.; Dyba, M.; Tarasov, S. G.; Weierstall, U.; Grant, T. D.; Schwieters, C. D.; Zhang, J.; Ferré-D'Amaré, A. R.; Fromme, P.; Draper, D. E.; Liang, M.; Hunter, M. S.; Boutet, S.; Tan, K.; Zuo, X.; Ji, X.; Barty, A.; Zatsepin, N. A.; Chapman, H. N.; Spence, J. C. H.; Woodson, S. A.; Wang, Y.-X.

2017-01-01

Riboswitches are structural RNA elements that are generally located in the 5‧ untranslated region of messenger RNA. During regulation of gene expression, ligand binding to the aptamer domain of a riboswitch triggers a signal to the downstream expression platform. A complete understanding of the structural basis of this mechanism requires the ability to study structural changes over time. Here we use femtosecond X-ray free electron laser (XFEL) pulses to obtain structural measurements from crystals so small that diffusion of a ligand can be timed to initiate a reaction before diffraction. We demonstrate this approach by determining four structures of the adenine riboswitch aptamer domain during the course of a reaction, involving two unbound apo structures, one ligand-bound intermediate, and the final ligand-bound conformation. These structures support a reaction mechanism model with at least four states and illustrate the structural basis of signal transmission. The three-way junction and the P1 switch helix of the two apo conformers are notably different from those in the ligand-bound conformation. Our time-resolved crystallographic measurements with a 10-second delay captured the structure of an intermediate with changes in the binding pocket that accommodate the ligand. With at least a 10-minute delay, the RNA molecules were fully converted to the ligand-bound state, in which the substantial conformational changes resulted in conversion of the space group. Such notable changes in crystallo highlight the important opportunities that micro- and nanocrystals may offer in these and similar time-resolved diffraction studies. Together, these results demonstrate the potential of ‘mix-and-inject’ time-resolved serial crystallography to study biochemically important interactions between biomacromolecules and ligands, including those that involve large conformational changes.
Structures of riboswitch RNA reaction states by mix-and-inject XFEL serial crystallography

PubMed Central

Stagno, J. R.; Liu, Y.; Bhandari, Y. R.; Conrad, C. E.; Panja, S.; Swain, M.; Fan, L.; Nelson, G.; Li, C.; Wendel, D. R.; White, T. A.; Coe, J. D.; Wiedorn, M. O.; Knoska, J.; Oberthuer, D.; Tuckey, R. A.; Yu, P.; Dyba, M.; Tarasov, S. G.; Weierstall, U.; Grant, T. D.; Schwieters, C. D.; Zhang, J.; Ferré-D’Amaré, A. R.; Fromme, P.; Draper, D. E.; Liang, M.; Hunter, M. S.; Boutet, S.; Tan, K.; Zuo, X.; Ji, X.; Barty, A.; Zatsepin, N. A.; Chapman, H. N.; Spence, J. C. H.; Woodson, S. A.; Wang, Y.-X.

2017-01-01

Riboswitches are structural RNA elements that are generally located in the 5′ untranslated region of messenger RNA. During regulation of gene expression, ligand binding to the aptamer domain of a riboswitch triggers a signal to the downstream expression platform1–3. A complete understanding of the structural basis of this mechanism requires the ability to study structural changes over time4. Here we use femtosecond X-ray free electron laser (XFEL) pulses5,6 to obtain structural measurements from crystals so small that diffusion of a ligand can be timed to initiate a reaction before diffraction. We demonstrate this approach by determining four structures of the adenine riboswitch aptamer domain during the course of a reaction, involving two unbound apo structures, one ligand-bound intermediate, and the final ligand-bound conformation. These structures support a reaction mechanism model with at least four states and illustrate the structural basis of signal transmission. The three-way junction and the P1 switch helix of the two apo conformers are notably different from those in the ligand-bound conformation. Our time-resolved crystallographic measurements with a 10-second delay captured the structure of an intermediate with changes in the binding pocket that accommodate the ligand. With at least a 10-minute delay, the RNA molecules were fully converted to the ligand-bound state, in which the substantial conformational changes resulted in conversion of the space group. Such notable changes in crystallo highlight the important opportunities that micro- and nanocrystals may offer in these and similar time-resolved diffraction studies. Together, these results demonstrate the potential of ‘mix-and-inject’ time-resolved serial crystallography to study biochemically important interactions between biomacromolecules and ligands, including those that involve large conformational changes. PMID:27841871
Cryptic tRNAs in chaetognath mitochondrial genomes.

PubMed

Barthélémy, Roxane-Marie; Seligmann, Hervé

2016-06-01

The chaetognaths constitute a small and enigmatic phylum of little marine invertebrates. Both nuclear and mitochondrial genomes have numerous originalities, some phylum-specific. Until recently, their mitogenomes seemed containing only one tRNA gene (trnMet), but a recent study found in two chaetognath mitogenomes two and four tRNA genes. Moreover, apparently two conspecific mitogenomes have different tRNA gene numbers (one and two). Reanalyses by tRNAscan-SE and ARWEN softwares of the five available complete chaetognath mitogenomes suggest numerous additional tRNA genes from different types. Their total number never reaches the 22 found in most other invertebrates using that genetic code. Predicted error compensation between codon-anticodon mismatch and tRNA misacylation suggests translational activity by tRNAs predicted solely according to secondary structure for tRNAs predicted by tRNAscan-SE, not ARWEN. Numbers of predicted stop-suppressor (antitermination) tRNAs coevolve with predicted overlapping, frameshifted protein coding genes including stop codons. Sequence alignments in secondary structure prediction with non-chaetognath tRNAs suggest that the most likely functional tRNAs are in intergenic regions, as regular mt-tRNAs. Due to usually short intergenic regions, generally tRNA sequences partially overlap with flanking genes. Some tRNA pairs seem templated by sense-antisense strands. Moreover, 16S rRNA genes, but not 12S rRNAs, appear as tRNA nurseries, as previously suggested for multifunctional ribosomal-like protogenomes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Free energy minimization to predict RNA secondary structures and computational RNA design.

PubMed

Churkin, Alexander; Weinbrand, Lina; Barash, Danny

2015-01-01

Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.
De novo discovery of structural motifs in RNA 3D structures through clustering.

PubMed

Ge, Ping; Islam, Shahidul; Zhong, Cuncong; Zhang, Shaojie

2018-05-18

As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.
MCTBI: a web server for predicting metal ion effects in RNA structures.

PubMed

Sun, Li-Zhen; Zhang, Jing-Xiang; Chen, Shi-Jie

2017-08-01

Metal ions play critical roles in RNA structure and function. However, web servers and software packages for predicting ion effects in RNA structures are notably scarce. Furthermore, the existing web servers and software packages mainly neglect ion correlation and fluctuation effects, which are potentially important for RNAs. We here report a new web server, the MCTBI server (http://rna.physics.missouri.edu/MCTBI), for the prediction of ion effects for RNA structures. This server is based on the recently developed MCTBI, a model that can account for ion correlation and fluctuation effects for nucleic acid structures and can provide improved predictions for the effects of metal ions, especially for multivalent ions such as Mg 2+ effects, as shown by extensive theory-experiment test results. The MCTBI web server predicts metal ion binding fractions, the most probable bound ion distribution, the electrostatic free energy of the system, and the free energy components. The results provide mechanistic insights into the role of metal ions in RNA structure formation and folding stability, which is important for understanding RNA functions and the rational design of RNA structures. © 2017 Sun et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Small-angle X-ray solution scattering study of the multi-aminoacyl-tRNA synthetase complex reveals an elongated and multi-armed particle.

PubMed

Dias, José; Renault, Louis; Pérez, Javier; Mirande, Marc

2013-08-16

In animal cells, nine aminoacyl-tRNA synthetases are associated with the three auxiliary proteins p18, p38, and p43 to form a stable and conserved large multi-aminoacyl-tRNA synthetase complex (MARS), whose molecular mass has been proposed to be between 1.0 and 1.5 MDa. The complex acts as a molecular hub for coordinating protein synthesis and diverse regulatory signal pathways. Electron microscopy studies defined its low resolution molecular envelope as an overall rather compact, asymmetric triangular shape. Here, we have analyzed the composition and homogeneity of the native mammalian MARS isolated from rabbit liver and characterized its overall internal structure, size, and shape at low resolution by hydrodynamic methods and small-angle x-ray scattering in solution. Our data reveal that the MARS exhibits a much more elongated and multi-armed shape than expected from previous reports. The hydrodynamic and structural features of the MARS are large compared with other supramolecular assemblies involved in translation, including ribosome. The large dimensions and non-compact structural organization of MARS favor a large protein surface accessibility for all its components. This may be essential to allow structural rearrangements between the catalytic and cis-acting tRNA binding domains of the synthetases required for binding the bulky tRNA substrates. This non-compact architecture may also contribute to the spatiotemporal controlled release of some of its components, which participate in non-canonical functions after dissociation from the complex.
Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions.

PubMed

Shi, Ya-Zhou; Jin, Lei; Feng, Chen-Jie; Tan, Ya-Lan; Tan, Zhi-Jie

2018-06-01

RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.
Ribosomal incorporation of backbone modified amino acids via an editing-deficient aminoacyl-tRNA synthetase.

PubMed

Iqbal, Emil S; Dods, Kara K; Hartman, Matthew C T

2018-02-14

The ability to incorporate non-canonical amino acids (ncAA) using translation offers researchers the ability to extend the functionality of proteins and peptides for many applications including synthetic biology, biophysical and structural studies, and discovery of novel ligands. Here we describe the high promiscuity of an editing-deficient valine-tRNA synthetase (ValRS T222P). Using this enzyme, we demonstrate ribosomal translation of 11 ncAAs including those with novel side chains, α,α-disubstitutions, and cyclic β-amino acids.
Structure of T7 RNA polymerase complexed to the transcriptional inhibitor T7 lysozyme.

PubMed Central

Jeruzalmi, D; Steitz, T A

1998-01-01

The T7 RNA polymerase-T7 lysozyme complex regulates phage gene expression during infection of Escherichia coli. The 2.8 A crystal structure of the complex reveals that lysozyme binds at a site remote from the polymerase active site, suggesting an indirect mechanism of inhibition. Comparison of the T7 RNA polymerase structure with that of the homologous pol I family of DNA polymerases reveals identities in the catalytic site but also differences specific to RNA polymerase function. The structure of T7 RNA polymerase presented here differs significantly from a previously published structure. Sequence similarities between phage RNA polymerases and those from mitochondria and chloroplasts, when interpreted in the context of our revised model of T7 RNA polymerase, suggest a conserved fold. PMID:9670025
Intracellular Virus-Specific Structures and RNAs in Oncornavirus-Producing Human Cells

PubMed Central

Bukrinskaya, A. G.; Miller, G. G.; Lebedeva, E. N.; Zhdanov, V. M.

1974-01-01

Two kinds of virus-specific structures were isolated from the cytoplasm of Detroit-6 and human amnion cells producing oncornavirus-like particles. These structures represented A particles with the diameter of 70 to 80 nm and aggregated strands of nucleocapsids with the diameter of 3 and 6 nm. The structures were separated from cellular contaminants by isopycnic banding in linear sucrose gradients and subsequently further purified by sedimentation in velocity sucrose gradients. Their sedimentation coefficient was 250 and 150S, respectively. Both structures contain 60, 45, and 35S RNA species, and 150S structures also contained 20S RNA. The 35 and 20S RNA from the 150S structure formed hybrids with DNA enzymatically synthesized on extracellular virions. The structures displayed endogeneous polymerase activity, DNA product of the reaction being predominantly associated with 60S RNA. No 70S RNA was found in the cell structures of various densities. Also, the virions purified from tissue culture fluid contained 70S RNA. These findings are consistent with those on extracellular maturation of oncornavirus RNA. Images PMID:4810779
Hepatitis E: Molecular Virology and Pathogenesis

PubMed Central

Panda, Subrat K.; Varma, Satya P.K.

2013-01-01

Hepatitis E virus is a single, positive-sense, capped and poly A tailed RNA virus classified under the family Hepeviridae. Enteric transmission, acute self-limiting hepatitis, frequent epidemic and sporadic occurrence, high mortality in affected pregnants are hallmarks of hepatitis E infection. Lack of an efficient culture system and resulting reductionist approaches for the study of replication and pathogenesis of HEV made it to be a less understood agent. Early studies on animal models, sub-genomic expression of open reading frames (ORF) and infectious cDNA clones have helped in elucidating the genome organization, important stages in HEV replication and pathogenesis. The genome contains three ORF's and three untranslated regions (UTR). The 5′ distal ORF, ORF1 is translated by host ribosomes in a cap dependent manner to form the non-structural polyprotein including the viral replicase. HEV replicates via a negative-sense RNA intermediate which helps in the formation of the positive-sense genomic RNA and a single bi-cistronic sub-genomic RNA. The 3′ distal ORF's including the major structural protein pORF2 and the multifunctional host interacting protein pORF3 are translated from the sub-genomic RNA. Pathogenesis in HEV infections is not well articulated, and remains a concern due to the many aspects like host dependent and genotype specific variations. Animal HEV, zoonosis, chronicity in immunosuppressed patients, and rapid decompensation in affected chronic liver diseased patients warrants detailed investigation of the underlying pathogenesis. Recent advances about structure, entry, egress and functional characterization of ORF1 domains has furthered our understanding about HEV. This article is an effort to review our present understanding about molecular biology and pathogenesis of HEV. PMID:25755485
High-resolution structure of the Escherichia coli ribosome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.

Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less
High-resolution structure of the Escherichia coli ribosome

DOE PAGES

Noeske, Jonas; Wasserman, Michael R.; Terry, Daniel S.; ...

2015-03-16

Protein synthesis by the ribosome is highly dependent on the ionic conditions in the cellular environment, but the roles of ribosome solvation remain poorly understood. Moreover, the function of modifications to ribosomal RNA and ribosomal proteins are unclear. Here we present the structure of the Escherichia coli 70S ribosome to 2.4 Å resolution. The structure reveals details of the ribosomal subunit interface that are conserved in all domains of life, and suggest how solvation contributes to ribosome integrity and function. The structure also suggests how the conformation of ribosomal protein uS12 likely impacts its contribution to messenger RNA decoding. Inmore » conclusion, this structure helps to explain the phylogenetic conservation of key elements of the ribosome, including posttranscriptional and posttranslational modifications and should serve as a basis for future antibiotic development.« less

The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

PubMed Central

Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

2016-01-01

Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712
Structure of Escherichia coli Arginyl-tRNA Synthetase in Complex with tRNAArg: Pivotal Role of the D-loop.

PubMed

Stephen, Preyesh; Ye, Sheng; Zhou, Ming; Song, Jian; Zhang, Rongguang; Wang, En-Duo; Giegé, Richard; Lin, Sheng-Xiang

2018-05-25

Aminoacyl-tRNA synthetases are essential components in protein biosynthesis. Arginyl-tRNA synthetase (ArgRS) belongs to the small group of aminoacyl-tRNA synthetases requiring cognate tRNA for amino acid activation. The crystal structure of Escherichia coli (Eco) ArgRS has been solved in complex with tRNA Arg at 3.0-Å resolution. With this first bacterial tRNA complex, we are attempting to bridge the gap existing in structure-function understanding in prokaryotic tRNA Arg recognition. The structure shows a tight binding of tRNA on the synthetase through the identity determinant A20 from the D-loop, a tRNA recognition snapshot never elucidated structurally. This interaction of A20 involves 5 amino acids from the synthetase. Additional contacts via U20a and U16 from the D-loop reinforce the interaction. The importance of D-loop recognition in EcoArgRS functioning is supported by a mutagenesis analysis of critical amino acids that anchor tRNA Arg on the synthetase; in particular, mutations at amino acids interacting with A20 affect binding affinity to the tRNA and specificity of arginylation. Altogether the structural and functional data indicate that the unprecedented ArgRS crystal structure represents a snapshot during functioning and suggest that the recognition of the D-loop by ArgRS is an important trigger that anchors tRNA Arg on the synthetase. In this process, A20 plays a major role, together with prominent conformational changes in several ArgRS domains that may eventually lead to the mature ArgRS:tRNA complex and the arginine activation. Functional implications that could be idiosyncratic to the arginine identity of bacterial ArgRSs are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.
Rapid functional diversification in the structurally conserved ELAV family of neuronal RNA binding proteins

PubMed Central

Samson, Marie-Laure

2008-01-01

Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
G-Quadruplexes influence pri-microRNA processing.

PubMed

Rouleau, Samuel G; Garant, Jean-Michel; Bolduc, François; Bisaillon, Martin; Perreault, Jean-Pierre

2018-02-01

RNA G-Quadruplexes (G4) have been shown to possess many biological functions, including the regulation of microRNA (miRNA) biogenesis and function. However, their impact on pri-miRNA processing remains unknown. We identified G4 located near the Drosha cleavage site in three distinct pri-miRNAs: pri-mir200c, pri-mir451a, and pri-mir497. The folding of the potential G4 motifs was determined in solution. Subsequently, mutations disrupting G4 folding led to important changes in the mature miRNAs levels in cells. Moreover, using small antisense oligonucleotides binding to the pri-miRNA, it was possible to modulate, either positively or negatively, the mature miRNA levels. Together, these data demonstrate that G4 motifs could contribute to the regulation of pri-mRNA processing, a novel role for G4. Considering that bio-informatics screening indicates that between 9% and 50% of all pri-miRNAs contain a putative G4, these structures possess interesting potential as future therapeutic targets.
Design of a bioactive small molecule that targets r(AUUCU) repeats in spinocerebellar ataxia 10.

PubMed

Yang, Wang-Yong; Gao, Rui; Southern, Mark; Sarkar, Partha S; Disney, Matthew D

2016-06-01

RNA is an important target for chemical probes of function and lead therapeutics; however, it is difficult to target with small molecules. One approach to tackle this problem is to identify compounds that target RNA structures and utilize them to multivalently target RNA. Here we show that small molecules can be identified to selectively bind RNA base pairs by probing a library of RNA-focused small molecules. A small molecule that selectively binds AU base pairs informed design of a dimeric compound (2AU-2) that targets the pathogenic RNA, expanded r(AUUCU) repeats, that causes spinocerebellar ataxia type 10 (SCA10) in patient-derived cells. Indeed, 2AU-2 (50 nM) ameliorates various aspects of SCA10 pathology including improvement of mitochondrial dysfunction, reduced activation of caspase 3, and reduction of nuclear foci. These studies provide a first-in-class chemical probe to study SCA10 RNA toxicity and potentially define broadly applicable compounds targeting RNA AU base pairs in cells.
Free-energy landscape of a hyperstable RNA tetraloop

PubMed Central

Miner, Jacob C.; Chen, Alan A.; García, Angel E.

2016-01-01

We report the characterization of the energy landscape and the folding/unfolding thermodynamics of a hyperstable RNA tetraloop obtained through high-performance molecular dynamics simulations at microsecond timescales. Sampling of the configurational landscape is conducted using temperature replica exchange molecular dynamics over three isochores at high, ambient, and negative pressures to determine the thermodynamic stability and the free-energy landscape of the tetraloop. The simulations reveal reversible folding/unfolding transitions of the tetraloop into the canonical A-RNA conformation and the presence of two alternative configurations, including a left-handed Z-RNA conformation and a compact purine Triplet. Increasing hydrostatic pressure shows a stabilizing effect on the A-RNA conformation and a destabilization of the left-handed Z-RNA. Our results provide a comprehensive description of the folded free-energy landscape of a hyperstable RNA tetraloop and highlight the significant advances of all-atom molecular dynamics in describing the unbiased folding of a simple RNA secondary structure motif. PMID:27233937
Viperin mRNA is a novel target for the human RNase MRP/RNase P endoribonuclease.

PubMed

Mattijssen, Sandy; Hinson, Ella R; Onnekink, Carla; Hermanns, Pia; Zabel, Bernhard; Cresswell, Peter; Pruijn, Ger J M

2011-07-01

RNase MRP is a conserved endoribonuclease, in humans consisting of a 267-nucleotide RNA associated with 7-10 proteins. Mutations in its RNA component lead to several autosomal recessive skeletal dysplasias, including cartilage-hair hypoplasia (CHH). Because the known substrates of mammalian RNase MRP, pre-ribosomal RNA, and RNA involved in mitochondrial DNA replication are not likely involved in CHH, we analyzed the effects of RNase MRP (and the structurally related RNase P) depletion on mRNAs using DNA microarrays. We confirmed the upregulation of the interferon-inducible viperin mRNA by RNAi experiments and this appeared to be independent of the interferon response. We detected two cleavage sites for RNase MRP/RNase P in the coding sequence of viperin mRNA. This is the first study providing direct evidence for the cleavage of a mRNA by RNase MRP/RNase P in human cells. Implications for the involvement in the pathophysiology of CHH are discussed.
Developing a Fluorescent Toolbox To Shed Light on the Mysteries of RNA.

PubMed

Alexander, Seth C; Devaraj, Neal K

2017-10-03

Technologies that detect and image RNA have illuminated the complex roles played by RNA, redefining the traditional and superficial role first outlined by the central dogma of biology. Because there is such a wide diversity of RNA structure arising from an assortment of functions within biology, a toolbox of approaches have emerged for investigation of this important class of biomolecules. These methods are necessary to detect and elucidate the localization and dynamics of specific RNAs and in doing so unlock our understanding of how RNA dysregulation leads to disease. Current methods for detecting and imaging RNA include in situ hybridization techniques, fluorescent aptamers, RNA binding proteins fused to fluorescent reporters, and covalent labeling strategies. Because of the inherent diversity of these methods, each approach comes with a set of strengths and limitations that leave room for future improvement. This perspective seeks to highlight the most recent advances and remaining challenges for the wide-ranging toolbox of technologies that illuminate RNA's contribution to cellular complexity.
Mutually Exclusive Splicing of the Insect Dscam Pre-mRNA Directed by Competing Intronic RNA Secondary Structures

PubMed Central

Graveley, Brenton R.

2008-01-01

Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

PubMed Central

Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis

2008-01-01

Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088
Free Energy Landscape and Multiple Folding Pathways of an H-Type RNA Pseudoknot

PubMed Central

Bian, Yunqiang; Zhang, Jian; Wang, Jun; Wang, Jihua; Wang, Wei

2015-01-01

How RNA sequences fold to specific tertiary structures is one of the key problems for understanding their dynamics and functions. Here, we study the folding process of an H-type RNA pseudoknot by performing a large-scale all-atom MD simulation and bias-exchange metadynamics. The folding free energy landscapes are obtained and several folding intermediates are identified. It is suggested that the folding occurs via multiple mechanisms, including a step-wise mechanism starting either from the first helix or the second, and a cooperative mechanism with both helices forming simultaneously. Despite of the multiple mechanism nature, the ensemble folding kinetics estimated from a Markov state model is single-exponential. It is also found that the correlation between folding and binding of metal ions is significant, and the bound ions mediate long-range interactions in the intermediate structures. Non-native interactions are found to be dominant in the unfolded state and also present in some intermediates, possibly hinder the folding process of the RNA. PMID:26030098
Modular architecture of eukaryotic RNase P and RNase MRP revealed by electron microscopy

PubMed Central

Hipp, Katharina; Galani, Kyriaki; Batisse, Claire; Prinz, Simone; Böttcher, Bettina

2012-01-01

Ribonuclease P (RNase P) and RNase MRP are closely related ribonucleoprotein enzymes, which process RNA substrates including tRNA precursors for RNase P and 5.8 S rRNA precursors, as well as some mRNAs, for RNase MRP. The structures of RNase P and RNase MRP have not yet been solved, so it is unclear how the proteins contribute to the structure of the complexes and how substrate specificity is determined. Using electron microscopy and image processing we show that eukaryotic RNase P and RNase MRP have a modular architecture, where proteins stabilize the RNA fold and contribute to cavities, channels and chambers between the modules. Such features are located at strategic positions for substrate recognition by shape and coordination of the cleaved-off sequence. These are also the sites of greatest difference between RNase P and RNase MRP, highlighting the importance of the adaptation of this region to the different substrates. PMID:22167472
In vitro selection of functional nucleic acids

NASA Technical Reports Server (NTRS)

Wilson, D. S.; Szostak, J. W.

1999-01-01

In vitro selection allows rare functional RNA or DNA molecules to be isolated from pools of over 10(15) different sequences. This approach has been used to identify RNA and DNA ligands for numerous small molecules, and recent three-dimensional structure solutions have revealed the basis for ligand recognition in several cases. By selecting high-affinity and -specificity nucleic acid ligands for proteins, promising new therapeutic and diagnostic reagents have been identified. Selection experiments have also been carried out to identify ribozymes that catalyze a variety of chemical transformations, including RNA cleavage, ligation, and synthesis, as well as alkylation and acyl-transfer reactions and N-glycosidic and peptide bond formation. The existence of such RNA enzymes supports the notion that ribozymes could have directed a primitive metabolism before the evolution of protein synthesis. New in vitro protein selection techniques should allow for a direct comparison of the frequency of ligand binding and catalytic structures in pools of random sequence polynucleotides versus polypeptides.
JNSViewer—A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures

PubMed Central

Dong, Min; Graham, Mitchell; Yadav, Nehul

2017-01-01

Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome) were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html. PMID:28582416
Topology of RNA–protein nucleobase–amino acid π–π interactions and comparison to analogous DNA–protein π–π contacts

PubMed Central

Wilson, Katie A.; Holland, Devany J.; Wetmore, Stacey D.

2016-01-01

The present work analyzed 120 high-resolution X-ray crystal structures and identified 335 RNA–protein π-interactions (154 nonredundant) between a nucleobase and aromatic (W, H, F, or Y) or acyclic (R, E, or D) π-containing amino acid. Each contact was critically analyzed (including using a visual inspection protocol) to determine the most prevalent composition, structure, and strength of π-interactions at RNA–protein interfaces. These contacts most commonly involve F and U, with U:F interactions comprising one-fifth of the total number of contacts found. Furthermore, the RNA and protein π-systems adopt many different relative orientations, although there is a preference for more parallel (stacked) arrangements. Due to the variation in structure, the strength of the intermolecular forces between the RNA and protein components (as determined from accurate quantum chemical calculations) exhibits a significant range, with most of the contacts providing significant stability to the associated RNA–protein complex (up to −65 kJ mol−1). Comparison to the analogous DNA–protein π-interactions emphasizes differences in RNA– and DNA–protein π-interactions at the molecular level, including the greater abundance of RNA contacts and the involvement of different nucleobase/amino acid residues. Overall, our results provide a clearer picture of the molecular basis of nucleic acid–protein binding and underscore the important role of these contacts in biology, including the significant contribution of π–π interactions to the stability of nucleic acid–protein complexes. Nevertheless, more work is still needed in this area in order to further appreciate the properties and roles of RNA nucleobase–amino acid π-interactions in nature. PMID:26979279
The DEAH-box helicase Dhr1 dissociates U3 from the pre-rRNA to promote formation of the central pseudoknot.

PubMed

Sardana, Richa; Liu, Xin; Granneman, Sander; Zhu, Jieyi; Gill, Michael; Papoulas, Ophelia; Marcotte, Edward M; Tollervey, David; Correll, Carl C; Johnson, Arlen W

2015-02-01

In eukaryotes, the highly conserved U3 small nucleolar RNA (snoRNA) base-pairs to multiple sites in the pre-ribosomal RNA (pre-rRNA) to promote early cleavage and folding events. Binding of the U3 box A region to the pre-rRNA is mutually exclusive with folding of the central pseudoknot (CPK), a universally conserved rRNA structure of the small ribosomal subunit essential for protein synthesis. Here, we report that the DEAH-box helicase Dhr1 (Ecm16) is responsible for displacing U3. An active site mutant of Dhr1 blocked release of U3 from the pre-ribosome, thereby trapping a pre-40S particle. This particle had not yet achieved its mature structure because it contained U3, pre-rRNA, and a number of early-acting ribosome synthesis factors but noticeably lacked ribosomal proteins (r-proteins) that surround the CPK. Dhr1 was cross-linked in vivo to the pre-rRNA and to U3 sequences flanking regions that base-pair to the pre-rRNA including those that form the CPK. Point mutations in the box A region of U3 suppressed a cold-sensitive mutation of Dhr1, strongly indicating that U3 is an in vivo substrate of Dhr1. To support the conclusions derived from in vivo analysis we showed that Dhr1 unwinds U3-18S duplexes in vitro by using a mechanism reminiscent of DEAD box proteins.
The DEAH-box Helicase Dhr1 Dissociates U3 from the Pre-rRNA to Promote Formation of the Central Pseudoknot

PubMed Central

Granneman, Sander; Zhu, Jieyi; Gill, Michael; Papoulas, Ophelia; Marcotte, Edward M.; Tollervey, David; Correll, Carl C.; Johnson, Arlen W.

2015-01-01

In eukaryotes, the highly conserved U3 small nucleolar RNA (snoRNA) base-pairs to multiple sites in the pre-ribosomal RNA (pre-rRNA) to promote early cleavage and folding events. Binding of the U3 box A region to the pre-rRNA is mutually exclusive with folding of the central pseudoknot (CPK), a universally conserved rRNA structure of the small ribosomal subunit essential for protein synthesis. Here, we report that the DEAH-box helicase Dhr1 (Ecm16) is responsible for displacing U3. An active site mutant of Dhr1 blocked release of U3 from the pre-ribosome, thereby trapping a pre-40S particle. This particle had not yet achieved its mature structure because it contained U3, pre-rRNA, and a number of early-acting ribosome synthesis factors but noticeably lacked ribosomal proteins (r-proteins) that surround the CPK. Dhr1 was cross-linked in vivo to the pre-rRNA and to U3 sequences flanking regions that base-pair to the pre-rRNA including those that form the CPK. Point mutations in the box A region of U3 suppressed a cold-sensitive mutation of Dhr1, strongly indicating that U3 is an in vivo substrate of Dhr1. To support the conclusions derived from in vivo analysis we showed that Dhr1 unwinds U3-18S duplexes in vitro by using a mechanism reminiscent of DEAD box proteins. PMID:25710520
Dinucleotide controlled null models for comparative RNA gene prediction.

PubMed

Gesell, Tanja; Washietl, Stefan

2008-05-27

Comparative prediction of RNA structures can be used to identify functional noncoding RNAs in genomic screens. It was shown recently by Babak et al. [BMC Bioinformatics. 8:33] that RNA gene prediction programs can be biased by the genomic dinucleotide content, in particular those programs using a thermodynamic folding model including stacking energies. As a consequence, there is need for dinucleotide-preserving control strategies to assess the significance of such predictions. While there have been randomization algorithms for single sequences for many years, the problem has remained challenging for multiple alignments and there is currently no algorithm available. We present a program called SISSIz that simulates multiple alignments of a given average dinucleotide content. Meeting additional requirements of an accurate null model, the randomized alignments are on average of the same sequence diversity and preserve local conservation and gap patterns. We make use of a phylogenetic substitution model that includes overlapping dependencies and site-specific rates. Using fast heuristics and a distance based approach, a tree is estimated under this model which is used to guide the simulations. The new algorithm is tested on vertebrate genomic alignments and the effect on RNA structure predictions is studied. In addition, we directly combined the new null model with the RNAalifold consensus folding algorithm giving a new variant of a thermodynamic structure based RNA gene finding program that is not biased by the dinucleotide content. SISSIz implements an efficient algorithm to randomize multiple alignments preserving dinucleotide content. It can be used to get more accurate estimates of false positive rates of existing programs, to produce negative controls for the training of machine learning based programs, or as standalone RNA gene finding program. Other applications in comparative genomics that require randomization of multiple alignments can be considered. SISSIz is available as open source C code that can be compiled for every major platform and downloaded here: http://sourceforge.net/projects/sissiz.
RNAdualPF: software to compute the dual partition function with sample applications in molecular evolution theory.

PubMed

Garcia-Martin, Juan Antonio; Bayegan, Amir H; Dotu, Ivan; Clote, Peter

2016-10-19

RNA inverse folding is the problem of finding one or more sequences that fold into a user-specified target structure s 0 , i.e. whose minimum free energy secondary structure is identical to the target s 0 . Here we consider the ensemble of all RNA sequences that have low free energy with respect to a given target s 0 . We introduce the program RNAdualPF, which computes the dual partition function Z ∗ , defined as the sum of Boltzmann factors exp(-E(a,s 0 )/RT) of all RNA nucleotide sequences a compatible with target structure s 0 . Using RNAdualPF, we efficiently sample RNA sequences that approximately fold into s 0 , where additionally the user can specify IUPAC sequence constraints at certain positions, and whether to include dangles (energy terms for stacked, single-stranded nucleotides). Moreover, since we also compute the dual partition function Z ∗ (k) over all sequences having GC-content k, the user can require that all sampled sequences have a precise, specified GC-content. Using Z ∗ , we compute the dual expected energy 〈E ∗ 〉, and use it to show that natural RNAs from the Rfam 12.0 database have higher minimum free energy than expected, thus suggesting that functional RNAs are under evolutionary pressure to be only marginally thermodynamically stable. We show that C. elegans precursor microRNA (pre-miRNA) is significantly non-robust with respect to mutations, by comparing the robustness of each wild type pre-miRNA sequence with 2000 [resp. 500] sequences of the same GC-content generated by RNAdualPF, which approximately [resp. exactly] fold into the wild type target structure. We confirm and strengthen earlier findings that precursor microRNAs and bacterial small noncoding RNAs display plasticity, a measure of structural diversity. We describe RNAdualPF, which rapidly computes the dual partition function Z ∗ and samples sequences having low energy with respect to a target structure, allowing sequence constraints and specified GC-content. Using different inverse folding software, another group had earlier shown that pre-miRNA is mutationally robust, even controlling for compositional bias. Our opposite conclusion suggests a cautionary note that computationally based insights into molecular evolution may heavily depend on the software used. C/C++-software for RNAdualPF is available at http://bioinformatics.bc.edu/clotelab/RNAdualPF .
Evaluating and learning from RNA pseudotorsional space: quantitative validation of a reduced representation for RNA structure.

PubMed

Wadley, Leven M; Keating, Kevin S; Duarte, Carlos M; Pyle, Anna Marie

2007-09-28

Quantitatively describing RNA structure and conformational elements remains a formidable problem. Seven standard torsion angles and the sugar pucker are necessary to characterize the conformation of an RNA nucleotide completely. Progress has been made toward understanding the discrete nature of RNA structure, but classifying simple and ubiquitous structural elements such as helices and motifs remains a difficult task. One approach for describing RNA structure in a simple, mathematically consistent, and computationally accessible manner involves the invocation of two pseudotorsions, eta (C4'(n-1), P(n), C4'(n), P(n+1)) and theta (P(n), C4'(n), P(n+1), C4'(n+1)), which can be used to describe RNA conformation in much the same way that varphi and psi are used to describe backbone configuration of proteins. Here, we conduct an exploration and statistical evaluation of pseudotorsional space and of the Ramachandran-like eta-theta plot. We show that, through the rigorous quantitative analysis of the eta-theta plot, the pseudotorsional descriptors eta and theta, together with sugar pucker, are sufficient to describe RNA backbone conformation fully in most cases. These descriptors are also shown to contain considerable information about nucleotide base conformation, revealing a previously uncharacterized interplay between backbone and base orientation. A window function analysis is used to discern statistically relevant regions of density in the eta-theta scatter plot and then nucleotides in colocalized clusters in the eta-theta plane are shown to have similar 3-D structures through RMSD analysis of the RNA structural constituents. We find that major clusters in the eta-theta plot are few, underscoring the discrete nature of RNA backbone conformation. Like the Ramachandran plot, the eta-theta plot is a valuable system for conceptualizing biomolecular conformation, it is a useful tool for analyzing RNA tertiary structures, and it is a vital component of new approaches for solving the 3-D structures of large RNA molecules and RNA assemblies.

Comparative Structural and Functional Analysis of Bunyavirus and Arenavirus Cap-Snatching Endonucleases

PubMed Central

Reguera, Juan; Gerlach, Piotr; Rosenthal, Maria; Gaudon, Stephanie; Coscia, Francesca; Günther, Stephan; Cusack, Stephen

2016-01-01

Segmented negative strand RNA viruses of the arena-, bunya- and orthomyxovirus families uniquely carry out viral mRNA transcription by the cap-snatching mechanism. This involves cleavage of host mRNAs close to their capped 5′ end by an endonuclease (EN) domain located in the N-terminal region of the viral polymerase. We present the structure of the cap-snatching EN of Hantaan virus, a bunyavirus belonging to hantavirus genus. Hantaan EN has an active site configuration, including a metal co-ordinating histidine, and nuclease activity similar to the previously reported La Crosse virus and Influenza virus ENs (orthobunyavirus and orthomyxovirus respectively), but is more active in cleaving a double stranded RNA substrate. In contrast, Lassa arenavirus EN has only acidic metal co-ordinating residues. We present three high resolution structures of Lassa virus EN with different bound ion configurations and show in comparative biophysical and biochemical experiments with Hantaan, La Crosse and influenza ENs that the isolated Lassa EN is essentially inactive. The results are discussed in the light of EN activation mechanisms revealed by recent structures of full-length influenza virus polymerase. PMID:27304209
When transcription goes on Holliday: Double Holliday junctions block RNA polymerase II transcription in vitro.

PubMed

Pipathsouk, Anne; Belotserkovskii, Boris P; Hanawalt, Philip C

2017-02-01

Non-canonical DNA structures can obstruct transcription. This transcription blockage could have various biological consequences, including genomic instability and gratuitous transcription-coupled repair. Among potential structures causing transcription blockage are Holliday junctions (HJs), which can be generated as intermediates in homologous recombination or during processing of stalled replication forks. Of particular interest is the double Holliday junction (DHJ), which contains two HJs. Topological considerations impose the constraint that the total number of helical turns in the DNA duplexes between the junctions cannot be altered as long as the flanking DNA duplexes are intact. Thus, the DHJ structure should strongly resist transient unwinding during transcription; consequently, it is predicted to cause significantly stronger blockage than single HJ structures. The patterns of transcription blockage obtained for RNA polymerase II transcription in HeLa cell nuclear extracts were in accordance with this prediction. However, we did not detect transcription blockage with purified T7 phage RNA polymerase; we discuss a possible explanation for this difference. In general, our findings implicate naturally occurring Holliday junctions in transcription arrest. Copyright © 2016 Elsevier B.V. All rights reserved.
Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).

PubMed

Watters, Kyle E; Lucks, Julius B

2016-01-01

Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.
Switch from translation to RNA replication in a positive-stranded RNA virus

PubMed Central

Gamarnik, Andrea V.; Andino, Raul

1998-01-01

In positive-stranded viruses, the genomic RNA serves as a template for both translation and RNA replication. Using poliovirus as a model, we examined the interaction between these two processes. We show that the RNA polymerase is unable to replicate RNA templates undergoing translation. We discovered that an RNA structure at the 5′ end of the viral genome, next to the internal ribosomal entry site, carries signals that control both viral translation and RNA synthesis. The interaction of this RNA structure with the cellular factor PCBP up-regulates viral translation, while the binding of the viral protein 3CD represses translation and promotes negative-strand RNA synthesis. We propose that the interaction of 3CD with this RNA structure controls whether the genomic RNA is used for translation or RNA replication. PMID:9694795
Analysis of RNA structure using small-angle X-ray scattering

PubMed Central

Cantara, William A.; Olson, Erik D.; Musier-Forsyth, Karin

2016-01-01

In addition to their role in correctly attaching specific amino acids to cognate tRNAs, aminoacyl-tRNA synthetases (aaRS) have been found to possess many alternative functions and often bind to and act on other nucleic acids. In contrast to the well-defined 3D structure of tRNA, the structures of many of the other RNAs recognized by aaRSs have not been solved. Despite advances in the use of X-ray crystallography (XRC), nuclear magnetic resonance (NMR) spectroscopy and cryo-electron microscopy (cryo-EM) for structural characterization of biomolecules, significant challenges to solving RNA structures still exist. Recently, small-angle X-ray scattering (SAXS) has been increasingly employed to characterize the 3D structures of RNAs and RNA-protein complexes. SAXS is capable of providing low-resolution tertiary structure information under physiological conditions and with less intensive sample preparation and data analysis requirements than XRC, NMR and cryo-EM. In this article, we describe best practices involved in the process of RNA and RNA-protein sample preparation, SAXS data collection, data analysis, and structural model building. PMID:27777026
Structure and mechanism of a molecular rheostat, an RNA thermometer that modulates immune evasion by Neisseria meningitidis

PubMed Central

Barnwal, Ravi Pratap; Loh, Edmund; Godin, Katherine S.; Yip, Jordan; Lavender, Hayley; Tang, Christoph M.; Varani, Gabriele

2016-01-01

Neisseria meningitidis causes bacterial meningitis and septicemia. It evades the host complement system by upregulating expression of immune evasion factors in response to changes in temperature. RNA thermometers within mRNAs control expression of bacterial immune evasion factors, including CssA, in the 5′-untranslated region of the operon for capsule biosynthesis. We dissect the molecular mechanisms of thermoregulation and report the structure of the CssA thermometer. We show that the RNA thermometer acts as a rheostat, whose stability is optimized to respond in a small temperature range around 37°C as occur within the upper airways during infection. Small increases in temperature gradually open up the structure to allow progressively increased access to the ribosome binding site. Even small changes in stability induced by mutations of imperfect base pairs, as in naturally occurring polymorphisms, shift the thermometer response outside of the desired temperature range, suggesting that its activity could be modulated by pharmacological intervention. PMID:27369378
Self-assembly of free-standing RNA membranes

NASA Astrophysics Data System (ADS)

Han, Daehoon; Park, Yongkuk; Kim, Hyejin; Lee, Jong Bum

2014-07-01

RNA has emerged as a promising material for nanostructure and microstructure engineering. Although rare, some macroscopic RNA structures have also been constructed using lipid or polymer materials. Here, we report the first example of an enzymatically generated RNA membrane. This robust and free-standing RNA membrane has a macroscopic structure and is generated without any polymer support or complexation. Our RNA membrane is fabricated following two sequential processes, complementary rolling circle transcription and evaporation-induced self-assembly, and its structural and functional properties are rationally controlled by adjusting RNA base pairing. In this study, three types of RNA membranes are fabricated and are used to demonstrate potential applications.
Aggregation and folding phase transitions of RNA molecules

NASA Astrophysics Data System (ADS)

Bundschuh, Ralf

2007-03-01

RNA is a biomolecule that is involved in nearly all aspects of cellular functions. In order to perform many of these functions, RNA molecules have to fold into specific secondary structures. This folding is driven by the tendency of the bases to form Watson-Crick base pairs. Beyond the biological importance of RNA, the relatively simple rules for structure formation of RNA make it a very interesting system from the statistical physics point of view. We will present examples of phase transitions in RNA secondary structure formation that are amenable to analytical descriptions. A special focus will be on aggregation between several RNA molecules which is important for some regulatory circuits based on RNA structure, triplet repeat diseases like Huntington's, and as a model for prion diseases. We show that depending on the relative strength of the intramolecular and the intermolecular base pairing, RNA molecules undergo a transition into an aggregated phase and quantitatively characterize this transition.
The Catalytic Domain of Topological Knot tRNA Methyltransferase (TrmH) Discriminates between Substrate tRNA and Nonsubstrate tRNA via an Induced-fit Process*

PubMed Central

Ochi, Anna; Makabe, Koki; Yamagami, Ryota; Hirata, Akira; Sakaguchi, Reiko; Hou, Ya-Ming; Watanabe, Kazunori; Nureki, Osamu; Kuwajima, Kunihiro; Hori, Hiroyuki

2013-01-01

A conserved guanosine at position 18 (G18) in the D-loop of tRNAs is often modified to 2′-O-methylguanosine (Gm). Formation of Gm18 in eubacterial tRNA is catalyzed by tRNA (Gm18) methyltransferase (TrmH). TrmH enzymes can be divided into two types based on their substrate tRNA specificity. Type I TrmH, including Thermus thermophilus TrmH, can modify all tRNA species, whereas type II TrmH, for example Escherichia coli TrmH, modifies only a subset of tRNA species. Our previous crystal study showed that T. thermophilus TrmH is a class IV S-adenosyl-l-methionine-dependent methyltransferase, which maintains a topological knot structure in the catalytic domain. Because TrmH enzymes have short stretches at the N and C termini instead of a clear RNA binding domain, these stretches are believed to be involved in tRNA recognition. In this study, we demonstrate by site-directed mutagenesis that both N- and C-terminal regions function in tRNA binding. However, in vitro and in vivo chimera protein studies, in which four chimeric proteins of type I and II TrmHs were used, demonstrated that the catalytic domain discriminates substrate tRNAs from nonsubstrate tRNAs. Thus, the N- and C-terminal regions do not function in the substrate tRNA discrimination process. Pre-steady state analysis of complex formation between mutant TrmH proteins and tRNA by stopped-flow fluorescence measurement revealed that the C-terminal region works in the initial binding process, in which nonsubstrate tRNA is not excluded, and that structural movement of the motif 2 region of the catalytic domain in an induced-fit process is involved in substrate tRNA discrimination. PMID:23867454
Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

PubMed Central

Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

2013-01-01

The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545
Topological constraints are major determinants of tRNA tertiary structure and dynamics and provide basis for tertiary folding cooperativity

PubMed Central

Mustoe, Anthony M.; Brooks, Charles L.; Al-Hashimi, Hashim M.

2014-01-01

Recent studies have shown that basic steric and connectivity constraints encoded at the secondary structure level are key determinants of 3D structure and dynamics in simple two-way RNA junctions. However, the role of these topological constraints in higher order RNA junctions remains poorly understood. Here, we use a specialized coarse-grained molecular dynamics model to directly probe the thermodynamic contributions of topological constraints in defining the 3D architecture and dynamics of transfer RNA (tRNA). Topological constraints alone restrict tRNA's allowed conformational space by over an order of magnitude and strongly discriminate against formation of non-native tertiary contacts, providing a sequence independent source of folding specificity. Topological constraints also give rise to long-range correlations between the relative orientation of tRNA's helices, which in turn provides a mechanism for encoding thermodynamic cooperativity between distinct tertiary interactions. These aspects of topological constraints make it such that only several tertiary interactions are needed to confine tRNA to its native global structure and specify functionally important 3D dynamics. We further show that topological constraints are conserved across tRNA's different naturally occurring secondary structures. Taken together, our results emphasize the central role of secondary-structure-encoded topological constraints in defining RNA 3D structure, dynamics and folding. PMID:25217593
Assembly of Q{beta} viral RNA polymerase with host translational elongation factors EF-Tu and -Ts.

PubMed

Takeshita, Daijiro; Tomita, Kozo

2010-09-07

Replication and transcription of viral RNA genomes rely on host-donated proteins. Qbeta virus infects Escherichia coli and replicates and transcribes its own genomic RNA by Qbeta replicase. Qbeta replicase requires the virus-encoded RNA-dependent RNA polymerase (beta-subunit), and the host-donated translational elongation factors EF-Tu and -Ts, as active core subunits for its RNA polymerization activity. Here, we present the crystal structure of the core Qbeta replicase, comprising the beta-subunit, EF-Tu and -Ts. The beta-subunit has a right-handed structure, and the EF-Tu:Ts binary complex maintains the structure of the catalytic core crevasse of the beta-subunit through hydrophobic interactions, between the finger and thumb domains of the beta-subunit and domain-2 of EF-Tu and the coiled-coil motif of EF-Ts, respectively. These hydrophobic interactions are required for the expression and assembly of the Qbeta replicase complex. Thus, EF-Tu and -Ts have chaperone-like functions in the maintenance of the structure of the active Qbeta replicase. Modeling of the template RNA and the growing RNA in the catalytic site of the Qbeta replicase structure also suggests that structural changes of the RNAs and EF-Tu:Ts should accompany processive RNA polymerization and that EF-Tu:Ts in the Qbeta replicase could function to modulate the RNA folding and structure.
Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency.

PubMed

Dang, Ying; Jia, Gengxiang; Choi, Jennie; Ma, Hongming; Anaya, Edgar; Ye, Chunting; Shankar, Premlata; Wu, Haoquan

2015-12-15

Single-guide RNA (sgRNA) is one of the two key components of the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 genome-editing system. The current commonly used sgRNA structure has a shortened duplex compared with the native bacterial CRISPR RNA (crRNA)-transactivating crRNA (tracrRNA) duplex and contains a continuous sequence of thymines, which is the pause signal for RNA polymerase III and thus could potentially reduce transcription efficiency. Here, we systematically investigate the effect of these two elements on knockout efficiency and showed that modifying the sgRNA structure by extending the duplex length and mutating the fourth thymine of the continuous sequence of thymines to cytosine or guanine significantly, and sometimes dramatically, improves knockout efficiency in cells. In addition, the optimized sgRNA structure also significantly increases the efficiency of more challenging genome-editing procedures, such as gene deletion, which is important for inducing a loss of function in non-coding genes. By a systematic investigation of sgRNA structure we find that extending the duplex by approximately 5 bp combined with mutating the continuous sequence of thymines at position 4 to cytosine or guanine significantly increases gene knockout efficiency in CRISPR-Cas9-based genome editing experiments.
Assembly of RNA nanostructures on supported lipid bilayers

NASA Astrophysics Data System (ADS)

Dabkowska, Aleksandra P.; Michanek, Agnes; Jaeger, Luc; Rabe, Michael; Chworos, Arkadiusz; Höök, Fredrik; Nylander, Tommy; Sparr, Emma

2014-12-01

The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces.The assembly of nucleic acid nanostructures with controlled size and shape has large impact in the fields of nanotechnology, nanomedicine and synthetic biology. The directed arrangement of nano-structures at interfaces is important for many applications. In spite of this, the use of laterally mobile lipid bilayers to control RNA three-dimensional nanostructure formation on surfaces remains largely unexplored. Here, we direct the self-assembly of RNA building blocks into three-dimensional structures of RNA on fluid lipid bilayers composed of cationic 1,2-dioleoyl-3-trimethylammonium-propane (DOTAP) or mixtures of zwitterionic 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC) and cationic sphingosine. We demonstrate the stepwise supramolecular assembly of discrete building blocks through specific and selective RNA-RNA interactions, based on results from quartz crystal microbalance with dissipation (QCM-D), ellipsometry, fluorescence recovery after photobleaching (FRAP) and total internal reflection fluorescence microscopy (TIRF) experiments. The assembly can be controlled to give a densely packed single layer of RNA polyhedrons at the fluid lipid bilayer surface. We show that assembly of the 3D structure can be modulated by sequence specific interactions, surface charge and changes in the salt composition and concentration. In addition, the tertiary structure of the RNA polyhedron can be controllably switched from an extended structure to one that is dense and compact. The versatile approach to building up three-dimensional structures of RNA does not require modification of the surface or the RNA molecules, and can be used as a bottom-up means of nanofabrication of functionalized bio-mimicking surfaces. Electronic supplementary information (ESI) available: Table with sequences of tRNA units used in this study; schematic structures of the RNA polyhedron and its building blocks; gel electrophoresis characterization of the RNA polyhedron and squares; AFM characterization of RNA tectosquare; schematic structures of RNA-9 and RNA-10 and their association with lipid bilayers; QCM-D frequency and dissipation data (as function of time) for adsorption of RNA polyhedrons, RNA squares and RNA9-10 TIRF images of RNA with Gelstar after photobleaching with analysis; Correlation plot in change of shear viscosity for TS3 and TO3-4 models for the stoichiometry of TS; QCM-D dissipation data for the sequential experiment in Fig. 5a; QCM-D and for the assembly of building blocks at the bilayer scaffold at varying bulk concentrations; QCM-D of adsorption of TS3. See DOI: 10.1039/c4nr05968a
Pseudouridine and N6-methyladenosine modifications weaken PUF protein/RNA interactions

PubMed Central

AlSadhan, Ishraq; Merriman, Dawn K.; Al-Hashimi, Hashim M.; Herschlag, Daniel

2017-01-01

RNA modifications are ubiquitous in biology, with over 100 distinct modifications. While the vast majority were identified and characterized on abundant noncoding RNA such as tRNA and rRNA, the advent of sensitive sequencing-based approaches has led to the discovery of extensive and regulated modification of eukaryotic messenger RNAs as well. The two most abundant mRNA modifications—pseudouridine (Ψ) and N6-methyladenosine (m6A)—affect diverse cellular processes including mRNA splicing, localization, translation, and decay and modulate RNA structure. Here, we test the hypothesis that RNA modifications directly affect interactions between RNA-binding proteins and target RNA. We show that Ψ and m6A weaken the binding of the human single-stranded RNA binding protein Pumilio 2 (hPUM2) to its consensus motif, with individual modifications having effects up to approximately threefold and multiple modifications giving larger effects. While there are likely to be some cases where RNA modifications essentially fully ablate protein binding, here we see modest responses that may be more common. Such modest effects could nevertheless profoundly alter the complex landscape of RNA:protein interactions, and the quantitative rather than qualitative nature of these effects underscores the need for quantitative, systems-level accounting of RNA:protein interactions to understand post-transcriptional regulation. PMID:28138061
From "Cellular" RNA to "Smart" RNA: Multiple Roles of RNA in Genome Stability and Beyond.

PubMed

Michelini, Flavia; Jalihal, Ameya P; Francia, Sofia; Meers, Chance; Neeb, Zachary T; Rossiello, Francesca; Gioia, Ubaldo; Aguado, Julio; Jones-Weinert, Corey; Luke, Brian; Biamonti, Giuseppe; Nowacki, Mariusz; Storici, Francesca; Carninci, Piero; Walter, Nils G; Fagagna, Fabrizio d'Adda di

2018-04-25

Coding for proteins has been considered the main function of RNA since the "central dogma" of biology was proposed. The discovery of noncoding transcripts shed light on additional roles of RNA, ranging from the support of polypeptide synthesis, to the assembly of subnuclear structures, to gene expression modulation. Cellular RNA has therefore been recognized as a central player in often unanticipated biological processes, including genomic stability. This ever-expanding list of functions inspired us to think of RNA as a "smart" phone, which has replaced the older obsolete "cellular" phone. In this review, we summarize the last two decades of advances in research on the interface between RNA biology and genome stability. We start with an account of the emergence of noncoding RNA, and then we discuss the involvement of RNA in DNA damage signaling and repair, telomere maintenance, and genomic rearrangements. We continue with the depiction of single-molecule RNA detection techniques, and we conclude by illustrating the possibilities of RNA modulation in hopes of creating or improving new therapies. The widespread biological functions of RNA have made this molecule a reoccurring theme in basic and translational research, warranting it the transcendence from classically studied "cellular" RNA to "smart" RNA.
Design of a Temperature-Responsive Transcription Terminator.

PubMed

Roßmanith, Johanna; Weskamp, Mareen; Narberhaus, Franz

2018-02-16

RNA structures regulate various steps in gene expression. Transcription in bacteria is typically terminated by stable hairpin structures. Translation initiation can be modulated by metabolite- or temperature-sensitive RNA structures, called riboswitches or RNA thermometers (RNATs), respectively. RNATs control translation initiation by occlusion of the ribosome binding site at low temperatures. Increasing temperatures destabilize the RNA structure and facilitate ribosome access. In this study, we exploited temperature-responsive RNAT structures to design regulatory elements that control transcription termination instead of translation initiation in Escherichia coli. In order to mimic the structure of factor-independent intrinsic terminators, naturally occurring RNAT hairpins were genetically engineered to be followed by a U-stretch. Functional temperature-responsive terminators (thermoterms) prevented mRNA synthesis at low temperatures but resumed transcription after a temperature upshift. The successful design of temperature-controlled terminators highlights the potential of RNA structures as versatile gene expression control elements.
Sequence to Structure (S2S): display, manipulate and interconnect RNA data from sequence to structure.

PubMed

Jossinet, Fabrice; Westhof, Eric

2005-08-01

Efficient RNA sequence manipulations (such as multiple alignments) need to be constrained by rules of RNA structure folding. The structural knowledge has increased dramatically in the last years with the accumulation of several large RNA structures similar to those of the bacterial ribosome subunits. However, no tool in the RNA community provides an easy way to link and integrate progress made at the sequence level using the available three-dimensional information. Sequence to Structure (S2S) proposes a framework in which an user can easily display, manipulate and interconnect heterogeneous RNA data, such as multiple sequence alignments, secondary and tertiary structures. S2S has been implemented using the Java language and has been developed and tested under UNIX systems, such as Linux and MacOSX. S2S is available at http://bioinformatics.org/S2S/.
Structure and assembly of the Ebola virus nucleocapsid

PubMed Central

Wan, William; Kolesnikova, Larissa; Clarke, Mairi; Koehler, Alexander; Noda, Takeshi; Becker, Stephan; Briggs, John A. G.

2017-01-01

Ebola and Marburg viruses are filoviruses: filamentous, enveloped viruses that cause hemorrhagic fever1. Filoviruses are within the order Mononegavirales2 which also includes rabies virus, measles virus, and respiratory syncytial virus. Mononegaviruses have non-segmented, single-stranded negative-sense RNA genomes that are encapsidated by nucleoprotein (NP) and other viral proteins to form a helical nucleocapsid (NC). NC acts as a scaffold for virus assembly and as a template for genome transcription and replication. Insights into NP-NP interactions have been derived from structural studies of oligomerized, RNA-encapsidating NP3–6 and cryo-electron microscopy (cryo-EM) of NC7–12 or NC-like structures11–13. There have been no high-resolution reconstructions of complete mononegavirus NCs. Here, we have applied cryo-electron tomography and subtomogram averaging to determine the structure of Ebola virus NC within intact viruses and recombinant NC-like assemblies. These structures reveal the identity and arrangement of the NC components, and suggest that the formation of an extended alpha-helix from the disordered C-terminal region of NP-core links NP oligomerization, NC condensation, RNA encapsidation, and accessory protein recruitment. PMID:29144446
New algorithms to represent complex pseudoknotted RNA structures in dot-bracket notation.

PubMed

Antczak, Maciej; Popenda, Mariusz; Zok, Tomasz; Zurkowski, Michal; Adamiak, Ryszard W; Szachniuk, Marta

2018-04-15

Understanding the formation, architecture and roles of pseudoknots in RNA structures are one of the most difficult challenges in RNA computational biology and structural bioinformatics. Methods predicting pseudoknots typically perform this with poor accuracy, often despite experimental data incorporation. Existing bioinformatic approaches differ in terms of pseudoknots' recognition and revealing their nature. A few ways of pseudoknot classification exist, most common ones refer to a genus or order. Following the latter one, we propose new algorithms that identify pseudoknots in RNA structure provided in BPSEQ format, determine their order and encode in dot-bracket-letter notation. The proposed encoding aims to illustrate the hierarchy of RNA folding. New algorithms are based on dynamic programming and hybrid (combining exhaustive search and random walk) approaches. They evolved from elementary algorithm implemented within the workflow of RNA FRABASE 1.0, our database of RNA structure fragments. They use different scoring functions to rank dissimilar dot-bracket representations of RNA structure. Computational experiments show an advantage of new methods over the others, especially for large RNA structures. Presented algorithms have been implemented as new functionality of RNApdbee webserver and are ready to use at http://rnapdbee.cs.put.poznan.pl. mszachniuk@cs.put.poznan.pl. Supplementary data are available at Bioinformatics online.

Crystal structure of RlmAI: Implications for understanding the 23S rRNA G745/G748-methylation at the macrolide antibiotic-binding site

PubMed Central

Das, Kalyan; Acton, Thomas; Chiang, Yiwen; Shih, Lydia; Arnold, Eddy; Montelione, Gaetano T.

2004-01-01

The RlmA class of enzymes (RlmAI and RlmAII) catalyzes N1-methylation of a guanine base (G745 in Gram-negative and G748 in Gram-positive bacteria) of hairpin 35 of 23S rRNA. We have determined the crystal structure of Escherichia coli RlmAI at 2.8-Å resolution, providing 3D structure information for the RlmA class of RNA methyltransferases. The dimeric protein structure exhibits features that provide new insights into its molecular function. Each RlmAI molecule has a Zn-binding domain, responsible for specific recognition and binding of its rRNA substrate, and a methyltransferase domain. The asymmetric RlmAI dimer observed in the crystal structure has a well defined W-shaped RNA-binding cleft. Two S-adenosyl-l-methionine substrate molecules are located at the two valleys of the W-shaped RNA-binding cleft. The unique shape of the RNA-binding cleft, different from that of known RNA-binding proteins, is highly specific and structurally complements the 3D structure of hairpin 35 of bacterial 23S rRNA. Apart from the hairpin 35, parts of hairpins 33 and 34 also interact with the RlmAI dimer. PMID:14999102
Gene silencing efficiency and INF-β induction effects of splicing miRNA 155-based artificial miRNA with pre-miRNA stem-loop structures.

PubMed

Sin, Onsam; Mabiala, Prudence; Liu, Ye; Sun, Ying; Hu, Tao; Liu, Qingzhen; Guo, Deyin

2012-02-01

Artificial microRNA (miRNA) expression vectors have been developed and used for RNA interference. The secondary structure of artificial miRNA is important for RNA interference efficacy. We designed two groups of six artificial splicing miRNA 155-based miRNAs (SM155-based miRNAs) with the same target in the coding region or 3' UTR of a target gene and studied their RNA silencing efficiency and interferon β (IFN-β) induction effects. SM155-based miRNA with a mismatch at the +1 position and a bulge at the +11, +12 positions in a miRNA precursor stem-loop structure showed the highest gene silencing efficiency and lowest IFN-β induction effect (increased IFN-β mRNA level by 10% in both target cases), regardless of the specificity of the target sequence, suggesting that pSM155-based miRNA with this design could be a valuable miRNA expression vector.
Characterizing RNA Dynamics at Atomic Resolution Using Solution-state NMR Spectroscopy

PubMed Central

Bothe, Jameson R.; Nikolova, Evgenia N.; Eichhorn, Catherine D.; Chugh, Jeetender; Hansen, Alexandar L.; Al-Hashimi, Hashim M.

2012-01-01

Many recently discovered non-coding RNAs do not fold into a single native conformation, but rather, sample many different conformations along their free energy landscape to carry out their biological function. Unprecedented insights into the RNA dynamic structure landscape are provided by solution-state NMR techniques that measure the structural, kinetic, and thermodynamic characteristics of motions spanning picosecond to second timescales at atomic resolution. From these studies a basic description of the RNA dynamic structure landscape is emerging, bringing new insights into how RNA structures change to carry out their function as well as applications in RNA-targeted drug discovery and RNA bioengineering. PMID:22036746
A new model for approximating RNA folding trajectories and population kinetics

NASA Astrophysics Data System (ADS)

Kirkpatrick, Bonnie; Hajiaghayi, Monir; Condon, Anne

2013-01-01

RNA participates both in functional aspects of the cell and in gene regulation. The interactions of these molecules are mediated by their secondary structure which can be viewed as a planar circle graph with arcs for all the chemical bonds between pairs of bases in the RNA sequence. The problem of predicting RNA secondary structure, specifically the chemically most probable structure, has many useful and efficient algorithms. This leaves RNA folding, the problem of predicting the dynamic behavior of RNA structure over time, as the main open problem. RNA folding is important for functional understanding because some RNA molecules change secondary structure in response to interactions with the environment. The full RNA folding model on at most O(3n) secondary structures is the gold standard. We present a new subset approximation model for the full model, give methods to analyze its accuracy and discuss the relative merits of our model as compared with a pre-existing subset approximation. The main advantage of our model is that it generates Monte Carlo folding pathways with the same probabilities with which they are generated under the full model. The pre-existing subset approximation does not have this property.
Optimal packaging of FIV genomic RNA depends upon a conserved long-range interaction and a palindromic sequence within gag.

PubMed

Rizvi, Tahir A; Kenyon, Julia C; Ali, Jahabar; Aktar, Suriya J; Phillip, Pretty S; Ghazawi, Akela; Mustafa, Farah; Lever, Andrew M L

2010-10-15

The feline immunodeficiency virus (FIV) is a lentivirus that is related to human immunodeficiency virus (HIV), causing a similar pathology in cats. It is a potential small animal model for AIDS and the FIV-based vectors are also being pursued for human gene therapy. Previous studies have mapped the FIV packaging signal (ψ) to two or more discontinuous regions within the 5' 511 nt of the genomic RNA and structural analyses have determined its secondary structure. The 5' and 3' sequences within ψ region interact through extensive long-range interactions (LRIs), including a conserved heptanucleotide interaction between R/U5 and gag. Other secondary structural elements identified include a conserved 150 nt stem-loop (SL2) and a small palindromic stem-loop within gag open reading frame that might act as a viral dimerization initiation site. We have performed extensive mutational analysis of these sequences and structures and ascertained their importance in FIV packaging using a trans-complementation assay. Disrupting the conserved heptanucleotide LRI to prevent base pairing between R/U5 and gag reduced packaging by 2.8-5.5 fold. Restoration of pairing using an alternative, non-wild type (wt) LRI sequence restored RNA packaging and propagation to wt levels, suggesting that it is the structure of the LRI, rather than its sequence, that is important for FIV packaging. Disrupting the palindrome within gag reduced packaging by 1.5-3-fold, but substitution with a different palindromic sequence did not restore packaging completely, suggesting that the sequence of this region as well as its palindromic nature is important. Mutation of individual regions of SL2 did not have a pronounced effect on FIV packaging, suggesting that either it is the structure of SL2 as a whole that is necessary for optimal packaging, or that there is redundancy within this structure. The mutational analysis presented here has further validated the previously predicted RNA secondary structure of FIV ψ. Copyright © 2010 Elsevier Ltd. All rights reserved.
Biochemical and Structural Studies of RNA Modification and Repair

ERIC Educational Resources Information Center

Chan, Chio Mui

2009-01-01

RNA modification, RNA interference, and RNA repair are important events in the cell. This thesis presents three projects related to these three fields. By using both biochemical and structural methods, we characterized enzymatic activities of pseudouridine synthase TruD, solved the structure of "A. aeolicus" GidA, and reconstituted a novel…
Structures of human ADAR2 bound to dsRNA reveal base-flipping mechanism and basis for site selectivity

DOE PAGES

Matthews, Melissa M.; Thomas, Justin M.; Zheng, Yuxuan; ...

2016-04-11

Adenosine deaminases acting on RNA (ADARs) are editing enzymes that convert adenosine to inosine in duplex RNA, a modification reaction with wide-ranging consequences in RNA function. Understanding of the ADAR reaction mechanism, the origin of editing-site selectivity, and the effect of mutations is limited by the lack of high-resolution structural data for complexes of ADARs bound to substrate RNAs. In this paper, we describe four crystal structures of the human ADAR2 deaminase domain bound to RNA duplexes bearing a mimic of the deamination reaction intermediate. These structures, together with structure-guided mutagenesis and RNA-modification experiments, explain the basis of the ADARmore » deaminase domain's dsRNA specificity, its base-flipping mechanism, and its nearest-neighbor preferences. In addition, we identified an ADAR2-specific RNA-binding loop near the enzyme active site, thus rationalizing differences in selectivity observed between different ADARs. In conclusion, our results provide a structural framework for understanding the effects of ADAR mutations associated with human disease.« less
The Crystal Structure of a Cardiovirus RNA-Dependent RNA Polymerase Reveals an Unusual Conformation of the Polymerase Active Site

PubMed Central

Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J. M.

2014-01-01

ABSTRACT Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. IMPORTANCE The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses. PMID:24600002
The crystal structure of a cardiovirus RNA-dependent RNA polymerase reveals an unusual conformation of the polymerase active site.

PubMed

Vives-Adrian, Laia; Lujan, Celia; Oliva, Baldo; van der Linden, Lonneke; Selisko, Barbara; Coutard, Bruno; Canard, Bruno; van Kuppeveld, Frank J M; Ferrer-Orta, Cristina; Verdaguer, Núria

2014-05-01

Encephalomyocarditis virus (EMCV) is a member of the Cardiovirus genus within the large Picornaviridae family, which includes a number of important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for viral genome replication. In this study, we report the X-ray structures of two different crystal forms of the EMCV RdRp determined at 2.8- and 2.15-Å resolution. The in vitro elongation and VPg uridylylation activities of the purified enzyme have also been demonstrated. Although the overall structure of EMCV 3Dpol is shown to be similar to that of the known RdRps of other members of the Picornaviridae family, structural comparisons show a large reorganization of the active-site cavity in one of the crystal forms. The rearrangement affects mainly motif A, where the conserved residue Asp240, involved in ribonucleoside triphosphate (rNTP) selection, and its neighbor residue, Phe239, move about 10 Å from their expected positions within the ribose binding pocket toward the entrance of the rNTP tunnel. This altered conformation of motif A is stabilized by a cation-π interaction established between the aromatic ring of Phe239 and the side chain of Lys56 within the finger domain. Other contacts, involving Phe239 and different residues of motif F, are also observed. The movement of motif A is connected with important conformational changes in the finger region flanked by residues 54 to 63, harboring Lys56, and in the polymerase N terminus. The structures determined in this work provide essential information for studies on the cardiovirus RNA replication process and may have important implications for the development of new antivirals targeting the altered conformation of motif A. The Picornaviridae family is one of the largest virus families known, including many important human and animal pathogens. The RNA-dependent RNA polymerase (RdRp) 3Dpol is a key enzyme for picornavirus genome replication and a validated target for the development of antiviral therapies. Solving the X-ray structure of the first cardiovirus RdRp, EMCV 3Dpol, we captured an altered conformation of a conserved motif in the polymerase active site (motif A) containing the aspartic acid residue involved in rNTP selection and binding. This altered conformation of motif A, which interferes with the correct positioning of the rNTP substrate in the active site, is stabilized by a number of residues strictly conserved among picornaviruses. The rearrangements observed suggest that this motif A segment is a dynamic element that can be modulated by external effectors, either activating or inhibiting enzyme activity, and this type of modulation appears to be general to all picornaviruses.
Translational induction of heat shock transcription factor σ32: evidence for a built-in RNA thermosensor

PubMed Central

Morita, Miyo Terao; Tanaka, Yoshiyuki; Kodama, Takashi S.; Kyogoku, Yoshimasa; Yanagi, Hideki; Yura, Takashi

1999-01-01

Induction of heat shock proteins in Escherichia coli is primarily caused by increased cellular levels of the heat shock σ-factor σ32 encoded by the rpoH gene. Increased σ32 levels result from both enhanced synthesis and stabilization. Previous work indicated that σ32 synthesis is induced at the translational level and is mediated by the mRNA secondary structure formed within the 5′-coding sequence of rpoH, including the translation initiation region. To understand the mechanism of heat induction of σ32 synthesis further, we analyzed expression of rpoH–lacZ gene fusions with altered stability of mRNA structure before and after heat shock. A clear correlation was found between the stability and expression or the extent of heat induction. Temperature-melting profiles of mRNAs with or without mutations correlated well with the expression patterns of fusion genes carrying the corresponding mutations in vivo. Furthermore, temperature dependence of mRNA–30S ribosome–tRNAfMet complex formation with wild-type or mutant mRNAs in vitro agreed well with that of the expression of gene fusions in vivo. Our results support a novel mechanism in which partial melting of mRNA secondary structure at high temperature enhances ribosome entry and translational initiation without involvement of other cellular components, that is, intrinsic mRNA stability controls synthesis of a transcriptional regulator. PMID:10090722
SSMART: Sequence-structure motif identification for RNA-binding proteins.

PubMed

Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe

2018-06-11

RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.
Insights into Structural and Mechanistic Features of Viral IRES Elements

PubMed Central

Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.

2018-01-01

Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
Ensemble-based prediction of RNA secondary structures.

PubMed

Aghaeepour, Nima; Hoos, Holger H

2013-04-24

Accurate structure prediction methods play an important role for the understanding of RNA function. Energy-based, pseudoknot-free secondary structure prediction is one of the most widely used and versatile approaches, and improved methods for this task have received much attention over the past five years. Despite the impressive progress that as been achieved in this area, existing evaluations of the prediction accuracy achieved by various algorithms do not provide a comprehensive, statistically sound assessment. Furthermore, while there is increasing evidence that no prediction algorithm consistently outperforms all others, no work has been done to exploit the complementary strengths of multiple approaches. In this work, we present two contributions to the area of RNA secondary structure prediction. Firstly, we use state-of-the-art, resampling-based statistical methods together with a previously published and increasingly widely used dataset of high-quality RNA structures to conduct a comprehensive evaluation of existing RNA secondary structure prediction procedures. The results from this evaluation clarify the performance relationship between ten well-known existing energy-based pseudoknot-free RNA secondary structure prediction methods and clearly demonstrate the progress that has been achieved in recent years. Secondly, we introduce AveRNA, a generic and powerful method for combining a set of existing secondary structure prediction procedures into an ensemble-based method that achieves significantly higher prediction accuracies than obtained from any of its component procedures. Our new, ensemble-based method, AveRNA, improves the state of the art for energy-based, pseudoknot-free RNA secondary structure prediction by exploiting the complementary strengths of multiple existing prediction procedures, as demonstrated using a state-of-the-art statistical resampling approach. In addition, AveRNA allows an intuitive and effective control of the trade-off between false negative and false positive base pair predictions. Finally, AveRNA can make use of arbitrary sets of secondary structure prediction procedures and can therefore be used to leverage improvements in prediction accuracy offered by algorithms and energy models developed in the future. Our data, MATLAB software and a web-based version of AveRNA are publicly available at http://www.cs.ubc.ca/labs/beta/Software/AveRNA.
Ribozyme-catalysed RNA synthesis using triplet building blocks.

PubMed

Attwater, James; Raguram, Aditya; Morgunov, Alexey S; Gianni, Edoardo; Holliger, Philipp

2018-05-15

RNA-catalyzed RNA replication is widely believed to have supported a primordial biology. However, RNA catalysis is dependent upon RNA folding, and this yields structures that can block replication of such RNAs. To address this apparent paradox we have re-examined the building blocks used for RNA replication. We report RNA-catalysed RNA synthesis on structured templates when using trinucleotide triphosphates (triplets) as substrates, catalysed by a general and accurate triplet polymerase ribozyme that emerged from in vitro evolution as a mutualistic RNA heterodimer. The triplets cooperatively invaded and unraveled even highly stable RNA secondary structures, and support non-canonical primer-free and bidirectional modes of RNA synthesis and replication. Triplet substrates thus resolve a central incongruity of RNA replication, and here allow the ribozyme to synthesise its own catalytic subunit '+' and '-' strands in segments and assemble them into a new active ribozyme. © 2018, Attwater et al.
5S ribosomal RNA database Y2K

PubMed Central

Szymanski, Maciej; Barciszewska, Miroslawa Z.; Barciszewski, Jan; Erdmann, Volker A.

2000-01-01

This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan. pl/5SData/index.html . This edition of the database contains 1985 primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms. PMID:10592212
5S ribosomal RNA database Y2K.

PubMed

Szymanski, M; Barciszewska, M Z; Barciszewski, J; Erdmann, V A

2000-01-01

This paper presents the updated version (Y2K) of the database of ribosomal 5S ribonucleic acids (5S rRNA) and their genes (5S rDNA), http://rose.man/poznan.pl/5SData/index.html. This edition of the database contains 1985primary structures of 5S rRNA and 5S rDNA. They include 60 archaebacterial, 470 eubacterial, 63 plastid, nine mitochondrial and 1383 eukaryotic sequences. The nucleotide sequences of the 5S rRNAs or 5S rDNAs are divided according to the taxonomic position of the source organisms.
Functional RNA structures throughout the Hepatitis C Virus genome.

PubMed

Adams, Rebecca L; Pirakitikulr, Nathan; Pyle, Anna Marie

2017-06-01

The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics. Copyright © 2017 Elsevier B.V. All rights reserved.
RNA and RNP as Building Blocks for Nanotechnology and Synthetic Biology.

PubMed

Ohno, Hirohisa; Saito, Hirohide

2016-01-01

Recent technologies that aimed to elucidate cellular function have revealed essential roles for RNA molecules in living systems. Our knowledge concerning functional and structural information of naturally occurring RNA and RNA-protein (RNP) complexes is increasing rapidly. RNA and RNP interaction motifs are structural units that function as building blocks to constitute variety of complex structures. RNA-central synthetic biology and nanotechnology are constructive approaches that employ the accumulated information and build synthetic RNA (RNP)-based circuits and nanostructures. Here, we describe how to design and construct synthetic RNA (RNP)-based devices and structures at the nanometer-scale for biological and future therapeutic applications. RNA/RNP nanostructures can also be utilized as the molecular scaffold to control the localization or interactions of target molecule(s). Moreover, RNA motifs recognized by RNA-binding proteins can be applied to make protein-responsive translational "switches" that can turn gene expression "on" or "off" depending on the intracellular environment. This "synthetic RNA and RNP world" will expand tools for nanotechnology and synthetic biology. In addition, these reconstructive approaches would lead to a greater understanding of building principle in naturally occurring RNA/RNP molecules and systems. Copyright © 2016 Elsevier Inc. All rights reserved.
Efficient trans-cleavage by the Schistosoma mansoni SMα1 hammerhead ribozyme in the extreme thermophile Thermus thermophilus

PubMed Central

Vazquez-Tello, Alejandro; Castán, Pablo; Moreno, Renata; Smith, James M.; Berenguer, José; Cedergren, Robert

2002-01-01

The catalytic hammerhead structure has been found in association with repetitive DNA from several animals, including salamanders, crickets and schistosomes, and functions to process in cis the long multimer transcripts into monomer RNA in vivo. The cellular role of these repetitive elements and their transcripts is unknown. Moreover, none of these natural hammerheads have been shown to trans-cleave a host mRNA in vivo. We analyzed the cis- and trans-cleavage properties of the hammerhead ribozyme associated with the SMα DNA family from the human parasite Schistosoma mansoni. The efficiency of trans-cleavage of a target RNA in vitro was affected mainly by both the temperature-dependent chemical step and the ribozyme–product dissociation step. The optimal temperature for trans-cleavage was 70°C. This result was confirmed when both the SMα1 ribozyme and the target RNA were expressed in the extreme thermophile Thermus thermophilus. Moreover, SMα1 RNA showed a remarkable thermostability, equal or superior to that of the most stable RNAs in this species, suggesting that SMα1 RNA has been selected for stability. Computer analysis predicts that the monomer and multimer transcripts fold into highly compact secondary structures, which may explain their exceptional stability in vivo. PMID:11917021
Multimodal RNA-seq using single-strand, double-strand, and CircLigase-based capture yields a refined and extended description of the C. elegans transcriptome.

PubMed

Lamm, Ayelet T; Stadler, Michael R; Zhang, Huibin; Gent, Jonathan I; Fire, Andrew Z

2011-02-01

We have used a combination of three high-throughput RNA capture and sequencing methods to refine and augment the transcriptome map of a well-studied genetic model, Caenorhabditis elegans. The three methods include a standard (non-directional) library preparation protocol relying on cDNA priming and foldback that has been used in several previous studies for transcriptome characterization in this species, and two directional protocols, one involving direct capture of single-stranded RNA fragments and one involving circular-template PCR (CircLigase). We find that each RNA-seq approach shows specific limitations and biases, with the application of multiple methods providing a more complete map than was obtained from any single method. Of particular note in the analysis were substantial advantages of CircLigase-based and ssRNA-based capture for defining sequences and structures of the precise 5' ends (which were lost using the double-strand cDNA capture method). Of the three methods, ssRNA capture was most effective in defining sequences to the poly(A) junction. Using data sets from a spectrum of C. elegans strains and stages and the UCSC Genome Browser, we provide a series of tools, which facilitate rapid visualization and assignment of gene structures.

In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

PubMed

Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

2014-01-30

RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
Temperature-responsive in vitro RNA structurome of Yersinia pseudotuberculosis.

PubMed

Righetti, Francesco; Nuss, Aaron M; Twittenhoff, Christian; Beele, Sascha; Urban, Kristina; Will, Sebastian; Bernhart, Stephan H; Stadler, Peter F; Dersch, Petra; Narberhaus, Franz

2016-06-28

RNA structures are fundamentally important for RNA function. Dynamic, condition-dependent structural changes are able to modulate gene expression as shown for riboswitches and RNA thermometers. By parallel analysis of RNA structures, we mapped the RNA structurome of Yersinia pseudotuberculosis at three different temperatures. This human pathogen is exquisitely responsive to host body temperature (37 °C), which induces a major metabolic transition. Our analysis profiles the structure of more than 1,750 RNAs at 25 °C, 37 °C, and 42 °C. Average mRNAs tend to be unstructured around the ribosome binding site. We searched for 5'-UTRs that are folded at low temperature and identified novel thermoresponsive RNA structures from diverse gene categories. The regulatory potential of 16 candidates was validated. In summary, we present a dynamic bacterial RNA structurome and find that the expression of virulence-relevant functions in Y. pseudotuberculosis and reprogramming of its metabolism in response to temperature is associated with a restructuring of numerous mRNAs.
The mode of inhibitor binding to peptidyl-tRNA hydrolase: binding studies and structure determination of unbound and bound peptidyl-tRNA hydrolase from Acinetobacter baumannii.

PubMed

Kaushik, Sanket; Singh, Nagendra; Yamini, Shavait; Singh, Avinash; Sinha, Mau; Arora, Ashish; Kaur, Punit; Sharma, Sujata; Singh, Tej P

2013-01-01

The incidences of infections caused by an aerobic Gram-negative bacterium, Acinetobacter baumannii are very common in hospital environments. It usually causes soft tissue infections including urinary tract infections and pneumonia. It is difficult to treat due to acquired resistance to available antibiotics is well known. In order to design specific inhibitors against one of the important enzymes, peptidyl-tRNA hydrolase from Acinetobacter baumannii, we have determined its three-dimensional structure. Peptidyl-tRNA hydrolase (AbPth) is involved in recycling of peptidyl-tRNAs which are produced in the cell as a result of premature termination of translation process. We have also determined the structures of two complexes of AbPth with cytidine and uridine. AbPth was cloned, expressed and crystallized in unbound and in two bound states with cytidine and uridine. The binding studies carried out using fluorescence spectroscopic and surface plasmon resonance techniques revealed that both cytidine and uridine bound to AbPth at nanomolar concentrations. The structure determinations of the complexes revealed that both ligands were located in the active site cleft of AbPth. The introduction of ligands to AbPth caused a significant widening of the entrance gate to the active site region and in the process of binding, it expelled several water molecules from the active site. As a result of interactions with protein atoms, the ligands caused conformational changes in several residues to attain the induced tight fittings. Such a binding capability of this protein makes it a versatile molecule for hydrolysis of peptidyl-tRNAs having variable peptide sequences. These are the first studies that revealed the mode of inhibitor binding in Peptidyl-tRNA hydrolases which will facilitate the structure based ligand design.
Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.

PubMed

Sloma, Michael F; Mathews, David H

2016-12-01

RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. © 2016 Sloma and Mathews; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
RNA secondary structure prediction using soft computing.

PubMed

Ray, Shubhra Sankar; Pal, Sankar K

2013-01-01

Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.
R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

PubMed Central

2011-01-01

Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file. PMID:21205310
GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies.

PubMed

Boudard, Mélanie; Bernauer, Julie; Barth, Dominique; Cohen, Johanne; Denise, Alain

2015-01-01

Cellular processes involve large numbers of RNA molecules. The functions of these RNA molecules and their binding to molecular machines are highly dependent on their 3D structures. One of the key challenges in RNA structure prediction and modeling is predicting the spatial arrangement of the various structural elements of RNA. As RNA folding is generally hierarchical, methods involving coarse-grained models hold great promise for this purpose. We present here a novel coarse-grained method for sampling, based on game theory and knowledge-based potentials. This strategy, GARN (Game Algorithm for RNa sampling), is often much faster than previously described techniques and generates large sets of solutions closely resembling the native structure. GARN is thus a suitable starting point for the molecular modeling of large RNAs, particularly those with experimental constraints. GARN is available from: http://garn.lri.fr/.
RNA Structural Analysis by Evolving SHAPE Chemistry

PubMed Central

Spitale, Robert C.; Flynn, Ryan A.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.

2017-01-01

RNA is central to the flow of biological information. From transcription to splicing, RNA localization, translation, and decay, RNA is intimately involved in regulating every step of the gene expression program, and is thus essential for health and understanding disease. RNA has the unique ability to base-pair with itself and other nucleic acids to form complex structures. Hence the information content in RNA is not simply its linear sequence of bases, but is also encoded in complex folding of RNA molecules. A general chemical functionality that all RNAs have is a 2’-hydroxyl group in the ribose ring, and the reactivity of the 2'-hydroxyl in RNA is gated by local nucleotide flexibility. In other words, the 2'-hydroxyl is reactive at single-stranded and conformationally flexible positions but is unreactive at nucleotides constrained by base pairing. Recent efforts have been focused on developing reagents that modify RNA as a function of RNA 2’ hydroxyl group flexibility. Such RNA structure probing techniques can be read out by primer extension in experiments termed RNA SHAPE (Selective 2’ Hydroxyl Acylation and Primer Extension). Herein we describe the efforts devoted to the design and utilization of SHAPE probes for characterizing RNA structure. We also describe current technological advances that are being used to utilize SHAPE chemistry with deep sequencing to probe many RNAs in parallel. The merger of chemistry with genomics is sure to open the door to genome-wide exploration of RNA structure and function. PMID:25132067
The highly efficient T7 RNA polymerase: A wonder macromolecule in biological realm.

PubMed

Borkotoky, Subhomoi; Murali, Ayaluru

2018-05-27

The study of bacteriophage has always been of keen interest for biologists to understand the fundamentals of biology. Bacteriophage T7 was first isolated in 1945 and its first comprehensive genetic map of was published in 1969. Since then, it gained immense attention of researchers and became a prime model system for experimental biologists. The major gene product of T7 phage, T7 RNA polymerase (T7RNAP), continues to attract researchers since a long time due to its high and specific processivity with a single subunit structure and its capability of transcribing a complete gene without additional proteins. Since the first review article in 1993 there has been around nine reviews on this polymerase till year 2009, most of which focussed on particular aspects of T7RNAP such as structure and function. However, this review encapsulates a broad view on T7RNAP, one of the simplest macromolecule catalyzing RNA synthesis including recent updates on its applications, structure, activators and inhibitors. Thus this brief review bridges the huge gap on the recent updates on this polymerase and will help the biologists in their endeavours that include the use of T7RNAP. Copyright © 2017. Published by Elsevier B.V.
CRISPR-Cas9 Structures and Mechanisms.

PubMed

Jiang, Fuguo; Doudna, Jennifer A

2017-05-22

Many bacterial clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) systems employ the dual RNA-guided DNA endonuclease Cas9 to defend against invading phages and conjugative plasmids by introducing site-specific double-stranded breaks in target DNA. Target recognition strictly requires the presence of a short protospacer adjacent motif (PAM) flanking the target site, and subsequent R-loop formation and strand scission are driven by complementary base pairing between the guide RNA and target DNA, Cas9-DNA interactions, and associated conformational changes. The use of CRISPR-Cas9 as an RNA-programmable DNA targeting and editing platform is simplified by a synthetic single-guide RNA (sgRNA) mimicking the natural dual trans-activating CRISPR RNA (tracrRNA)-CRISPR RNA (crRNA) structure. This review aims to provide an in-depth mechanistic and structural understanding of Cas9-mediated RNA-guided DNA targeting and cleavage. Molecular insights from biochemical and structural studies provide a framework for rational engineering aimed at altering catalytic function, guide RNA specificity, and PAM requirements and reducing off-target activity for the development of Cas9-based therapies against genetic diseases.
Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach.

PubMed

Zakov, Shay; Tsur, Dekel; Ziv-Ukelson, Michal

2011-08-18

RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms.
Reducing the worst case running times of a family of RNA and CFG problems, using Valiant's approach

PubMed Central

2011-01-01

Background RNA secondary structure prediction is a mainstream bioinformatic domain, and is key to computational analysis of functional RNA. In more than 30 years, much research has been devoted to defining different variants of RNA structure prediction problems, and to developing techniques for improving prediction quality. Nevertheless, most of the algorithms in this field follow a similar dynamic programming approach as that presented by Nussinov and Jacobson in the late 70's, which typically yields cubic worst case running time algorithms. Recently, some algorithmic approaches were applied to improve the complexity of these algorithms, motivated by new discoveries in the RNA domain and by the need to efficiently analyze the increasing amount of accumulated genome-wide data. Results We study Valiant's classical algorithm for Context Free Grammar recognition in sub-cubic time, and extract features that are common to problems on which Valiant's approach can be applied. Based on this, we describe several problem templates, and formulate generic algorithms that use Valiant's technique and can be applied to all problems which abide by these templates, including many problems within the world of RNA Secondary Structures and Context Free Grammars. Conclusions The algorithms presented in this paper improve the theoretical asymptotic worst case running time bounds for a large family of important problems. It is also possible that the suggested techniques could be applied to yield a practical speedup for these problems. For some of the problems (such as computing the RNA partition function and base-pair binding probabilities), the presented techniques are the only ones which are currently known for reducing the asymptotic running time bounds of the standard algorithms. PMID:21851589
Integrative structure and functional anatomy of a nuclear pore complex

NASA Astrophysics Data System (ADS)

Kim, Seung Joong; Fernandez-Martinez, Javier; Nudelman, Ilona; Shi, Yi; Zhang, Wenzhu; Raveh, Barak; Herricks, Thurston; Slaughter, Brian D.; Hogan, Joanna A.; Upla, Paula; Chemmama, Ilan E.; Pellarin, Riccardo; Echeverria, Ignacia; Shivaraju, Manjunatha; Chaudhury, Azraa S.; Wang, Junjie; Williams, Rosemary; Unruh, Jay R.; Greenberg, Charles H.; Jacobs, Erica Y.; Yu, Zhiheng; de La Cruz, M. Jason; Mironska, Roxana; Stokes, David L.; Aitchison, John D.; Jarrold, Martin F.; Gerton, Jennifer L.; Ludtke, Steven J.; Akey, Christopher W.; Chait, Brian T.; Sali, Andrej; Rout, Michael P.

2018-03-01

Nuclear pore complexes play central roles as gatekeepers of RNA and protein transport between the cytoplasm and nucleoplasm. However, their large size and dynamic nature have impeded a full structural and functional elucidation. Here we determined the structure of the entire 552-protein nuclear pore complex of the yeast Saccharomyces cerevisiae at sub-nanometre precision by satisfying a wide range of data relating to the molecular arrangement of its constituents. The nuclear pore complex incorporates sturdy diagonal columns and connector cables attached to these columns, imbuing the structure with strength and flexibility. These cables also tie together all other elements of the nuclear pore complex, including membrane-interacting regions, outer rings and RNA-processing platforms. Inwardly directed anchors create a high density of transport factor-docking Phe-Gly repeats in the central channel, organized into distinct functional units. This integrative structure enables us to rationalize the architecture, transport mechanism and evolutionary origins of the nuclear pore complex.
Integrative structure and functional anatomy of a nuclear pore complex.

PubMed

Kim, Seung Joong; Fernandez-Martinez, Javier; Nudelman, Ilona; Shi, Yi; Zhang, Wenzhu; Raveh, Barak; Herricks, Thurston; Slaughter, Brian D; Hogan, Joanna A; Upla, Paula; Chemmama, Ilan E; Pellarin, Riccardo; Echeverria, Ignacia; Shivaraju, Manjunatha; Chaudhury, Azraa S; Wang, Junjie; Williams, Rosemary; Unruh, Jay R; Greenberg, Charles H; Jacobs, Erica Y; Yu, Zhiheng; de la Cruz, M Jason; Mironska, Roxana; Stokes, David L; Aitchison, John D; Jarrold, Martin F; Gerton, Jennifer L; Ludtke, Steven J; Akey, Christopher W; Chait, Brian T; Sali, Andrej; Rout, Michael P

2018-03-22

Nuclear pore complexes play central roles as gatekeepers of RNA and protein transport between the cytoplasm and nucleoplasm. However, their large size and dynamic nature have impeded a full structural and functional elucidation. Here we determined the structure of the entire 552-protein nuclear pore complex of the yeast Saccharomyces cerevisiae at sub-nanometre precision by satisfying a wide range of data relating to the molecular arrangement of its constituents. The nuclear pore complex incorporates sturdy diagonal columns and connector cables attached to these columns, imbuing the structure with strength and flexibility. These cables also tie together all other elements of the nuclear pore complex, including membrane-interacting regions, outer rings and RNA-processing platforms. Inwardly directed anchors create a high density of transport factor-docking Phe-Gly repeats in the central channel, organized into distinct functional units. This integrative structure enables us to rationalize the architecture, transport mechanism and evolutionary origins of the nuclear pore complex.
DOMMINO 2.0: integrating structurally resolved protein-, RNA-, and DNA-mediated macromolecular interactions

PubMed Central

Kuang, Xingyan; Dhroso, Andi; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

2016-01-01

Macromolecular interactions are formed between proteins, DNA and RNA molecules. Being a principle building block in macromolecular assemblies and pathways, the interactions underlie most of cellular functions. Malfunctioning of macromolecular interactions is also linked to a number of diseases. Structural knowledge of the macromolecular interaction allows one to understand the interaction’s mechanism, determine its functional implications and characterize the effects of genetic variations, such as single nucleotide polymorphisms, on the interaction. Unfortunately, until now the interactions mediated by different types of macromolecules, e.g. protein–protein interactions or protein–DNA interactions, are collected into individual and unrelated structural databases. This presents a significant obstacle in the analysis of macromolecular interactions. For instance, the homogeneous structural interaction databases prevent scientists from studying structural interactions of different types but occurring in the same macromolecular complex. Here, we introduce DOMMINO 2.0, a structural Database Of Macro-Molecular INteractiOns. Compared to DOMMINO 1.0, a comprehensive database on protein-protein interactions, DOMMINO 2.0 includes the interactions between all three basic types of macromolecules extracted from PDB files. DOMMINO 2.0 is automatically updated on a weekly basis. It currently includes ∼1 040 000 interactions between two polypeptide subunits (e.g. domains, peptides, termini and interdomain linkers), ∼43 000 RNA-mediated interactions, and ∼12 000 DNA-mediated interactions. All protein structures in the database are annotated using SCOP and SUPERFAMILY family annotation. As a result, protein-mediated interactions involving protein domains, interdomain linkers, C- and N- termini, and peptides are identified. Our database provides an intuitive web interface, allowing one to investigate interactions at three different resolution levels: whole subunit network, binary interaction and interaction interface. Database URL: http://dommino.org PMID:26827237
Structural Dynamics of the GW182 Silencing Domain Including its RNA Recognition motif (RRM) Revealed by Hydrogen-Deuterium Exchange Mass Spectrometry

NASA Astrophysics Data System (ADS)

Cieplak-Rotowska, Maja K.; Tarnowski, Krzysztof; Rubin, Marcin; Fabian, Marc R.; Sonenberg, Nahum; Dadlez, Michal; Niedzwiecka, Anna

2018-01-01

The human GW182 protein plays an essential role in micro(mi)RNA-dependent gene silencing. miRNA silencing is mediated, in part, by a GW182 C-terminal region called the silencing domain, which interacts with the poly(A) binding protein and the CCR4-NOT deadenylase complex to repress protein synthesis. Structural studies of this GW182 fragment are challenging due to its predicted intrinsically disordered character, except for its RRM domain. However, detailed insights into the properties of proteins containing disordered regions can be provided by hydrogen-deuterium exchange mass spectrometry (HDX/MS). In this work, we applied HDX/MS to define the structural state of the GW182 silencing domain. HDX/MS analysis revealed that this domain is clearly divided into a natively unstructured part, including the CCR4-NOT interacting motif 1, and a distinct RRM domain. The GW182 RRM has a very dynamic structure, since water molecules can penetrate the whole domain in 2 h. The finding of this high structural dynamics sheds new light on the RRM structure. Though this domain is one of the most frequently occurring canonical protein domains in eukaryotes, these results are - to our knowledge - the first HDX/MS characteristics of an RRM. The HDX/MS studies show also that the α2 helix of the RRM can display EX1 behavior after a freezing-thawing cycle. This means that the RRM structure is sensitive to environmental conditions and can change its conformation, which suggests that the state of the RRM containing proteins should be checked by HDX/MS in regard of the conformational uniformity. [Figure not available: see fulltext.
The DEAD-box RNA helicase Ddx39ab is essential for myocyte and lens development in zebrafish.

PubMed

Zhang, Linlin; Yang, Yuxi; Li, Beibei; Scott, Ian C; Lou, Xin

2018-04-23

RNA helicases from the DEAD-box family are found in almost all organisms and have important roles in RNA metabolism, including RNA synthesis, processing and degradation. The function and mechanism of action of most of these helicases in animal development and human disease remain largely unexplored. In a zebrafish mutagenesis screen to identify genes essential for heart development we identified a mutant that disrupts the gene encoding the RNA helicase DEAD-box 39ab ( ddx39ab ). Homozygous ddx39ab mutant embryos exhibit profound cardiac and trunk muscle dystrophy, along with lens abnormalities, caused by abrupt terminal differentiation of cardiomyocyte, myoblast and lens fiber cells. Loss of ddx39ab hindered splicing of mRNAs encoding epigenetic regulatory factors, including members of the KMT2 gene family, leading to misregulation of structural gene expression in cardiomyocyte, myoblast and lens fiber cells. Taken together, these results show that Ddx39ab plays an essential role in establishment of the proper epigenetic status during differentiation of multiple cell lineages. © 2018. Published by The Company of Biologists Ltd.
Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families.

PubMed

Reinharz, Vladimir; Soulé, Antoine; Westhof, Eric; Waldispühl, Jérôme; Denise, Alain

2018-05-04

The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.
ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data

PubMed Central

Krestel, Ralf; Ohler, Uwe; Vingron, Martin; Marsico, Annalisa

2017-01-01

Abstract RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which RNA structure influences protein binding in the presence or absence of a sequence motif is still poorly understood. Existing RNA motif finders either take the structure of the RNA only partially into account, or employ models which are not directly interpretable as sequence-structure motifs. We developed ssHMM, an RNA motif finder based on a hidden Markov model (HMM) and Gibbs sampling which fully captures the relationship between RNA sequence and secondary structure preference of a given RBP. Compared to previous methods which output separate logos for sequence and structure, it directly produces a combined sequence-structure motif when trained on a large set of sequences. ssHMM’s model is visualized intuitively as a graph and facilitates biological interpretation. ssHMM can be used to find novel bona fide sequence-structure motifs of uncharacterized RBPs, such as the one presented here for the YY1 protein. ssHMM reaches a high motif recovery rate on synthetic data, it recovers known RBP motifs from CLIP-Seq data, and scales linearly on the input size, being considerably faster than MEMERIS and RNAcontext on large datasets while being on par with GraphProt. It is freely available on Github and as a Docker image. PMID:28977546
Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

PubMed

Kawaguchi, Risa; Kiryu, Hisanori

2016-05-06

RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

Some links on this page may take you to non-federal websites. Their policies may differ from this site.