Sample records for extended rna code

  1. On the Evolution of the Standard Genetic Code: Vestiges of Critical Scale Invariance from the RNA World in Current Prokaryote Genomes

    PubMed Central

    José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.

    2009-01-01

    Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813

  2. Genetic hotels for the standard genetic code: evolutionary analysis based upon novel three-dimensional algebraic models.

    PubMed

    José, Marco V; Morgado, Eberto R; Govezensky, Tzipe

    2011-07-01

    Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.

  3. A human haploid gene trap collection to study lncRNAs with unusual RNA biology.

    PubMed

    Kornienko, Aleksandra E; Vlatkovic, Irena; Neesen, Jürgen; Barlow, Denise P; Pauler, Florian M

    2016-01-01

    Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator.

  4. Essential RNA-Based Technologies and Their Applications in Plant Functional Genomics.

    PubMed

    Teotia, Sachin; Singh, Deepali; Tang, Xiaoqing; Tang, Guiliang

    2016-02-01

    Genome sequencing has not only extended our understanding of the blueprints of many plant species but has also revealed the secrets of coding and non-coding genes. We present here a brief introduction to and personal account of key RNA-based technologies, as well as their development and applications for functional genomics of plant coding and non-coding genes, with a focus on short tandem target mimics (STTMs), artificial microRNAs (amiRNAs), and CRISPR/Cas9. In addition, their use in multiplex technologies for the functional dissection of gene networks is discussed. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. Fragile X mental retardation protein participates in non-coding RNA pathways.

    PubMed

    Li, En-Hui; Zhao, Xin; Zhang, Ce; Liu, Wei

    2018-02-20

    Fragile X syndrome is one of the most common forms of inherited intellectual disability. It is caused by mutations of the Fragile X mental retardation 1(FMR1) gene, resulting in either the loss or abnormal expression of the Fragile X mental retardation protein (FMRP). Recent research showed that FMRP participates in non-coding RNA pathways and plays various important roles in physiology, thereby extending our knowledge of the pathogenesis of the Fragile X syndrome. Initial studies showed that the Drosophila FMRP participates in siRNA and miRNA pathways by interacting with Dicer, Ago1 and Ago2, involved in neural activity and the fate determination of the germline stem cells. Subsequent studies showed that the Drosophila FMRP participates in piRNA pathway by interacting with Aub, Ago1 and Piwi in the maintenance of normal chromatin structures and genomic stability. More recent studies showed that FMRP is associated with lncRNA pathway, suggesting a potential role for the involvement in the clinical manifestations. In this review, we summarize the novel findings and explore the relationship between FMRP and non-coding RNA pathways, particularly the piRNA pathway, thereby providing critical insights on the molecular pathogenesis of Fragile X syndrome, and potential translational applications in clinical management of the disease.

  6. Mediator directs co-transcriptional heterochromatin assembly by RNA interference-dependent and -independent pathways.

    PubMed

    Oya, Eriko; Kato, Hiroaki; Chikashige, Yuji; Tsutsumi, Chihiro; Hiraoka, Yasushi; Murakami, Yota

    2013-01-01

    Heterochromatin at the pericentromeric repeats in fission yeast is assembled and spread by an RNAi-dependent mechanism, which is coupled with the transcription of non-coding RNA from the repeats by RNA polymerase II. In addition, Rrp6, a component of the nuclear exosome, also contributes to heterochromatin assembly and is coupled with non-coding RNA transcription. The multi-subunit complex Mediator, which directs initiation of RNA polymerase II-dependent transcription, has recently been suggested to function after initiation in processes such as elongation of transcription and splicing. However, the role of Mediator in the regulation of chromatin structure is not well understood. We investigated the role of Mediator in pericentromeric heterochromatin formation and found that deletion of specific subunits of the head domain of Mediator compromised heterochromatin structure. The Mediator head domain was required for Rrp6-dependent heterochromatin nucleation at the pericentromere and for RNAi-dependent spreading of heterochromatin into the neighboring region. In the latter process, Mediator appeared to contribute to efficient processing of siRNA from transcribed non-coding RNA, which was required for efficient spreading of heterochromatin. Furthermore, the head domain directed efficient transcription in heterochromatin. These results reveal a pivotal role for Mediator in multiple steps of transcription-coupled formation of pericentromeric heterochromatin. This observation further extends the role of Mediator to co-transcriptional chromatin regulation.

  7. Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency.

    PubMed

    Dang, Ying; Jia, Gengxiang; Choi, Jennie; Ma, Hongming; Anaya, Edgar; Ye, Chunting; Shankar, Premlata; Wu, Haoquan

    2015-12-15

    Single-guide RNA (sgRNA) is one of the two key components of the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 genome-editing system. The current commonly used sgRNA structure has a shortened duplex compared with the native bacterial CRISPR RNA (crRNA)-transactivating crRNA (tracrRNA) duplex and contains a continuous sequence of thymines, which is the pause signal for RNA polymerase III and thus could potentially reduce transcription efficiency. Here, we systematically investigate the effect of these two elements on knockout efficiency and showed that modifying the sgRNA structure by extending the duplex length and mutating the fourth thymine of the continuous sequence of thymines to cytosine or guanine significantly, and sometimes dramatically, improves knockout efficiency in cells. In addition, the optimized sgRNA structure also significantly increases the efficiency of more challenging genome-editing procedures, such as gene deletion, which is important for inducing a loss of function in non-coding genes. By a systematic investigation of sgRNA structure we find that extending the duplex by approximately 5 bp combined with mutating the continuous sequence of thymines at position 4 to cytosine or guanine significantly increases gene knockout efficiency in CRISPR-Cas9-based genome editing experiments.

  8. Four RNA families with functional transient structures

    PubMed Central

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5′ flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5′ UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here. PMID:25751035

  9. Four RNA families with functional transient structures.

    PubMed

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5' flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5' UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All transient and dominant structures are mapped to our new alignments introduced here.

  10. A global view of the nonprotein-coding transcriptome in Plasmodium falciparum

    PubMed Central

    Raabe, Carsten A.; Sanchez, Cecilia P.; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V.; Chinni, Suresh V.; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y.; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S.

    2010-01-01

    Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense–antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors. PMID:19864253

  11. A global view of the nonprotein-coding transcriptome in Plasmodium falciparum.

    PubMed

    Raabe, Carsten A; Sanchez, Cecilia P; Randau, Gerrit; Robeck, Thomas; Skryabin, Boris V; Chinni, Suresh V; Kube, Michael; Reinhardt, Richard; Ng, Guey Hooi; Manickam, Ravichandran; Kuryshev, Vladimir Y; Lanzer, Michael; Brosius, Juergen; Tang, Thean Hock; Rozhdestvensky, Timofey S

    2010-01-01

    Nonprotein-coding RNAs (npcRNAs) represent an important class of regulatory molecules that act in many cellular pathways. Here, we describe the experimental identification and validation of the small npcRNA transcriptome of the human malaria parasite Plasmodium falciparum. We identified 630 novel npcRNA candidates. Based on sequence and structural motifs, 43 of them belong to the C/D and H/ACA-box subclasses of small nucleolar RNAs (snoRNAs) and small Cajal body-specific RNAs (scaRNAs). We further observed the exonization of a functional H/ACA snoRNA gene, which might contribute to the regulation of ribosomal protein L7a gene expression. Some of the small npcRNA candidates are from telomeric and subtelomeric repetitive regions, suggesting their potential involvement in maintaining telomeric integrity and subtelomeric gene silencing. We also detected 328 cis-encoded antisense npcRNAs (asRNAs) complementary to P. falciparum protein-coding genes of a wide range of biochemical pathways, including determinants of virulence and pathology. All cis-encoded asRNA genes tested exhibit lifecycle-specific expression profiles. For all but one of the respective sense-antisense pairs, we deduced concordant patterns of expression. Our findings have important implications for a better understanding of gene regulatory mechanisms in P. falciparum, revealing an extended and sophisticated npcRNA network that may control the expression of housekeeping genes and virulence factors.

  12. An RNA tool kit to study the status of mouse ES cells: sex determination and stemness.

    PubMed

    Jay, F; Ciaudo, C

    2013-09-01

    Mouse embryonic stem cells (mESCs) are pluripotent stem cells derived from the inner cell mass of the blastocyst. They can be maintained under controlled culture conditions in a pluripotent state, or be induced to differentiate into all derivatives of the three primary germ layers: ectoderm, endoderm and mesoderm. Several studies have characterised the coding and non-coding (nc) RNA repertoires of mESCs, uncovering highly dynamic variations during the process of differentiation, but also qualitative differences pertaining to sex. For example, up-regulation of the long non-coding RNA Xist on the X chromosome induces gene silencing and X inactivation exclusively during female mESC differentiation. In contrast, specific small RNAs have been shown to be up-regulated during male mESC differentiation. Here, we illustrate how a small set of key coding and ncRNAs can be exploited as dynamic and sensitive markers of the stemness and/or the differentiation status of male or female mESC lines. We describe adapted techniques for the extended characterization and analysis of mESCs from as little material as that cultured in a single 75cm(2) flask. Copyright © 2013 Elsevier Inc. All rights reserved.

  13. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction.

    PubMed

    Thiebaut, Flávia; Rojas, Cristian A; Grativol, Clícia; Calixto, Edmundo P da R; Motta, Mariana R; Ballesteros, Helkin G F; Peixoto, Barbara; de Lima, Berenice N S; Vieira, Lucas M; Walter, Maria Emilia; de Armas, Elvismary M; Entenza, Júlio O P; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S; Ferreira, Paulo C G

    2017-12-20

    Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae . Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae , while the siRNAs were repressed in the presence of A. avenae . Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.

  14. Current Research on Non-Coding Ribonucleic Acid (RNA).

    PubMed

    Wang, Jing; Samuels, David C; Zhao, Shilin; Xiang, Yu; Zhao, Ying-Yong; Guo, Yan

    2017-12-05

    Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.

  15. RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules.

    PubMed

    Garcia-Martin, Juan Antonio; Dotu, Ivan; Clote, Peter

    2015-07-01

    Several algorithms for RNA inverse folding have been used to design synthetic riboswitches, ribozymes and thermoswitches, whose activity has been experimentally validated. The RNAiFold software is unique among approaches for inverse folding in that (exhaustive) constraint programming is used instead of heuristic methods. For that reason, RNAiFold can generate all sequences that fold into the target structure or determine that there is no solution. RNAiFold 2.0 is a complete overhaul of RNAiFold 1.0, rewritten from the now defunct COMET language to C++. The new code properly extends the capabilities of its predecessor by providing a user-friendly pipeline to design synthetic constructs having the functionality of given Rfam families. In addition, the new software supports amino acid constraints, even for proteins translated in different reading frames from overlapping coding sequences; moreover, structure compatibility/incompatibility constraints have been expanded. With these features, RNAiFold 2.0 allows the user to design single RNA molecules as well as hybridization complexes of two RNA molecules. the web server, source code and linux binaries are publicly accessible at http://bioinformatics.bc.edu/clotelab/RNAiFold2.0. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Using hidden Markov models and observed evolution to annotate viral genomes.

    PubMed

    McCauley, Stephen; Hein, Jotun

    2006-06-01

    ssRNA (single stranded) viral genomes are generally constrained in length and utilize overlapping reading frames to maximally exploit the coding potential within the genome length restrictions. This overlapping coding phenomenon leads to complex evolutionary constraints operating on the genome. In regions which code for more than one protein, silent mutations in one reading frame generally have a protein coding effect in another. To maximize coding flexibility in all reading frames, overlapping regions are often compositionally biased towards amino acids which are 6-fold degenerate with respect to the 64 codon alphabet. Previous methodologies have used this fact in an ad hoc manner to look for overlapping genes by motif matching. In this paper differentiated nucleotide compositional patterns in overlapping regions are incorporated into a probabilistic hidden Markov model (HMM) framework which is used to annotate ssRNA viral genomes. This work focuses on single sequence annotation and applies an HMM framework to ssRNA viral annotation. A description of how the HMM is parameterized, whilst annotating within a missing data framework is given. A Phylogenetic HMM (Phylo-HMM) extension, as applied to 14 aligned HIV2 sequences is also presented. This evolutionary extension serves as an illustration of the potential of the Phylo-HMM framework for ssRNA viral genomic annotation. The single sequence annotation procedure (SSA) is applied to 14 different strains of the HIV2 virus. Further results on alternative ssRNA viral genomes are presented to illustrate more generally the performance of the method. The results of the SSA method are encouraging however there is still room for improvement, and since there is overwhelming evidence to indicate that comparative methods can improve coding sequence (CDS) annotation, the SSA method is extended to a Phylo-HMM to incorporate evolutionary information. The Phylo-HMM extension is applied to the same set of 14 HIV2 sequences which are pre-aligned. The performance improvement that results from including the evolutionary information in the analysis is illustrated.

  17. Transterm—extended search facilities and improved integration with other databases

    PubMed Central

    Jacobs, Grant H.; Stockwell, Peter A.; Tate, Warren P.; Brown, Chris M.

    2006-01-01

    Transterm has now been publicly available for >10 years. Major changes have been made since its last description in this database issue in 2002. The current database provides data for key regions of mRNA sequences, a curated database of mRNA motifs and tools to allow users to investigate their own motifs or mRNA sequences. The key mRNA regions database is derived computationally from Genbank. It contains 3′ and 5′ flanking regions, the initiation and termination signal context and coding sequence for annotated CDS features from Genbank and RefSeq. The database is non-redundant, enabling summary files and statistics to be prepared for each species. Advances include providing extended search facilities, the database may now be searched by BLAST in addition to regular expressions (patterns) allowing users to search for motifs such as known miRNA sequences, and the inclusion of RefSeq data. The database contains >40 motifs or structural patterns important for translational control. In this release, patterns from UTRsite and Rfam are also incorporated with cross-referencing. Users may search their sequence data with Transterm or user-defined patterns. The system is accessible at . PMID:16381889

  18. Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

    PubMed Central

    Grativol, Clícia; Motta, Mariana R.; Ballesteros, Helkin G. F.; Peixoto, Barbara; Vieira, Lucas M.; Walter, Maria Emilia; de Armas, Elvismary M.; Entenza, Júlio O. P.; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S.

    2017-01-01

    Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly. PMID:29657296

  19. Molecular mimicry between protein and tRNA.

    PubMed

    Nakamura, Y

    2001-01-01

    Mimicry is a sophisticated development in animals, fish, and plants that allows them to fool others by imitating a shape or color for diverse purposes, such as to prey, evade, lure, pollinate, or threaten. This is not restricted to the macro-world, but extends to the micro-world as molecular mimicry. Recent advances in structural and molecular biology uncovered a set of translation factors that resembles a tRNA shape and, in one case, even mimics a tRNA function for deciphering the genetic code. Nature must have evolved this art of molecular mimicry between protein and ribonucleic acid by using different protein structures until the translation factors sat in the cockpit of a ribosome machine, on behalf of tRNA, and achieved diverse actions. Structural, functional, and evolutionary aspects of molecular mimicry will be discussed.

  20. Biological significance of long non-coding RNA FTX expression in human colorectal cancer.

    PubMed

    Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

    2015-01-01

    The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer.

  1. Biological significance of long non-coding RNA FTX expression in human colorectal cancer

    PubMed Central

    Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

    2015-01-01

    The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer. PMID:26629053

  2. The Evolution and Expression Pattern of Human Overlapping lncRNA and Protein-coding Gene Pairs.

    PubMed

    Ning, Qianqian; Li, Yixue; Wang, Zhen; Zhou, Songwen; Sun, Hong; Yu, Guangjun

    2017-03-27

    Long non-coding RNA overlapping with protein-coding gene (lncRNA-coding pair) is a special type of overlapping genes. Protein-coding overlapping genes have been well studied and increasing attention has been paid to lncRNAs. By studying lncRNA-coding pairs in human genome, we showed that lncRNA-coding pairs were more likely to be generated by overprinting and retaining genes in lncRNA-coding pairs were given higher priority than non-overlapping genes. Besides, the preference of overlapping configurations preserved during evolution was based on the origin of lncRNA-coding pairs. Further investigations showed that lncRNAs promoting the splicing of their embedded protein-coding partners was a unilateral interaction, but the existence of overlapping partners improving the gene expression was bidirectional and the effect was decreased with the increased evolutionary age of genes. Additionally, the expression of lncRNA-coding pairs showed an overall positive correlation and the expression correlation was associated with their overlapping configurations, local genomic environment and evolutionary age of genes. Comparison of the expression correlation of lncRNA-coding pairs between normal and cancer samples found that the lineage-specific pairs including old protein-coding genes may play an important role in tumorigenesis. This work presents a systematically comprehensive understanding of the evolution and the expression pattern of human lncRNA-coding pairs.

  3. A novel RNA binding surface of the TAM domain of TIP5/BAZ2A mediates epigenetic regulation of rRNA genes.

    PubMed

    Anosova, Irina; Melnik, Svitlana; Tripsianes, Konstantinos; Kateb, Fatiha; Grummt, Ingrid; Sattler, Michael

    2015-05-26

    The chromatin remodeling complex NoRC, comprising the subunits SNF2h and TIP5/BAZ2A, mediates heterochromatin formation at major clusters of repetitive elements, including rRNA genes, centromeres and telomeres. Association with chromatin requires the interaction of the TAM (TIP5/ARBP/MBD) domain of TIP5 with noncoding RNA, which targets NoRC to specific genomic loci. Here, we show that the NMR structure of the TAM domain of TIP5 resembles the fold of the MBD domain, found in methyl-CpG binding proteins. However, the TAM domain exhibits an extended MBD fold with unique C-terminal extensions that constitute a novel surface for RNA binding. Mutation of critical amino acids within this surface abolishes RNA binding in vitro and in vivo. Our results explain the distinct binding specificities of TAM and MBD domains to RNA and methylated DNA, respectively, and reveal structural features for the interaction of NoRC with non-coding RNA. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. In human pseudouridine synthase 1 (hPus1), a C-terminal helical insert blocks tRNA from binding in the same orientation as in the Pus1 bacterial homologue TruA, consistent with their different target selectivities.

    PubMed

    Czudnochowski, Nadine; Wang, Amy Liya; Finer-Moore, Janet; Stroud, Robert M

    2013-10-23

    Human pseudouridine (Ψ) synthase Pus1 (hPus1) modifies specific uridine residues in several non-coding RNAs: tRNA, U2 spliceosomal RNA, and steroid receptor activator RNA. We report three structures of the catalytic core domain of hPus1 from two crystal forms, at 1.8Å resolution. The structures are the first of a mammalian Ψ synthase from the set of five Ψ synthase families common to all kingdoms of life. hPus1 adopts a fold similar to bacterial Ψ synthases, with a central antiparallel β-sheet flanked by helices and loops. A flexible hinge at the base of the sheet allows the enzyme to open and close around an electropositive active-site cleft. In one crystal form, a molecule of Mes [2-(N-morpholino)ethane sulfonic acid] mimics the target uridine of an RNA substrate. A positively charged electrostatic surface extends from the active site towards the N-terminus of the catalytic domain, suggesting an extensive binding site specific for target RNAs. Two α-helices C-terminal to the core domain, but unique to hPus1, extend along the back and top of the central β-sheet and form the walls of the RNA binding surface. Docking of tRNA to hPus1 in a productive orientation requires only minor conformational changes to enzyme and tRNA. The docked tRNA is bound by the electropositive surface of the protein employing a completely different binding mode than that seen for the tRNA complex of the Escherichia coli homologue TruA. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. LncRNA mediated regulation of aging pathways in Drosophila melanogaster during dietary restriction.

    PubMed

    Yang, Deying; Lian, Ting; Tu, Jianbo; Gaur, Uma; Mao, Xueping; Fan, Xiaolan; Li, Diyan; Li, Ying; Yang, Mingyao

    2016-09-27

    Dietary restriction (DR) extends lifespan in many species which is a well-known phenomenon. Long non-coding RNAs (lncRNAs) play an important role in regulation of cell senescence and important age-related signaling pathways. Here, we profiled the lncRNA and mRNA transcriptome of fruit flies at 7 day and 42 day during DR and fully-fed conditions, respectively. In general, 102 differentially expressed lncRNAs and 1406 differentially expressed coding genes were identified. Most informatively we found a large number of differentially expressed lncRNAs and their targets enriched in GO and KEGG analysis. We discovered some new aging related signaling pathways during DR, such as hippo signaling pathway-fly, phototransduction-fly and protein processing in endoplasmic reticulum etc. Novel lncRNAs XLOC_092363 and XLOC_166557 are found to be located in 10 kb upstream sequences of hairy and ems promoters, respectively. Furthermore, tissue specificity of some novel lncRNAs had been analyzed at 7 day of DR in fly head, gut and fat body. Also the silencing of lncRNA XLOC_076307 resulted in altered expression level of its targets including Gadd45 (involved in FoxO signaling pathway). Together, the results implicated many lncRNAs closely associated with dietary restriction, which could provide a resource for lncRNA in aging and age-related disease field.

  6. Human coding RNA editing is generally nonadaptive

    PubMed Central

    Xu, Guixia; Zhang, Jianzhi

    2014-01-01

    Impairment of RNA editing at a handful of coding sites causes severe disorders, prompting the view that coding RNA editing is highly advantageous. Recent genomic studies have expanded the list of human coding RNA editing sites by more than 100 times, raising the question of how common advantageous RNA editing is. Analyzing 1,783 human coding A-to-G editing sites, we show that both the frequency and level of RNA editing decrease as the importance of a site or gene increases; that during evolution, edited As are more likely than unedited As to be replaced with Gs but not with Ts or Cs; and that among nonsynonymously edited As, those that are evolutionarily least conserved exhibit the highest editing levels. These and other observations reveal the overall nonadaptive nature of coding RNA editing, despite the presence of a few sites in which editing is clearly beneficial. We propose that most observed coding RNA editing results from tolerable promiscuous targeting by RNA editing enzymes, the original physiological functions of which remain elusive. PMID:24567376

  7. Internal control regions for transcription of eukaryotic tRNA genes.

    PubMed Central

    Sharp, S; DeFranco, D; Dingermann, T; Farrell, P; Söll, D

    1981-01-01

    We have identified the region within a eukaryotic tRNA gene required for initiation of transcription. These results were obtained by systematically constructing deletions extending from the 5' or the 3' flanking regions into a cloned Drosophila tRNAArg gene by using nuclease BAL 31. The ability of the newly generated deletion clones to direct the in vitro synthesis of tRNA precursors was measured in transcription systems from Xenopus laevis oocytes, Drosophila Kc cells, and HeLa cells. Two control regions within the coding sequence were identified. The first was essential for transcription and was contained between nucleotides 8 and 25 of the mature tRNA sequence. Genes devoid of the second control region, which was contained between nucleotides 50 and 58 of the mature tRNA sequence, could be transcribed but with reduced efficiency. Thus, the promoter regions within a tRNA gene encode the tRNA sequences of the D stem and D loop, the invariant uridine at position 8, and the semi-invariant G-T-psi-C sequence. Images PMID:6947245

  8. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

    PubMed Central

    Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

    2008-01-01

    Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468

  9. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene.

    PubMed

    Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

    2008-10-28

    The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.

  10. Molecular, Cellular, and Structural Mechanisms of Cocaine Addiction: A Key Role for MicroRNAs

    PubMed Central

    Jonkman, Sietse; Kenny, Paul J

    2013-01-01

    The rewarding properties of cocaine play a key role in establishing and maintaining the drug-taking habit. However, as exposure to cocaine increases, drug use can transition from controlled to compulsive. Importantly, very little is known about the neurobiological mechanisms that control this switch in drug use that defines addiction. MicroRNAs (miRNAs) are small non-protein coding RNA transcripts that can regulate the expression of messenger RNAs that code for proteins. Because of their highly pleiotropic nature, each miRNA has the potential to regulate hundreds or even thousands of protein-coding RNA transcripts. This property of miRNAs has generated considerable interest in their potential involvement in complex psychiatric disorders such as addiction, as each miRNA could potentially influence the many different molecular and cellular adaptations that arise in response to drug use that are hypothesized to drive the emergence of addiction. Here, we review recent evidence supporting a key role for miRNAs in the ventral striatum in regulating the rewarding and reinforcing properties of cocaine in animals with limited exposure to the drug. Moreover, we discuss evidence suggesting that miRNAs in the dorsal striatum control the escalation of drug intake in rats with extended cocaine access. These findings highlight the central role for miRNAs in drug-induced neuroplasticity in brain reward systems that drive the emergence of compulsive-like drug use in animals, and suggest that a better understanding of how miRNAs control drug intake will provide new insights into the neurobiology of drug addiction. PMID:22968819

  11. External Guide Sequences Targeting the aac(6′)-Ib mRNA Induce Inhibition of Amikacin Resistance▿

    PubMed Central

    Bistué, Alfonso J. C. Soler; Ha, Hongphuc; Sarno, Renee; Don, Michelle; Zorreguieta, Angeles; Tolmasky, Marcelo E.

    2007-01-01

    The dissemination of AAC(6′)-I-type acetyltransferases have rendered amikacin and other aminoglycosides all but useless in some parts of the world. Antisense technologies could be an alternative to extend the life of these antibiotics. External guide sequences are short antisense oligoribonucleotides that induce RNase P-mediated cleavage of a target RNA by forming a precursor tRNA-like complex. Thirteen-nucleotide external guide sequences complementary to locations within five regions accessible for interaction with antisense oligonucleotides in the mRNA that encodes AAC(6′)-Ib were analyzed. While small variations in the location targeted by different external guide sequences resulted in big changes in efficiency of binding to native aac(6′)-Ib mRNA, most of them induced high levels of RNase P-mediated cleavage in vitro. Recombinant plasmids coding for selected external guide sequences were introduced into Escherichia coli harboring aac(6′)-Ib, and the transformant strains were tested to determine their resistance to amikacin. The two external guide sequences that showed the strongest binding efficiency to the mRNA in vitro, EGSC3 and EGSA2, interfered with expression of the resistance phenotype at different degrees. Growth curve experiments showed that E. coli cells harboring a plasmid coding for EGSC3, the external guide sequence with the highest mRNA binding affinity in vitro, did not grow for at least 300 min in the presence of 15 μg of amikacin/ml. EGSA2, which had a lower mRNA-binding affinity in vitro than EGSC3, inhibited the expression of amikacin resistance at a lesser level; growth of E. coli harboring a plasmid coding for EGSA2, in the presence of 15 μg of amikacin/ml was undetectable for 200 min but reached an optical density at 600 nm of 0.5 after 5 h of incubation. Our results indicate that the use of external guide sequences could be a viable strategy to preserve the efficacy of amikacin. PMID:17387154

  12. The identification and functional annotation of RNA structures conserved in vertebrates

    PubMed Central

    Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan

    2017-01-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280

  13. Rooted tRNAomes and evolution of the genetic code

    PubMed Central

    Pak, Daewoo; Du, Nan; Kim, Yunsoo; Sun, Yanni

    2018-01-01

    ABSTRACT We advocate for a tRNA- rather than an mRNA-centric model for evolution of the genetic code. The mechanism for evolution of cloverleaf tRNA provides a root sequence for radiation of tRNAs and suggests a simplified understanding of code evolution. To analyze code sectoring, rooted tRNAomes were compared for several archaeal and one bacterial species. Rooting of tRNAome trees reveals conserved structures, indicating how the code was shaped during evolution and suggesting a model for evolution of a LUCA tRNAome tree. We propose the polyglycine hypothesis that the initial product of the genetic code may have been short chain polyglycine to stabilize protocells. In order to describe how anticodons were allotted in evolution, the sectoring-degeneracy hypothesis is proposed. Based on sectoring, a simple stepwise model is developed, in which the code sectors from a 1→4→8→∼16 letter code. At initial stages of code evolution, we posit strong positive selection for wobble base ambiguity, supporting convergence to 4-codon sectors and ∼16 letters. In a later stage, ∼5–6 letters, including stops, were added through innovating at the anticodon wobble position. In archaea and bacteria, tRNA wobble adenine is negatively selected, shrinking the maximum size of the primordial genetic code to 48 anticodons. Because 64 codons are recognized in mRNA, tRNA-mRNA coevolution requires tRNA wobble position ambiguity leading to degeneracy of the code. PMID:29372672

  14. Global Organization of a Positive-strand RNA Virus Genome

    PubMed Central

    Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew

    2013-01-01

    The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202

  15. A-to-I editing of coding and non-coding RNAs by ADARs

    PubMed Central

    Nishikura, Kazuko

    2016-01-01

    Adenosine deaminases acting on RNA (ADARs) convert adenosine to inosine in double-stranded RNA. This A-to-I editing occurs not only in protein-coding regions of mRNAs, but also frequently in non-coding regions that contain inverted Alu repeats. Editing of coding sequences can result in the expression of functionally altered proteins that are not encoded in the genome, whereas the significance of Alu editing remains largely unknown. Certain microRNA (miRNA) precursors are also edited, leading to reduced expression or altered function of mature miRNAs. Conversely, recent studies indicate that ADAR1 forms a complex with Dicer to promote miRNA processing, revealing a new function of ADAR1 in the regulation of RNA interference. PMID:26648264

  16. SSMART: Sequence-structure motif identification for RNA-binding proteins.

    PubMed

    Munteanu, Alina; Mukherjee, Neelanjan; Ohler, Uwe

    2018-06-11

    RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART_137/. Supplementary data are available at Bioinformatics online.

  17. Non-coding, mRNA-like RNAs database Y2K.

    PubMed

    Erdmann, V A; Szymanski, M; Hochberg, A; Groot, N; Barciszewski, J

    2000-01-01

    In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www. man.poznan.pl/5SData/ncRNA/index.html

  18. Non-coding, mRNA-like RNAs database Y2K

    PubMed Central

    Erdmann, Volker A.; Szymanski, Maciej; Hochberg, Abraham; Groot, Nathan de; Barciszewski, Jan

    2000-01-01

    In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www.man.poznan.pl/5SData/ncRNA/index.html PMID:10592224

  19. Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code

    ERIC Educational Resources Information Center

    Szeberenyi, Jozsef

    2010-01-01

    Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…

  20. Coevolution Theory of the Genetic Code at Age Forty: Pathway to Translation and Synthetic Life

    PubMed Central

    Wong, J. Tze-Fei; Ng, Siu-Kin; Mat, Wai-Kin; Hu, Taobo; Xue, Hong

    2016-01-01

    The origins of the components of genetic coding are examined in the present study. Genetic information arose from replicator induction by metabolite in accordance with the metabolic expansion law. Messenger RNA and transfer RNA stemmed from a template for binding the aminoacyl-RNA synthetase ribozymes employed to synthesize peptide prosthetic groups on RNAs in the Peptidated RNA World. Coevolution of the genetic code with amino acid biosynthesis generated tRNA paralogs that identify a last universal common ancestor (LUCA) of extant life close to Methanopyrus, which in turn points to archaeal tRNA introns as the most primitive introns and the anticodon usage of Methanopyrus as an ancient mode of wobble. The prediction of the coevolution theory of the genetic code that the code should be a mutable code has led to the isolation of optional and mandatory synthetic life forms with altered protein alphabets. PMID:26999216

  1. Metformin-Induced Changes of the Coding Transcriptome and Non-Coding RNAs in the Livers of Non-Alcoholic Fatty Liver Disease Mice.

    PubMed

    Guo, Jun; Zhou, Yuan; Cheng, Yafen; Fang, Weiwei; Hu, Gang; Wei, Jie; Lin, Yajun; Man, Yong; Guo, Lixin; Sun, Mingxiao; Cui, Qinghua; Li, Jian

    2018-01-01

    Recent studies have suggested that changes in non-coding mRNA play a key role in the progression of non-alcoholic fatty liver disease (NAFLD). Metformin is now recommended and effective for the treatment of NAFLD. We hope the current analyses of the non-coding mRNA transcriptome will provide a better presentation of the potential roles of mRNAs and long non-coding RNAs (lncRNAs) that underlie NAFLD and metformin intervention. The present study mainly analysed changes in the coding transcriptome and non-coding RNAs after the application of a five-week metformin intervention. Liver samples from three groups of mice were harvested for transcriptome profiling, which covered mRNA, lncRNA, microRNA (miRNA) and circular RNA (circRNA), using a microarray technique. A systematic alleviation of high-fat diet (HFD)-induced transcriptome alterations by metformin was observed. The metformin treatment largely reversed the correlations with diabetes-related pathways. Our analysis also suggested interaction networks between differentially expressed lncRNAs and known hepatic disease genes and interactions between circRNA and their disease-related miRNA partners. Eight HFD-responsive lncRNAs and three metformin-responsive lncRNAs were noted due to their widespread associations with disease genes. Moreover, seven miRNAs that interacted with multiple differentially expressed circRNAs were highlighted because they were likely to be associated with metabolic or liver diseases. The present study identified novel changes in the coding transcriptome and non-coding RNAs in the livers of NAFLD mice after metformin treatment that might shed light on the underlying mechanism by which metformin impedes the progression of NAFLD. © 2018 The Author(s). Published by S. Karger AG, Basel.

  2. Bio—Cryptography: A Possible Coding Role for RNA Redundancy

    NASA Astrophysics Data System (ADS)

    Regoli, M.

    2009-03-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions," are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behavior in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  3. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE PAGES

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; ...

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  4. Promoter analysis reveals globally differential regulation of human long non-coding RNA and protein-coding genes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptionalmore » regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.« less

  5. Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications.

    PubMed

    Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan

    2010-01-01

    It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes.

  6. Revisiting the operational RNA code for amino acids: Ensemble attributes and their implications

    PubMed Central

    Shaul, Shaul; Berel, Dror; Benjamini, Yoav; Graur, Dan

    2010-01-01

    It has been suggested that tRNA acceptor stems specify an operational RNA code for amino acids. In the last 20 years several attributes of the putative code have been elucidated for a small number of model organisms. To gain insight about the ensemble attributes of the code, we analyzed 4925 tRNA sequences from 102 bacterial and 21 archaeal species. Here, we used a classification and regression tree (CART) methodology, and we found that the degrees of degeneracy or specificity of the RNA codes in both Archaea and Bacteria differ from those of the genetic code. We found instances of taxon-specific alternative codes, i.e., identical acceptor stem determinants encrypting different amino acids in different species, as well as instances of ambiguity, i.e., identical acceptor stem determinants encrypting two or more amino acids in the same species. When partitioning the data by class of synthetase, the degree of code ambiguity was significantly reduced. In cryptographic terms, a plausible interpretation of this result is that the class distinction in synthetases is an essential part of the decryption rules for resolving the subset of RNA code ambiguities enciphered by identical acceptor stem determinants of tRNAs acylated by enzymes belonging to the two classes. In evolutionary terms, our findings lend support to the notion that in the pre-DNA world, interactions between tRNA acceptor stems and synthetases formed the basis for the distinction between the two classes; hence, ambiguities in the ancient RNA code were pivotal for the fixation of these enzymes in the genomes of ancestral prokaryotes. PMID:19952117

  7. The identification and functional annotation of RNA structures conserved in vertebrates.

    PubMed

    Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

    2017-08-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Effects of RNA integrity on transcript quantification by total RNA sequencing of clinically collected human placental samples.

    PubMed

    Reiman, Mario; Laan, Maris; Rull, Kristiina; Sõber, Siim

    2017-08-01

    RNA degradation is a ubiquitous process that occurs in living and dead cells, as well as during handling and storage of extracted RNA. Reduced RNA quality caused by degradation is an established source of uncertainty for all RNA-based gene expression quantification techniques. RNA sequencing is an increasingly preferred method for transcriptome analyses, and dependence of its results on input RNA integrity is of significant practical importance. This study aimed to characterize the effects of varying input RNA integrity [estimated as RNA integrity number (RIN)] on transcript level estimates and delineate the characteristic differences between transcripts that differ in degradation rate. The study used ribodepleted total RNA sequencing data from a real-life clinically collected set ( n = 32) of human solid tissue (placenta) samples. RIN-dependent alterations in gene expression profiles were quantified by using DESeq2 software. Our results indicate that small differences in RNA integrity affect gene expression quantification by introducing a moderate and pervasive bias in expression level estimates that significantly affected 8.1% of studied genes. The rapidly degrading transcript pool was enriched in pseudogenes, short noncoding RNAs, and transcripts with extended 3' untranslated regions. Typical slowly degrading transcripts (median length, 2389 nt) represented protein coding genes with 4-10 exons and high guanine-cytosine content.-Reiman, M., Laan, M., Rull, K., Sõber, S. Effects of RNA integrity on transcript quantification by total RNA sequencing of clinically collected human placental samples. © FASEB.

  9. The non-coding RNA landscape of human hematopoiesis and leukemia.

    PubMed

    Schwarzer, Adrian; Emmrich, Stephan; Schmidt, Franziska; Beck, Dominik; Ng, Michelle; Reimer, Christina; Adams, Felix Ferdinand; Grasedieck, Sarah; Witte, Damian; Käbler, Sebastian; Wong, Jason W H; Shah, Anushi; Huang, Yizhou; Jammal, Razan; Maroz, Aliaksandra; Jongen-Lavrencic, Mojca; Schambach, Axel; Kuchenbauer, Florian; Pimanda, John E; Reinhardt, Dirk; Heckl, Dirk; Klusmann, Jan-Henning

    2017-08-09

    Non-coding RNAs have emerged as crucial regulators of gene expression and cell fate decisions. However, their expression patterns and regulatory functions during normal and malignant human hematopoiesis are incompletely understood. Here we present a comprehensive resource defining the non-coding RNA landscape of the human hematopoietic system. Based on highly specific non-coding RNA expression portraits per blood cell population, we identify unique fingerprint non-coding RNAs-such as LINC00173 in granulocytes-and assign these to critical regulatory circuits involved in blood homeostasis. Following the incorporation of acute myeloid leukemia samples into the landscape, we further uncover prognostically relevant non-coding RNA stem cell signatures shared between acute myeloid leukemia blasts and healthy hematopoietic stem cells. Our findings highlight the importance of the non-coding transcriptome in the formation and maintenance of the human blood hierarchy.While micro-RNAs are known regulators of haematopoiesis and leukemogenesis, the role of long non-coding RNAs is less clear. Here the authors provide a non-coding RNA expression landscape of the human hematopoietic system, highlighting their role in the formation and maintenance of the human blood hierarchy.

  10. Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data

    PubMed Central

    Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

    2015-01-01

    Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/ PMID:26363020

  11. Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term.

    PubMed

    Romero, Roberto; Tarca, Adi L; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S; Kalita, Cynthia A; Cai, Juan; Yeo, Lami; Lipovich, Leonard

    2014-09-01

    To identify differentially expressed long non-coding RNA (lncRNA) genes in human myometrium in women with spontaneous labor at term. Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n = 19) and women in spontaneous labor at term (n = 20). RNA was extracted and profiled using an Illumina® microarray platform. We have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. We identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an experimental method completely independent of the microarray analysis. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site, that lacked evolutionary conservation beyond primates. We provide, for the first time, evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term.

  12. Conserved small mRNA with an unique, extended Shine-Dalgarno sequence

    PubMed Central

    Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald

    2017-01-01

    ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614

  13. Three-Dimensional Algebraic Models of the tRNA Code and 12 Graphs for Representing the Amino Acids.

    PubMed

    José, Marco V; Morgado, Eberto R; Guimarães, Romeu Cardoso; Zamudio, Gabriel S; de Farías, Sávio Torres; Bobadilla, Juan R; Sosa, Daniela

    2014-08-11

    Three-dimensional algebraic models, also called Genetic Hotels, are developed to represent the Standard Genetic Code, the Standard tRNA Code (S-tRNA-C), and the Human tRNA code (H-tRNA-C). New algebraic concepts are introduced to be able to describe these models, to wit, the generalization of the 2n-Klein Group and the concept of a subgroup coset with a tail. We found that the H-tRNA-C displayed broken symmetries in regard to the S-tRNA-C, which is highly symmetric. We also show that there are only 12 ways to represent each of the corresponding phenotypic graphs of amino acids. The averages of statistical centrality measures of the 12 graphs for each of the three codes are carried out and they are statistically compared. The phenotypic graphs of the S-tRNA-C display a common triangular prism of amino acids in 10 out of the 12 graphs, whilst the corresponding graphs for the H-tRNA-C display only two triangular prisms. The graphs exhibit disjoint clusters of amino acids when their polar requirement values are used. We contend that the S-tRNA-C is in a frozen-like state, whereas the H-tRNA-C may be in an evolving state.

  14. Long Non-Coding RNAs (lncRNAs) of Sea Cucumber: Large-Scale Prediction, Expression Profiling, Non-Coding Network Construction, and lncRNA-microRNA-Gene Interaction Analysis of lncRNAs in Apostichopus japonicus and Holothuria glaberrima During LPS Challenge and Radial Organ Complex Regeneration.

    PubMed

    Mu, Chuang; Wang, Ruijia; Li, Tianqi; Li, Yuqiang; Tian, Meilin; Jiao, Wenqian; Huang, Xiaoting; Zhang, Lingling; Hu, Xiaoli; Wang, Shi; Bao, Zhenmin

    2016-08-01

    Long non-coding RNA (lncRNA) structurally resembles mRNA but cannot be translated into protein. Although the systematic identification and characterization of lncRNAs have been increasingly reported in model species, information concerning non-model species is still lacking. Here, we report the first systematic identification and characterization of lncRNAs in two sea cucumber species: (1) Apostichopus japonicus during lipopolysaccharide (LPS) challenge and in heathy tissues and (2) Holothuria glaberrima during radial organ complex regeneration, using RNA-seq datasets and bioinformatics analysis. We identified A. japonicus and H. glaberrima lncRNAs that were differentially expressed during LPS challenge and radial organ complex regeneration, respectively. Notably, the predicted lncRNA-microRNA-gene trinities revealed that, in addition to targeting protein-coding transcripts, miRNAs might also target lncRNAs, thereby participating in a potential novel layer of regulatory interactions among non-coding RNA classes in echinoderms. Furthermore, the constructed coding-non-coding network implied the potential involvement of lncRNA-gene interactions during the regulation of several important genes (e.g., Toll-like receptor 1 [TLR1] and transglutaminase-1 [TGM1]) in response to LPS challenge and radial organ complex regeneration in sea cucumbers. Overall, this pioneer systematic identification, annotation, and characterization of lncRNAs in echinoderm pave the way for similar studies and future genetic, genomic, and evolutionary research in non-model species.

  15. [Long non-coding RNAs in the pathophysiology of atherosclerosis].

    PubMed

    Novak, Jan; Vašků, Julie Bienertová; Souček, Miroslav

    2018-01-01

    The human genome contains about 22 000 protein-coding genes that are transcribed to an even larger amount of messenger RNAs (mRNA). Interestingly, the results of the project ENCODE from 2012 show, that despite up to 90 % of our genome being actively transcribed, protein-coding mRNAs make up only 2-3 % of the total amount of the transcribed RNA. The rest of RNA transcripts is not translated to proteins and that is why they are referred to as "non-coding RNAs". Earlier the non-coding RNA was considered "the dark matter of genome", or "the junk", whose genes has accumulated in our DNA during the course of evolution. Today we already know that non-coding RNAs fulfil a variety of regulatory functions in our body - they intervene into epigenetic processes from chromatin remodelling to histone methylation, or into the transcription process itself, or even post-transcription processes. Long non-coding RNAs (lncRNA) are one of the classes of non-coding RNAs that have more than 200 nucleotides in length (non-coding RNAs with less than 200 nucleotides in length are called small non-coding RNAs). lncRNAs represent a widely varied and large group of molecules with diverse regulatory functions. We can identify them in all thinkable cell types or tissues, or even in an extracellular space, which includes blood, specifically plasma. Their levels change during the course of organogenesis, they are specific to different tissues and their changes also occur along with the development of different illnesses, including atherosclerosis. This review article aims to present lncRNAs problematics in general and then focuses on some of their specific representatives in relation to the process of atherosclerosis (i.e. we describe lncRNA involvement in the biology of endothelial cells, vascular smooth muscle cells or immune cells), and we further describe possible clinical potential of lncRNA, whether in diagnostics or therapy of atherosclerosis and its clinical manifestations.Key words: atherosclerosis - lincRNA - lncRNA - MALAT - MIAT.

  16. sRNAdb: A small non-coding RNA database for gram-positive bacteria

    PubMed Central

    2012-01-01

    Background The class of small non-coding RNA molecules (sRNA) regulates gene expression by different mechanisms and enables bacteria to mount a physiological response due to adaptation to the environment or infection. Over the last decades the number of sRNAs has been increasing rapidly. Several databases like Rfam or fRNAdb were extended to include sRNAs as a class of its own. Furthermore new specialized databases like sRNAMap (gram-negative bacteria only) and sRNATarBase (target prediction) were established. To the best of the authors’ knowledge no database focusing on sRNAs from gram-positive bacteria is publicly available so far. Description In order to understand sRNA’s functional and phylogenetic relationships we have developed sRNAdb and provide tools for data analysis and visualization. The data compiled in our database is assembled from experiments as well as from bioinformatics analyses. The software enables comparison and visualization of gene loci surrounding the sRNAs of interest. To accomplish this, we use a client–server based approach. Offline versions of the database including analyses and visualization tools can easily be installed locally on the user’s computer. This feature facilitates customized local addition of unpublished sRNA candidates and related information such as promoters or terminators using tab-delimited files. Conclusion sRNAdb allows a user-friendly and comprehensive comparative analysis of sRNAs from available sequenced gram-positive prokaryotic replicons. Offline versions including analysis and visualization tools facilitate complex user specific bioinformatics analyses. PMID:22883983

  17. Pertussis toxin export genes are regulated by the ptx promoter and may be required for efficient translation of ptx mRNA in Bordetella pertussis.

    PubMed Central

    Baker, S M; Masi, A; Liu, D F; Novitsky, B K; Deich, R A

    1995-01-01

    The gene products from an 8-kb region adjacent to the 3' end of the ptx operon are required by Bordetella pertussis for the export of pertussis holotoxin. At least one of these gene products (PtlC) is specifically required for the export of assembled holotoxin from the periplasmic space. ptlC mutants exhibit a 20-fold reduction in the amount of holotoxin present in the culture supernatant but have no effect upon the assembly or steady-state level of holotoxin present in the periplasmic space. Impaired export of holotoxin from the ptlC strain blocks expression of toxin at a posttranscriptional level, and wild-type levels of ptx mRNA are detected in the mutant strain. The transcription of ptl is subject to modulation by MgSO4 in the same manner as ptx is; however, in B. pertussis strains containing an E. coli tac promoter in place of the native ptx promoter, wild-type levels of ptx mRNA are present and holotoxin is synthesized and exported even in the presence of MgSO4. Promoter mapping of the region extending from the ptxS3 coding region to the ptlC coding region failed to detect the ptl transcription initiation site. Additional RNase protection experiments with ptx promoter deletion and substitution strains indicate that the ptl operon is transcribed from the ptx promoter as part of a > 11-kb mRNA. PMID:7558300

  18. The analysis of the complete mitochondrial genome of Lecanicillium muscarium (synonym Verticillium lecanii) suggests a minimum common gene organization in mtDNAs of Sordariomycetes: phylogenetic implications.

    PubMed

    Kouvelis, Vassili N; Ghikas, Dimitri V; Typas, Milton A

    2004-10-01

    The mitochondrial genome (mtDNA) of the entomopathogenic fungus Lecanicillium muscarium (synonym Verticillium lecanii) with a total size of 24,499-bp has been analyzed. So far, it is the smallest known mitochondrial genome among Pezizomycotina, with an extremely compact gene organization and only one group-I intron in its large ribosomal RNA (rnl) gene. It contains the 14 typical genes coding for proteins related to oxidative phosphorylation, the two rRNA genes, one intronic ORF coding for a possible ribosomal protein (rps), and a set of 25 tRNA genes which recognize codons for all amino acids, except alanine and cysteine. All genes are transcribed from the same DNA strand. Gene order comparison with all available complete fungal mtDNAs-representatives of all four Phyla are included-revealed some characteristic common features like uninterrupted gene pairs, overlapping genes, and extremely variable intergenic regions, that can all be exploited for the study of fungal mitochondrial genomes. Moreover, a minimum common mtDNA gene order could be detected, in two units, for all known Sordariomycetes namely nad1-nad4-atp8-atp6 and rns-cox3-rnl, which can be extended in Hypocreales, to nad4L-nad5-cob-cox1-nad1-nad4-atp8-atp6 and rns-cox3-rnl nad2-nad3, respectively. Phylogenetic analysis of all fungal mtDNA essential protein-coding genes as one unit, clearly demonstrated the superiority of small genome (mtDNA) over single gene comparisons.

  19. Fragmentation of tRNA in Phytophthora infestans asexual life cycle stages and during host plant infection.

    PubMed

    Åsman, Anna K M; Vetukuri, Ramesh R; Jahan, Sultana N; Fogelqvist, Johan; Corcoran, Pádraic; Avrova, Anna O; Whisson, Stephen C; Dixelius, Christina

    2014-12-10

    The oomycete Phytophthora infestans possesses active RNA silencing pathways, which presumably enable this plant pathogen to control the large numbers of transposable elements present in its 240 Mb genome. Small RNAs (sRNAs), central molecules in RNA silencing, are known to also play key roles in this organism, notably in regulation of critical effector genes needed for infection of its potato host. To identify additional classes of sRNAs in oomycetes, we mapped deep sequencing reads to transfer RNAs (tRNAs) thereby revealing the presence of 19-40 nt tRNA-derived RNA fragments (tRFs). Northern blot analysis identified abundant tRFs corresponding to half tRNA molecules. Some tRFs accumulated differentially during infection, as seen by examining sRNAs sequenced from P. infestans-potato interaction libraries. The putative connection between tRF biogenesis and the canonical RNA silencing pathways was investigated by employing hairpin RNA-mediated RNAi to silence the genes encoding P. infestans Argonaute (PiAgo) and Dicer (PiDcl) endoribonucleases. By sRNA sequencing we show that tRF accumulation is PiDcl1-independent, while Northern hybridizations detected reduced levels of specific tRNA-derived species in the PiAgo1 knockdown line. Our findings extend the sRNA diversity in oomycetes to include fragments derived from non-protein-coding RNA transcripts and identify tRFs with elevated levels during infection of potato by P. infestans.

  20. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry

    PubMed Central

    Carter, Charles W.; Wolfenden, Richard

    2016-01-01

    abstract The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology. PMID:26595350

  1. Evaluation of non-coding variation in GLUT1 deficiency.

    PubMed

    Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S

    2016-12-01

    Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.

  2. The agents of natural genome editing.

    PubMed

    Witzany, Guenther

    2011-06-01

    The DNA serves as a stable information storage medium and every protein which is needed by the cell is produced from this blueprint via an RNA intermediate code. More recently it was found that an abundance of various RNA elements cooperate in a variety of steps and substeps as regulatory and catalytic units with multiple competencies to act on RNA transcripts. Natural genome editing on one side is the competent agent-driven generation and integration of meaningful DNA nucleotide sequences into pre-existing genomic content arrangements, and the ability to (re-)combine and (re-)regulate them according to context-dependent (i.e. adaptational) purposes of the host organism. Natural genome editing on the other side designates the integration of all RNA activities acting on RNA transcripts without altering DNA-encoded genes. If we take the genetic code seriously as a natural code, there must be agents that are competent to act on this code because no natural code codes itself as no natural language speaks itself. As code editing agents, viral and subviral agents have been suggested because there are several indicators that demonstrate viruses competent in both RNA and DNA natural genome editing.

  3. Bijective transformation circular codes and nucleotide exchanging RNA transcription.

    PubMed

    Michel, Christian J; Seligmann, Hervé

    2014-04-01

    The C(3) self-complementary circular code X identified in genes of prokaryotes and eukaryotes is a set of 20 trinucleotides enabling reading frame retrieval and maintenance, i.e. a framing code (Arquès and Michel, 1996; Michel, 2012, 2013). Some mitochondrial RNAs correspond to DNA sequences when RNA transcription systematically exchanges between nucleotides (Seligmann, 2013a,b). We study here the 23 bijective transformation codes ΠX of X which may code nucleotide exchanging RNA transcription as suggested by this mitochondrial observation. The 23 bijective transformation codes ΠX are C(3) trinucleotide circular codes, seven of them are also self-complementary. Furthermore, several correlations are observed between the Reading Frame Retrieval (RFR) probability of bijective transformation codes ΠX and the different biological properties of ΠX related to their numbers of RNAs in GenBank's EST database, their polymerization rate, their number of amino acids and the chirality of amino acids they code. Results suggest that the circular code X with the functions of reading frame retrieval and maintenance in regular RNA transcription, may also have, through its bijective transformation codes ΠX, the same functions in nucleotide exchanging RNA transcription. Associations with properties such as amino acid chirality suggest that the RFR of X and its bijective transformations molded the origins of the genetic code's machinery. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  4. The Bean Pod Mottle Virus RNA2-Encoded 58-Kilodalton Protein P58 Is Required in cis for RNA2 Accumulation

    PubMed Central

    Lin, Junyan; Guo, Jiangbo; Finer, John; Dorrance, Anne E.; Redinbaugh, Margaret G.

    2014-01-01

    ABSTRACT Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus in the Secoviridae family. Its RNA1 encodes proteins required for genome replication, whereas RNA2 primarily encodes proteins needed for virion assembly and cell-to-cell movement. However, the function of a 58-kDa protein (P58) encoded by RNA2 has not been resolved. P58 and the movement protein (MP) of BPMV are two largely identical proteins differing only at their N termini, with P58 extending MP upstream by 102 amino acid residues. In this report, we unveil a unique role for P58. We show that BPMV RNA2 accumulation in infected cells was abolished when the start codon of P58 was eliminated. The role of P58 does not require the region shared by MP, as RNA2 accumulation in individual cells remained robust even when most of the MP coding sequence was removed. Importantly, the function of P58 required the P58 protein, rather than its coding RNA, as compensatory mutants could be isolated that restored RNA2 accumulation by acquiring new start codons upstream of the original one. Most strikingly, loss of P58 function could not be complemented by P58 provided in trans, suggesting that P58 functions in cis to selectively promote the accumulation of RNA2 copies that encode a functional P58 protein. Finally, we found that all RNA1-encoded proteins are cis-acting relative to RNA1. Together, our results suggest that P58 probably functions by recruiting the RNA1-encoded polyprotein to RNA2 to enable RNA2 reproduction. IMPORTANCE Bean pod mottle virus (BPMV) is one of the most important pathogens of the crop plant soybean, yet its replication mechanism is not well understood, hindering the development of knowledge-based control measures. The current study examined the replication strategy of BPMV RNA2, one of the two genomic RNA segments of this virus, and established an essential role for P58, one of the RNA2-encoded proteins, in the process of RNA2 replication. Our study demonstrates for the first time that P58 functions preferentially with the very RNA from which it is translated, thus greatly advancing our understanding of the replication mechanisms of this and related viruses. Furthermore, this study is important because it provides a potential target for BPMV-specific control, and hence could help to mitigate soybean production losses caused by this virus. PMID:24390330

  5. RBind: computational network method to predict RNA binding sites.

    PubMed

    Wang, Kaili; Jian, Yiren; Wang, Huiwen; Zeng, Chen; Zhao, Yunjie

    2018-04-26

    Non-coding RNA molecules play essential roles by interacting with other molecules to perform various biological functions. However, it is difficult to determine RNA structures due to their flexibility. At present, the number of experimentally solved RNA-ligand and RNA-protein structures is still insufficient. Therefore, binding sites prediction of non-coding RNA is required to understand their functions. Current RNA binding site prediction algorithms produce many false positive nucleotides that are distance away from the binding sites. Here, we present a network approach, RBind, to predict the RNA binding sites. We benchmarked RBind in RNA-ligand and RNA-protein datasets. The average accuracy of 0.82 in RNA-ligand and 0.63 in RNA-protein testing showed that this network strategy has a reliable accuracy for binding sites prediction. The codes and datasets are available at https://zhaolab.com.cn/RBind. yjzhaowh@mail.ccnu.edu.cn. Supplementary data are available at Bioinformatics online.

  6. Dengue Non-coding RNA: TRIMmed for Transmission.

    PubMed

    Göertz, Giel P; Pijlman, Gorben P

    2015-08-12

    Dengue virus RNA is trimmed by the 5'→3' exoribonuclease XRN1 to produce an abundant, non-coding subgenomic flavivirus RNA (sfRNA) in infected cells. In a recent paper in Science, Manokaran et al. (2015) report that sfRNA binds TRIM25 to evade innate immune sensing of viral RNA by RIG-I. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Three-Dimensional Algebraic Models of the tRNA Code and 12 Graphs for Representing the Amino Acids

    PubMed Central

    José, Marco V.; Morgado, Eberto R.; Guimarães, Romeu Cardoso; Zamudio, Gabriel S.; de Farías, Sávio Torres; Bobadilla, Juan R.; Sosa, Daniela

    2014-01-01

    Three-dimensional algebraic models, also called Genetic Hotels, are developed to represent the Standard Genetic Code, the Standard tRNA Code (S-tRNA-C), and the Human tRNA code (H-tRNA-C). New algebraic concepts are introduced to be able to describe these models, to wit, the generalization of the 2n-Klein Group and the concept of a subgroup coset with a tail. We found that the H-tRNA-C displayed broken symmetries in regard to the S-tRNA-C, which is highly symmetric. We also show that there are only 12 ways to represent each of the corresponding phenotypic graphs of amino acids. The averages of statistical centrality measures of the 12 graphs for each of the three codes are carried out and they are statistically compared. The phenotypic graphs of the S-tRNA-C display a common triangular prism of amino acids in 10 out of the 12 graphs, whilst the corresponding graphs for the H-tRNA-C display only two triangular prisms. The graphs exhibit disjoint clusters of amino acids when their polar requirement values are used. We contend that the S-tRNA-C is in a frozen-like state, whereas the H-tRNA-C may be in an evolving state. PMID:25370377

  8. The development of non-coding RNA ontology.

    PubMed

    Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; de Silva, Nisansa; Kasukurthi, Mohan Vamsi; Jha, Vikash Kumar; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

    2016-01-01

    Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data.

  9. a Simple Symmetric Algorithm Using a Likeness with Introns Behavior in RNA Sequences

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2009-02-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. The RNA sequences has some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algoritnm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  10. Co-LncRNA: investigating the lncRNA combinatorial effects in GO annotations and KEGG pathways based on human RNA-Seq data.

    PubMed

    Zhao, Zheng; Bai, Jing; Wu, Aiwei; Wang, Yuan; Zhang, Jinwen; Wang, Zishan; Li, Yongsheng; Xu, Juan; Li, Xia

    2015-01-01

    Long non-coding RNAs (lncRNAs) are emerging as key regulators of diverse biological processes and diseases. However, the combinatorial effects of these molecules in a specific biological function are poorly understood. Identifying co-expressed protein-coding genes of lncRNAs would provide ample insight into lncRNA functions. To facilitate such an effort, we have developed Co-LncRNA, which is a web-based computational tool that allows users to identify GO annotations and KEGG pathways that may be affected by co-expressed protein-coding genes of a single or multiple lncRNAs. LncRNA co-expressed protein-coding genes were first identified in publicly available human RNA-Seq datasets, including 241 datasets across 6560 total individuals representing 28 tissue types/cell lines. Then, the lncRNA combinatorial effects in a given GO annotations or KEGG pathways are taken into account by the simultaneous analysis of multiple lncRNAs in user-selected individual or multiple datasets, which is realized by enrichment analysis. In addition, this software provides a graphical overview of pathways that are modulated by lncRNAs, as well as a specific tool to display the relevant networks between lncRNAs and their co-expressed protein-coding genes. Co-LncRNA also supports users in uploading their own lncRNA and protein-coding gene expression profiles to investigate the lncRNA combinatorial effects. It will be continuously updated with more human RNA-Seq datasets on an annual basis. Taken together, Co-LncRNA provides a web-based application for investigating lncRNA combinatorial effects, which could shed light on their biological roles and could be a valuable resource for this community. Database URL: http://www.bio-bigdata.com/Co-LncRNA/. © The Author(s) 2015. Published by Oxford University Press.

  11. In silico screening of the chicken genome for overlaps between genomic regions: microRNA genes, coding and non-coding transcriptional units, QTL, and genetic variations.

    PubMed

    Zorc, Minja; Kunej, Tanja

    2016-05-01

    MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a starting point for further functional studies and association studies with poultry production and health traits and the basis for systematic screening of exonic miRNAs and missense/miRNA seed polymorphisms in other genomes.

  12. Intergenic Transcriptional Interference Is Blocked by RNA Polymerase III Transcription Factor TFIIIB in Saccharomyces cerevisiae

    PubMed Central

    Korde, Asawari; Rosselot, Jessica M.; Donze, David

    2014-01-01

    The major function of eukaryotic RNA polymerase III is to transcribe transfer RNA, 5S ribosomal RNA, and other small non-protein-coding RNA molecules. Assembly of the RNA polymerase III complex on chromosomal DNA requires the sequential binding of transcription factor complexes TFIIIC and TFIIIB. Recent evidence has suggested that in addition to producing RNA transcripts, chromatin-assembled RNA polymerase III complexes may mediate additional nuclear functions that include chromatin boundary, nucleosome phasing, and general genome organization activities. This study provides evidence of another such “extratranscriptional” activity of assembled RNA polymerase III complexes, which is the ability to block progression of intergenic RNA polymerase II transcription. We demonstrate that the RNA polymerase III complex bound to the tRNA gene upstream of the Saccharomyces cerevisiae ATG31 gene protects the ATG31 promoter against readthrough transcriptional interference from the upstream noncoding intergenic SUT467 transcription unit. This protection is predominately mediated by binding of the TFIIIB complex. When TFIIIB binding to this tRNA gene is weakened, an extended SUT467–ATG31 readthrough transcript is produced, resulting in compromised ATG31 translation. Since the ATG31 gene product is required for autophagy, strains expressing the readthrough transcript exhibit defective autophagy induction and reduced fitness under autophagy-inducing nitrogen starvation conditions. Given the recent discovery of widespread pervasive transcription in all forms of life, protection of neighboring genes from intergenic transcriptional interference may be a key extratranscriptional function of assembled RNA polymerase III complexes and possibly other DNA binding proteins. PMID:24336746

  13. The First Mitochondrial Genome for the Superfamily Hagloidea and Implications for Its Systematic Status in Ensifera

    PubMed Central

    Zhou, Zhijun; Shi, Fuming; Zhao, Ling

    2014-01-01

    Hagloidea Handlirsch, 1906 was an ancient group of Ensifera, that was much more diverse in the past extending at least into the Triassic, apparently diminishing in diversity through the Cretaceous, and now only represented by a few extant species. In this paper, we report the complete mitochondrial genome (mitogenome) of Tarragoilus diuturnus Gorochov, 2001, representing the first mitogenome of the superfamily Hagloidea. The size of the entire mitogenome of T. diuturnus is 16144 bp, containing 13 protein-coding genes (PCGs), 2 ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes and one control region. The order and orientation of the gene arrangement pattern is identical to that of D. yakuba and most ensiferans species. A phylogenomic analysis was carried out based on the concatenated dataset of 13 PCGs and 2 rRNA genes from mitogenome sequences of 15 ensiferan species, comprising four superfamilies Grylloidea, Tettigonioidae, Rhaphidophoroidea and Hagloidea. Both maximum likelihood and Bayesian inference analyses strongly support Hagloidea T. diuturnus and Rhaphidophoroidea Troglophilus neglectus as forming a monophyletic group, sister to the Tettigonioidea. The relationships among four superfamilies of Ensifera were (Grylloidea, (Tettigonioidea, (Hagloidea, Rhaphidophoroidea))). PMID:24465850

  14. S6K2-mediated regulation of TRBP as a determinant of miRNA expression in human primary lymphatic endothelial cells

    PubMed Central

    Warner, Matthew J.; Bridge, Katherine S.; Hewitson, James P.; Hodgkinson, Michael R.; Heyam, Alex; Massa, Bailey C.; Haslam, Jessica C.; Chatzifrangkeskou, Maria; Evans, Gareth J.O.; Plevin, Michael J.; Sharp, Tyson V.; Lagos, Dimitris

    2016-01-01

    MicroRNAs (miRNAs) are short non-coding RNAs that silence mRNAs. They are generated following transcription and cleavage by the DROSHA/DGCR8 and DICER/TRBP/PACT complexes. Although it is known that components of the miRNA biogenesis machinery can be phosphorylated, it remains poorly understood how these events become engaged during physiological cellular activation. We demonstrate that S6 kinases can phosphorylate the extended C-terminal domain of TRBP and interact with TRBP in situ in primary cells. TRBP serines 283/286 are essential for S6K-mediated TRBP phosphorylation, optimal expression of TRBP, and the S6K-TRBP interaction in human primary cells. We demonstrate the functional relevance of this interaction in primary human dermal lymphatic endothelial cells (HDLECs). Angiopoietin-1 (ANG1) can augment miRNA biogenesis in HDLECs through enhancing TRBP phosphorylation and expression in an S6K2-dependent manner. We propose that the S6K2/TRBP node controls miRNA biogenesis in HDLECs and provides a molecular link between the mTOR pathway and the miRNA biogenesis machinery. PMID:27407113

  15. The small RNA complement of adult Schistosoma haematobium.

    PubMed

    Stroehlein, Andreas J; Young, Neil D; Korhonen, Pasi K; Hall, Ross S; Jex, Aaron R; Webster, Bonnie L; Rollinson, David; Brindley, Paul J; Gasser, Robin B

    2018-05-01

    Blood flukes of the genus Schistosoma cause schistosomiasis-a neglected tropical disease (NTD) that affects more than 200 million people worldwide. Studies of schistosome genomes have improved our understanding of the molecular biology of flatworms, but most of them have focused largely on protein-coding genes. Small non-coding RNAs (sncRNAs) have been explored in selected schistosome species and are suggested to play essential roles in the post-transcriptional regulation of genes, and in modulating flatworm-host interactions. However, genome-wide small RNA data are currently lacking for key schistosomes including Schistosoma haematobium-the causative agent of urogenital schistosomiasis of humans. MicroRNAs (miRNAs) and other sncRNAs of male and female adults of S. haematobium and small RNA transcription levels were explored by deep sequencing, genome mapping and detailed bioinformatic analyses. In total, 89 transcribed miRNAs were identified in S. haematobium-a similar complement to those reported for the congeners S. mansoni and S. japonicum. Of these miRNAs, 34 were novel, with no homologs in other schistosomes. Most miRNAs (n = 64) exhibited sex-biased transcription, suggestive of roles in sexual differentiation, pairing of adult worms and reproductive processes. Of the sncRNAs that were not miRNAs, some related to the spliceosome (n = 21), biogenesis of other RNAs (n = 3) or ribozyme functions (n = 16), whereas most others (n = 3798) were novel ('orphans') with unknown functions. This study provides the first genome-wide sncRNA resource for S. haematobium, extending earlier studies of schistosomes. The present work should facilitate the future curation and experimental validation of sncRNA functions in schistosomes to enhance our understanding of post-transcriptional gene regulation and of the roles that sncRNAs play in schistosome reproduction, development and parasite-host cross-talk.

  16. RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps

    PubMed Central

    Drory Retwitzer, Matan; Polishchuk, Maya; Churkin, Elena; Kifer, Ilona; Yakhini, Zohar; Barash, Danny

    2015-01-01

    Searching for RNA sequence-structure patterns is becoming an essential tool for RNA practitioners. Novel discoveries of regulatory non-coding RNAs in targeted organisms and the motivation to find them across a wide range of organisms have prompted the use of computational RNA pattern matching as an enhancement to sequence similarity. State-of-the-art programs differ by the flexibility of patterns allowed as queries and by their simplicity of use. In particular—no existing method is available as a user-friendly web server. A general program that searches for RNA sequence-structure patterns is RNA Structator. However, it is not available as a web server and does not provide the option to allow flexible gap pattern representation with an upper bound of the gap length being specified at any position in the sequence. Here, we introduce RNAPattMatch, a web-based application that is user friendly and makes sequence/structure RNA queries accessible to practitioners of various background and proficiency. It also extends RNA Structator and allows a more flexible variable gaps representation, in addition to analysis of results using energy minimization methods. RNAPattMatch service is available at http://www.cs.bgu.ac.il/rnapattmatch. A standalone version of the search tool is also available to download at the site. PMID:25940619

  17. microRNA in Cerebral Spinal Fluid as Biomarkers of Alzheimer’s Disease Risk After Brain Injury

    DTIC Science & Technology

    2016-08-01

    protein processing is a key feature of AD. MiRNAs are small non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein...non- coding RNA that regulate mRNA transcription, and may be a significant cause of protein dysregulation. Our investigative team has generated

  18. Pre-Mrna Introns as a Model for Cryptographic Algorithm:. Theory and Experiments

    NASA Astrophysics Data System (ADS)

    Regoli, Massimo

    2010-01-01

    The RNA-Crypto System (shortly RCS) is a symmetric key algorithm to cipher data. The idea for this new algorithm starts from the observation of nature. In particular from the observation of RNA behavior and some of its properties. In particular the RNA sequences have some sections called Introns. Introns, derived from the term "intragenic regions", are non-coding sections of precursor mRNA (pre-mRNA) or other RNAs, that are removed (spliced out of the RNA) before the mature RNA is formed. Once the introns have been spliced out of a pre-mRNA, the resulting mRNA sequence is ready to be translated into a protein. The corresponding parts of a gene are known as introns as well. The nature and the role of Introns in the pre-mRNA is not clear and it is under ponderous researches by Biologists but, in our case, we will use the presence of Introns in the RNA-Crypto System output as a strong method to add chaotic non coding information and an unnecessary behaviour in the access to the secret key to code the messages. In the RNA-Crypto System algorithm the introns are sections of the ciphered message with non-coding information as well as in the precursor mRNA.

  19. Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

    PubMed Central

    Hall, L; Laird, J E; Craig, R K

    1984-01-01

    Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375

  20. Structural Phylogenomics Retrodicts the Origin of the Genetic Code and Uncovers the Evolutionary Impact of Protein Flexibility

    PubMed Central

    Caetano-Anollés, Gustavo; Wang, Minglei; Caetano-Anollés, Derek

    2013-01-01

    The genetic code shapes the genetic repository. Its origin has puzzled molecular scientists for over half a century and remains a long-standing mystery. Here we show that the origin of the genetic code is tightly coupled to the history of aminoacyl-tRNA synthetase enzymes and their interactions with tRNA. A timeline of evolutionary appearance of protein domain families derived from a structural census in hundreds of genomes reveals the early emergence of the ‘operational’ RNA code and the late implementation of the standard genetic code. The emergence of codon specificities and amino acid charging involved tight coevolution of aminoacyl-tRNA synthetases and tRNA structures as well as episodes of structural recruitment. Remarkably, amino acid and dipeptide compositions of single-domain proteins appearing before the standard code suggest archaic synthetases with structures homologous to catalytic domains of tyrosyl-tRNA and seryl-tRNA synthetases were capable of peptide bond formation and aminoacylation. Results reveal that genetics arose through coevolutionary interactions between polypeptides and nucleic acid cofactors as an exacting mechanism that favored flexibility and folding of the emergent proteins. These enhancements of phenotypic robustness were likely internalized into the emerging genetic system with the early rise of modern protein structure. PMID:23991065

  1. Generating code adapted for interlinking legacy scalar code and extended vector code

    DOEpatents

    Gschwind, Michael K

    2013-06-04

    Mechanisms for intermixing code are provided. Source code is received for compilation using an extended Application Binary Interface (ABI) that extends a legacy ABI and uses a different register configuration than the legacy ABI. First compiled code is generated based on the source code, the first compiled code comprising code for accommodating the difference in register configurations used by the extended ABI and the legacy ABI. The first compiled code and second compiled code are intermixed to generate intermixed code, the second compiled code being compiled code that uses the legacy ABI. The intermixed code comprises at least one call instruction that is one of a call from the first compiled code to the second compiled code or a call from the second compiled code to the first compiled code. The code for accommodating the difference in register configurations is associated with the at least one call instruction.

  2. Improving the genome annotation of the acarbose producer Actinoplanes sp. SE50/110 by sequencing enriched 5'-ends of primary transcripts.

    PubMed

    Schwientek, Patrick; Neshat, Armin; Kalinowski, Jörn; Klein, Andreas; Rückert, Christian; Schneiker-Bekel, Susanne; Wendler, Sergej; Stoye, Jens; Pühler, Alfred

    2014-11-20

    Actinoplanes sp. SE50/110 is the producer of the alpha-glucosidase inhibitor acarbose, which is an economically relevant and potent drug in the treatment of type-2 diabetes mellitus. In this study, we present the detection of transcription start sites on this genome by sequencing enriched 5'-ends of primary transcripts. Altogether, 1427 putative transcription start sites were initially identified. With help of the annotated genome sequence, 661 transcription start sites were found to belong to the leader region of protein-coding genes with the surprising result that roughly 20% of these genes rank among the class of leaderless transcripts. Next, conserved promoter motifs were identified for protein-coding genes with and without leader sequences. The mapped transcription start sites were finally used to improve the annotation of the Actinoplanes sp. SE50/110 genome sequence. Concerning protein-coding genes, 41 translation start sites were corrected and 9 novel protein-coding genes could be identified. In addition to this, 122 previously undetermined non-coding RNA (ncRNA) genes of Actinoplanes sp. SE50/110 were defined. Focusing on antisense transcription start sites located within coding genes or their leader sequences, it was discovered that 96 of those ncRNA genes belong to the class of antisense RNA (asRNA) genes. The remaining 26 ncRNA genes were found outside of known protein-coding genes. Four chosen examples of prominent ncRNA genes, namely the transfer messenger RNA gene ssrA, the ribonuclease P class A RNA gene rnpB, the cobalamin riboswitch RNA gene cobRS, and the selenocysteine-specific tRNA gene selC, are presented in more detail. This study demonstrates that sequencing of enriched 5'-ends of primary transcripts and the identification of transcription start sites are valuable tools for advanced genome annotation of Actinoplanes sp. SE50/110 and most probably also for other bacteria. Copyright © 2014 Elsevier B.V. All rights reserved.

  3. Activity-Dependent Human Brain Coding/Noncoding Gene Regulatory Networks

    PubMed Central

    Lipovich, Leonard; Dachet, Fabien; Cai, Juan; Bagla, Shruti; Balan, Karina; Jia, Hui; Loeb, Jeffrey A.

    2012-01-01

    While most gene transcription yields RNA transcripts that code for proteins, a sizable proportion of the genome generates RNA transcripts that do not code for proteins, but may have important regulatory functions. The brain-derived neurotrophic factor (BDNF) gene, a key regulator of neuronal activity, is overlapped by a primate-specific, antisense long noncoding RNA (lncRNA) called BDNFOS. We demonstrate reciprocal patterns of BDNF and BDNFOS transcription in highly active regions of human neocortex removed as a treatment for intractable seizures. A genome-wide analysis of activity-dependent coding and noncoding human transcription using a custom lncRNA microarray identified 1288 differentially expressed lncRNAs, of which 26 had expression profiles that matched activity-dependent coding genes and an additional 8 were adjacent to or overlapping with differentially expressed protein-coding genes. The functions of most of these protein-coding partner genes, such as ARC, include long-term potentiation, synaptic activity, and memory. The nuclear lncRNAs NEAT1, MALAT1, and RPPH1, composing an RNAse P-dependent lncRNA-maturation pathway, were also upregulated. As a means to replicate human neuronal activity, repeated depolarization of SY5Y cells resulted in sustained CREB activation and produced an inverse pattern of BDNF-BDNFOS co-expression that was not achieved with a single depolarization. RNAi-mediated knockdown of BDNFOS in human SY5Y cells increased BDNF expression, suggesting that BDNFOS directly downregulates BDNF. Temporal expression patterns of other lncRNA-messenger RNA pairs validated the effect of chronic neuronal activity on the transcriptome and implied various lncRNA regulatory mechanisms. lncRNAs, some of which are unique to primates, thus appear to have potentially important regulatory roles in activity-dependent human brain plasticity. PMID:22960213

  4. Regions of extreme synonymous codon selection in mammalian genes

    PubMed Central

    Schattner, Peter; Diekhans, Mark

    2006-01-01

    Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911

  5. Progressive changes in non-coding RNA profile in leucocytes with age

    PubMed Central

    Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David

    2017-01-01

    It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962

  6. An imprinted non-coding genomic cluster at 14q32 defines clinically relevant molecular subtypes in osteosarcoma across multiple independent datasets.

    PubMed

    Hill, Katherine E; Kelly, Andrew D; Kuijjer, Marieke L; Barry, William; Rattani, Ahmed; Garbutt, Cassandra C; Kissick, Haydn; Janeway, Katherine; Perez-Atayde, Antonio; Goldsmith, Jeffrey; Gebhardt, Mark C; Arredouani, Mohamed S; Cote, Greg; Hornicek, Francis; Choy, Edwin; Duan, Zhenfeng; Quackenbush, John; Haibe-Kains, Benjamin; Spentzos, Dimitrios

    2017-05-15

    A microRNA (miRNA) collection on the imprinted 14q32 MEG3 region has been associated with outcome in osteosarcoma. We assessed the clinical utility of this miRNA set and their association with methylation status. We integrated coding and non-coding RNA data from three independent annotated clinical osteosarcoma cohorts (n = 65, n = 27, and n = 25) and miRNA and methylation data from one in vitro (19 cell lines) and one clinical (NCI Therapeutically Applicable Research to Generate Effective Treatments (TARGET) osteosarcoma dataset, n = 80) dataset. We used time-dependent receiver operating characteristic (tdROC) analysis to evaluate the clinical value of candidate miRNA profiles and machine learning approaches to compare the coding and non-coding transcriptional programs of high- and low-risk osteosarcoma tumors and high- versus low-aggressiveness cell lines. In the cell line and TARGET datasets, we also studied the methylation patterns of the MEG3 imprinting control region on 14q32 and their association with miRNA expression and tumor aggressiveness. In the tdROC analysis, miRNA sets on 14q32 showed strong discriminatory power for recurrence and survival in the three clinical datasets. High- or low-risk tumor classification was robust to using different microRNA sets or classification methods. Machine learning approaches showed that genome-wide miRNA profiles and miRNA regulatory networks were quite different between the two outcome groups and mRNA profiles categorized the samples in a manner concordant with the miRNAs, suggesting potential molecular subtypes. Further, miRNA expression patterns were reproducible in comparing high-aggressiveness versus low-aggressiveness cell lines. Methylation patterns in the MEG3 differentially methylated region (DMR) also distinguished high-aggressiveness from low-aggressiveness cell lines and were associated with expression of several 14q32 miRNAs in both the cell lines and the large TARGET clinical dataset. Within the limits of available CpG array coverage, we observed a potential methylation-sensitive regulation of the non-coding RNA cluster by CTCF, a known enhancer-blocking factor. Loss of imprinting/methylation changes in the 14q32 non-coding region defines reproducible previously unrecognized osteosarcoma subtypes with distinct transcriptional programs and biologic and clinical behavior. Future studies will define the precise relationship between 14q32 imprinting, non-coding RNA expression, genomic enhancer binding, and tumor aggressiveness, with possible therapeutic implications for both early- and advanced-stage patients.

  7. Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

    PubMed Central

    Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

    2012-01-01

    While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738

  8. Prediction of plant lncRNA by ensemble machine learning classifiers.

    PubMed

    Simopoulos, Caitlin M A; Weretilnyk, Elizabeth A; Golding, G Brian

    2018-05-02

    In plants, long non-protein coding RNAs are believed to have essential roles in development and stress responses. However, relative to advances on discerning biological roles for long non-protein coding RNAs in animal systems, this RNA class in plants is largely understudied. With comparatively few validated plant long non-coding RNAs, research on this potentially critical class of RNA is hindered by a lack of appropriate prediction tools and databases. Supervised learning models trained on data sets of mostly non-validated, non-coding transcripts have been previously used to identify this enigmatic RNA class with applications largely focused on animal systems. Our approach uses a training set comprised only of empirically validated long non-protein coding RNAs from plant, animal, and viral sources to predict and rank candidate long non-protein coding gene products for future functional validation. Individual stochastic gradient boosting and random forest classifiers trained on only empirically validated long non-protein coding RNAs were constructed. In order to use the strengths of multiple classifiers, we combined multiple models into a single stacking meta-learner. This ensemble approach benefits from the diversity of several learners to effectively identify putative plant long non-coding RNAs from transcript sequence features. When the predicted genes identified by the ensemble classifier were compared to those listed in GreeNC, an established plant long non-coding RNA database, overlap for predicted genes from Arabidopsis thaliana, Oryza sativa and Eutrema salsugineum ranged from 51 to 83% with the highest agreement in Eutrema salsugineum. Most of the highest ranking predictions from Arabidopsis thaliana were annotated as potential natural antisense genes, pseudogenes, transposable elements, or simply computationally predicted hypothetical protein. Due to the nature of this tool, the model can be updated as new long non-protein coding transcripts are identified and functionally verified. This ensemble classifier is an accurate tool that can be used to rank long non-protein coding RNA predictions for use in conjunction with gene expression studies. Selection of plant transcripts with a high potential for regulatory roles as long non-protein coding RNAs will advance research in the elucidation of long non-protein coding RNA function.

  9. Structural and functional analyses of Saccharomyces cerevisiae wild-type and mutant RNA1 genes.

    PubMed Central

    Traglia, H M; Atkinson, N S; Hopper, A K

    1989-01-01

    The yeast gene RNA1 has been defined by the thermosensitive rna1-1 lesion. This lesion interferes with the processing and production of all major classes of RNA. Each class of RNA is affected at a distinct and presumably unrelated step. Furthermore, RNA does not appear to exit the nucleus. To investigate how the RNA1 gene product can pleiotropically affect disparate processes, we undertook a structural analysis of wild-type and mutant RNA1 genes. The wild-type gene was found to contain a 407-amino-acid open reading frame that encodes a hydrophilic protein. No clue regarding the function of the RNA1 protein was obtained by searching banks for similarity to other known gene products. Surprisingly, the rna1-1 lesion was found to code for two amino acid differences from wild type. We found that neither single-amino-acid change alone resulted in temperature sensitivity. The carboxy-terminal region of the RNA1 open reading frame contains a highly acidic domain extending from amino acids 334 to 400. We generated genomic deletions that removed C-terminal regions of this protein. Deletion of amino acids 397 to 407 did not appear to affect cell growth. Removal of amino acids 359 to 397, a region containing 24 acidic residues, caused temperature-sensitive growth. This allele, rna1-delta 359-397, defines a second conditional lesion of the RNA1 locus. We found that strains possessing the rna1-delta 359-397 allele did not show thermosensitive defects in pre-rRNA or pre-tRNA processing. Removal of amino acids 330 to 407 resulted in loss of viability. Images PMID:2674676

  10. Genome-wide screening and identification of long noncoding RNAs and their interaction with protein coding RNAs in bladder urothelial cell carcinoma.

    PubMed

    Wang, Longxin; Fu, Dian; Qiu, Yongbin; Xing, Xiaoxiao; Xu, Feng; Han, Conghui; Xu, Xiaofeng; Wei, Zhifeng; Zhang, Zhengyu; Ge, Jingping; Cheng, Wen; Xie, Hai-Long

    2014-07-10

    To understand lncRNAs expression profiling and their potential functions in bladder cancer, we investigated the lncRNA and coding RNA expression on human bladder cancer and normal bladder tissues. Bioinformatic analysis revealed thousands of significantly differentially expressed lncRNAs and coding mRNA in bladder cancer relative to normal bladder tissue. Co-expression analysis revealed that 50% of lncRNAs and coding RNAs expressed in the same direction. A subset of lncRNAs might be involved in mTOR signaling, p53 signaling, cancer pathways. Our study provides a large scale of co-expression between lncRNA and coding RNAs in bladder cancer cells and lays biological basis for further investigation. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  11. Genome-scale deletion screening of human long non-coding RNAs using a paired-guide RNA CRISPR library

    PubMed Central

    Zhu, Shiyou; Li, Wei; Liu, Jingze; Chen, Chen-Hao; Liao, Qi; Xu, Ping; Xu, Han; Xiao, Tengfei; Cao, Zhongzheng; Peng, Jingyu; Yuan, Pengfei; Brown, Myles; Liu, Xiaole Shirley; Wei, Wensheng

    2017-01-01

    CRISPR/Cas9 screens have been widely adopted to analyse coding gene functions, but high throughput screening of non-coding elements using this method is more challenging, because indels caused by a single cut in non-coding regions are unlikely to produce a functional knockout. A high-throughput method to produce deletions of non-coding DNA is needed. Herein, we report a high throughput genomic deletion strategy to screen for functional long non-coding RNAs (lncRNAs) that is based on a lentiviral paired-guide RNA (pgRNA) library. Applying our screening method, we identified 51 lncRNAs that can positively or negatively regulate human cancer cell growth. We individually validated 9 lncRNAs using CRISPR/Cas9-mediated genomic deletion and functional rescue, CRISPR activation or inhibition, and gene expression profiling. Our high-throughput pgRNA genome deletion method should enable rapid identification of functional mammalian non-coding elements. PMID:27798563

  12. Specific and Modular Binding Code for Cytosine Recognition in Pumilio/FBF (PUF) RNA-binding Domains

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dong, Shuyun; Wang, Yang; Cassidy-Amstutz, Caleb

    2011-10-28

    Pumilio/fem-3 mRNA-binding factor (PUF) proteins possess a recognition code for bases A, U, and G, allowing designed RNA sequence specificity of their modular Pumilio (PUM) repeats. However, recognition side chains in a PUM repeat for cytosine are unknown. Here we report identification of a cytosine-recognition code by screening random amino acid combinations at conserved RNA recognition positions using a yeast three-hybrid system. This C-recognition code is specific and modular as specificity can be transferred to different positions in the RNA recognition sequence. A crystal structure of a modified PUF domain reveals specific contacts between an arginine side chain and themore » cytosine base. We applied the C-recognition code to design PUF domains that recognize targets with multiple cytosines and to generate engineered splicing factors that modulate alternative splicing. Finally, we identified a divergent yeast PUF protein, Nop9p, that may recognize natural target RNAs with cytosine. This work deepens our understanding of natural PUF protein target recognition and expands the ability to engineer PUF domains to recognize any RNA sequence.« less

  13. The Maximal C³ Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses.

    PubMed

    Michel, Christian J

    2017-04-18

    In 1996, a set X of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set X has an interesting mathematical property as X is a maximal C 3 self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code X was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code X . As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code X is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code X . Finally, by studying viral genes, the circular code X was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes.

  14. On origin of genetic code and tRNA before translation

    PubMed Central

    2011-01-01

    Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520

  15. Tuning iteration space slicing based tiled multi-core code implementing Nussinov's RNA folding.

    PubMed

    Palkowski, Marek; Bielecki, Wlodzimierz

    2018-01-15

    RNA folding is an ongoing compute-intensive task of bioinformatics. Parallelization and improving code locality for this kind of algorithms is one of the most relevant areas in computational biology. Fortunately, RNA secondary structure approaches, such as Nussinov's recurrence, involve mathematical operations over affine control loops whose iteration space can be represented by the polyhedral model. This allows us to apply powerful polyhedral compilation techniques based on the transitive closure of dependence graphs to generate parallel tiled code implementing Nussinov's RNA folding. Such techniques are within the iteration space slicing framework - the transitive dependences are applied to the statement instances of interest to produce valid tiles. The main problem at generating parallel tiled code is defining a proper tile size and tile dimension which impact parallelism degree and code locality. To choose the best tile size and tile dimension, we first construct parallel parametric tiled code (parameters are variables defining tile size). With this purpose, we first generate two nonparametric tiled codes with different fixed tile sizes but with the same code structure and then derive a general affine model, which describes all integer factors available in expressions of those codes. Using this model and known integer factors present in the mentioned expressions (they define the left-hand side of the model), we find unknown integers in this model for each integer factor available in the same fixed tiled code position and replace in this code expressions, including integer factors, with those including parameters. Then we use this parallel parametric tiled code to implement the well-known tile size selection (TSS) technique, which allows us to discover in a given search space the best tile size and tile dimension maximizing target code performance. For a given search space, the presented approach allows us to choose the best tile size and tile dimension in parallel tiled code implementing Nussinov's RNA folding. Experimental results, received on modern Intel multi-core processors, demonstrate that this code outperforms known closely related implementations when the length of RNA strands is bigger than 2500.

  16. Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term

    PubMed Central

    Romero, Roberto; Tarca, Adi; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S.; Kalita, Cynthia A.; Cai, Juan; Yeo, Lami; Lipovich, Leonard

    2014-01-01

    Objective The mechanisms responsible for normal and abnormal parturition are poorly understood. Myometrial activation leading to regular uterine contractions is a key component of labor. Dysfunctional labor (arrest of dilatation and/or descent) is a leading indication for cesarean delivery. Compelling evidence suggests that most of these disorders are functional in nature, and not the result of cephalopelvic disproportion. The methodology and the datasets afforded by the post-genomic era provide novel opportunities to understand and target gene functions in these disorders. In 2012, the ENCODE Consortium elucidated the extraordinary abundance and functional complexity of long non-coding RNA genes in the human genome. The purpose of the study was to identify differentially expressed long non-coding RNA genes in human myometrium in women in spontaneous labor at term. Materials and Methods Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n=19) and women in spontaneous labor at term (n=20). RNA was extracted and profiled using an Illumina® microarray platform. The analysis of the protein coding genes from this study has been previously reported. Here, we have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. Results Upon considering more than 18,498 distinct lncRNA genes compiled nonredundantly from public experimental data sources, and interrogating 2,634 that matched Illumina microarray probes, we identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an independent experimental method. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site that lacked evolutionary conservation beyond primates. Conclusions We provide for the first time evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known, as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term. PMID:24168098

  17. Advances in RNA Structure Determination | Center for Cancer Research

    Cancer.gov

    The recent years have witnessed a revolution in the field of RNA structure and function. Until recently the main contribution of RNA in cellular and disease functions was considered to be a role defined by the central dogma, namely DNA codes for mRNAs, which in turn encode for proteins, a notion facilitated by non-coding ribosomal RNA and tRNA. It was also assumed at the time

  18. Dnmt2 mediates intergenerational transmission of paternally acquired metabolic disorders through sperm small non-coding RNAs.

    PubMed

    Zhang, Yunfang; Zhang, Xudong; Shi, Junchao; Tuorto, Francesca; Li, Xin; Liu, Yusheng; Liebers, Reinhard; Zhang, Liwen; Qu, Yongcun; Qian, Jingjing; Pahima, Maya; Liu, Ying; Yan, Menghong; Cao, Zhonghong; Lei, Xiaohua; Cao, Yujing; Peng, Hongying; Liu, Shichao; Wang, Yue; Zheng, Huili; Woolsey, Rebekah; Quilici, David; Zhai, Qiwei; Li, Lei; Zhou, Tong; Yan, Wei; Lyko, Frank; Zhang, Ying; Zhou, Qi; Duan, Enkui; Chen, Qi

    2018-05-01

    The discovery of RNAs (for example, messenger RNAs, non-coding RNAs) in sperm has opened the possibility that sperm may function by delivering additional paternal information aside from solely providing the DNA 1 . Increasing evidence now suggests that sperm small non-coding RNAs (sncRNAs) can mediate intergenerational transmission of paternally acquired phenotypes, including mental stress 2,3 and metabolic disorders 4-6 . How sperm sncRNAs encode paternal information remains unclear, but the mechanism may involve RNA modifications. Here we show that deletion of a mouse tRNA methyltransferase, DNMT2, abolished sperm sncRNA-mediated transmission of high-fat-diet-induced metabolic disorders to offspring. Dnmt2 deletion prevented the elevation of RNA modifications (m 5 C, m 2 G) in sperm 30-40 nt RNA fractions that are induced by a high-fat diet. Also, Dnmt2 deletion altered the sperm small RNA expression profile, including levels of tRNA-derived small RNAs and rRNA-derived small RNAs, which might be essential in composing a sperm RNA 'coding signature' that is needed for paternal epigenetic memory. Finally, we show that Dnmt2-mediated m 5 C contributes to the secondary structure and biological properties of sncRNAs, implicating sperm RNA modifications as an additional layer of paternal hereditary information.

  19. Methylation of miRNA genes and oncogenesis.

    PubMed

    Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A

    2015-02-01

    Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.

  20. Cross-species inference of long non-coding RNAs greatly expands the ruminant transcriptome.

    PubMed

    Bush, Stephen J; Muriuki, Charity; McCulloch, Mary E B; Farquhar, Iseabail L; Clark, Emily L; Hume, David A

    2018-04-24

    mRNA-like long non-coding RNAs (lncRNAs) are a significant component of mammalian transcriptomes, although most are expressed only at low levels, with high tissue-specificity and/or at specific developmental stages. Thus, in many cases lncRNA detection by RNA-sequencing (RNA-seq) is compromised by stochastic sampling. To account for this and create a catalogue of ruminant lncRNAs, we compared de novo assembled lncRNAs derived from large RNA-seq datasets in transcriptional atlas projects for sheep and goats with previous lncRNAs assembled in cattle and human. We then combined the novel lncRNAs with the sheep transcriptional atlas to identify co-regulated sets of protein-coding and non-coding loci. Few lncRNAs could be reproducibly assembled from a single dataset, even with deep sequencing of the same tissues from multiple animals. Furthermore, there was little sequence overlap between lncRNAs that were assembled from pooled RNA-seq data. We combined positional conservation (synteny) with cross-species mapping of candidate lncRNAs to identify a consensus set of ruminant lncRNAs and then used the RNA-seq data to demonstrate detectable and reproducible expression in each species. In sheep, 20 to 30% of lncRNAs were located close to protein-coding genes with which they are strongly co-expressed, which is consistent with the evolutionary origin of some ncRNAs in enhancer sequences. Nevertheless, most of the lncRNAs are not co-expressed with neighbouring protein-coding genes. Alongside substantially expanding the ruminant lncRNA repertoire, the outcomes of our analysis demonstrate that stochastic sampling can be partly overcome by combining RNA-seq datasets from related species. This has practical implications for the future discovery of lncRNAs in other species.

  1. LncRNA-DANCR: A valuable cancer related long non-coding RNA for human cancers.

    PubMed

    Thin, Khaing Zar; Liu, Xuefang; Feng, Xiaobo; Raveendran, Sudheesh; Tu, Jian Cheng

    2018-06-01

    Long noncoding RNAs (lncRNA) are a type of noncoding RNA that comprise of longer than 200 nucleotides sequences. They can regulate chromosome structure, gene expression and play an essential role in the pathophysiology of human diseases, especially in tumorigenesis and progression. Nowadays, they are being targeted as potential biomarkers for various cancer types. And many research studies have proven that lncRNAs might bring a new era to cancer diagnosis and support treatment management. The purpose of this review was to inspect the molecular mechanism and clinical significance of long non-coding RNA- differentiation antagonizing nonprotein coding RNA(DANCR) in various types of human cancers. In this review, we summarize and figure out recent research studies concerning the expression and biological mechanisms of lncRNA-DANCR in tumour development. The related studies were obtained through a systematic search of PubMed, Embase and Cochrane Library. Long non-coding RNAs-DANCR is a valuable cancer-related lncRNA that its dysregulated expression was found in a variety of malignancies, including hepatocellular carcinoma, breast cancer, glioma, colorectal cancer, gastric cancer, and lung cancer. The aberrant expressions of DANCR have been shown to contribute to proliferation, migration and invasion of cancer cells. Long non-coding RNAs-DANCR likely serves as a useful disease biomarker or therapeutic cancer target. Copyright © 2018 Elsevier GmbH. All rights reserved.

  2. Characterization of mitochondrial genome of sea cucumber Stichopus horrens: a novel gene arrangement in Holothuroidea.

    PubMed

    Fan, SiGang; Hu, ChaoQun; Wen, Jing; Zhang, LvPing

    2011-05-01

    The complete mitochondrial DNA sequence contains useful information for phylogenetic analyses of metazoa. In this study, the complete mitochondrial DNA sequence of sea cucumber Stichopus horrens (Holothuroidea: Stichopodidae: Stichopus) is presented. The complete sequence was determined using normal and long PCRs. The mitochondrial genome of Stichopus horrens is a circular molecule 16257 bps long, composed of 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes. Most of these genes are coded on the heavy strand except for one protein-coding gene (nad6) and five tRNA genes (tRNA ( Ser(UCN) ), tRNA ( Gln ), tRNA ( Ala ), tRNA ( Val ), tRNA ( Asp )) which are coded on the light strand. The composition of the heavy strand is 30.8% A, 23.7% C, 16.2% G, and 29.3% T bases (AT skew=0.025; GC skew=-0.188). A non-coding region of 675 bp was identified as a putative control region because of its location and AT richness. The intergenic spacers range from 1 to 50 bp in size, totaling 227 bp. A total of 25 overlapping nucleotides, ranging from 1 to 10 bp in size, exist among 11 genes. All 13 protein-coding genes are initiated with an ATG. The TAA codon is used as the stop codon in all the protein coding genes except nad3 and nad4 that use TAG as their termination codon. The most frequently used amino acids are Leu (16.29%), Ser (10.34%) and Phe (8.37%). All of the tRNA genes have the potential to fold into typical cloverleaf secondary structures. We also compared the order of the genes in the mitochondrial DNA from the five holothurians that are now available and found a novel gene arrangement in the mitochondrial DNA of Stichopus horrens.

  3. Divergent transcription is associated with promoters of transcriptional regulators

    PubMed Central

    2013-01-01

    Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181

  4. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity

    PubMed Central

    Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna

    2013-01-01

    Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005

  5. Microprocessor-dependent processing of Splice site Overlapping microRNA exons does not result in changes in alternative splicing.

    PubMed

    Pianigiani, Giulia; Licastro, Danilo; Fortugno, Paola; Castiglia, Daniele; Petrovic, Ivana; Pagani, Franco

    2018-06-12

    MicroRNAs are found throughout the genome and are processed by the microprocessor complex (MPC) from longer precursors. Some precursor miRNAs overlap intron:exon junctions. These Splice site Overlapping microRNAs (SO-miRNAs) are mostly located in coding genes. It has been intimated, in the rarer examples of SO-miRNAs in non-coding RNAs, that the competition between the spliceosome and the MPC modulates alternative splicing. However, the effect of this overlap on coding transcripts is unknown. Unexpectedly, we show that neither Drosha silencing nor SF3b1 silencing changed the inclusion ratio of SO-miRNA exons. Two SO-miRNAs, located in genes that code for basal membrane proteins, are known to inhibit proliferation in primary keratinocytes. These SO-miRNAs were upregulated during differentiation and the host mRNAs were downregulated, but again there was no change in inclusion ratio of the SO-miRNA exons. Interestingly, Drosha silencing increased nascent RNA density, on chromatin, downstream of SO-miRNA exons. Overall our data suggest a novel mechanism for regulating gene expression in which MPC-dependent cleavage of SO-miRNA exons could cause premature transcriptional termination of coding genes rather than affecting alternative splicing. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  6. DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts

    PubMed Central

    Paraskevopoulou, Maria D.; Vlachos, Ioannis S.; Karagkouni, Dimitra; Georgakilas, Georgios; Kanellos, Ilias; Vergoulis, Thanasis; Zagganas, Konstantinos; Tsanakas, Panayiotis; Floros, Evangelos; Dalamagas, Theodore; Hatzigeorgiou, Artemis G.

    2016-01-01

    microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads. PMID:26612864

  7. RNA meets disease in paradise.

    PubMed

    Winter, Julia; Roth, Anna; Diederichs, Sven

    2011-01-01

    Getting off the train in Jena-Paradies, 60 participants joined for the 12 (th) Young Scientist Meeting of the German Society for Cell Biology (DGZ) entitled "RNA & Disease". Excellent speakers from around the world, graduate students, postdocs and young group leaders enjoyed a meeting in a familiar atmosphere to exchange inspiring new data and vibrant scientific discussions about the fascinating history and exciting future of non-coding RNA research including microRNA, piRNA and long non-coding RNA as well as their function in cancer, diabetes and neurodegenerative diseases.

  8. Parallel tiled Nussinov RNA folding loop nest generated using both dependence graph transitive closure and loop skewing.

    PubMed

    Palkowski, Marek; Bielecki, Wlodzimierz

    2017-06-02

    RNA secondary structure prediction is a compute intensive task that lies at the core of several search algorithms in bioinformatics. Fortunately, the RNA folding approaches, such as the Nussinov base pair maximization, involve mathematical operations over affine control loops whose iteration space can be represented by the polyhedral model. Polyhedral compilation techniques have proven to be a powerful tool for optimization of dense array codes. However, classical affine loop nest transformations used with these techniques do not optimize effectively codes of dynamic programming of RNA structure predictions. The purpose of this paper is to present a novel approach allowing for generation of a parallel tiled Nussinov RNA loop nest exposing significantly higher performance than that of known related code. This effect is achieved due to improving code locality and calculation parallelization. In order to improve code locality, we apply our previously published technique of automatic loop nest tiling to all the three loops of the Nussinov loop nest. This approach first forms original rectangular 3D tiles and then corrects them to establish their validity by means of applying the transitive closure of a dependence graph. To produce parallel code, we apply the loop skewing technique to a tiled Nussinov loop nest. The technique is implemented as a part of the publicly available polyhedral source-to-source TRACO compiler. Generated code was run on modern Intel multi-core processors and coprocessors. We present the speed-up factor of generated Nussinov RNA parallel code and demonstrate that it is considerably faster than related codes in which only the two outer loops of the Nussinov loop nest are tiled.

  9. Sequence-based heuristics for faster annotation of non-coding RNA families.

    PubMed

    Weinberg, Zasha; Ruzzo, Walter L

    2006-01-01

    Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. The source code is available under GNU Public License at the supplementary web site.

  10. RNA-seq reveals distinctive RNA profiles of small extracellular vesicles from different human liver cancer cell lines.

    PubMed

    Berardocco, Martina; Radeghieri, Annalisa; Busatto, Sara; Gallorini, Marialucia; Raggi, Chiara; Gissi, Clarissa; D'Agnano, Igea; Bergese, Paolo; Felsani, Armando; Berardi, Anna C

    2017-10-10

    Liver cancer (LC) is one of the most common cancers and represents the third highest cause of cancer-related deaths worldwide. Extracellular vesicle (EVs) cargoes, which are selectively enriched in RNA, offer great promise for the diagnosis, prognosis and treatment of LC. Our study analyzed the RNA cargoes of EVs derived from 4 liver-cancer cell lines: HuH7, Hep3B, HepG2 (hepato-cellular carcinoma) and HuH6 (hepatoblastoma), generating two different sets of sequencing libraries for each. One library was size-selected for small RNAs and the other targeted the whole transcriptome. Here are reported genome wide data of the expression level of coding and non-coding transcripts, microRNAs, isomiRs and snoRNAs providing the first comprehensive overview of the extracellular-vesicle RNA cargo released from LC cell lines. The EV-RNA expression profiles of the four liver cancer cell lines share a similar background, but cell-specific features clearly emerge showing the marked heterogeneity of the EV-cargo among the individual cell lines, evident both for the coding and non-coding RNA species.

  11. Characterization of a major late herpes simplex virus type 1 mRNA.

    PubMed

    Costa, R H; Devi, B G; Anderson, K P; Gaylord, B H; Wagner, E K

    1981-05-01

    A major, late 6-kilobase (6-kb) mRNa mapping in the large unique region of herpes simplex virus type 1 (HSV-1) was characterized by using two recombinant DNA clones, one containing EcoRI fragment G (0.190 to 0.30 map units) in lambda. WES.B (L. Enquist, M. Madden, P. Schiop-Stansly, and G. Vandl Woude, Science 203:541-544, 1979) and one containing HindIII fragment J (0.181 to 0.259 map units) in pBR322. This 6-kb mRNA had its 3' end to the left of 0.231 on the prototypical arrangement of the HSV-1 genome and was transcribed from right to left. It was bounded on both sides by regions containing a large number of distinct mRNA species, and its 3' end was partially colinear with a 1.5-kb mRNA which encoded a 35,000-dalton polypeptide. The 6-kb mRNA encoded a 155,000-dalton polypeptide which was shown to be the only one of this size detectable by hybrid-arrested translation encoded by late polyadenylated polyribosomal RNA. The S1 nuclease mapping experiments indicated that there were no introns in the coding sequence for this mRNA and that its 3' end mapped approximately 800 nucleotides to the left of the BglII site at 0.231, whereas its 5' end extended very close to the BamHI site at 0.266.

  12. A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

    PubMed

    Torrent, C; Gabus, C; Darlix, J L

    1994-02-01

    Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.

  13. One ancestor for two codes viewed from the perspective of two complementary modes of tRNA aminoacylation

    PubMed Central

    Rodin, Andrei S; Szathmáry, Eörs; Rodin, Sergei N

    2009-01-01

    Background The genetic code is brought into action by 20 aminoacyl-tRNA synthetases. These enzymes are evenly divided into two classes (I and II) that recognize tRNAs from the minor and major groove sides of the acceptor stem, respectively. We have reported recently that: (1) ribozymic precursors of the synthetases seem to have used the same two sterically mirror modes of tRNA recognition, (2) having these two modes might have helped in preventing erroneous aminoacylation of ancestral tRNAs with complementary anticodons, yet (3) the risk of confusion for the presumably earliest pairs of complementarily encoded amino acids had little to do with anticodons. Accordingly, in this communication we focus on the acceptor stem. Results Our main result is the emergence of a palindrome structure for the acceptor stem's common ancestor, reconstructed from the phylogenetic trees of Bacteria, Archaea and Eukarya. In parallel, for pairs of ancestral tRNAs with complementary anticodons, we present updated evidence of concerted complementarity of the second bases in the acceptor stems. These two results suggest that the first pairs of "complementary" amino acids that were engaged in primordial coding, such as Gly and Ala, could have avoided erroneous aminoacylation if and only if the acceptor stems of their adaptors were recognized from the same, major groove, side. The class II protein synthetases then inherited this "primary preference" from isofunctional ribozymes. Conclusion Taken together, our results support the hypothesis that the genetic code per se (the one associated with the anticodons) and the operational code of aminoacylation (associated with the acceptor) diverged from a common ancestor that probably began developing before translation. The primordial advantage of linking some amino acids (most likely glycine and alanine) to the ancestral acceptor stem may have been selective retention in a protocell surrounded by a leaky membrane for use in nucleotide and coenzyme synthesis. Such acceptor stems (as cofactors) thus transferred amino acids as groups for biosynthesis. Later, with the advent of an anticodon loop, some amino acids (such as aspartic acid, histidine, arginine) assumed a catalytic role while bound to such extended adaptors, in line with the original coding coenzyme handle (CCH) hypothesis. Reviewers This article was reviewed by Rob Knight, Juergen Brosius and Anthony Poole. PMID:19173731

  14. An intergenic non-coding rRNA correlated with expression of the rRNA and frequency of an rRNA single nucleotide polymorphism in lung cancer cells.

    PubMed

    Shiao, Yih-Horng; Lupascu, Sorin T; Gu, Yuhan D; Kasprzak, Wojciech; Hwang, Christopher J; Fields, Janet R; Leighty, Robert M; Quiñones, Octavio; Shapiro, Bruce A; Alvord, W Gregory; Anderson, Lucy M

    2009-10-19

    Ribosomal RNA (rRNA) is a central regulator of cell growth and may control cancer development. A cis noncoding rRNA (nc-rRNA) upstream from the 45S rRNA transcription start site has recently been implicated in control of rRNA transcription in mouse fibroblasts. We investigated whether a similar nc-rRNA might be expressed in human cancer epithelial cells, and related to any genomic characteristics. Using quantitative rRNA measurement, we demonstrated that a nc-rRNA is transcribed in human lung epithelial and lung cancer cells, starting from approximately -1000 nucleotides upstream of the rRNA transcription start site (+1) and extending at least to +203. This nc-rRNA was significantly more abundant in the majority of lung cancer cell lines, relative to a nontransformed lung epithelial cell line. Its abundance correlated negatively with total 45S rRNA in 12 of 13 cell lines (P = 0.014). During sequence analysis from -388 to +306, we observed diverse, frequent intercopy single nucleotide polymorphisms (SNPs) in rRNA, with a frequency greater than predicted by chance at 12 sites. A SNP at +139 (U/C) in the 5' leader sequence varied among the cell lines and correlated negatively with level of the nc-rRNA (P = 0.014). Modelling of the secondary structure of the rRNA 5'-leader sequence indicated a small increase in structural stability due to the +139 U/C SNP and a minor shift in local configuration occurrences. The results demonstrate occurrence of a sense nc-rRNA in human lung epithelial and cancer cells, and imply a role in regulation of the rRNA gene, which may be affected by a +139 SNP in the 5' leader sequence of the primary rRNA transcript.

  15. Correction of the consequences of mitochondrial 3243A>G mutation in the MT-TL1 gene causing the MELAS syndrome by tRNA import into mitochondria.

    PubMed

    Karicheva, Olga Z; Kolesnikova, Olga A; Schirtz, Tom; Vysokikh, Mikhail Y; Mager-Heckel, Anne-Marie; Lombès, Anne; Boucheham, Abdeldjalil; Krasheninnikov, Igor A; Martin, Robert P; Entelis, Nina; Tarassov, Ivan

    2011-10-01

    Mutations in human mitochondrial DNA are often associated with incurable human neuromuscular diseases. Among these mutations, an important number have been identified in tRNA genes, including 29 in the gene MT-TL1 coding for the tRNA(Leu(UUR)). The m.3243A>G mutation was described as the major cause of the MELAS syndrome (mitochondrial encephalomyopathy with lactic acidosis and stroke-like episodes). This mutation was reported to reduce tRNA(Leu(UUR)) aminoacylation and modification of its anti-codon wobble position, which results in a defective mitochondrial protein synthesis and reduced activities of respiratory chain complexes. In the present study, we have tested whether the mitochondrial targeting of recombinant tRNAs bearing the identity elements for human mitochondrial leucyl-tRNA synthetase can rescue the phenotype caused by MELAS mutation in human transmitochondrial cybrid cells. We demonstrate that nuclear expression and mitochondrial targeting of specifically designed transgenic tRNAs results in an improvement of mitochondrial translation, increased levels of mitochondrial DNA-encoded respiratory complexes subunits, and significant rescue of respiration. These findings prove the possibility to direct tRNAs with changed aminoacylation specificities into mitochondria, thus extending the potential therapeutic strategy of allotopic expression to address mitochondrial disorders.

  16. The crystal structure of the Split End protein SHARP adds a new layer of complexity to proteins containing RNA recognition motifs

    PubMed Central

    Arieti, Fabiana; Gabus, Caroline; Tambalo, Margherita; Huet, Tiphaine; Round, Adam; Thore, Stéphane

    2014-01-01

    The Split Ends (SPEN) protein was originally discovered in Drosophila in the late 1990s. Since then, homologous proteins have been identified in eukaryotic species ranging from plants to humans. Every family member contains three predicted RNA recognition motifs (RRMs) in the N-terminal region of the protein. We have determined the crystal structure of the region of the human SPEN homolog that contains these RRMs—the SMRT/HDAC1 Associated Repressor Protein (SHARP), at 2.0 Å resolution. SHARP is a co-regulator of the nuclear receptors. We demonstrate that two of the three RRMs, namely RRM3 and RRM4, interact via a highly conserved interface. Furthermore, we show that the RRM3–RRM4 block is the main platform mediating the stable association with the H12–H13 substructure found in the steroid receptor RNA activator (SRA), a long, non-coding RNA previously shown to play a crucial role in nuclear receptor transcriptional regulation. We determine that SHARP association with SRA relies on both single- and double-stranded RNA sequences. The crystal structure of the SHARP–RRM fragment, together with the associated RNA-binding studies, extend the repertoire of nucleic acid binding properties of RRM domains suggesting a new hypothesis for a better understanding of SPEN protein functions. PMID:24748666

  17. miRNA-dependent gene silencing involving Ago2-mediated cleavage of a circular antisense RNA

    PubMed Central

    Hansen, Thomas B; Wiklund, Erik D; Bramsen, Jesper B; Villadsen, Sune B; Statham, Aaron L; Clark, Susan J; Kjems, Jørgen

    2011-01-01

    MicroRNAs (miRNAs) are ∼22 nt non-coding RNAs that typically bind to the 3′ UTR of target mRNAs in the cytoplasm, resulting in mRNA destabilization and translational repression. Here, we report that miRNAs can also regulate gene expression by targeting non-coding antisense transcripts in human cells. Specifically, we show that miR-671 directs cleavage of a circular antisense transcript of the Cerebellar Degeneration-Related protein 1 (CDR1) locus in an Ago2-slicer-dependent manner. The resulting downregulation of circular antisense has a concomitant decrease in CDR1 mRNA levels, independently of heterochromatin formation. This study provides the first evidence for non-coding antisense transcripts as functional miRNA targets, and a novel regulatory mechanism involving a positive correlation between mRNA and antisense circular RNA levels. PMID:21964070

  18. Auto-Regulatory RNA Editing Fine-Tunes mRNA Re-Coding and Complex Behaviour in Drosophila

    PubMed Central

    Savva, Yiannis A.; Jepson, James E.C; Sahin, Asli; Sugden, Arthur U.; Dorsky, Jacquelyn S.; Alpert, Lauren; Lawrence, Charles; Reenan, Robert A.

    2014-01-01

    Auto-regulatory feedback loops are a common molecular strategy used to optimize protein function. In Drosophila many mRNAs involved in neuro-transmission are re-coded at the RNA level by the RNA editing enzyme dADAR, leading to the incorporation of amino acids that are not directly encoded by the genome. dADAR also re-codes its own transcript, but the consequences of this auto-regulation in vivo are unclear. Here we show that hard-wiring or abolishing endogenous dADAR auto-regulation dramatically remodels the landscape of re-coding events in a site-specific manner. These molecular phenotypes correlate with altered localization of dADAR within the nuclear compartment. Furthermore, auto-editing exhibits sexually dimorphic patterns of spatial regulation and can be modified by abiotic environmental factors. Finally, we demonstrate that modifying dAdar auto-editing affects adaptive complex behaviors. Our results reveal the in vivo relevance of auto-regulatory control over post-transcriptional mRNA re-coding events in fine-tuning brain function and organismal behavior. PMID:22531175

  19. Standing your Ground to Exoribonucleases: Function of Flavivirus Long Non-coding RNAs

    PubMed Central

    Charley, Phillida A.; Wilusz, Jeffrey

    2015-01-01

    Members of the Flaviviridae (e.g. Dengue virus, West Nile virus, and Hepatitis C virus) contain a positive-sense RNA genome that encodes a large polyprotein. It is now also clear most if not all of these viruses also produce an abundant subgenomic long non-coding RNA. These non-coding RNAs, which are called subgenomicflavivirus RNAs (sfRNAs) or Xrn1-resistant RNAs (xrRNAs), are stable decay intermediates generated from the viral genomic RNA through the stalling of the cellular exoribonuclease Xrn1 at highly structured regions. Several functions of these flavivirus long non-coding RNAs have been revealed in recent years. The generation of these sfRNAs/xrRNAs from viral transcripts results in the repression of Xrn1 and the dysregulation of cellular mRNA stability. The abundant sfRNAs also serve directly as a decoy for important cellular protein regulators of the interferon and RNA interference antiviral pathways. Thus the generation of long non-coding RNAs from flaviviruses, hepaciviruses and pestiviruses likely disrupts aspects of innate immunity and may directly contribute to viral replication, cytopathology and pathogenesis. PMID:26368052

  20. nRC: non-coding RNA Classifier based on structural features.

    PubMed

    Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

    2017-01-01

    Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.

  1. DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts.

    PubMed

    Paraskevopoulou, Maria D; Vlachos, Ioannis S; Karagkouni, Dimitra; Georgakilas, Georgios; Kanellos, Ilias; Vergoulis, Thanasis; Zagganas, Konstantinos; Tsanakas, Panayiotis; Floros, Evangelos; Dalamagas, Theodore; Hatzigeorgiou, Artemis G

    2016-01-04

    microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.

    PubMed

    Kim, K S; Lee, S E; Jeong, H W; Ha, J H

    1998-10-01

    The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.

  3. A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.

    PubMed

    Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor

    2017-08-30

    Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.

  4. Tau mRNA 3'UTR-to-CDS ratio is increased in Alzheimer disease.

    PubMed

    García-Escudero, Vega; Gargini, Ricardo; Martín-Maestro, Patricia; García, Esther; García-Escudero, Ramón; Avila, Jesús

    2017-08-10

    Neurons frequently show an imbalance in expression of the 3' untranslated region (3'UTR) relative to the coding DNA sequence (CDS) region of mature messenger RNAs (mRNA). The ratio varies among different cells or parts of the brain. The Map2 protein levels per cell depend on the 3'UTR-to-CDS ratio rather than the total mRNA amount, which suggests powerful regulation of protein expression by 3'UTR sequences. Here we found that MAPT (the microtubule-associated protein tau gene) 3'UTR levels are particularly high with respect to other genes; indeed, the 3'UTR-to-CDS ratio of MAPT is balanced in healthy brain in mouse and human. The tau protein accumulates in Alzheimer diseased brain. We nonetheless observed that the levels of RNA encoding MAPT/tau were diminished in these patients' brains. To explain this apparently contradictory result, we studied MAPT mRNA stoichiometry in coding and non-coding regions, and found that the 3'UTR-to-CDS ratio was higher in the hippocampus of Alzheimer disease patients, with higher tau protein but lower total mRNA levels. Our data indicate that changes in the 3'UTR-to-CDS ratio have a regulatory role in the disease. Future research should thus consider not only mRNA levels, but also the ratios between coding and non-coding regions. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Circular non-coding RNA ANRIL modulates ribosomal RNA maturation and atherosclerosis in humans

    PubMed Central

    Holdt, Lesca M.; Stahringer, Anika; Sass, Kristina; Pichler, Garwin; Kulak, Nils A.; Wilfert, Wolfgang; Kohlmaier, Alexander; Herbst, Andreas; Northoff, Bernd H.; Nicolaou, Alexandros; Gäbel, Gabor; Beutner, Frank; Scholz, Markus; Thiery, Joachim; Musunuru, Kiran; Krohn, Knut; Mann, Matthias; Teupser, Daniel

    2016-01-01

    Circular RNAs (circRNAs) are broadly expressed in eukaryotic cells, but their molecular mechanism in human disease remains obscure. Here we show that circular antisense non-coding RNA in the INK4 locus (circANRIL), which is transcribed at a locus of atherosclerotic cardiovascular disease on chromosome 9p21, confers atheroprotection by controlling ribosomal RNA (rRNA) maturation and modulating pathways of atherogenesis. CircANRIL binds to pescadillo homologue 1 (PES1), an essential 60S-preribosomal assembly factor, thereby impairing exonuclease-mediated pre-rRNA processing and ribosome biogenesis in vascular smooth muscle cells and macrophages. As a consequence, circANRIL induces nucleolar stress and p53 activation, resulting in the induction of apoptosis and inhibition of proliferation, which are key cell functions in atherosclerosis. Collectively, these findings identify circANRIL as a prototype of a circRNA regulating ribosome biogenesis and conferring atheroprotection, thereby showing that circularization of long non-coding RNAs may alter RNA function and protect from human disease. PMID:27539542

  6. Mapping of RNA accessible sites by extension of random oligonucleotide libraries with reverse transcriptase.

    PubMed Central

    Allawi, H T; Dong, F; Ip, H S; Neri, B P; Lyamichev, V I

    2001-01-01

    A rapid and simple method for determining accessible sites in RNA that is independent of the length of target RNA and does not require RNA labeling is described. In this method, target RNA is allowed to hybridize with sequence-randomized libraries of DNA oligonucleotides linked to a common tag sequence at their 5'-end. Annealed oligonucleotides are extended with reverse transcriptase and the extended products are then amplified by using PCR with a primer corresponding to the tag sequence and a second primer specific to the target RNA sequence. We used the combination of both the lengths of the RT-PCR products and the location of the binding site of the RNA-specific primer to determine which regions of the RNA molecules were RNA extendible sites, that is, sites available for oligonucleotide binding and extension. We then employed this reverse transcription with the random oligonucleotide libraries (RT-ROL) method to determine the accessible sites on four mRNA targets, human activated ras (ha-ras), human intercellular adhesion molecule-1 (ICAM-1), rabbit beta-globin, and human interferon-gamma (IFN-gamma). Our results were concordant with those of other researchers who had used RNase H cleavage or hybridization with arrays of oligonucleotides to identify accessible sites on some of these targets. Further, we found good correlation between sites when we compared the location of extendible sites identified by RT-ROL with hybridization sites of effective antisense oligonucleotides on ICAM-1 mRNA in antisense inhibition studies. Finally, we discuss the relationship between RNA extendible sites and RNA accessibility. PMID:11233988

  7. RNA-seq reveals distinctive RNA profiles of small extracellular vesicles from different human liver cancer cell lines

    PubMed Central

    Berardocco, Martina; Radeghieri, Annalisa; Busatto, Sara; Gallorini, Marialucia; Raggi, Chiara; Gissi, Clarissa; D’Agnano, Igea; Bergese, Paolo; Felsani, Armando; Berardi, Anna C.

    2017-01-01

    Liver cancer (LC) is one of the most common cancers and represents the third highest cause of cancer-related deaths worldwide. Extracellular vesicle (EVs) cargoes, which are selectively enriched in RNA, offer great promise for the diagnosis, prognosis and treatment of LC. Our study analyzed the RNA cargoes of EVs derived from 4 liver-cancer cell lines: HuH7, Hep3B, HepG2 (hepato-cellular carcinoma) and HuH6 (hepatoblastoma), generating two different sets of sequencing libraries for each. One library was size-selected for small RNAs and the other targeted the whole transcriptome. Here are reported genome wide data of the expression level of coding and non-coding transcripts, microRNAs, isomiRs and snoRNAs providing the first comprehensive overview of the extracellular-vesicle RNA cargo released from LC cell lines. The EV-RNA expression profiles of the four liver cancer cell lines share a similar background, but cell-specific features clearly emerge showing the marked heterogeneity of the EV-cargo among the individual cell lines, evident both for the coding and non-coding RNA species. PMID:29137313

  8. The "periodic table" of the genetic code: A new way to look at the code and the decoding process.

    PubMed

    Komar, Anton A

    2016-01-01

    Henri Grosjean and Eric Westhof recently presented an information-rich, alternative view of the genetic code, which takes into account current knowledge of the decoding process, including the complex nature of interactions between mRNA, tRNA and rRNA that take place during protein synthesis on the ribosome, and it also better reflects the evolution of the code. The new asymmetrical circular genetic code has a number of advantages over the traditional codon table and the previous circular diagrams (with a symmetrical/clockwise arrangement of the U, C, A, G bases). Most importantly, all sequence co-variances can be visualized and explained based on the internal logic of the thermodynamics of codon-anticodon interactions.

  9. The Long Non-coding RNA HOTTIP Enhances Pancreatic Cancer Cell Proliferation, Survival and Migration

    EPA Science Inventory

    ABSTRACTHOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5' tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expres...

  10. The RNA Exosome Adaptor ZFC3H1 Functionally Competes with Nuclear Export Activity to Retain Target Transcripts.

    PubMed

    Silla, Toomas; Karadoulama, Evdoxia; Mąkosa, Dawid; Lubas, Michal; Jensen, Torben Heick

    2018-05-15

    Mammalian genomes are promiscuously transcribed, yielding protein-coding and non-coding products. Many transcripts are short lived due to their nuclear degradation by the ribonucleolytic RNA exosome. Here, we show that abolished nuclear exosome function causes the formation of distinct nuclear foci, containing polyadenylated (pA + ) RNA secluded from nucleocytoplasmic export. We asked whether exosome co-factors could serve such nuclear retention. Co-localization studies revealed the enrichment of pA + RNA foci with "pA-tail exosome targeting (PAXT) connection" components MTR4, ZFC3H1, and PABPN1 but no overlap with known nuclear structures such as Cajal bodies, speckles, paraspeckles, or nucleoli. Interestingly, ZFC3H1 is required for foci formation, and in its absence, selected pA + RNAs, including coding and non-coding transcripts, are exported to the cytoplasm in a process dependent on the mRNA export factor AlyREF. Our results establish ZFC3H1 as a central nuclear pA + RNA retention factor, counteracting nuclear export activity. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  11. The Maximal C3 Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses

    PubMed Central

    Michel, Christian J.

    2017-01-01

    In 1996, a set X of 20 trinucleotides was identified in genes of both prokaryotes and eukaryotes which has on average the highest occurrence in reading frame compared to its two shifted frames. Furthermore, this set X has an interesting mathematical property as X is a maximal C3 self-complementary trinucleotide circular code. In 2015, by quantifying the inspection approach used in 1996, the circular code X was confirmed in the genes of bacteria and eukaryotes and was also identified in the genes of plasmids and viruses. The method was based on the preferential occurrence of trinucleotides among the three frames at the gene population level. We extend here this definition at the gene level. This new statistical approach considers all the genes, i.e., of large and small lengths, with the same weight for searching the circular code X. As a consequence, the concept of circular code, in particular the reading frame retrieval, is directly associated to each gene. At the gene level, the circular code X is strengthened in the genes of bacteria, eukaryotes, plasmids, and viruses, and is now also identified in the genes of archaea. The genes of mitochondria and chloroplasts contain a subset of the circular code X. Finally, by studying viral genes, the circular code X was found in DNA genomes, RNA genomes, double-stranded genomes, and single-stranded genomes. PMID:28420220

  12. The origins and evolutionary history of human non-coding RNA regulatory networks.

    PubMed

    Sherafatian, Masih; Mowla, Seyed Javad

    2017-04-01

    The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.

  13. Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing.

    PubMed

    Zhang, Rui; Deng, Patricia; Jacobson, Dionna; Li, Jin Billy

    2017-02-01

    Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3'UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3'UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions.

  14. Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing

    PubMed Central

    Jacobson, Dionna

    2017-01-01

    Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3’UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3’UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions. PMID:28166241

  15. Piecemeal Buildup of the Genetic Code, Ribosomes, and Genomes from Primordial tRNA Building Blocks

    PubMed Central

    Caetano-Anollés, Derek; Caetano-Anollés, Gustavo

    2016-01-01

    The origin of biomolecular machinery likely centered around an ancient and central molecule capable of interacting with emergent macromolecular complexity. tRNA is the oldest and most central nucleic acid molecule of the cell. Its co-evolutionary interactions with aminoacyl-tRNA synthetase protein enzymes define the specificities of the genetic code and those with the ribosome their accurate biosynthetic interpretation. Phylogenetic approaches that focus on molecular structure allow reconstruction of evolutionary timelines that describe the history of RNA and protein structural domains. Here we review phylogenomic analyses that reconstruct the early history of the synthetase enzymes and the ribosome, their interactions with RNA, and the inception of amino acid charging and codon specificities in tRNA that are responsible for the genetic code. We also trace the age of domains and tRNA onto ancient tRNA homologies that were recently identified in rRNA. Our findings reveal a timeline of recruitment of tRNA building blocks for the formation of a functional ribosome, which holds both the biocatalytic functions of protein biosynthesis and the ability to store genetic memory in primordial RNA genomic templates. PMID:27918435

  16. Piecemeal Buildup of the Genetic Code, Ribosomes, and Genomes from Primordial tRNA Building Blocks.

    PubMed

    Caetano-Anollés, Derek; Caetano-Anollés, Gustavo

    2016-12-02

    The origin of biomolecular machinery likely centered around an ancient and central molecule capable of interacting with emergent macromolecular complexity. tRNA is the oldest and most central nucleic acid molecule of the cell. Its co-evolutionary interactions with aminoacyl-tRNA synthetase protein enzymes define the specificities of the genetic code and those with the ribosome their accurate biosynthetic interpretation. Phylogenetic approaches that focus on molecular structure allow reconstruction of evolutionary timelines that describe the history of RNA and protein structural domains. Here we review phylogenomic analyses that reconstruct the early history of the synthetase enzymes and the ribosome, their interactions with RNA, and the inception of amino acid charging and codon specificities in tRNA that are responsible for the genetic code. We also trace the age of domains and tRNA onto ancient tRNA homologies that were recently identified in rRNA. Our findings reveal a timeline of recruitment of tRNA building blocks for the formation of a functional ribosome, which holds both the biocatalytic functions of protein biosynthesis and the ability to store genetic memory in primordial RNA genomic templates.

  17. Ribosome biogenesis in replicating cells: Integration of experiment and theory.

    PubMed

    Earnest, Tyler M; Cole, John A; Peterson, Joseph R; Hallock, Michael J; Kuhlman, Thomas E; Luthey-Schulten, Zaida

    2016-10-01

    Ribosomes-the primary macromolecular machines responsible for translating the genetic code into proteins-are complexes of precisely folded RNA and proteins. The ways in which their production and assembly are managed by the living cell is of deep biological importance. Here we extend a recent spatially resolved whole-cell model of ribosome biogenesis in a fixed volume [Earnest et al., Biophys J 2015, 109, 1117-1135] to include the effects of growth, DNA replication, and cell division. All biological processes are described in terms of reaction-diffusion master equations and solved stochastically using the Lattice Microbes simulation software. In order to determine the replication parameters, we construct and analyze a series of Escherichia coli strains with fluorescently labeled genes distributed evenly throughout their chromosomes. By measuring these cells' lengths and number of gene copies at the single-cell level, we could fit a statistical model of the initiation and duration of chromosome replication. We found that for our slow-growing (120 min doubling time) E. coli cells, replication was initiated 42 min into the cell cycle and completed after an additional 42 min. While simulations of the biogenesis model produce the correct ribosome and mRNA counts over the cell cycle, the kinetic parameters for transcription and degradation are lower than anticipated from a recent analytical time dependent model of in vivo mRNA production. Describing expression in terms of a simple chemical master equation, we show that the discrepancies are due to the lack of nonribosomal genes in the extended biogenesis model which effects the competition of mRNA for ribosome binding, and suggest corrections to parameters to be used in the whole-cell model when modeling expression of the entire transcriptome. © 2016 Wiley Periodicals, Inc. Biopolymers 105: 735-751, 2016. © 2016 Wiley Periodicals, Inc.

  18. A Celiac Diasease Associated lncRNA Named HCG14 Regulates NOD1 Expression in Intestinal Cells.

    PubMed

    Santin, Izortze; Jauregi-Miguel, Amaia; Velayos, Teresa; Castellanos-Rubio, Ainara; Garcia-Etxebarria, Koldo; Romero-Garmendia, Irati; Fernandez-Jimenez, Nora; Irastorza, Iñaki; Castaño, Luis; Bilbao, Jose Ramón

    2018-03-29

    To identify additional celiac disease associated loci in the Major Histocompatibility Complex independent from classical HLA risk alleles (HLA-DR3-DQ2) and to characterize their potential functional impact in celiac disease pathogenesis at the intestinal level. We performed a high resolution SNP genotyping of the MHC region, comparing HLA-DR3 homozygous celiac patients and non-celiac controls carrying a single copy of the B8-DR3-DQ2 conserved extended haplotype. Expression level of potential novel risk genes was determined by RT-PCR in intestinal biopsies and in intestinal and immune cells isolated from control and celiac individuals. Small interfering RNA-driven silencing of selected genes was performed in the intestinal cell line T84. MHC genotyping revealed two associated SNPs, one located in TRIM27 gene and another in the non-coding gene HCG14. After stratification analysis, only HCG14 showed significant association independent from HLA-DR-DQ loci Expression of HCG14 was slightly downregulated in epithelial cells isolated from duodenal biopsies of celiac patients, and eQTL analysis revealed that polymorphisms in HCG14 region were associated with decreased NOD1 expression in duodenal intestinal cells. We have sucessfully employed a conserved extended haplotype-matching strategy and identified a novel additional celiac disease risk variant in the lncRNA HGC14. This lncRNA seems to regulate the expression of NOD1 in an allele-specific manner. Further functional studies are needed to clarify the role of HCG14 in the regulation of gene expression and to determine the molecular mechanisms by which the risk variant in HCG14 contributes to celiac disease pathogenesis.

  19. The complete mitochondrial genome of Papilio glaucus and its phylogenetic implications.

    PubMed

    Shen, Jinhui; Cong, Qian; Grishin, Nick V

    2015-09-01

    Due to the intriguing morphology, lifecycle, and diversity of butterflies and moths, Lepidoptera are emerging as model organisms for the study of genetics, evolution and speciation. The progress of these studies relies on decoding Lepidoptera genomes, both nuclear and mitochondrial. Here we describe a protocol to obtain mitogenomes from Next Generation Sequencing reads performed for whole-genome sequencing and report the complete mitogenome of Papilio (Pterourus) glaucus. The circular mitogenome is 15,306 bp in length and rich in A and T. It contains 13 protein-coding genes (PCGs), 22 transfer-RNA-coding genes (tRNA), and 2 ribosomal-RNA-coding genes (rRNA), with a gene order typical for mitogenomes of Lepidoptera. We performed phylogenetic analyses based on PCG and RNA-coding genes or protein sequences using Bayesian Inference and Maximum Likelihood methods. The phylogenetic trees consistently show that among species with available mitogenomes Papilio glaucus is the closest to Papilio (Agehana) maraho from Asia.

  20. Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt.

    PubMed

    AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

    2014-10-07

    The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact "nanogenome."

  1. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

    PubMed Central

    Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

    2010-01-01

    RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462

  2. Connections Underlying Translation and mRNA Stability.

    PubMed

    Radhakrishnan, Aditya; Green, Rachel

    2016-09-11

    Gene expression and regulation in organisms minimally depends on transcription by RNA polymerase and on the stability of the RNA product (for both coding and non-coding RNAs). For coding RNAs, gene expression is further influenced by the amount of translation by the ribosome and by the stability of the protein product. The stabilities of these two classes of RNA, non-coding and coding, vary considerably: tRNAs and rRNAs tend to be long lived while mRNAs tend to be more short lived. Even among mRNAs, however, there is a considerable range in stability (ranging from seconds to hours in bacteria and up to days in metazoans), suggesting a significant role for stability in the regulation of gene expression. Here, we review recent experiments from bacteria, yeast and metazoans indicating that the stability of most mRNAs is broadly impacted by the actions of ribosomes that translate them. Ribosomal recognition of defective mRNAs triggers "mRNA surveillance" pathways that target the mRNA for degradation [Shoemaker and Green (2012) ]. More generally, even the stability of perfectly functional mRNAs appears to be dictated by overall rates of translation by the ribosome [Herrick et al. (1990), Presnyak et al. (2015) ]. Given that mRNAs are synthesized for the purpose of being translated into proteins, it is reassuring that such intimate connections between mRNA and the ribosome can drive biological regulation. In closing, we consider the likelihood that these connections between protein synthesis and mRNA stability are widespread or whether other modes of regulation dominate the mRNA stability landscape in higher organisms. Copyright © 2016. Published by Elsevier Ltd.

  3. The origin and evolution of tRNA inferred from phylogenetic analysis of structure.

    PubMed

    Sun, Feng-Jie; Caetano-Anollés, Gustavo

    2008-01-01

    The evolutionary history of the two structural and functional domains of tRNA is controversial but harbors the secrets of early translation and the genetic code. To explore the origin and evolution of tRNA, we reconstructed phylogenetic trees directly from molecular structure. Forty-two structural characters describing the geometry of 571 tRNAs and three statistical parameters describing thermodynamic and mechanical features of molecules quantitatively were used to derive phylogenetic trees of molecules and molecular substructures. Trees of molecules failed to group tRNA according to amino acid specificity and did not reveal the tripartite nature of life, probably due to loss of phylogenetic signal or because tRNA diversification predated organismal diversification. Trees of substructures derived from both structural and statistical characters support the origin of tRNA in the acceptor arm and the hypothesis that the top half domain composed of acceptor and pseudouridine (TPsiC) arms is more ancient than the bottom half domain composed of dihydrouridine (DHU) and anticodon arms. This constitutes the cornerstone of the genomic tag hypothesis that postulates tRNAs were ancient telomeres in the RNA world. The trees of substructures suggest a model for the evolution of the major functional and structural components of tRNA. In this model, short RNA hairpins with stems homologous to the acceptor arm of present day tRNAs were extended with regions homologous to TPsiC and anticodon arms. The DHU arm was then incorporated into the resulting three-stemmed structure to form a proto-cloverleaf structure. The variable region was the last structural addition to the molecular repertoire of evolving tRNA substructures.

  4. A small and efficient dimerization/packaging signal of rat VL30 RNA and its use in murine leukemia virus-VL30-derived vectors for gene transfer.

    PubMed Central

    Torrent, C; Gabus, C; Darlix, J L

    1994-01-01

    Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer. Images PMID:8289369

  5. Post-transcriptional control by bacteriophage T4: mRNA decay and inhibition of translation initiation

    PubMed Central

    2010-01-01

    Over 50 years of biological research with bacteriophage T4 includes notable discoveries in post-transcriptional control, including the genetic code, mRNA, and tRNA; the very foundations of molecular biology. In this review we compile the past 10 - 15 year literature on RNA-protein interactions with T4 and some of its related phages, with particular focus on advances in mRNA decay and processing, and on translational repression. Binding of T4 proteins RegB, RegA, gp32 and gp43 to their cognate target RNAs has been characterized. For several of these, further study is needed for an atomic-level perspective, where resolved structures of RNA-protein complexes are awaiting investigation. Other features of post-transcriptional control are also summarized. These include: RNA structure at translation initiation regions that either inhibit or promote translation initiation; programmed translational bypassing, where T4 orchestrates ribosome bypass of a 50 nucleotide mRNA sequence; phage exclusion systems that involve T4-mediated activation of a latent endoribonuclease (PrrC) and cofactor-assisted activation of EF-Tu proteolysis (Gol-Lit); and potentially important findings on ADP-ribosylation (by Alt and Mod enzymes) of ribosome-associated proteins that might broadly impact protein synthesis in the infected cell. Many of these problems can continue to be addressed with T4, whereas the growing database of T4-related phage genome sequences provides new resources and potentially new phage-host systems to extend the work into a broader biological, evolutionary context. PMID:21129205

  6. The Rodin-Ohno hypothesis that two enzyme superfamilies descended from one ancestral gene: an unlikely scenario for the origins of translation that will not be dismissed

    PubMed Central

    2014-01-01

    Background Because amino acid activation is rate-limiting for uncatalyzed protein synthesis, it is a key puzzle in understanding the origin of the genetic code. Two unrelated classes (I and II) of contemporary aminoacyl-tRNA synthetases (aaRS) now translate the code. Observing that codons for the most highly conserved, Class I catalytic peptides, when read in the reverse direction, are very nearly anticodons for Class II defining catalytic peptides, Rodin and Ohno proposed that the two superfamilies descended from opposite strands of the same ancestral gene. This unusual hypothesis languished for a decade, perhaps because it appeared to be unfalsifiable. Results The proposed sense/antisense alignment makes important predictions. Fragments that align in antiparallel orientations, and contain the respective active sites, should catalyze the same two reactions catalyzed by contemporary synthetases. Recent experiments confirmed that prediction. Invariant cores from both classes, called Urzymes after Ur = primitive, authentic, plus enzyme and representing ~20% of the contemporary structures, can be expressed and exhibit high, proportionate rate accelerations for both amino-acid activation and tRNA acylation. A major fraction (60%) of the catalytic rate acceleration by contemporary synthetases resides in segments that align sense/antisense. Bioinformatic evidence for sense/antisense ancestry extends to codons specifying the invariant secondary and tertiary structures outside the active sites of the two synthetase classes. Peptides from a designed, 46-residue gene constrained by Rosetta to encode Class I and II ATP binding sites with fully complementary sequences both accelerate amino acid activation by ATP ~400 fold. Conclusions Biochemical and bioinformatic results substantially enhance the posterior probability that ancestors of the two synthetase classes arose from opposite strands of the same ancestral gene. The remarkable acceleration by short peptides of the rate-limiting step in uncatalyzed protein synthesis, together with the synergy of synthetase Urzymes and their cognate tRNAs, introduce a new paradigm for the origin of protein catalysts, emphasize the potential relevance of an operational RNA code embedded in the tRNA acceptor stems, and challenge the RNA-World hypothesis. Reviewers This article was reviewed by Dr. Paul Schimmel (nominated by Laura Landweber), Dr. Eugene Koonin and Professor David Ardell. PMID:24927791

  7. Diversity of Antisense and Other Non-Coding RNAs in Archaea Revealed by Comparative Small RNA Sequencing in Four Pyrobaculum Species

    PubMed Central

    Bernick, David L.; Dennis, Patrick P.; Lui, Lauren M.; Lowe, Todd M.

    2012-01-01

    A great diversity of small, non-coding RNA (ncRNA) molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs (sRNA) in archaea is limited. We employed RNA-seq to identify novel sRNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense sRNAs encoded opposite to key regulatory (ferric uptake regulator), metabolic (triose-phosphate isomerase), and core transcriptional apparatus genes (transcription factor B). We also found a large increase in the number of conserved C/D box sRNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these sRNAs indicates they are relatively recent, stable adaptations. PMID:22783241

  8. Overview of long non-coding RNA and mRNA expression in response to methamphetamine treatment in vitro.

    PubMed

    Xiong, Kun; Long, Lingling; Zhang, Xudong; Qu, Hongke; Deng, Haixiao; Ding, Yanjun; Cai, Jifeng; Wang, Shuchao; Wang, Mi; Liao, Lvshuang; Huang, Jufang; Yi, Chun-Xia; Yan, Jie

    2017-10-01

    Long non-coding RNAs (lncRNAs) display multiple functions including regulation of neuronal injury. However, their impact in methamphetamine (METH)-induced neurotoxicity has rarely been reported. Here, using microarray analysis, we investigated the expression profiling of lncRNAs and mRNAs in primary cultured prefrontal cortical neurons after METH treatment. We observed a difference in lncRNA and mRNA expression between the experimental and sham control groups. Using bioinformatics, we analyzed the highest enriched gene ontology (GO) terms of biological process, cellular component, and molecular function, and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway and pathway network analysis. Furthermore, an lncRNA-mRNA co-expression sub-network for aberrantly expressed terms revealed possible interactions of lncRNA NR_110713 and NR_027943 with their related genes. Afterwards, three lncRNAs (NR_110713, NR_027943, GAS5) and two mRNAs (Ddit3, Casp12) were targeted to validate the microarray data by qRT-PCR. This presented an overview of lncRNA and mRNA expression profiling and indicated that lncRNA might participate in METH-induced neuronal apoptosis by regulating the coding genes of neurons. Copyright © 2017 Elsevier Ltd. All rights reserved.

  9. Non-coding functions of alternative pre-mRNA splicing in development

    PubMed Central

    Mockenhaupt, Stefan; Makeyev, Eugene V.

    2015-01-01

    A majority of messenger RNA precursors (pre-mRNAs) in the higher eukaryotes undergo alternative splicing to generate more than one mature product. By targeting the open reading frame region this process increases diversity of protein isoforms beyond the nominal coding capacity of the genome. However, alternative splicing also frequently controls output levels and spatiotemporal features of cellular and organismal gene expression programs. Here we discuss how these non-coding functions of alternative splicing contribute to development through regulation of mRNA stability, translational efficiency and cellular localization. PMID:26493705

  10. A genome-wide survey of maternal and embryonic transcripts during Xenopus tropicalis development.

    PubMed

    Paranjpe, Sarita S; Jacobi, Ulrike G; van Heeringen, Simon J; Veenstra, Gert Jan C

    2013-11-06

    Dynamics of polyadenylation vs. deadenylation determine the fate of several developmentally regulated genes. Decay of a subset of maternal mRNAs and new transcription define the maternal-to-zygotic transition, but the full complement of polyadenylated and deadenylated coding and non-coding transcripts has not yet been assessed in Xenopus embryos. To analyze the dynamics and diversity of coding and non-coding transcripts during development, both polyadenylated mRNA and ribosomal RNA-depleted total RNA were harvested across six developmental stages and subjected to high throughput sequencing. The maternally loaded transcriptome is highly diverse and consists of both polyadenylated and deadenylated transcripts. Many maternal genes show peak expression in the oocyte and include genes which are known to be the key regulators of events like oocyte maturation and fertilization. Of all the transcripts that increase in abundance between early blastula and larval stages, about 30% of the embryonic genes are induced by fourfold or more by the late blastula stage and another 35% by late gastrulation. Using a gene model validation and discovery pipeline, we identified novel transcripts and putative long non-coding RNAs (lncRNA). These lncRNA transcripts were stringently selected as spliced transcripts generated from independent promoters, with limited coding potential and a codon bias characteristic of noncoding sequences. Many lncRNAs are conserved and expressed in a developmental stage-specific fashion. These data reveal dynamics of transcriptome polyadenylation and abundance and provides a high-confidence catalogue of novel and long non-coding RNAs.

  11. tRNA-Derived Small RNA: A Novel Regulatory Small Non-Coding RNA.

    PubMed

    Li, Siqi; Xu, Zhengping; Sheng, Jinghao

    2018-05-10

    Deep analysis of next-generation sequencing data unveils numerous small non-coding RNAs with distinct functions. Recently, fragments derived from tRNA, named as tRNA-derived small RNA (tsRNA), have attracted broad attention. There are mainly two types of tsRNAs, including tRNA-derived stress-induced RNA (tiRNA) and tRNA-derived fragment (tRF), which differ in the cleavage position of the precursor or mature tRNA transcript. Emerging evidence has shown that tsRNAs are not merely tRNA degradation debris but have been recognized to play regulatory roles in many specific physiological and pathological processes. In this review, we summarize the biogeneses of various tsRNAs, present the emerging concepts regarding functions and mechanisms of action of tsRNAs, highlight the potential application of tsRNAs in human diseases, and put forward the current problems and future research directions.

  12. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis.

    PubMed

    Gao, Bo; Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-Hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including "immune response" as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma.

  13. Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt

    PubMed Central

    AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

    2014-01-01

    The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact “nanogenome.” PMID:25253891

  14. The Landscape of long non-coding RNA classification

    PubMed Central

    St Laurent, Georges; Wahlestedt, Claes; Kapranov, Philipp

    2015-01-01

    Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long non-coding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the non-coding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual un-ambiguous classification framework results in a number of challenges in the annotation and interpretation of non-coding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets. PMID:25869999

  15. Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

    PubMed Central

    Wang, Hongbo; Zhao, Yingchao; Chen, Mingyue; Cui, Jie

    2017-01-01

    Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16) mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN) tissues from three patients with high-throughput RNA sequencing (RNA-seq). In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE) in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO) terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA) network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer. PMID:28970820

  16. Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells

    PubMed Central

    Carlile, Thomas M.; Rojas-Duran, Maria F.; Zinshteyn, Boris; Shin, Hakyung; Bartoli, Kristen M.; Gilbert, Wendy V.

    2014-01-01

    Post-transcriptional modification of RNA nucleosides occurs in all living organisms. Pseudouridine, the most abundant modified nucleoside in non-coding RNAs1, enhances the function of transfer RNA and ribosomal RNA by stabilizing RNA structure2–8. mRNAs were not known to contain pseudouridine, but artificial pseudouridylation dramatically affects mRNA function – it changes the genetic code by facilitating non-canonical base pairing in the ribosome decoding center9,10. However, without evidence of naturally occurring mRNA pseudouridylation, its physiological was unclear. Here we present a comprehensive analysis of pseudouridylation in yeast and human RNAs using Pseudo-seq, a genome-wide, single-nucleotide-resolution method for pseudouridine identification. Pseudo-seq accurately identifies known modification sites as well as 100 novel sites in non-coding RNAs, and reveals hundreds of pseudouridylated sites in mRNAs. Genetic analysis allowed us to assign most of the new modification sites to one of seven conserved pseudouridine synthases, Pus1–4, 6, 7 and 9. Notably, the majority of pseudouridines in mRNA are regulated in response to environmental signals, such as nutrient deprivation in yeast and serum starvation in human cells. These results suggest a mechanism for the rapid and regulated rewiring of the genetic code through inducible mRNA modifications. Our findings reveal unanticipated roles for pseudouridylation and provide a resource for identifying the targets of pseudouridine synthases implicated in human disease11–13. PMID:25192136

  17. mRNA changes in nucleus accumbens related to methamphetamine addiction in mice

    NASA Astrophysics Data System (ADS)

    Zhu, Li; Li, Jiaqi; Dong, Nan; Guan, Fanglin; Liu, Yufeng; Ma, Dongliang; Goh, Eyleen L. K.; Chen, Teng

    2016-11-01

    Methamphetamine (METH) is a highly addictive psychostimulant that elicits aberrant changes in the expression of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) in the nucleus accumbens of mice, indicating a potential role of METH in post-transcriptional regulations. To decipher the potential consequences of these post-transcriptional regulations in response to METH, we performed strand-specific RNA sequencing (ssRNA-Seq) to identify alterations in mRNA expression and their alternative splicing in the nucleus accumbens of mice following exposure to METH. METH-mediated changes in mRNAs were analyzed and correlated with previously reported changes in non-coding RNAs (miRNAs and lncRNAs) to determine the potential functions of these mRNA changes observed here and how non-coding RNAs are involved. A total of 2171 mRNAs were differentially expressed in response to METH with functions involved in synaptic plasticity, mitochondrial energy metabolism and immune response. 309 and 589 of these mRNAs are potential targets of miRNAs and lncRNAs respectively. In addition, METH treatment decreases mRNA alternative splicing, and there are 818 METH-specific events not observed in saline-treated mice. Our results suggest that METH-mediated addiction could be attributed by changes in miRNAs and lncRNAs and consequently, changes in mRNA alternative splicing and expression. In conclusion, our study reported a methamphetamine-modified nucleus accumbens transcriptome and provided non-coding RNA-mRNA interaction networks possibly involved in METH addiction.

  18. RNAi screening of subtracted transcriptomes reveals tumor suppression by taurine-activated GABAA receptors involved in volume regulation

    PubMed Central

    van Nierop, Pim; Vormer, Tinke L.; Foijer, Floris; Verheij, Joanne; Lodder, Johannes C.; Andersen, Jesper B.; Mansvelder, Huibert D.; te Riele, Hein

    2018-01-01

    To identify coding and non-coding suppressor genes of anchorage-independent proliferation by efficient loss-of-function screening, we have developed a method for enzymatic production of low complexity shRNA libraries from subtracted transcriptomes. We produced and screened two LEGO (Low-complexity by Enrichment for Genes shut Off) shRNA libraries that were enriched for shRNA vectors targeting coding and non-coding polyadenylated transcripts that were reduced in transformed Mouse Embryonic Fibroblasts (MEFs). The LEGO shRNA libraries included ~25 shRNA vectors per transcript which limited off-target artifacts. Our method identified 79 coding and non-coding suppressor transcripts. We found that taurine-responsive GABAA receptor subunits, including GABRA5 and GABRB3, were induced during the arrest of non-transformed anchor-deprived MEFs and prevented anchorless proliferation. We show that taurine activates chloride currents through GABAA receptors on MEFs, causing seclusion of cell volume in large membrane protrusions. Volume seclusion from cells by taurine correlated with reduced proliferation and, conversely, suppression of this pathway allowed anchorage-independent proliferation. In human cholangiocarcinomas, we found that several proteins involved in taurine signaling via GABAA receptors were repressed. Low GABRA5 expression typified hyperproliferative tumors, and loss of taurine signaling correlated with reduced patient survival, suggesting this tumor suppressive mechanism operates in vivo. PMID:29787571

  19. Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria.

    PubMed

    Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn

    2015-04-22

    In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.

  20. RNAcentral: A comprehensive database of non-coding RNA sequences

    DOE PAGES

    Williams, Kelly Porter; Lau, Britney Yan

    2016-10-28

    RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. Furthermore, the website has been subject to continuous improvements focusing on text and sequence similaritymore » searches as well as genome browsing functionality.« less

  1. RNAcentral: A comprehensive database of non-coding RNA sequences

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Williams, Kelly Porter; Lau, Britney Yan

    RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. Furthermore, the website has been subject to continuous improvements focusing on text and sequence similaritymore » searches as well as genome browsing functionality.« less

  2. The crystal structure of the Split End protein SHARP adds a new layer of complexity to proteins containing RNA recognition motifs.

    PubMed

    Arieti, Fabiana; Gabus, Caroline; Tambalo, Margherita; Huet, Tiphaine; Round, Adam; Thore, Stéphane

    2014-06-01

    The Split Ends (SPEN) protein was originally discovered in Drosophila in the late 1990s. Since then, homologous proteins have been identified in eukaryotic species ranging from plants to humans. Every family member contains three predicted RNA recognition motifs (RRMs) in the N-terminal region of the protein. We have determined the crystal structure of the region of the human SPEN homolog that contains these RRMs-the SMRT/HDAC1 Associated Repressor Protein (SHARP), at 2.0 Å resolution. SHARP is a co-regulator of the nuclear receptors. We demonstrate that two of the three RRMs, namely RRM3 and RRM4, interact via a highly conserved interface. Furthermore, we show that the RRM3-RRM4 block is the main platform mediating the stable association with the H12-H13 substructure found in the steroid receptor RNA activator (SRA), a long, non-coding RNA previously shown to play a crucial role in nuclear receptor transcriptional regulation. We determine that SHARP association with SRA relies on both single- and double-stranded RNA sequences. The crystal structure of the SHARP-RRM fragment, together with the associated RNA-binding studies, extend the repertoire of nucleic acid binding properties of RRM domains suggesting a new hypothesis for a better understanding of SPEN protein functions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. Microbial metatranscriptomics in a permanent marine oxygen minimum zone.

    PubMed

    Stewart, Frank J; Ulloa, Osvaldo; DeLong, Edward F

    2012-01-01

    Simultaneous characterization of taxonomic composition, metabolic gene content and gene expression in marine oxygen minimum zones (OMZs) has potential to broaden perspectives on the microbial and biogeochemical dynamics in these environments. Here, we present a metatranscriptomic survey of microbial community metabolism in the Eastern Tropical South Pacific OMZ off northern Chile. Community RNA was sampled in late austral autumn from four depths (50, 85, 110, 200 m) extending across the oxycline and into the upper OMZ. Shotgun pyrosequencing of cDNA yielded 180,000 to 550,000 transcript sequences per depth. Based on functional gene representation, transcriptome samples clustered apart from corresponding metagenome samples from the same depth, highlighting the discrepancies between metabolic potential and actual transcription. BLAST-based characterizations of non-ribosomal RNA sequences revealed a dominance of genes involved with both oxidative (nitrification) and reductive (anammox, denitrification) components of the marine nitrogen cycle. Using annotations of protein-coding genes as proxies for taxonomic affiliation, we observed depth-specific changes in gene expression by key functional taxonomic groups. Notably, transcripts most closely matching the genome of the ammonia-oxidizing archaeon Nitrosopumilus maritimus dominated the transcriptome in the upper three depths, representing one in five protein-coding transcripts at 85 m. In contrast, transcripts matching the anammox bacterium Kuenenia stuttgartiensis dominated at the core of the OMZ (200 m; 1 in 12 protein-coding transcripts). The distribution of N. maritimus-like transcripts paralleled that of transcripts matching ammonia monooxygenase genes, which, despite being represented by both bacterial and archaeal sequences in the community DNA, were dominated (> 99%) by archaeal sequences in the RNA, suggesting a substantial role for archaeal nitrification in the upper OMZ. These data, as well as those describing other key OMZ metabolic processes (e.g. sulfur oxidation), highlight gene-specific expression patterns in the context of the entire community transcriptome, as well as identify key functional groups for taxon-specific genomic profiling. © 2011 Society for Applied Microbiology and Blackwell Publishing Ltd.

  4. Low-dose exposure to bisphenols A, F and S of human primary adipocyte impacts coding and non-coding RNA profiles

    PubMed Central

    Leloire, Audrey; Dhennin, Véronique; Coumoul, Xavier; Yengo, Loïc; Froguel, Philippe

    2017-01-01

    Bisphenol A (BPA) exposure has been suspected to be associated with deleterious effects on health including obesity and metabolically-linked diseases. Although bisphenols F (BPF) and S (BPS) are BPA structural analogs commonly used in many marketed products as a replacement for BPA, only sparse toxicological data are available yet. Our objective was to comprehensively characterize bisphenols gene targets in a human primary adipocyte model, in order to determine whether they may induce cellular dysfunction, using chronic exposure at two concentrations: a “low-dose” similar to the dose usually encountered in human biological fluids and a higher dose. Therefore, BPA, BPF and BPS have been added at 10 nM or 10 μM during the differentiation of human primary adipocytes from subcutaneous fat of three non-diabetic Caucasian female patients. Gene expression (mRNA/lncRNA) arrays and microRNA arrays, have been used to assess coding and non-coding RNA changes. We detected significantly deregulated mRNA/lncRNA and miRNA at low and high doses. Enrichment in “cancer” and “organismal injury and abnormalities” related pathways was found in response to the three products. Some long intergenic non-coding RNAs and small nucleolar RNAs were differentially expressed suggesting that bisphenols may also activate multiple cellular processes and epigenetic modifications. The analysis of upstream regulators of deregulated genes highlighted hormones or hormone-like chemicals suggesting that BPS and BPF can be suspected to interfere, just like BPA, with hormonal regulation and have to be considered as endocrine disruptors. All these results suggest that as BPA, its substitutes BPS and BPF should be used with the same restrictions. PMID:28628672

  5. Ribosome profiling: a Hi-Def monitor for protein synthesis at the genome-wide scale

    PubMed Central

    Michel, Audrey M; Baranov, Pavel V

    2013-01-01

    Ribosome profiling or ribo-seq is a new technique that provides genome-wide information on protein synthesis (GWIPS) in vivo. It is based on the deep sequencing of ribosome protected mRNA fragments allowing the measurement of ribosome density along all RNA molecules present in the cell. At the same time, the high resolution of this technique allows detailed analysis of ribosome density on individual RNAs. Since its invention, the ribosome profiling technique has been utilized in a range of studies in both prokaryotic and eukaryotic organisms. Several studies have adapted and refined the original ribosome profiling protocol for studying specific aspects of translation. Ribosome profiling of initiating ribosomes has been used to map sites of translation initiation. These studies revealed the surprisingly complex organization of translation initiation sites in eukaryotes. Multiple initiation sites are responsible for the generation of N-terminally extended and truncated isoforms of known proteins as well as for the translation of numerous open reading frames (ORFs), upstream of protein coding ORFs. Ribosome profiling of elongating ribosomes has been used for measuring differential gene expression at the level of translation, the identification of novel protein coding genes and ribosome pausing. It has also provided data for developing quantitative models of translation. Although only a dozen or so ribosome profiling datasets have been published so far, they have already dramatically changed our understanding of translational control and have led to new hypotheses regarding the origin of protein coding genes. © 2013 John Wiley & Sons, Ltd. PMID:23696005

  6. ADAR1-mediated 3' UTR editing and expression control of antiapoptosis genes fine-tunes cellular apoptosis response.

    PubMed

    Yang, Chang-Ching; Chen, Yi-Tung; Chang, Yi-Feng; Liu, Hsuan; Kuo, Yu-Ping; Shih, Chieh-Tien; Liao, Wei-Chao; Chen, Hui-Wen; Tsai, Wen-Sy; Tan, Bertrand Chin-Ming

    2017-05-25

    Adenosine-to-inosine RNA editing constitutes a crucial component of the cellular transcriptome and critically underpins organism survival and development. While recent high-throughput approaches have provided comprehensive documentation of the RNA editome, its functional output remains mostly unresolved, particularly for events in the non-coding regions. Gene ontology analysis of the known RNA editing targets unveiled a preponderance of genes related to apoptosis regulation, among which proto-oncogenes XIAP and MDM2 encode two the most abundantly edited transcripts. To further decode this potential functional connection, here we showed that the main RNA editor ADAR1 directly targets this 3' UTR editing of XIAP and MDM2, and further exerts a negative regulation on the expression of their protein products. This post-transcriptional silencing role was mediated via the inverted Alu elements in the 3' UTR but independent of alteration in transcript stability or miRNA targeting. Rather, we discovered that ADAR1 competes transcript occupancy with the RNA shuttling factor STAU1 to facilitate nuclear retention of the XIAP and MDM2 mRNAs. As a consequence, ADAR1 may acquire functionality in part by conferring spatial distribution and translation efficiency of the target transcripts. Finally, abrogation of ADAR1 expression or catalytic activity elicited a XIAP-dependent suppression of apoptotic response, whereas ectopic expression reversed this protective effect on cell death. Together, our results extended the known functions of ADAR1 and RNA editing to the critical fine-tuning of the intracellular apoptotic signaling and also provided mechanistic explanation for ADAR1's roles in development and tumorigenesis.

  7. Nuclear export of RNA: Different sizes, shapes and functions.

    PubMed

    Williams, Tobias; Ngo, Linh H; Wickramasinghe, Vihandha O

    2018-03-01

    Export of protein-coding and non-coding RNA molecules from the nucleus to the cytoplasm is critical for gene expression. This necessitates the continuous transport of RNA species of different size, shape and function through nuclear pore complexes via export receptors and adaptor proteins. Here, we provide an overview of the major RNA export pathways in humans, highlighting the similarities and differences between each. Its importance is underscored by the growing appreciation that deregulation of RNA export pathways is associated with human diseases like cancer. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.

  8. Non-coding functions of alternative pre-mRNA splicing in development.

    PubMed

    Mockenhaupt, Stefan; Makeyev, Eugene V

    2015-12-01

    A majority of messenger RNA precursors (pre-mRNAs) in the higher eukaryotes undergo alternative splicing to generate more than one mature product. By targeting the open reading frame region this process increases diversity of protein isoforms beyond the nominal coding capacity of the genome. However, alternative splicing also frequently controls output levels and spatiotemporal features of cellular and organismal gene expression programs. Here we discuss how these non-coding functions of alternative splicing contribute to development through regulation of mRNA stability, translational efficiency and cellular localization. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  9. Genetic coding and gene expression - new Quadruplet genetic coding model

    NASA Astrophysics Data System (ADS)

    Shankar Singh, Rama

    2012-07-01

    Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.

  10. Transcriptional mapping of the ribosomal RNA region of mouse L-cell mitochondrial DNA.

    PubMed Central

    Nagley, P; Clayton, D A

    1980-01-01

    The map positions in mouse mitochondrial DNA of the two ribosomal RNA genes and adjacent genes coding several small transcripts have been determined precisely by application of a procedure in which DNA-RNA hybrids have been subjected to digestion by S1 nuclease under conditions of varying severity. Digestion of the DNA-RNA hybrids with S1 nuclease yielded a series of species which were shown to contain ribosomal RNA molecules together with adjacent transcripts hybridized conjointly to a continuous segment of mitochondrial DNA. There is one small transcript about 60 bases long whose gene adjoins the sequences coding the 5'-end of the small ribosomal RNA (950 bases) and which lies approximately 200 nucleotides from the D-loop origin of heavy strand mitochondrial DNA synthesis. An 80-base transcript lies between the small and large ribosomal RNA genes, and genes for two further short transcript (each about 80 bases in length) abut the sequences coding the 3'-end of the large ribosomal RNA (approximately 1500 bases). The ability to isolate a discrete DNA-RNA hybrid species approximately 2700 base pairs in length containing all these transcripts suggests that there can be few nucleotides in this region of mouse mitochondrial DNA which are not represented as stable RNA species. Images PMID:6253898

  11. Pseudoscorpion mitochondria show rearranged genes and genome-wide reductions of RNA gene sizes and inferred structures, yet typical nucleotide composition bias

    PubMed Central

    2012-01-01

    Background Pseudoscorpions are chelicerates and have historically been viewed as being most closely related to solifuges, harvestmen, and scorpions. No mitochondrial genomes of pseudoscorpions have been published, but the mitochondrial genomes of some lineages of Chelicerata possess unusual features, including short rRNA genes and tRNA genes that lack sequence to encode arms of the canonical cloverleaf-shaped tRNA. Additionally, some chelicerates possess an atypical guanine-thymine nucleotide bias on the major coding strand of their mitochondrial genomes. Results We sequenced the mitochondrial genomes of two divergent taxa from the chelicerate order Pseudoscorpiones. We find that these genomes possess unusually short tRNA genes that do not encode cloverleaf-shaped tRNA structures. Indeed, in one genome, all 22 tRNA genes lack sequence to encode canonical cloverleaf structures. We also find that the large ribosomal RNA genes are substantially shorter than those of most arthropods. We inferred secondary structures of the LSU rRNAs from both pseudoscorpions, and find that they have lost multiple helices. Based on comparisons with the crystal structure of the bacterial ribosome, two of these helices were likely contact points with tRNA T-arms or D-arms as they pass through the ribosome during protein synthesis. The mitochondrial gene arrangements of both pseudoscorpions differ from the ancestral chelicerate gene arrangement. One genome is rearranged with respect to the location of protein-coding genes, the small rRNA gene, and at least 8 tRNA genes. The other genome contains 6 tRNA genes in novel locations. Most chelicerates with rearranged mitochondrial genes show a genome-wide reversal of the CA nucleotide bias typical for arthropods on their major coding strand, and instead possess a GT bias. Yet despite their extensive rearrangement, these pseudoscorpion mitochondrial genomes possess a CA bias on the major coding strand. Phylogenetic analyses of all 13 mitochondrial protein-coding gene sequences consistently yield trees that place pseudoscorpions as sister to acariform mites. Conclusion The well-supported phylogenetic placement of pseudoscorpions as sister to Acariformes differs from some previous analyses based on morphology. However, these two lineages share multiple molecular evolutionary traits, including substantial mitochondrial genome rearrangements, extensive nucleotide substitution, and loss of helices in their inferred tRNA and rRNA structures. PMID:22409411

  12. Coding and non-coding gene regulatory networks underlie the immune response in liver cirrhosis

    PubMed Central

    Zhang, Xueming; Huang, Yongming; Yang, Zhengpeng; Zhang, Yuguo; Zhang, Weihui; Gao, Zu-hua; Xue, Dongbo

    2017-01-01

    Liver cirrhosis is recognized as being the consequence of immune-mediated hepatocyte damage and repair processes. However, the regulation of these immune responses underlying liver cirrhosis has not been elucidated. In this study, we used GEO datasets and bioinformatics methods to established coding and non-coding gene regulatory networks including transcription factor-/lncRNA-microRNA-mRNA, and competing endogenous RNA interaction networks. Our results identified 2224 mRNAs, 70 lncRNAs and 46 microRNAs were differentially expressed in liver cirrhosis. The transcription factor -/lncRNA- microRNA-mRNA network we uncovered that results in immune-mediated liver cirrhosis is comprised of 5 core microRNAs (e.g., miR-203; miR-219-5p), 3 transcription factors (i.e., FOXP3, ETS1 and FOS) and 7 lncRNAs (e.g., ENTS00000671336, ENST00000575137). The competing endogenous RNA interaction network we identified includes a complex immune response regulatory subnetwork that controls the entire liver cirrhosis network. Additionally, we found 10 overlapping GO terms shared by both liver cirrhosis and hepatocellular carcinoma including “immune response” as well. Interestingly, the overlapping differentially expressed genes in liver cirrhosis and hepatocellular carcinoma were enriched in immune response-related functional terms. In summary, a complex gene regulatory network underlying immune response processes may play an important role in the development and progression of liver cirrhosis, and its development into hepatocellular carcinoma. PMID:28355233

  13. Biosynthesis of reovirus-specified polypeptides: the reovirus s1 mRNA encodes two primary translation products

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jacobs, B.L.; Samuel, C.E.

    1985-05-01

    Reovirus serotypes 1 (Lang strain) and 3 (Dearing strain) code for a hitherto unrecognized low-molecular-weight polypeptide of Mr approximately 12,000. This polypeptide (p12) was synthesized in vitro in L-cell-free protein synthesizing systems programmed with either reovirus serotype 1 mRNA, reovirus serotype 3 mRNA, or with denatured reovirus genome double-stranded RNA, and in vivo in L-cell cultures infected with either reovirus serotype. Pulse-chase experiments in vivo, and the relative kinetics of synthesis of p12 in vitro, indicate that it is a primary translation product. Fractionation of reovirus mRNAs by velocity sedimentation and translation of separated mRNAs in vitro suggests that p12more » is coded for by the s1 mRNA, which also codes for the previously recognized sigma 1 polypeptide. Synthesis of both p12 and sigma 1 in vitro in L-cell-free protein synthesizing systems programmed with denatured reovirus genome double-stranded RNA also suggests that these two polypeptides can be coded by the same mRNA species. It is proposed that the Mr approximately 12,000 polypeptide encoded by the S1 genome segment be designated sigma 1bNS, and that the polypeptide previously designated sigma 1 be renamed sigma 1a.« less

  14. Long Non-Coding RNAs Differentially Expressed between Normal versus Primary Breast Tumor Tissues Disclose Converse Changes to Breast Cancer-Related Protein-Coding Genes

    PubMed Central

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628

  15. Long non-coding RNAs differentially expressed between normal versus primary breast tumor tissues disclose converse changes to breast cancer-related protein-coding genes.

    PubMed

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.

  16. Microprocessor mediates transcriptional termination in long noncoding microRNA genes

    PubMed Central

    Dhir, Ashish; Dhir, Somdutta; Proudfoot, Nick J.; Jopling, Catherine L.

    2015-01-01

    MicroRNA (miRNA) play a major role in the post-transcriptional regulation of gene expression. Mammalian miRNA biogenesis begins with co-transcriptional cleavage of RNA polymerase II (Pol II) transcripts by the Microprocessor complex. While most miRNA are located within introns of protein coding genes, a substantial minority of miRNA originate from long non coding (lnc) RNA where transcript processing is largely uncharacterized. We show, by detailed characterization of liver-specific lnc-pri-miR-122 and genome-wide analysis in human cell lines, that most lnc-pri-miRNA do not use the canonical cleavage and polyadenylation (CPA) pathway, but instead use Microprocessor cleavage to terminate transcription. This Microprocessor inactivation leads to extensive transcriptional readthrough of lnc-pri-miRNA and transcriptional interference with downstream genes. Consequently we define a novel RNase III-mediated, polyadenylation-independent mechanism of Pol II transcription termination in mammalian cells. PMID:25730776

  17. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    PubMed

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  18. Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria

    PubMed Central

    Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn

    2015-01-01

    In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393

  19. Advanced Design of Dumbbell-shaped Genetic Minimal Vectors Improves Non-coding and Coding RNA Expression.

    PubMed

    Jiang, Xiaoou; Yu, Han; Teo, Cui Rong; Tan, Genim Siu Xian; Goh, Sok Chin; Patel, Parasvi; Chua, Yiqiang Kevin; Hameed, Nasirah Banu Sahul; Bertoletti, Antonio; Patzel, Volker

    2016-09-01

    Dumbbell-shaped DNA minimal vectors lacking nontherapeutic genes and bacterial sequences are considered a stable, safe alternative to viral, nonviral, and naked plasmid-based gene-transfer systems. We investigated novel molecular features of dumbbell vectors aiming to reduce vector size and to improve the expression of noncoding or coding RNA. We minimized small hairpin RNA (shRNA) or microRNA (miRNA) expressing dumbbell vectors in size down to 130 bp generating the smallest genetic expression vectors reported. This was achieved by using a minimal H1 promoter with integrated transcriptional terminator transcribing the RNA hairpin structure around the dumbbell loop. Such vectors were generated with high conversion yields using a novel protocol. Minimized shRNA-expressing dumbbells showed accelerated kinetics of delivery and transcription leading to enhanced gene silencing in human tissue culture cells. In primary human T cells, minimized miRNA-expressing dumbbells revealed higher stability and triggered stronger target gene suppression as compared with plasmids and miRNA mimics. Dumbbell-driven gene expression was enhanced up to 56- or 160-fold by implementation of an intron and the SV40 enhancer compared with control dumbbells or plasmids. Advanced dumbbell vectors may represent one option to close the gap between durable expression that is achievable with integrating viral vectors and short-term effects triggered by naked RNA.

  20. Novel variants of the 5S rRNA genes in Eruca sativa.

    PubMed

    Singh, K; Bhatia, S; Lakshmikumaran, M

    1994-02-01

    The 5S ribosomal RNA (rRNA) genes of Eruca sativa were cloned and characterized. They are organized into clusters of tandemly repeated units. Each repeat unit consists of a 119-bp coding region followed by a noncoding spacer region that separates it from the coding region of the next repeat unit. Our study reports novel gene variants of the 5S rRNA genes in plants. Two families of the 5S rDNA, the 0.5-kb size family and the 1-kb size family, coexist in the E. sativa genome. The 0.5-kb size family consists of the 5S rRNA genes (S4) that have coding regions similar to those of other reported plant 5S rDNA sequences, whereas the 1-kb size family consists of the 5S rRNA gene variants (S1) that exist as 1-kb BamHI tandem repeats. S1 is made up of two variant units (V1 and V2) of 5S rDNA where the BamHI site between the two units is mutated. Sequence heterogeneity among S4, V1, and V2 units exists throughout the sequence and is not limited to the noncoding spacer region only. The coding regions of V1 and V2 show approximately 20% dissimilarity to the coding regions of S4 and other reported plant 5S rDNA sequences. Such a large variation in the coding regions of the 5S rDNA units within the same plant species has been observed for the first time. Restriction site variation is observed between the two size classes of 5S rDNA in E. sativa.(ABSTRACT TRUNCATED AT 250 WORDS)

  1. Saturation of recognition elements blocks evolution of new tRNA identities

    PubMed Central

    Saint-Léger, Adélaïde; Bello, Carla; Dans, Pablo D.; Torres, Adrian Gabriel; Novoa, Eva Maria; Camacho, Noelia; Orozco, Modesto; Kondrashov, Fyodor A.; Ribas de Pouplana, Lluís

    2016-01-01

    Understanding the principles that led to the current complexity of the genetic code is a central question in evolution. Expansion of the genetic code required the selection of new transfer RNAs (tRNAs) with specific recognition signals that allowed them to be matured, modified, aminoacylated, and processed by the ribosome without compromising the fidelity or efficiency of protein synthesis. We show that saturation of recognition signals blocks the emergence of new tRNA identities and that the rate of nucleotide substitutions in tRNAs is higher in species with fewer tRNA genes. We propose that the growth of the genetic code stalled because a limit was reached in the number of identity elements that can be effectively used in the tRNA structure. PMID:27386510

  2. Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.

    PubMed

    Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel

    2013-09-01

    RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.

  3. The Complete Mitochondrial DNA Sequence of Scenedesmus obliquus Reflects an Intermediate Stage in the Evolution of the Green Algal Mitochondrial Genome

    PubMed Central

    Nedelcu, Aurora M.; Lee, Robert W.; Lemieux, Claude; Gray, Michael W.; Burger, Gertraud

    2000-01-01

    Two distinct mitochondrial genome types have been described among the green algal lineages investigated to date: a reduced–derived, Chlamydomonas-like type and an ancestral, Prototheca-like type. To determine if this unexpected dichotomy is real or is due to insufficient or biased sampling and to define trends in the evolution of the green algal mitochondrial genome, we sequenced and analyzed the mitochondrial DNA (mtDNA) of Scenedesmus obliquus. This genome is 42,919 bp in size and encodes 42 conserved genes (i.e., large and small subunit rRNA genes, 27 tRNA and 13 respiratory protein-coding genes), four additional free-standing open reading frames with no known homologs, and an intronic reading frame with endonuclease/maturase similarity. No 5S rRNA or ribosomal protein-coding genes have been identified in Scenedesmus mtDNA. The standard protein-coding genes feature a deviant genetic code characterized by the use of UAG (normally a stop codon) to specify leucine, and the unprecedented use of UCA (normally a serine codon) as a signal for termination of translation. The mitochondrial genome of Scenedesmus combines features of both green algal mitochondrial genome types: the presence of a more complex set of protein-coding and tRNA genes is shared with the ancestral type, whereas the lack of 5S rRNA and ribosomal protein-coding genes as well as the presence of fragmented and scrambled rRNA genes are shared with the reduced–derived type of mitochondrial genome organization. Furthermore, the gene content and the fragmentation pattern of the rRNA genes suggest that this genome represents an intermediate stage in the evolutionary process of mitochondrial genome streamlining in green algae. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF204057.] PMID:10854413

  4. RNAcentral: an international database of ncRNA sequences

    DOE PAGES

    Williams, Kelly Porter

    2014-10-28

    The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species.

  5. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

    PubMed

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-10-03

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

  6. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes

    PubMed Central

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-01-01

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274

  7. PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast

    PubMed Central

    2014-01-01

    Background Nrd1 and Nab3 are essential sequence-specific yeast RNA binding proteins that function as a heterodimer in the processing and degradation of diverse classes of RNAs. These proteins also regulate several mRNA coding genes; however, it remains unclear exactly what percentage of the mRNA component of the transcriptome these proteins control. To address this question, we used the pyCRAC software package developed in our laboratory to analyze CRAC and PAR-CLIP data for Nrd1-Nab3-RNA interactions. Results We generated high-resolution maps of Nrd1-Nab3-RNA interactions, from which we have uncovered hundreds of new Nrd1-Nab3 mRNA targets, representing between 20 and 30% of protein-coding transcripts. Although Nrd1 and Nab3 showed a preference for binding near 5′ ends of relatively short transcripts, they bound transcripts throughout coding sequences and 3′ UTRs. Moreover, our data for Nrd1-Nab3 binding to 3′ UTRs was consistent with a role for these proteins in the termination of transcription. Our data also support a tight integration of Nrd1-Nab3 with the nutrient response pathway. Finally, we provide experimental evidence for some of our predictions, using northern blot and RT-PCR assays. Conclusions Collectively, our data support the notion that Nrd1 and Nab3 function is tightly integrated with the nutrient response and indicate a role for these proteins in the regulation of many mRNA coding genes. Further, we provide evidence to support the hypothesis that Nrd1-Nab3 represents a failsafe termination mechanism in instances of readthrough transcription. PMID:24393166

  8. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding

    PubMed Central

    Carter, Charles W; Wills, Peter R

    2018-01-01

    Abstract Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma’s emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene–replicase–translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today. PMID:29077934

  9. Basic biology and therapeutic implications of lncRNA.

    PubMed

    Khorkova, O; Hsiao, J; Wahlestedt, C

    2015-06-29

    Long non-coding RNAs (lncRNA), a class of non-coding RNA molecules recently identified largely due to the efforts of FANTOM, and later GENCODE and ENCODE consortia, have been a subject of intense investigation in the past decade. Extensive efforts to get deeper understanding of lncRNA biology have yielded evidence of their diverse structural and regulatory roles in protecting chromosome integrity, maintaining genomic architecture, X chromosome inactivation, imprinting, transcription, translation and epigenetic regulation. Here we will briefly review the recent studies in the field of lncRNA biology focusing mostly on mammalian species and discuss their therapeutic implications. Copyright © 2015 Elsevier B.V. All rights reserved.

  10. RNA therapeutics: RNAi and antisense mechanisms and clinical applications.

    PubMed

    Chery, Jessica

    2016-07-01

    RNA therapeutics refers to the use of oligonucleotides to target primarily ribonucleic acids (RNA) for therapeutic efforts or in research studies to elucidate functions of genes. Oligonucleotides are distinct from other pharmacological modalities, such as small molecules and antibodies that target mainly proteins, due to their mechanisms of action and chemical properties. Nucleic acids come in two forms: deoxyribonucleic acids (DNA) and ribonucleic acids (RNA). Although DNA is more stable, RNA offers more structural variety ranging from messenger RNA (mRNA) that codes for protein to non-coding RNAs, microRNA (miRNA), transfer RNA (tRNA), short interfering RNAs (siRNAs), ribosomal RNA (rRNA), and long-noncoding RNAs (lncRNAs). As our understanding of the wide variety of RNAs deepens, researchers have sought to target RNA since >80% of the genome is estimated to be transcribed. These transcripts include non-coding RNAs such as miRNAs and siRNAs that function in gene regulation by playing key roles in the transfer of genetic information from DNA to protein, the final product of the central dogma in biology 1 . Currently there are two main approaches used to target RNA: double stranded RNA-mediated interference (RNAi) and antisense oligonucleotides (ASO). Both approaches are currently in clinical trials for targeting of RNAs involved in various diseases, such as cancer and neurodegeneration. In fact, ASOs targeting spinal muscular atrophy and amyotrophic lateral sclerosis have shown positive results in clinical trials 2 . Advantages of ASOs include higher affinity due to the development of chemical modifications that increase affinity, selectivity while decreasing toxicity due to off-target effects. This review will highlight the major therapeutic approaches of RNA medicine currently being applied with a focus on RNAi and ASOs.

  11. Xenopus microRNA genes are predominantly located within introns and are differentially expressed in adult frog tissues via post-transcriptional regulation

    PubMed Central

    Tang, Guo-Qing; Maxwell, E. Stuart

    2008-01-01

    The amphibian Xenopus provides a model organism for investigating microRNA expression during vertebrate embryogenesis and development. Searching available Xenopus genome databases using known human pre-miRNAs as query sequences, more than 300 genes encoding 142 Xenopus tropicalis miRNAs were identified. Analysis of Xenopus tropicalis miRNA genes revealed a predominate positioning within introns of protein-coding and nonprotein-coding RNA Pol II-transcribed genes. MiRNA genes were also located in pre-mRNA exons and positioned intergenically between known protein-coding genes. Many miRNA species were found in multiple locations and in more than one genomic context. MiRNA genes were also clustered throughout the genome, indicating the potential for the cotranscription and coordinate expression of miRNAs located in a given cluster. Northern blot analysis confirmed the expression of many identified miRNAs in both X. tropicalis and X. laevis. Comparison of X. tropicalis and X. laevis blots revealed comparable expression profiles, although several miRNAs exhibited species-specific expression in different tissues. More detailed analysis revealed that for some miRNAs, the tissue-specific expression profile of the pri-miRNA precursor was distinctly different from that of the mature miRNA profile. Differential miRNA precursor processing in both the nucleus and cytoplasm was implicated in the observed tissue-specific differences. These observations indicated that post-transcriptional processing plays an important role in regulating miRNA expression in the amphibian Xenopus. PMID:18032731

  12. Expression of short hairpin RNAs using the compact architecture of retroviral microRNA genes.

    PubMed

    Burke, James M; Kincaid, Rodney P; Aloisio, Francesca; Welch, Nicole; Sullivan, Christopher S

    2017-09-29

    Short hairpin RNAs (shRNAs) are effective in generating stable repression of gene expression. RNA polymerase III (RNAP III) type III promoters (U6 or H1) are typically used to drive shRNA expression. While useful for some knockdown applications, the robust expression of U6/H1-driven shRNAs can induce toxicity and generate heterogeneous small RNAs with undesirable off-target effects. Additionally, typical U6/H1 promoters encompass the majority of the ∼270 base pairs (bp) of vector space required for shRNA expression. This can limit the efficacy and/or number of delivery vector options, particularly when delivery of multiple gene/shRNA combinations is required. Here, we develop a compact shRNA (cshRNA) expression system based on retroviral microRNA (miRNA) gene architecture that uses RNAP III type II promoters. We demonstrate that cshRNAs coded from as little as 100 bps of total coding space can precisely generate small interfering RNAs (siRNAs) that are active in the RNA-induced silencing complex (RISC). We provide an algorithm with a user-friendly interface to design cshRNAs for desired target genes. This cshRNA expression system reduces the coding space required for shRNA expression by >2-fold as compared to the typical U6/H1 promoters, which may facilitate therapeutic RNAi applications where delivery vector space is limiting. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Long Non-Coding RNA in Glioma: Target miRNA and Signaling Pathways.

    PubMed

    Dang, Yuan; Wei, Xudong; Xue, Laien; Wen, Fuli; Gu, Jianjun; Zheng, Heping

    2018-06-01

    Glioma is one of the most common and aggressive malignant tumors of the central nervous system. Here, we review and explore the use of long noncoding RNA (lncRNA) as a therapeutic strategy for the targeting of gliomas. LncRNA is a functional RNA molecule with no protein coding function and is involved in the occurrence and progression of glioma. It is reported that the activation of several signaling pathways, including the MAPK, p53, Wnt/β-catenin, PI3K/AKT/mTOR, and epithelial mesenchymal transformation (EMT) pathways, are involved in the regulation of gliomas. In addition, microRNAs in glioma may also interact with lncRNAs and affect tumor growth and progression. Therefore, the exploration of lncRNA participation in signaling pathway regulatory mechanisms and the determination of the interaction between lncRNA and miRNA may help to develop new effective therapies for the treatment of glioma.

  14. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes.

    PubMed

    Seligmann, Hervé

    2013-03-01

    Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  15. Origins of tmRNA: the missing link in the birth of protein synthesis?

    PubMed

    Macé, Kevin; Gillet, Reynald

    2016-09-30

    The RNA world hypothesis refers to the early period on earth in which RNA was central in assuring both genetic continuity and catalysis. The end of this era coincided with the development of the genetic code and protein synthesis, symbolized by the apparition of the first non-random messenger RNA (mRNA). Modern transfer-messenger RNA (tmRNA) is a unique hybrid molecule which has the properties of both mRNA and transfer RNA (tRNA). It acts as a key molecule during trans-translation, a major quality control pathway of modern bacterial protein synthesis. tmRNA shares many common characteristics with ancestral RNA. Here, we present a model in which proto-tmRNAs were the first molecules on earth to support non-random protein synthesis, explaining the emergence of early genetic code. In this way, proto-tmRNA could be the missing link between the first mRNA and tRNA molecules and modern ribosome-mediated protein synthesis. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Author Correction: Recognition of RNA N6-methyladenosine by IGF2BP proteins enhances mRNA stability and translation.

    PubMed

    Huang, Huilin; Weng, Hengyou; Sun, Wenju; Qin, Xi; Shi, Hailing; Wu, Huizhe; Zhao, Boxuan Simen; Mesquita, Ana; Liu, Chang; Yuan, Celvie L; Hu, Yueh-Chiang; Hüttelmaier, Stefan; Skibbe, Jennifer R; Su, Rui; Deng, Xiaolan; Dong, Lei; Sun, Miao; Li, Chenying; Nachtergaele, Sigrid; Wang, Yungui; Hu, Chao; Ferchen, Kyle; Greis, Kenneth D; Jiang, Xi; Wei, Minjie; Qu, Lianghu; Guan, Jun-Lin; He, Chuan; Yang, Jianhua; Chen, Jianjun

    2018-06-07

    In the version of this Article originally published, the authors incorrectly listed an accession code as GES90642. The correct code is GSE90642 . This has now been amended in all online versions of the Article.

  17. A large shRNA library approach identifies lncRNA Ntep as an essential regulator of cell proliferation

    PubMed Central

    Beermann, Julia; Kirste, Dominique; Iwanov, Katharina; Lu, Dongchao; Kleemiß, Felix; Kumarswamy, Regalla; Schimmel, Katharina; Bär, Christian; Thum, Thomas

    2018-01-01

    The mammalian cell cycle is a complex and tightly controlled event. Myriads of different control mechanisms are involved in its regulation. Long non-coding RNAs (lncRNA) have emerged as important regulators of many cellular processes including cellular proliferation. However, a more global and unbiased approach to identify lncRNAs with importance for cell proliferation is missing. Here, we present a lentiviral shRNA library-based approach for functional lncRNA profiling. We validated our library approach in NIH3T3 (3T3) fibroblasts by identifying lncRNAs critically involved in cell proliferation. Using stringent selection criteria we identified lncRNA NR_015491.1 out of 3842 different RNA targets represented in our library. We termed this transcript Ntep (non-coding transcript essential for proliferation), as a bona fide lncRNA essential for cell cycle progression. Inhibition of Ntep in 3T3 and primary fibroblasts prevented normal cell growth and expression of key fibroblast markers. Mechanistically, we discovered that Ntep is important to activate P53 concomitant with increased apoptosis and cell cycle blockade in late G2/M. Our findings suggest Ntep to serve as an important regulator of fibroblast proliferation and function. In summary, our study demonstrates the applicability of an innovative shRNA library approach to identify long non-coding RNA functions in a massive parallel approach. PMID:29099486

  18. The long non-coding RNA MALAT1 promotes the migration and invasion of hepatocellular carcinoma by sponging miR-204 and releasing SIRT1.

    PubMed

    Hou, Zhouhua; Xu, Xuwen; Zhou, Ledu; Fu, Xiaoyu; Tao, Shuhui; Zhou, Jiebin; Tan, Deming; Liu, Shuiping

    2017-07-01

    Increasing evidence supports the significance of long non-coding RNA in cancer development. Several recent studies suggest the oncogenic activity of long non-coding RNA metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) in hepatocellular carcinoma. In this study, we explored the molecular mechanisms by which MALAT1 modulates hepatocellular carcinoma biological behaviors. We found that microRNA-204 was significantly downregulated in sh-MALAT1 HepG2 cell and 15 hepatocellular carcinoma tissues by quantitative real-time polymerase chain reaction analysis. Through bioinformatic screening, luciferase reporter assay, RNA-binding protein immunoprecipitation, and RNA pull-down assay, we identified microRNA-204 as a potential interacting partner for MALAT1. Functionally, wound-healing and transwell assays revealed that microRNA-204 significantly inhibited the migration and invasion of hepatocellular carcinoma cells. Notably, sirtuin 1 was recognized as a direct downstream target of microRNA-204 in HepG2 cells. Moreover, si-SIRT1 significantly inhibited cell invasion and migration process. These data elucidated, by sponging and competitive binding to microRNA-204, MALAT1 releases the suppression on sirtuin 1, which in turn promotes hepatocellular carcinoma migration and invasion. This study reveals a novel mechanism by which MALAT1 stimulates hepatocellular carcinoma progression and justifies targeting metastasis-associated lung adenocarcinoma transcript 1 as a potential therapy for hepatocellular carcinoma.

  19. Opposite GC skews at the 5' and 3' ends of genes in unicellular fungi

    PubMed Central

    2011-01-01

    Background GC-skews have previously been linked to transcription in some eukaryotes. They have been associated with transcription start sites, with the coding strand G-biased in mammals and C-biased in fungi and invertebrates. Results We show a consistent and highly significant pattern of GC-skew within genes of almost all unicellular fungi. The pattern of GC-skew is asymmetrical: the coding strand of genes is typically C-biased at the 5' ends but G-biased at the 3' ends, with intermediate skews at the middle of genes. Thus, the initiation, elongation, and termination phases of transcription are associated with different skews. This pattern influences the encoded proteins by generating differential usage of amino acids at the 5' and 3' ends of genes. These biases also affect fourfold-degenerate positions and extend into promoters and 3' UTRs, indicating that skews cannot be accounted by selection for protein function or translation. Conclusions We propose two explanations, the mutational pressure hypothesis, and the adaptive hypothesis. The mutational pressure hypothesis is that different co-factors bind to RNA pol II at different phases of transcription, producing different mutational regimes. The adaptive hypothesis is that cytidine triphosphate deficiency may lead to C-avoidance at the 3' ends of transcripts to control the flow of RNA pol II molecules and reduce their frequency of collisions. PMID:22208287

  20. tRNA acceptor stem and anticodon bases form independent codes related to protein folding

    PubMed Central

    Carter, Charles W.; Wolfenden, Richard

    2015-01-01

    Aminoacyl-tRNA synthetases recognize tRNA anticodon and 3′ acceptor stem bases. Synthetase Urzymes acylate cognate tRNAs even without anticodon-binding domains, in keeping with the possibility that acceptor stem recognition preceded anticodon recognition. Representing tRNA identity elements with two bits per base, we show that the anticodon encodes the hydrophobicity of each amino acid side-chain as represented by its water-to-cyclohexane distribution coefficient, and this relationship holds true over the entire temperature range of liquid water. The acceptor stem codes preferentially for the surface area or size of each side-chain, as represented by its vapor-to-cyclohexane distribution coefficient. These orthogonal experimental properties are both necessary to account satisfactorily for the exposed surface area of amino acids in folded proteins. Moreover, the acceptor stem codes correctly for β-branched and carboxylic acid side-chains, whereas the anticodon codes for a wider range of such properties, but not for size or β-branching. These and other results suggest that genetic coding of 3D protein structures evolved in distinct stages, based initially on the size of the amino acid and later on its compatibility with globular folding in water. PMID:26034281

  1. RAID: a comprehensive resource for human RNA-associated (RNA-RNA/RNA-protein) interaction.

    PubMed

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-07-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA-RNA/RNA-protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA-RNA interactions and 1619 RNA-protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA-RNA/RNA-protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA-RNA/RNA-protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. © 2014 Zhang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  2. Expression Profiling Smackdown: Human Transcriptome Array HTA 2.0 vs. RNA-Seq

    PubMed Central

    Palermo, Meghann; Driscoll, Heather; Tighe, Scott; Dragon, Julie; Bond, Jeff; Shukla, Arti; Vangala, Mahesh; Vincent, James; Hunter, Tim

    2014-01-01

    The advent of both microarray and massively parallel sequencing have revolutionized high-throughput analysis of the human transcriptome. Due to limitations in microarray technology, detecting and quantifying coding transcript isoforms, in addition to non-coding transcripts, has been challenging. As a result, RNA-Seq has been the preferred method for characterizing the full human transcriptome, until now. A new high-resolution array from Affymetrix, GeneChip Human Transcriptome Array 2.0 (HTA 2.0), has been designed to interrogate all transcript isoforms in the human transcriptome with >6 million probes targeting coding transcripts, exon-exon splice junctions, and non-coding transcripts. Here we compare expression results from GeneChip HTA 2.0 and RNA-Seq data using identical RNA extractions from three samples each of healthy human mesothelial cells in culture, LP9-C1, and healthy mesothelial cells treated with asbestos, LP9-A1. For GeneChip HTA 2.0 sample preparation, we chose to compare two target preparation methods, NuGEN Ovation Pico WTA V2 with the Encore Biotin Module versus Affymetrix's GeneChip WT PLUS with the WT Terminal Labeling Kit, on identical RNA extractions from both untreated and treated samples. These same RNA extractions were used for the RNA-Seq library preparation. All analyses were performed in Partek Genomics Suite 6.6. Expression profiles for control and asbestos-treated mesothelial cells prepared with NuGEN versus Affymetrix target preparation methods (GeneChip HTA 2.0) are compared to each other as well as to RNA-Seq results.

  3. Complete mitochondrial genome of Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae).

    PubMed

    Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C

    2015-04-01

    The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.

  4. Possibilities for the evolution of the genetic code from a preceding form

    NASA Technical Reports Server (NTRS)

    Jukes, T. H.

    1973-01-01

    Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.

  5. The Complete Mitogenome of the Wood-Feeding Cockroach Cryptocercus meridianus (Blattodea: Cryptocercidae) and Its Phylogenetic Relationship among Cockroach Families.

    PubMed

    Li, Weijun; Wang, Zongqing; Che, Yanli

    2017-11-12

    In this study, the complete mitochondrial genome of Cryptocercus meridianus was sequenced. The circular mitochondrial genome is 15,322 bp in size and contains 13 protein-coding genes, two ribosomal RNA genes (12S rRNA and 16S rRNA), 22 transfer RNA genes, and one D-loop region. We compare the mitogenome of C. meridianus with that of C. relictus and C. kyebangensis . The base composition of the whole genome was 45.20%, 9.74%, 16.06%, and 29.00% for A, G, C, and T, respectively; it shows a high AT content (74.2%), similar to the mitogenomes of C. relictus and C. kyebangensis . The protein-coding genes are initiated with typical mitochondrial start codons except for cox1 with TTG. The gene order of the C. meridianus mitogenome differs from the typical insect pattern for the translocation of tRNA-Ser AGN , while the mitogenomes of the other two Cryptocercus species, C. relictus and C. kyebangensis , are consistent with the typical insect pattern. There are two very long non-coding intergenic regions lying on both sides of the rearranged gene tRNA-Ser AGN . The phylogenetic relationships were constructed based on the nucleotide sequence of 13 protein-coding genes and two ribosomal RNA genes. The mitogenome of C. meridianus is the first representative of the order Blattodea that demonstrates rearrangement, and it will contribute to the further study of the phylogeny and evolution of the genus Cryptocercus and related taxa.

  6. An integrated, structure- and energy-based view of the genetic code.

    PubMed

    Grosjean, Henri; Westhof, Eric

    2016-09-30

    The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. COME: a robust coding potential calculation tool for lncRNA identification and characterization based on multiple features.

    PubMed

    Hu, Long; Xu, Zhiyu; Hu, Boqin; Lu, Zhi John

    2017-01-09

    Recent genomic studies suggest that novel long non-coding RNAs (lncRNAs) are specifically expressed and far outnumber annotated lncRNA sequences. To identify and characterize novel lncRNAs in RNA sequencing data from new samples, we have developed COME, a coding potential calculation tool based on multiple features. It integrates multiple sequence-derived and experiment-based features using a decompose-compose method, which makes it more accurate and robust than other well-known tools. We also showed that COME was able to substantially improve the consistency of predication results from other coding potential calculators. Moreover, COME annotates and characterizes each predicted lncRNA transcript with multiple lines of supporting evidence, which are not provided by other tools. Remarkably, we found that one subgroup of lncRNAs classified by such supporting features (i.e. conserved local RNA secondary structure) was highly enriched in a well-validated database (lncRNAdb). We further found that the conserved structural domains on lncRNAs had better chance than other RNA regions to interact with RNA binding proteins, based on the recent eCLIP-seq data in human, indicating their potential regulatory roles. Overall, we present COME as an accurate, robust and multiple-feature supported method for the identification and characterization of novel lncRNAs. The software implementation is available at https://github.com/lulab/COME. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes.

    PubMed

    Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

    2016-07-01

    The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.

  9. Long non-coding RNA discovery across the genus anopheles reveals conserved secondary structures within and beyond the Gambiae complex.

    PubMed

    Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T

    2015-04-23

    Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.

  10. Regulatory consequences of neuronal ELAV-like protein binding to coding and non-coding RNAs in human brain

    PubMed Central

    Scheckel, Claudia; Drapeau, Elodie; Frias, Maria A; Park, Christopher Y; Fak, John; Zucker-Scharff, Ilana; Kou, Yan; Haroutunian, Vahram; Ma'ayan, Avi

    2016-01-01

    Neuronal ELAV-like (nELAVL) RNA binding proteins have been linked to numerous neurological disorders. We performed crosslinking-immunoprecipitation and RNAseq on human brain, and identified nELAVL binding sites on 8681 transcripts. Using knockout mice and RNAi in human neuroblastoma cells, we showed that nELAVL intronic and 3' UTR binding regulates human RNA splicing and abundance. We validated hundreds of nELAVL targets among which were important neuronal and disease-associated transcripts, including Alzheimer's disease (AD) transcripts. We therefore investigated RNA regulation in AD brain, and observed differential splicing of 150 transcripts, which in some cases correlated with differential nELAVL binding. Unexpectedly, the most significant change of nELAVL binding was evident on non-coding Y RNAs. nELAVL/Y RNA complexes were specifically remodeled in AD and after acute UV stress in neuroblastoma cells. We propose that the increased nELAVL/Y RNA association during stress may lead to nELAVL sequestration, redistribution of nELAVL target binding, and altered neuronal RNA splicing. DOI: http://dx.doi.org/10.7554/eLife.10421.001 PMID:26894958

  11. The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.

    PubMed Central

    Gustafson, G; Armour, S L

    1986-01-01

    The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962

  12. Role of RNA interference (RNAi) in the Moss Physcomitrella patens.

    PubMed

    Arif, Muhammad Asif; Frank, Wolfgang; Khraiwesh, Basel

    2013-01-14

    RNA interference (RNAi) is a mechanism that regulates genes by either transcriptional (TGS) or posttranscriptional gene silencing (PTGS), required for genome maintenance and proper development of an organism. Small non-coding RNAs are the key players in RNAi and have been intensively studied in eukaryotes. In plants, several classes of small RNAs with specific sizes and dedicated functions have evolved. The major classes of small RNAs include microRNAs (miRNAs) and small interfering RNAs (siRNAs), which differ in their biogenesis. miRNAs are synthesized from a short hairpin structure while siRNAs are derived from long double-stranded RNAs (dsRNA). Both miRNA and siRNAs control the expression of cognate target RNAs by binding to reverse complementary sequences mediating cleavage or translational inhibition of the target RNA. They also act on the DNA and cause epigenetic changes such as DNA methylation and histone modifications. In the last years, the analysis of plant RNAi pathways was extended to the bryophyte Physcomitrella patens, a non-flowering, non-vascular ancient land plant that diverged from the lineage of seed plants approximately 450 million years ago. Based on a number of characteristic features and its phylogenetic key position in land plant evolution P. patens emerged as a plant model species to address basic as well as applied topics in plant biology. Here we summarize the current knowledge on the role of RNAi in P. patens that shows functional overlap with RNAi pathways from seed plants, and also unique features specific to this species.

  13. A novel TBP-TAF complex on RNA polymerase II-transcribed snRNA genes.

    PubMed

    Zaborowska, Justyna; Taylor, Alice; Roeder, Robert G; Murphy, Shona

    2012-01-01

    Initiation of transcription of most human genes transcribed by RNA polymerase II (RNAP II) requires the formation of a preinitiation complex comprising TFIIA, B, D, E, F, H and RNAP II. The general transcription factor TFIID is composed of the TATA-binding protein and up to 13 TBP-associated factors. During transcription of snRNA genes, RNAP II does not appear to make the transition to long-range productive elongation, as happens during transcription of protein-coding genes. In addition, recognition of the snRNA gene-type specific 3' box RNA processing element requires initiation from an snRNA gene promoter. These characteristics may, at least in part, be driven by factors recruited to the promoter. For example, differences in the complement of TAFs might result in differential recruitment of elongation and RNA processing factors. As precedent, it already has been shown that the promoters of some protein-coding genes do not recruit all the TAFs found in TFIID. Although TAF5 has been shown to be associated with RNAP II-transcribed snRNA genes, the full complement of TAFs associated with these genes has remained unclear. Here we show, using a ChIP and siRNA-mediated approach, that the TBP/TAF complex on snRNA genes differs from that found on protein-coding genes. Interestingly, the largest TAF, TAF1, and the core TAFs, TAF10 and TAF4, are not detected on snRNA genes. We propose that this snRNA gene-specific TAF subset plays a key role in gene type-specific control of expression.

  14. An RNA Phage Lab: MS2 in Walter Fiers' laboratory of molecular biology in Ghent, from genetic code to gene and genome, 1963-1976.

    PubMed

    Pierrel, Jérôme

    2012-01-01

    The importance of viruses as model organisms is well-established in molecular biology and Max Delbrück's phage group set standards in the DNA phage field. In this paper, I argue that RNA phages, discovered in the 1960s, were also instrumental in the making of molecular biology. As part of experimental systems, RNA phages stood for messenger RNA (mRNA), genes and genome. RNA was thought to mediate information transfers between DNA and proteins. Furthermore, RNA was more manageable at the bench than DNA due to the availability of specific RNases, enzymes used as chemical tools to analyse RNA. Finally, RNA phages provided scientists with a pure source of mRNA to investigate the genetic code, genes and even a genome sequence. This paper focuses on Walter Fiers' laboratory at Ghent University (Belgium) and their work on the RNA phage MS2. When setting up his Laboratory of Molecular Biology, Fiers planned a comprehensive study of the virus with a strong emphasis on the issue of structure. In his lab, RNA sequencing, now a little-known technique, evolved gradually from a means to solve the genetic code, to a tool for completing the first genome sequence. Thus, I follow the research pathway of Fiers and his 'RNA phage lab' with their evolving experimental system from 1960 to the late 1970s. This study illuminates two decisive shifts in post-war biology: the emergence of molecular biology as a discipline in the 1960s in Europe and of genomics in the 1990s.

  15. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

    PubMed Central

    2010-01-01

    Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. Conclusions RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field. PMID:20459631

  16. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures.

    PubMed

    Popenda, Mariusz; Szachniuk, Marta; Blazewicz, Marek; Wasik, Szymon; Burke, Edmund K; Blazewicz, Jacek; Adamiak, Ryszard W

    2010-05-06

    Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB) in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics) is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA structures is provided. RNA FRABASE 2.0 is freely available at http://rnafrabase.cs.put.poznan.pl. RNA FRABASE 2.0 provides a novel database and powerful search engine which is equipped with new data and functionalities that are unavailable elsewhere. Our intention is that this advanced version of the RNA FRABASE will be of interest to all researchers working in the RNA field.

  17. Integrative analyses of transcriptome sequencing identify novel functional lncRNAs in esophageal squamous cell carcinoma.

    PubMed

    Li, C-Q; Huang, G-W; Wu, Z-Y; Xu, Y-J; Li, X-C; Xue, Y-J; Zhu, Y; Zhao, J-M; Li, M; Zhang, J; Wu, J-Y; Lei, F; Wang, Q-Y; Li, S; Zheng, C-P; Ai, B; Tang, Z-D; Feng, C-C; Liao, L-D; Wang, S-H; Shen, J-H; Liu, Y-J; Bai, X-F; He, J-Z; Cao, H-H; Wu, B-L; Wang, M-R; Lin, D-C; Koeffler, H P; Wang, L-D; Li, X; Li, E-M; Xu, L-Y

    2017-02-13

    Long non-coding RNAs (lncRNAs) have a critical role in cancer initiation and progression, and thus may mediate oncogenic or tumor suppressing effects, as well as be a new class of cancer therapeutic targets. We performed high-throughput sequencing of RNA (RNA-seq) to investigate the expression level of lncRNAs and protein-coding genes in 30 esophageal samples, comprised of 15 esophageal squamous cell carcinoma (ESCC) samples and their 15 paired non-tumor tissues. We further developed an integrative bioinformatics method, denoted URW-LPE, to identify key functional lncRNAs that regulate expression of downstream protein-coding genes in ESCC. A number of known onco-lncRNA and many putative novel ones were effectively identified by URW-LPE. Importantly, we identified lncRNA625 as a novel regulator of ESCC cell proliferation, invasion and migration. ESCC patients with high lncRNA625 expression had significantly shorter survival time than those with low expression. LncRNA625 also showed specific prognostic value for patients with metastatic ESCC. Finally, we identified E1A-binding protein p300 (EP300) as a downstream executor of lncRNA625-induced transcriptional responses. These findings establish a catalog of novel cancer-associated functional lncRNAs, which will promote our understanding of lncRNA-mediated regulation in this malignancy.

  18. Endogenous short RNAs generated by Dicer 2 and RNA-dependent RNA polymerase 1 regulate mRNAs in the basal fungus Mucor circinelloides

    PubMed Central

    Nicolas, Francisco Esteban; Moxon, Simon; de Haro, Juan P.; Calo, Silvia; Grigoriev, Igor V.; Torres-Martínez, Santiago; Moulton, Vincent; Ruiz-Vázquez, Rosa M.; Dalmay, Tamas

    2010-01-01

    Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi. PMID:20427422

  19. Modification of orthogonal tRNAs: unexpected consequences for sense codon reassignment.

    PubMed

    Biddle, Wil; Schmitt, Margaret A; Fisk, John D

    2016-12-01

    Breaking the degeneracy of the genetic code via sense codon reassignment has emerged as a way to incorporate multiple copies of multiple non-canonical amino acids into a protein of interest. Here, we report the modification of a normally orthogonal tRNA by a host enzyme and show that this adventitious modification has a direct impact on the activity of the orthogonal tRNA in translation. We observed nearly equal decoding of both histidine codons, CAU and CAC, by an engineered orthogonal M. jannaschii tRNA with an AUG anticodon: tRNA Opt We suspected a modification of the tRNA Opt AUG anticodon was responsible for the anomalous lack of codon discrimination and demonstrate that adenosine 34 of tRNA Opt AUG is converted to inosine. We identified tRNA Opt AUG anticodon loop variants that increase reassignment of the histidine CAU codon, decrease incorporation in response to the histidine CAC codon, and improve cell health and growth profiles. Recognizing tRNA modification as both a potential pitfall and avenue of directed alteration will be important as the field of genetic code engineering continues to infiltrate the genetic codes of diverse organisms. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. GRID-seq reveals the global RNA-chromatin interactome

    PubMed Central

    Li, Xiao; Zhou, Bing; Chen, Liang; Gou, Lan-Tao; Li, Hairi; Fu, Xiang-Dong

    2017-01-01

    Higher eukaryotic genomes are bound by a large number of coding and non-coding RNAs, but approaches to comprehensively map the identity and binding sites of these RNAs are lacking. Here we report a method to in situ capture global RNA interactions with DNA by deep sequencing (GRID-seq), which enables the comprehensive identification of the entire repertoire of chromatin-interacting RNAs and their respective binding sites. In human, mouse and Drosophila cells, we detected a large set of tissue-specific coding and non-coding RNAs that are bound to active promoters and enhancers, especially super-enhancers. Assuming that most mRNA-chromatin interactions indicate the physical proximity of a promoter and an enhancer, we constructed a three-dimensional global connectivity map of promoters and enhancers, revealing transcription activity-linked genomic interactions in the nucleus. PMID:28922346

  1. The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).

    PubMed

    Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai

    2014-12-01

    The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.

  2. The fourfold way of the genetic code.

    PubMed

    Jiménez-Montaño, Miguel Angel

    2009-11-01

    We describe a compact representation of the genetic code that factorizes the table in quartets. It represents a "least grammar" for the genetic language. It is justified by the Klein-4 group structure of RNA bases and codon doublets. The matrix of the outer product between the column-vector of bases and the corresponding row-vector V(T)=(C G U A), considered as signal vectors, has a block structure consisting of the four cosets of the KxK group of base transformations acting on doublet AA. This matrix, translated into weak/strong (W/S) and purine/pyrimidine (R/Y) nucleotide classes, leads to a code table with mixed and unmixed families in separate regions. A basic difference between them is the non-commuting (R/Y) doublets: AC/CA, GU/UG. We describe the degeneracy in the canonical code and the systematic changes in deviant codes in terms of the divisors of 24, employing modulo multiplication groups. We illustrate binary sub-codes characterizing mutations in the quartets. We introduce a decision-tree to predict the mode of tRNA recognition corresponding to each codon, and compare our result with related findings by Jestin and Soulé [Jestin, J.-L., Soulé, C., 2007. Symmetries by base substitutions in the genetic code predict 2' or 3' aminoacylation of tRNAs. J. Theor. Biol. 247, 391-394], and the rearrangements of the table by Delarue [Delarue, M., 2007. An asymmetric underlying rule in the assignment of codons: possible clue to a quick early evolution of the genetic code via successive binary choices. RNA 13, 161-169] and Rodin and Rodin [Rodin, S.N., Rodin, A.S., 2008. On the origin of the genetic code: signatures of its primordial complementarity in tRNAs and aminoacyl-tRNA synthetases. Heredity 100, 341-355], respectively.

  3. Non coding RNAs in vascular disease - from basic science to clinical applications: Scientific update from the Working Group of Myocardial Function of the European Society of Cardiology

    PubMed

    Fiedler, Jan; Baker, Andrew H; Dimmeler, Stefanie; Heymans, Stephane; Mayr, Manuel; Thum, Thomas

    2018-05-23

    Non-coding RNAs are increasingly recognized not only as regulators of various biological functions but also as targets for a new generation of RNA therapeutics and biomarkers. We hereby review recent insights relating to non-coding RNAs including microRNAs (e.g. miR-126, miR-146a), long non-coding RNAs (e.g. MIR503HG, GATA6-AS, SMILR) and circular RNAs (e.g. cZNF292) and their role in vascular diseases. This includes identification and therapeutic use of hypoxia-regulated non-coding RNAs and endogenous non-coding RNAs that regulate intrinsic smooth muscle cell signalling, age-related non-coding RNAs and non-coding RNAs involved in the regulation of mitochondrial biology and metabolic control. Finally, we discuss non-coding RNA species with biomarker potential.

  4. The long non-coding RNA LSINCT5 promotes malignancy in non-small cell lung cancer by stabilizing HMGA2.

    PubMed

    Tian, Yuheng; Zhang, Lina; Chen, Shuwen; Ma, Yuan; Liu, Yanyan

    2018-06-08

    Long non-coding RNAs (lncRNAs) can actively participate in tumorigenesis in various cancers. However, the involvement of lncRNA long stress induced non-coding transcripts 5 (LSINCT5) in non-small cell lung cancer (NSCLC) remains largely unknown. Here we showed a novel lncRNA signature in NSCLC through lncRNA profiling. Increased LSINCT5 expression positively correlates with malignant clinicopathological features and poor survival. LSINCT5 can promote migration and viability of various NSCLC cells in vitro and also enhance lung cancer progression in vivo. RNA immunoprecipitation followed by mass spectrometry has identified that LSINCT5 interacts with HMGA2. This physical interaction can increase the stability of HMGA2 by inhibiting proteasome-mediated degradation. Therefore, LSINCT5 may possibly contribute to NSCLC tumorigenesis by stabilizing the oncogenic factor of HMGA2. This novel LSINCT5/HMGA2 axis can modulate lung cancer progression and might be a promising target for pharmacological intervention.

  5. Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin

    PubMed Central

    Böhmdorfer, Gudrun; Sethuraman, Shriya; Rowley, M Jordan; Krzyszton, Michal; Rothi, M Hafiz; Bouzit, Lilia; Wierzbicki, Andrzej T

    2016-01-01

    RNA-mediated transcriptional gene silencing is a conserved process where small RNAs target transposons and other sequences for repression by establishing chromatin modifications. A central element of this process are long non-coding RNAs (lncRNA), which in Arabidopsis thaliana are produced by a specialized RNA polymerase known as Pol V. Here we show that non-coding transcription by Pol V is controlled by preexisting chromatin modifications located within the transcribed regions. Most Pol V transcripts are associated with AGO4 but are not sliced by AGO4. Pol V-dependent DNA methylation is established on both strands of DNA and is tightly restricted to Pol V-transcribed regions. This indicates that chromatin modifications are established in close proximity to Pol V. Finally, Pol V transcription is preferentially enriched on edges of silenced transposable elements, where Pol V transcribes into TEs. We propose that Pol V may play an important role in the determination of heterochromatin boundaries. DOI: http://dx.doi.org/10.7554/eLife.19092.001 PMID:27779094

  6. The microRNA machinery regulates fasting-induced changes in gene expression and longevity in Caenorhabditis elegans.

    PubMed

    Kogure, Akiko; Uno, Masaharu; Ikeda, Takako; Nishida, Eisuke

    2017-07-07

    Intermittent fasting (IF) is a dietary restriction regimen that extends the lifespans of Caenorhabditis elegans and mammals by inducing changes in gene expression. However, how IF induces these changes and promotes longevity remains unclear. One proposed mechanism involves gene regulation by microRNAs (miRNAs), small non-coding RNAs (∼22 nucleotides) that repress gene expression and whose expression can be altered by fasting. To test this proposition, we examined the role of the miRNA machinery in fasting-induced transcriptional changes and longevity in C. elegans We revealed that fasting up-regulated the expression of the miRNA-induced silencing complex (miRISC) components, including Argonaute and GW182, and the miRNA-processing enzyme DRSH-1 (the ortholog of the Drosophila Drosha enzyme). Our lifespan measurements demonstrated that IF-induced longevity was suppressed by knock-out or knockdown of miRISC components and was completely inhibited by drsh-1 ablation. Remarkably, drsh-1 ablation inhibited the fasting-induced changes in the expression of the target genes of DAF-16, the insulin/IGF-1 signaling effector in C. elegans Fasting-induced transcriptome alterations were substantially and modestly suppressed in the drsh-1 null mutant and the null mutant of ain-1 , a gene encoding GW182, respectively. Moreover, miRNA array analyses revealed that the expression levels of numerous miRNAs changed after 2 days of fasting. These results indicate that components of the miRNA machinery, especially the miRNA-processing enzyme DRSH-1, play an important role in mediating IF-induced longevity via the regulation of fasting-induced changes in gene expression. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Study characterizes long non-coding RNA’s response to DNA damage in colon cancer cells | Center for Cancer Research

    Cancer.gov

    Researchers led by Ashish Lal, Ph.D., Investigator in the Genetics Branch, have shown that when the DNA in human colon cancer cells is damaged, a long non-coding RNA (lncRNA) regulates the expression of genes that halt growth, which allows the cells to repair the damage and promote survival. Their findings suggest an important pro-survival function of a lncRNA in cancer

  8. Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

    PubMed

    Brunak, S; Engelbrecht, J

    1996-06-01

    A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.

  9. A Novel Subgenomic Murine Leukemia Virus RNA Transcript Results from Alternative Splicing

    PubMed Central

    Déjardin, Jérôme; Bompard-Maréchal, Guillaume; Audit, Muriel; Hope, Thomas J.; Sitbon, Marc; Mougel, Marylène

    2000-01-01

    Here we show the existence of a novel subgenomic 4.4-kb RNA in cells infected with the prototypic replication-competent Friend or Moloney murine leukemia viruses (MuLV). This RNA derives by splicing from an alternative donor site (SD′) within the capsid-coding region to the canonical envelope splice acceptor site. The position and the sequence of SD′ was highly conserved among mammalian type C and D oncoviruses. Point mutations used to inactivate SD′ without changing the capsid-coding ability affected viral RNA splicing and reduced viral replication in infected cells. PMID:10729146

  10. The complete mitochondrial genome of Chrysopa pallens (Insecta, Neuroptera, Chrysopidae).

    PubMed

    He, Kun; Chen, Zhe; Yu, Dan-Na; Zhang, Jia-Yong

    2012-10-01

    The complete mitochondrial genome of Chrysopa pallens (Neuroptera, Chrysopidae) was sequenced. It consists of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA (rRNA) genes, and a control region (AT-rich region). The total length of C. pallens mitogenome is 16,723 bp with 79.5% AT content, and the length of control region is 1905 bp with 89.1% AT content. The non-coding regions of C. pallens include control region between 12S rRNA and trnI genes, and a 75-bp space region between trnI and trnQ genes.

  11. tRNA Shifts the G-quadruplex-Hairpin Conformational Equilibrium in RNA towards the Hairpin Conformer.

    PubMed

    Rode, Ambadas B; Endoh, Tamaki; Sugimoto, Naoki

    2016-11-07

    Non-coding RNAs play important roles in cellular homeostasis and are involved in many human diseases including cancer. Intermolecular RNA-RNA interactions are the basis for the diverse functions of many non-coding RNAs. Herein, we show how the presence of tRNA influences the equilibrium between hairpin and G-quadruplex conformations in the 5' untranslated regions of oncogenes and model sequences. Kinetic and equilibrium analyses of the hairpin to G-quadruplex conformational transition of purified RNA as well as during co-transcriptional folding indicate that tRNA significantly shifts the equilibrium toward the hairpin conformer. The enhancement of relative translation efficiency in a reporter gene assay is shown to be due to the tRNA-mediated shift in hairpin-G-quadruplex equilibrium of oncogenic mRNAs. Our findings suggest that tRNA is a possible therapeutic target in diseases in which RNA conformational equilibria is dysregulated. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. StarScan: a web server for scanning small RNA targets from degradome sequencing data.

    PubMed

    Liu, Shun; Li, Jun-Hao; Wu, Jie; Zhou, Ke-Ren; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2015-07-01

    Endogenous small non-coding RNAs (sRNAs), including microRNAs, PIWI-interacting RNAs and small interfering RNAs, play important gene regulatory roles in animals and plants by pairing to the protein-coding and non-coding transcripts. However, computationally assigning these various sRNAs to their regulatory target genes remains technically challenging. Recently, a high-throughput degradome sequencing method was applied to identify biologically relevant sRNA cleavage sites. In this study, an integrated web-based tool, StarScan (sRNA target Scan), was developed for scanning sRNA targets using degradome sequencing data from 20 species. Given a sRNA sequence from plants or animals, our web server performs an ultrafast and exhaustive search for potential sRNA-target interactions in annotated and unannotated genomic regions. The interactions between small RNAs and target transcripts were further evaluated using a novel tool, alignScore. A novel tool, degradomeBinomTest, was developed to quantify the abundance of degradome fragments located at the 9-11th nucleotide from the sRNA 5' end. This is the first web server for discovering potential sRNA-mediated RNA cleavage events in plants and animals, which affords mechanistic insights into the regulatory roles of sRNAs. The StarScan web server is available at http://mirlab.sysu.edu.cn/starscan/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Light transport feature for SCINFUL.

    PubMed

    Etaati, G R; Ghal-Eh, N

    2008-03-01

    An extended version of the scintillator response function prediction code SCINFUL has been developed by incorporating PHOTRACK, a Monte Carlo light transport code. Comparisons of calculated and experimental results for organic scintillators exposed to neutrons show that the extended code improves the predictive capability of SCINFUL.

  14. MicroRNAs and other non-coding RNAs as targets for anticancer drug development

    PubMed Central

    Ling, Hui; Fabbri, Muller; Calin, George A.

    2015-01-01

    With the first cancer-targeted microRNA drug, MRX34, a liposome-based miR-34 mimic, entering phase I clinical trial in patients with advanced hepatocellular carcinoma in April 2013, miRNA therapeutics are attracting special attention from both academia and biotechnology companies. Although to date the most studied non-coding RNAs (ncRNAs) are miRNAs, the importance of long non-coding RNAs (lncRNAs) is increasingly being recognized. Here we summarize the roles of miRNAs and lncRNAs in cancer, with a focus on the recently identified novel mechanisms of action, and discuss the current strategies in designing ncRNA-targeting therapeutics, as well as the associated challenges. PMID:24172333

  15. Complex Interplay among DNA Modification, Noncoding RNA Expression and Protein-Coding RNA Expression in Salvia miltiorrhiza Chloroplast Genome

    PubMed Central

    Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

    2014-01-01

    Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614

  16. Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

    PubMed

    Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

    2014-01-01

    Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.

  17. Transcription initiation complex structures elucidate DNA opening.

    PubMed

    Plaschka, C; Hantsche, M; Dienemann, C; Burzinski, C; Plitzko, J; Cramer, P

    2016-05-19

    Transcription of eukaryotic protein-coding genes begins with assembly of the RNA polymerase (Pol) II initiation complex and promoter DNA opening. Here we report cryo-electron microscopy (cryo-EM) structures of yeast initiation complexes containing closed and open DNA at resolutions of 8.8 Å and 3.6 Å, respectively. DNA is positioned and retained over the Pol II cleft by a network of interactions between the TATA-box-binding protein TBP and transcription factors TFIIA, TFIIB, TFIIE, and TFIIF. DNA opening occurs around the tip of the Pol II clamp and the TFIIE 'extended winged helix' domain, and can occur in the absence of TFIIH. Loading of the DNA template strand into the active centre may be facilitated by movements of obstructing protein elements triggered by allosteric binding of the TFIIE 'E-ribbon' domain. The results suggest a unified model for transcription initiation with a key event, the trapping of open promoter DNA by extended protein-protein and protein-DNA contacts.

  18. Gene and genon concept: coding versus regulation

    PubMed Central

    2007-01-01

    We analyse here the definition of the gene in order to distinguish, on the basis of modern insight in molecular biology, what the gene is coding for, namely a specific polypeptide, and how its expression is realized and controlled. Before the coding role of the DNA was discovered, a gene was identified with a specific phenotypic trait, from Mendel through Morgan up to Benzer. Subsequently, however, molecular biologists ventured to define a gene at the level of the DNA sequence in terms of coding. As is becoming ever more evident, the relations between information stored at DNA level and functional products are very intricate, and the regulatory aspects are as important and essential as the information coding for products. This approach led, thus, to a conceptual hybrid that confused coding, regulation and functional aspects. In this essay, we develop a definition of the gene that once again starts from the functional aspect. A cellular function can be represented by a polypeptide or an RNA. In the case of the polypeptide, its biochemical identity is determined by the mRNA prior to translation, and that is where we locate the gene. The steps from specific, but possibly separated sequence fragments at DNA level to that final mRNA then can be analysed in terms of regulation. For that purpose, we coin the new term “genon”. In that manner, we can clearly separate product and regulative information while keeping the fundamental relation between coding and function without the need to introduce a conceptual hybrid. In mRNA, the program regulating the expression of a gene is superimposed onto and added to the coding sequence in cis - we call it the genon. The complementary external control of a given mRNA by trans-acting factors is incorporated in its transgenon. A consequence of this definition is that, in eukaryotes, the gene is, in most cases, not yet present at DNA level. Rather, it is assembled by RNA processing, including differential splicing, from various pieces, as steered by the genon. It emerges finally as an uninterrupted nucleic acid sequence at mRNA level just prior to translation, in faithful correspondence with the amino acid sequence to be produced as a polypeptide. After translation, the genon has fulfilled its role and expires. The distinction between the protein coding information as materialised in the final polypeptide and the processing information represented by the genon allows us to set up a new information theoretic scheme. The standard sequence information determined by the genetic code expresses the relation between coding sequence and product. Backward analysis asks from which coding region in the DNA a given polypeptide originates. The (more interesting) forward analysis asks in how many polypeptides of how many different types a given DNA segment is expressed. This concerns the control of the expression process for which we have introduced the genon concept. Thus, the information theoretic analysis can capture the complementary aspects of coding and regulation, of gene and genon. PMID:18087760

  19. Androgen-responsive non-coding small RNAs extend the potential of HCG stimulation to act as a bioassay of androgen sufficiency.

    PubMed

    Rodie, M E; Mudaliar, M A V; Herzyk, P; McMillan, M; Boroujerdi, M; Chudleigh, S; Tobias, E S; Ahmed, S F

    2017-10-01

    It is unclear whether a short-term change in circulating androgens is associated with changes in the transcriptome of the peripheral blood mononuclear cells (PBMC). To explore the effect of hCG stimulation on the PBMC transcriptome, 12 boys with a median age (range) of 0.7 years (0.3, 11.2) who received intramuscular hCG 1500u on 3 consecutive days as part of their investigations underwent transcriptomic array analysis on RNA extracted from peripheral blood mononuclear cells before and after hCG stimulation. Median pre- and post-hCG testosterone for the overall group was 0.7 nmol/L (<0.5, 6) and 7.9 nmol/L (<0.5, 31.5), respectively. Of the 12 boys, 3 (25%) did not respond to hCG stimulation with a pre and post median serum testosterone of <0.5 nmol/L and <0.5 nmol/L, respectively. When corrected for gene expression changes in the non-responders to exclude hCG effects, all 9 of the hCG responders consistently demonstrated a 20% or greater increase in the expression of piR-37153 and piR-39248 , non-coding PIWI-interacting RNAs (piRNAs). In addition, of the 9 responders, 8, 6 and 4 demonstrated a 30, 40 and 50% rise, respectively, in a total of 2 further piRNAs. In addition, 3 of the responders showed a 50% or greater rise in the expression of another small RNA, SNORD5 . On comparing fold-change in serum testosterone with fold-change in the above transcripts, a positive correlation was detected for SNORD5 ( P  = 0.01). The identification of a dynamic and androgen-responsive PBMC transcriptome extends the potential value of the hCG test for the assessment of androgen sufficiency. © 2017 The authors.

  20. Deciphering the role of the Gag-Pol ribosomal frameshift signal in HIV-1 RNA genome packaging.

    PubMed

    Nikolaitchik, Olga A; Hu, Wei-Shau

    2014-04-01

    A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5' untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5' end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts.

  1. Deciphering the Role of the Gag-Pol Ribosomal Frameshift Signal in HIV-1 RNA Genome Packaging

    PubMed Central

    Nikolaitchik, Olga A.

    2014-01-01

    ABSTRACT A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5′ untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. IMPORTANCE To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5′ end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts. PMID:24453371

  2. Identification of small non-coding RNA classes expressed in swine whole blood during HP-PRRSV infection

    USDA-ARS?s Scientific Manuscript database

    It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs ...

  3. Specificity Protein (Sp) Transcription Factors and Metformin Regulate Expression of the Long Non-coding RNA HULC

    EPA Science Inventory

    There is evidence that specificity protein 1 (Sp1) transcription factor (TF) regulates expression of long non-coding RNAs (lncRNAs) in hepatocellular carcinoma (HCC) cells. RNA interference (RNAi) studies showed that among several lncRNAs expressed in HepG2, SNU-449 and SK-Hep-1...

  4. Identification and characterization of long non-coding RNAs in rainbow trout eggs

    USDA-ARS?s Scientific Manuscript database

    Long non-coding RNAs (lncRNAs) are in general considered as a diverse class of transcripts longer than 200 nucleotides that structurally resemble mRNAs but do not encode proteins. Recent advances in RNA sequencing (RNA-Seq) and bioinformatics methods have provided an opportunity to indentify and ana...

  5. The Big Entity of New RNA World: Long Non-Coding RNAs in Microvascular Complications of Diabetes.

    PubMed

    Raut, Satish K; Khullar, Madhu

    2018-01-01

    A major part of the genome is known to be transcribed into non-protein coding RNAs (ncRNAs), such as microRNA and long non-coding RNA (lncRNA). The importance of ncRNAs is being increasingly recognized in physiological and pathological processes. lncRNAs are a novel class of ncRNAs that do not code for proteins and are important regulators of gene expression. In the past, these molecules were thought to be transcriptional "noise" with low levels of evolutionary conservation. However, recent studies provide strong evidence indicating that lncRNAs are (i) regulated during various cellular processes, (ii) exhibit cell type-specific expression, (iii) localize to specific organelles, and (iv) associated with human diseases. Emerging evidence indicates an aberrant expression of lncRNAs in diabetes and diabetes-related microvascular complications. In the present review, we discuss the current state of knowledge of lncRNAs, their genesis from genome, and the mechanism of action of individual lncRNAs in the pathogenesis of microvascular complications of diabetes and therapeutic approaches.

  6. RAID: a comprehensive resource for human RNA-associated (RNA–RNA/RNA–protein) interaction

    PubMed Central

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-01-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA–RNA/RNA–protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA–RNA interactions and 1619 RNA–protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA–RNA/RNA–protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA–RNA/RNA–protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. PMID:24803509

  7. A multidimensional platform for the purification of non-coding RNA species

    PubMed Central

    Chionh, Yok Hian; Ho, Chia-Hua; Pruksakorn, Dumnoensun; Ramesh Babu, I.; Ng, Chee Sheng; Hia, Fabian; McBee, Megan E.; Su, Dan; Pang, Yan Ling Joy; Gu, Chen; Dong, Hongping; Prestwich, Erin G.; Shi, Pei-Yong; Preiser, Peter Rainer; Alonso, Sylvie; Dedon, Peter C.

    2013-01-01

    A renewed interest in non-coding RNA (ncRNA) has led to the discovery of novel RNA species and post-transcriptional ribonucleoside modifications, and an emerging appreciation for the role of ncRNA in RNA epigenetics. Although much can be learned by amplification-based analysis of ncRNA sequence and quantity, there is a significant need for direct analysis of RNA, which has led to numerous methods for purification of specific ncRNA molecules. However, no single method allows purification of the full range of cellular ncRNA species. To this end, we developed a multidimensional chromatographic platform to resolve, isolate and quantify all canonical ncRNAs in a single sample of cells or tissue, as well as novel ncRNA species. The applicability of the platform is demonstrated in analyses of ncRNA from bacteria, human cells and plasmodium-infected reticulocytes, as well as a viral RNA genome. Among the many potential applications of this platform are a system-level analysis of the dozens of modified ribonucleosides in ncRNA, characterization of novel long ncRNA species, enhanced detection of rare transcript variants and analysis of viral genomes. PMID:23907385

  8. Protein functional features are reflected in the patterns of mRNA translation speed.

    PubMed

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  9. The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development.

    PubMed

    Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez; Shaulsky, Gad

    2017-02-09

    Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development. Copyright © 2017 Rosengarten et al.

  10. A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes

    PubMed Central

    Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

    2016-01-01

    The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221

  11. Study characterizes long non-coding RNA’s response to DNA damage in colon cancer cells | Center for Cancer Research

    Cancer.gov

    Researchers led by Ashish Lal, Ph.D., Investigator in the Genetics Branch, have shown that when the DNA in human colon cancer cells is damaged, a long non-coding RNA (lncRNA) regulates the expression of genes that halt growth, which allows the cells to repair the damage and promote survival. Their findings suggest an important pro-survival function of a lncRNA in cancer cells.  Read more...

  12. Selective inhibitors of trypanosomal uridylyl transferase RET1 establish druggability of RNA post-transcriptional modifications

    PubMed Central

    Cording, Amy; Gormally, Michael; Bond, Peter J.; Carrington, Mark; Balasubramanian, Shankar; Miska, Eric A.; Thomas, Beth

    2017-01-01

    ABSTRACT Non-coding RNAs are crucial regulators for a vast array of cellular processes and have been implicated in human disease. These biological processes represent a hitherto untapped resource in our fight against disease. In this work we identify small molecule inhibitors of a non-coding RNA uridylylation pathway. The TUTase family of enzymes is important for modulating non-coding RNA pathways in both human cancer and pathogen systems. We demonstrate that this new class of drug target can be accessed with traditional drug discovery techniques. Using the Trypanosoma brucei TUTase, RET1, we identify TUTase inhibitors and lay the groundwork for the use of this new target class as a therapeutic opportunity for the under-served disease area of African Trypanosomiasis. In a broader sense this work demonstrates the therapeutic potential for targeting RNA post-transcriptional modifications with small molecules in human disease. PMID:26786754

  13. Selective inhibitors of trypanosomal uridylyl transferase RET1 establish druggability of RNA post-transcriptional modifications.

    PubMed

    Cording, Amy; Gormally, Michael; Bond, Peter J; Carrington, Mark; Balasubramanian, Shankar; Miska, Eric A; Thomas, Beth

    2017-05-04

    Non-coding RNAs are crucial regulators for a vast array of cellular processes and have been implicated in human disease. These biological processes represent a hitherto untapped resource in our fight against disease. In this work we identify small molecule inhibitors of a non-coding RNA uridylylation pathway. The TUTase family of enzymes is important for modulating non-coding RNA pathways in both human cancer and pathogen systems. We demonstrate that this new class of drug target can be accessed with traditional drug discovery techniques. Using the Trypanosoma brucei TUTase, RET1, we identify TUTase inhibitors and lay the groundwork for the use of this new target class as a therapeutic opportunity for the under-served disease area of African Trypanosomiasis. In a broader sense this work demonstrates the therapeutic potential for targeting RNA post-transcriptional modifications with small molecules in human disease.

  14. A Network Based Method for Analysis of lncRNA-Disease Associations and Prediction of lncRNAs Implicated in Diseases

    PubMed Central

    Yang, Xiaofei; Gao, Lin; Guo, Xingli; Shi, Xinghua; Wu, Hao; Song, Fei; Wang, Bingbo

    2014-01-01

    Increasing evidence has indicated that long non-coding RNAs (lncRNAs) are implicated in and associated with many complex human diseases. Despite of the accumulation of lncRNA-disease associations, only a few studies had studied the roles of these associations in pathogenesis. In this paper, we investigated lncRNA-disease associations from a network view to understand the contribution of these lncRNAs to complex diseases. Specifically, we studied both the properties of the diseases in which the lncRNAs were implicated, and that of the lncRNAs associated with complex diseases. Regarding the fact that protein coding genes and lncRNAs are involved in human diseases, we constructed a coding-non-coding gene-disease bipartite network based on known associations between diseases and disease-causing genes. We then applied a propagation algorithm to uncover the hidden lncRNA-disease associations in this network. The algorithm was evaluated by leave-one-out cross validation on 103 diseases in which at least two genes were known to be involved, and achieved an AUC of 0.7881. Our algorithm successfully predicted 768 potential lncRNA-disease associations between 66 lncRNAs and 193 diseases. Furthermore, our results for Alzheimer's disease, pancreatic cancer, and gastric cancer were verified by other independent studies. PMID:24498199

  15. T cells are influenced by a long non-coding RNA in the autoimmune associated PTPN2 locus.

    PubMed

    Houtman, Miranda; Shchetynsky, Klementy; Chemin, Karine; Hensvold, Aase Haj; Ramsköld, Daniel; Tandre, Karolina; Eloranta, Maija-Leena; Rönnblom, Lars; Uebe, Steffen; Catrina, Anca Irinel; Malmström, Vivianne; Padyukov, Leonid

    2018-06-01

    Non-coding SNPs in the protein tyrosine phosphatase non-receptor type 2 (PTPN2) locus have been linked with several autoimmune diseases, including rheumatoid arthritis, type I diabetes, and inflammatory bowel disease. However, the functional consequences of these SNPs are poorly characterized. Herein, we show in blood cells that SNPs in the PTPN2 locus are highly correlated with DNA methylation levels at four CpG sites downstream of PTPN2 and expression levels of the long non-coding RNA (lncRNA) LINC01882 downstream of these CpG sites. We observed that LINC01882 is mainly expressed in T cells and that anti-CD3/CD28 activated naïve CD4 + T cells downregulate the expression of LINC01882. RNA sequencing analysis of LINC01882 knockdown in Jurkat T cells, using a combination of antisense oligonucleotides and RNA interference, revealed the upregulation of the transcription factor ZEB1 and kinase MAP2K4, both involved in IL-2 regulation. Overall, our data suggests the involvement of LINC01882 in T cell activation and hints towards an auxiliary role of these non-coding SNPs in autoimmunity associated with the PTPN2 locus. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

  16. 3D RNA and functional interactions from evolutionary couplings

    PubMed Central

    Weinreb, Caleb; Riesselman, Adam; Ingraham, John B.; Gross, Torsten; Sander, Chris; Marks, Debora S.

    2016-01-01

    Summary Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces research on their structure and functional interactions. We mine the evolutionary sequence record to derive precise information about function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules, and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions, e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by accelerating sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA. PMID:27087444

  17. Characterization of circulating transfer RNA-derived RNA fragments in cattle

    PubMed Central

    Casas, Eduardo; Cai, Guohong; Neill, John D.

    2015-01-01

    The objective was to characterize naturally occurring circulating transfer RNA-derived RNA fragments (tRFs) in cattle1. Serum from eight clinically normal adult dairy cows was collected, and small non-coding RNAs were extracted immediately after collection and sequenced by Illumina MiSeq. Sequences aligned to transfer RNA (tRNA) genes or their flanking sequences were characterized. Sequences aligned to the beginning of 5′ end of the mature tRNA were classified as tRF5; those aligned to the 3′ end of mature tRNA were classified as tRF3; and those aligned to the beginning of the 3′ end flanking sequences were classified as tRF1. There were 3,190,962 sequences that mapped to transfer RNA and small non-coding RNAs in the bovine genome. Of these, 2,323,520 were identified as tRF5s, 562 were tRF3s, and 81 were tRF1s. There were 866,799 sequences identified as other small non-coding RNAs (microRNA, rRNA, snoRNA, etc.) and were excluded from the study. The tRF5s ranged from 28 to 40 nucleotides; and 98.7% ranged from 30 to 34 nucleotides in length. The tRFs with the greatest number of sequences were derived from tRNA of histidine, glutamic acid, lysine, glycine, and valine. There was no association between number of codons for each amino acid and number of tRFs in the samples. The reason for tRF5s being the most abundant can only be explained if these sequences are associated with function within the animal. PMID:26379699

  18. In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression.

    PubMed

    Meyer, Irmtraud M

    2017-05-01

    RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive. Copyright © 2017. Published by Elsevier Inc.

  19. Allele-Selective Transcriptome Recruitment to Polysomes Primed for Translation: Protein-Coding and Noncoding RNAs, and RNA Isoforms.

    PubMed

    Mascarenhas, Roshan; Pietrzak, Maciej; Smith, Ryan M; Webb, Amy; Wang, Danxin; Papp, Audrey C; Pinsonneault, Julia K; Seweryn, Michal; Rempala, Grzegorz; Sadee, Wolfgang

    2015-01-01

    mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs), and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs) in lymphoblast cell lines (LCL) and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T) in ABCB1 (MDR1) on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq) in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs) affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.

  20. Identification of novel non-coding RNA-based negative feedback regulating the expression of the oncogenic transcription factor GLI1.

    PubMed

    Villegas, Victoria E; Rahman, Mohammed Ferdous-Ur; Fernandez-Barrena, Maite G; Diao, Yumei; Liapi, Eleni; Sonkoly, Enikö; Ståhle, Mona; Pivarcsi, Andor; Annaratone, Laura; Sapino, Anna; Ramírez Clavijo, Sandra; Bürglin, Thomas R; Shimokawa, Takashi; Ramachandran, Saraswathi; Kapranov, Philipp; Fernandez-Zapico, Martin E; Zaphiropoulos, Peter G

    2014-07-01

    Non-coding RNAs are a complex class of nucleic acids, with growing evidence supporting regulatory roles in gene expression. Here we identify a non-coding RNA located head-to-head with the gene encoding the Glioma-associated oncogene 1 (GLI1), a transcriptional effector of multiple cancer-associated signaling pathways. The expression of this three-exon GLI1 antisense (GLI1AS) RNA in cancer cells was concordant with GLI1 levels. siRNAs knockdown of GLI1AS up-regulated GLI1 and increased cellular proliferation and tumor growth in a xenograft model system. Conversely, GLI1AS overexpression decreased the levels of GLI1, its target genes PTCH1 and PTCH2, and cellular proliferation. Additionally, we demonstrate that GLI1 knockdown reduced GLI1AS, while GLI1 overexpression increased GLI1AS, supporting the role of GLI1AS as a target gene of the GLI1 transcription factor. Activation of TGFβ and Hedgehog signaling, two known regulators of GLI1 expression, conferred a concordant up-regulation of GLI1 and GLI1AS in cancer cells. Finally, analysis of the mechanism underlying the interplay between GLI1 and GLI1AS indicates that the non-coding RNA elicits a local alteration of chromatin structure by increasing the silencing mark H3K27me3 and decreasing the recruitment of RNA polymerase II to this locus. Taken together, the data demonstrate the existence of a novel non-coding RNA-based negative feedback loop controlling GLI1 levels, thus expanding the repertoire of mechanisms regulating the expression of this oncogenic transcription factor. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  1. Digital data for quick response (QR) codes of alkalophilic Bacillus pumilus to identify and to compare bacilli isolated from Lonar Crator Lake, India.

    PubMed

    Rekadwad, Bhagwan N; Khobragade, Chandrahasya N

    2016-06-01

    Microbiologists are routinely engaged isolation, identification and comparison of isolated bacteria for their novelty. 16S rRNA sequences of Bacillus pumilus were retrieved from NCBI repository and generated QR codes for sequences (FASTA format and full Gene Bank information). 16SrRNA were used to generate quick response (QR) codes of Bacillus pumilus isolated from Lonar Crator Lake (19° 58' N; 76° 31' E), India. Bacillus pumilus 16S rRNA gene sequences were used to generate CGR, FCGR and PCA. These can be used for visual comparison and evaluation respectively. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. This generated digital data helps to evaluate and compare any Bacillus pumilus strain, minimizes laboratory efforts and avoid misinterpretation of the species.

  2. GENCODE: the reference human genome annotation for The ENCODE Project.

    PubMed

    Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J

    2012-09-01

    The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

  3. Upregulation of long non-coding RNA M26317 correlates with tumor progression and poor prognosis in gastric cancer.

    PubMed

    Li, Li; Wang, Yuan-Yu; Mou, Xiao Zhou; Ye, Zai-Yuan; Zhao, Zhong-Sheng

    2018-04-23

    To investigate the expression and clinical significance of long non-coding RNA (lnc RNA) in gastric cancer, we applied microarray analysis to obtain expression profiles of protein coding genes and lncRNAs in tumor and paired adjacent non-tumor tissues. We found that 41 lncRNAs were upregulated and 31 lncRNAs were downregulated more than 2-fold in gastric cancer versus noncancerous tissues (ratio>2.0, P<.01). We established a co-expression network of the differentially expressed lncRNAs and targeted coding genes that included 17 lncRNAs and 16 coding genes. As the results of microarray analysis showed that lncRNA M26317 was upregulated in gastric cancer tissues we examined the expression level of M26317 in 103 gastric cancer tissues by RT-PCR and 436 gastric cancer tissues by in situ hybridization. Our data confirmed that M26317 was upregulated in gastric cancer tissues. Moreover, expression of M26317 correlated with patient age, size of tumor, Lauren's classification, depth of invasion, lymph node and distant metastasis, TNM stage and poor prognosis (P<.05), but was not associated with gender, location of tumor, and differentiation (P>.05). M26317 may have an important role in malignant transformation and metastasis of gastric cancer. Copyright © 2018. Published by Elsevier Inc.

  4. Genome-wide piRNA profiles of virus transmitting whitefly Bemisia tabaci during feeding on TYLCV-infected tomato

    USDA-ARS?s Scientific Manuscript database

    Small RNAs (sRNAs) are 20-31 nucleotide (nt) non-coding regulatory elements commonly found in plants and animals, which are classified as short interfering RNA (siRNA), microRNA (miRNA) and Piwi-interacting RNA (piRNA). The whitefly Bemisia tabaci MEAM1 is a vector capable of transmitting many devas...

  5. Association between long non-coding RNA polymorphisms and cancer risk: a meta-analysis.

    PubMed

    Huang, Xin; Zhang, Weiyue; Shao, Zengwu

    2018-05-25

    Several studies have suggested that long non-coding RNA (lncRNA) gene polymorphisms are associated with cancer risk. In the present study, we conducted a meta-analysis related to studies on the association between lncRNA single-nucleotide polymorphisms (SNPs) and the overall risk of cancer. A total 12 SNPs in five common lncRNA genes were finally included in the meta-analysis. In the lncRNA antisense noncoding RNA in the INK4 locus (ANRIL), the rs1333048 A/C, rs4977574 A/G, and rs10757278 A/G polymorphisms, but not rs1333045 C/T, were correlated with overall cancer risk. Our study also demonstrated that other SNPs were correlated with overall cancer risk, namely, metastasis-associated lung adenocarcinoma transcript 1 (MALAT1, rs619586 A/G), HOXA distal transcript antisense RNA (HOTTIP, rs1859168 A/C) and highly up-regulated in liver cancer (HULC, rs7763881 A/C). Moreover, four prostate cancer‑associated non‑coding RNA 1 (PRNCR1, rs16901946 G/A, rs13252298 G/A, rs1016343 T/C, and rs1456315 G/A) SNPs were in association with cancer risk. No association was found between the PRNCR1 (rs7007694 C/T) SNP and the risk of cancer. In conclusion, our results suggest that several studied lncRNA SNPs are associated with overall cancer risk. Therefore, they might be potential predictive biomarkers for the risk of cancer. More studies based on larger sample sizes and more lncRNA SNPs are warranted to confirm these findings. ©2018 The Author(s).

  6. Genome-wide identification and analysis of A-to-I RNA editing events in bovine by transcriptome sequencing

    PubMed Central

    Salehi, Abdolreza; Rivera, Rocío Melissa

    2018-01-01

    RNA editing increases the diversity of the transcriptome and proteome. Adenosine-to-inosine (A-to-I) editing is the predominant type of RNA editing in mammals and it is catalyzed by the adenosine deaminases acting on RNA (ADARs) family. Here, we used a largescale computational analysis of transcriptomic data from brain, heart, colon, lung, spleen, kidney, testes, skeletal muscle and liver, from three adult animals in order to identify RNA editing sites in bovine. We developed a computational pipeline and used a rigorous strategy to identify novel editing sites from RNA-Seq data in the absence of corresponding DNA sequence information. Our methods take into account sequencing errors, mapping bias, as well as biological replication to reduce the probability of obtaining a false-positive result. We conducted a detailed characterization of sequence and structural features related to novel candidate sites and found 1,600 novel canonical A-to-I editing sites in the nine bovine tissues analyzed. Results show that these sites 1) occur frequently in clusters and short interspersed nuclear elements (SINE) repeats, 2) have a preference for guanines depletion/enrichment in the flanking 5′/3′ nucleotide, 3) occur less often in coding sequences than other regions of the genome, and 4) have low evolutionary conservation. Further, we found that a positive correlation exists between expression of ADAR family members and tissue-specific RNA editing. Most of the genes with predicted A-to-I editing in each tissue were significantly enriched in biological terms relevant to the function of the corresponding tissue. Lastly, the results highlight the importance of the RNA editome in nervous system regulation. The present study extends the list of RNA editing sites in bovine and provides pipelines that may be used to investigate the editome in other organisms. PMID:29470549

  7. Present Scenario of Long Non-Coding RNAs in Plants

    PubMed Central

    Bhatia, Garima; Goyal, Neetu; Sharma, Shailesh; Upadhyay, Santosh Kumar; Singh, Kashmir

    2017-01-01

    Small non-coding RNAs have been extensively studied in plants over the last decade. In contrast, genome-wide identification of plant long non-coding RNAs (lncRNAs) has recently gained momentum. LncRNAs are now being recognized as important players in gene regulation, and their potent regulatory roles are being studied comprehensively in eukaryotes. LncRNAs were first reported in humans in 1992. Since then, research in animals, particularly in humans, has rapidly progressed, and a vast amount of data has been generated, collected, and organized using computational approaches. Additionally, numerous studies have been conducted to understand the roles of these long RNA species in several diseases. However, the status of lncRNA investigation in plants lags behind that in animals (especially humans). Efforts are being made in this direction using computational tools and high-throughput sequencing technologies, such as the lncRNA microarray technique, RNA-sequencing (RNA-seq), RNA capture sequencing, (RNA CaptureSeq), etc. Given the current scenario, significant amounts of data have been produced regarding plant lncRNAs, and this amount is likely to increase in the subsequent years. In this review we have documented brief information about lncRNAs and their status of research in plants, along with the plant-specific resources/databases for information retrieval on lncRNAs. PMID:29657289

  8. Mitochondrial and cytoplasmic isoleucyl-, glutamyl- and arginyl-tRNA synthetases of yeast are encoded by separate genes.

    PubMed

    Tzagoloff, A; Shtanko, A

    1995-06-01

    Three complementation groups of a pet mutant collection have been found to be composed of respiratory-deficient deficient mutants with lesions in mitochondrial protein synthesis. Recombinant plasmids capable of restoring respiration were cloned by transformation of representatives of each complementation group with a yeast genomic library. The plasmids were used to characterize the complementing genes and to institute disruption of the chromosomal copies of each gene in respiratory-proficient yeast. The sequences of the cloned genes indicate that they code for isoleucyl-, arginyl- and glutamyl-tRNA synthetases. The properties of the mutants used to obtain the genes and of strains with the disrupted genes indicate that all three aminoacyl-tRNA synthetases function exclusively in mitochondrial proteins synthesis. The ISM1 gene for mitochondrial isoleucyl-tRNA synthetase has been localized to chromosome XVI next to UME5. The MSR1 gene for the arginyl-tRNA synthetase was previously located on yeast chromosome VIII. The third gene MSE1 for the mitochondrial glutamyl-tRNA synthetase has not been localized. The identification of three new genes coding for mitochondrial-specific aminoacyl-tRNA synthetases indicates that in Saccharomyces cerevisiae at least 11 members of this protein family are encoded by genes distinct from those coding for the homologous cytoplasmic enzymes.

  9. Complete mitochondrial genome of Chuanzhong black goat in southwest of China (Capra hircus).

    PubMed

    Huang, Yong-Fu; Chen, Li-Peng; Zhao, Yong-Ju; Zhang, Hao; Na, Ri-Su; Zhao, Zhong-Quan; Zhang, Jia-Hua; Jiang, Cao-De; Ma, Yue-Hui; Sun, Ya-Wang; E, Guang-Xin

    2016-09-01

    The Chuanzhong black goat (Capra hircus) is a breed native to southwest of China. Its complete mitochondrial genome is 16,641 nt in length, consisting of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a non-coding control region. As in other mammals, most mitochondrial genes are encoded on the heavy strand, except for ND6 and eight tRNA genes, which are encoded on the light strand. Its overall base composition is A: 33.5%, T: 27.3%, C: 26.1%, and G: 13.1%. The complete mitogenome of the Chinese indigenous breed of goat could provide a basic data for further phylogenetics analysis.

  10. Coding of Class I and II aminoacyl-tRNA synthetases

    PubMed Central

    Carter, Charles W.

    2018-01-01

    SUMMARY The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels—protozymes and Urzymes—associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric—middle base-pairing frequencies in sense/antisense alignments—that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins. PMID:28828732

  11. Non-coding RNAs and Their Roles in Stress Response in Plants.

    PubMed

    Wang, Jingjing; Meng, Xianwen; Dobrovolskaya, Oxana B; Orlov, Yuriy L; Chen, Ming

    2017-10-01

    Eukaryotic genomes encode thousands of non-coding RNAs (ncRNAs), which play crucial roles in transcriptional and post-transcriptional regulation of gene expression. Accumulating evidence indicates that ncRNAs, especially microRNAs (miRNAs) and long ncRNAs (lncRNAs), have emerged as key regulatory molecules in plant stress responses. In this review, we have summarized the current progress on the understanding of plant miRNA and lncRNA identification, characteristics, bioinformatics tools, and resources, and provided examples of mechanisms of miRNA- and lncRNA-mediated plant stress tolerance. Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

  12. Genome-wide identification of long non-coding RNA genes and their association with insecticide resistance and metamorphosis in diamondback moth, Plutella xylostella.

    PubMed

    Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei

    2017-11-20

    Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.

  13. The gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis contains a group I intron.

    PubMed Central

    De Wachter, R; Neefs, J M; Goris, A; Van de Peer, Y

    1992-01-01

    The nucleotide sequence of the gene coding for small ribosomal subunit RNA in the basidiomycete Ustilago maydis was determined. It revealed the presence of a group I intron with a length of 411 nucleotides. This is the third occurrence of such an intron discovered in a small subunit rRNA gene encoded by a eukaryotic nuclear genome. The other two occurrences are in Pneumocystis carinii, a fungus of uncertain taxonomic status, and Ankistrodesmus stipitatus, a green alga. The nucleotides of the conserved core structure of 101 group I intron sequences present in different genes and genome types were aligned and their evolutionary relatedness was examined. This revealed a cluster including all group I introns hitherto found in eukaryotic nuclear genes coding for small and large subunit rRNAs. A secondary structure model was designed for the area of the Ustilago maydis small ribosomal subunit RNA precursor where the intron is situated. It shows that the internal guide sequence pairing with the intron boundaries fits between two helices of the small subunit rRNA, and that minimal rearrangement of base pairs suffices to achieve the definitive secondary structure of the 18S rRNA upon splicing. PMID:1561081

  14. BcMF11, a novel non-coding RNA gene from Brassica campestris, is required for pollen development and male fertility.

    PubMed

    Song, Jiang-Hua; Cao, Jia-Shu; Wang, Cheng-Gang

    2013-01-01

    KEY MESSAGE : BcMF11 as a non-coding RNA gene has an essential role in pollen development, and might be useful for regulating the pollen fertility of crops by antisense RNA technology. We previously identified a 828-bp full-length cDNA of BcMF11, a novel pollen-specific non-coding mRNA-like gene from Chinese cabbage (Brassica campestris L. ssp. chinensis Makino). However, little information is known about the function of BcMF11 in pollen development. To investigate its exact biological roles in pollen development, the BcMF11 cDNA was antisense inhibited in transgenic Chinese cabbage under the control of a tapetum-specific promoter BcA9 and a constitutive promoter CaMV 35S. Antisense RNA transgenic plants displayed decreasing expression of BcMF11 and showed distinct morphological defects. Pollen germination test in vitro and in vivo of the transgenic plants suggested that inhibition of BcMF11 decreased pollen germination efficiency and delayed the pollen tubes' extension in the style. Under scanning electron microscopy, many shrunken and collapsed pollen grains were detected in the antisense BcMF11 transgenic Chinese cabbage. Further cytological observation revealed abnormal pollen development process in transgenic plants, including delayed degradation of tapetum, asynchronous separation of microspore, and aborted development of pollen grain. These results suggest that BcMF11, as a non-coding RNA, plays an essential role in pollen development and male fertility.

  15. FEELnc: a tool for long non-coding RNA annotation and its application to the dog transcriptome.

    PubMed

    Wucher, Valentin; Legeai, Fabrice; Hédan, Benoît; Rizk, Guillaume; Lagoutte, Lætitia; Leeb, Tosso; Jagannathan, Vidhya; Cadieu, Edouard; David, Audrey; Lohi, Hannes; Cirera, Susanna; Fredholm, Merete; Botherel, Nadine; Leegwater, Peter A J; Le Béguec, Céline; Fieten, Hille; Johnson, Jeremy; Alföldi, Jessica; André, Catherine; Lindblad-Toh, Kerstin; Hitte, Christophe; Derrien, Thomas

    2017-05-05

    Whole transcriptome sequencing (RNA-seq) has become a standard for cataloguing and monitoring RNA populations. One of the main bottlenecks, however, is to correctly identify the different classes of RNAs among the plethora of reconstructed transcripts, particularly those that will be translated (mRNAs) from the class of long non-coding RNAs (lncRNAs). Here, we present FEELnc (FlExible Extraction of LncRNAs), an alignment-free program that accurately annotates lncRNAs based on a Random Forest model trained with general features such as multi k-mer frequencies and relaxed open reading frames. Benchmarking versus five state-of-the-art tools shows that FEELnc achieves similar or better classification performance on GENCODE and NONCODE data sets. The program also provides specific modules that enable the user to fine-tune classification accuracy, to formalize the annotation of lncRNA classes and to identify lncRNAs even in the absence of a training set of non-coding RNAs. We used FEELnc on a real data set comprising 20 canine RNA-seq samples produced by the European LUPA consortium to substantially expand the canine genome annotation to include 10 374 novel lncRNAs and 58 640 mRNA transcripts. FEELnc moves beyond conventional coding potential classifiers by providing a standardized and complete solution for annotating lncRNAs and is freely available at https://github.com/tderrien/FEELnc. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Evolution of coding and non-coding genes in HOX clusters of a marsupial.

    PubMed

    Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

    2012-06-18

    The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.

  17. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    PubMed Central

    2012-01-01

    Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672

  18. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation

    PubMed Central

    Bazzini, Ariel A; Johnstone, Timothy G; Christiano, Romain; Mackowiak, Sebastian D; Obermayer, Benedikt; Fleming, Elizabeth S; Vejnar, Charles E; Lee, Miler T; Rajewsky, Nikolaus; Walther, Tobias C; Giraldez, Antonio J

    2014-01-01

    Identification of the coding elements in the genome is a fundamental step to understanding the building blocks of living systems. Short peptides (< 100 aa) have emerged as important regulators of development and physiology, but their identification has been limited by their size. We have leveraged the periodicity of ribosome movement on the mRNA to define actively translated ORFs by ribosome footprinting. This approach identifies several hundred translated small ORFs in zebrafish and human. Computational prediction of small ORFs from codon conservation patterns corroborates and extends these findings and identifies conserved sequences in zebrafish and human, suggesting functional peptide products (micropeptides). These results identify micropeptide-encoding genes in vertebrates, providing an entry point to define their function in vivo. PMID:24705786

  19. RNA G-quadruplexes: emerging mechanisms in disease

    PubMed Central

    Cammas, Anne

    2017-01-01

    Abstract RNA G-quadruplexes (G4s) are formed by G-rich RNA sequences in protein-coding (mRNA) and non-coding (ncRNA) transcripts that fold into a four-stranded conformation. Experimental studies and bioinformatic predictions support the view that these structures are involved in different cellular functions associated to both DNA processes (telomere elongation, recombination and transcription) and RNA post-transcriptional mechanisms (including pre-mRNA processing, mRNA turnover, targeting and translation). An increasing number of different diseases have been associated with the inappropriate regulation of RNA G4s exemplifying the potential importance of these structures on human health. Here, we review the different molecular mechanisms underlying the link between RNA G4s and human diseases by proposing several overlapping models of deregulation emerging from recent research, including (i) sequestration of RNA-binding proteins, (ii) aberrant expression or localization of RNA G4-binding proteins, (iii) repeat associated non-AUG (RAN) translation, (iv) mRNA translational blockade and (v) disabling of protein–RNA G4 complexes. This review also provides a comprehensive survey of the functional RNA G4 and their mechanisms of action. Finally, we highlight future directions for research aimed at improving our understanding on RNA G4-mediated regulatory mechanisms linked to diseases. PMID:28013268

  20. Genome-wide Discovery of Circular RNAs in the Leaf and Seedling Tissues of Arabidopsis Thaliana

    PubMed Central

    Dou, Yongchao; Li, Shengjun; Yang, Weilong; Liu, Kan; Du, Qian; Ren, Guodong; Yu, Bin; Zhang, Chi

    2017-01-01

    Background: Recently, identification and functional studies of circular RNAs, a type of non-coding RNAs arising from a ligation of 3’ and 5’ ends of a linear RNA molecule, were conducted in mammalian cells with the development of RNA-seq technology. Method: Since compared with animals, studies on circular RNAs in plants are less thorough, a genome-wide identification of circular RNA candidates in Arabidopsis was conducted with our own developed bioinformatics tool to several existing RNA-seq datasets specifically for non-coding RNAs. Results: A total of 164 circular RNA candidates were identified from RNA-seq data, and 4 circular RNA transcripts, including both exonic and intronic circular RNAs, were experimentally validated. Interestingly, our results show that circular RNA transcripts are enriched in the photosynthesis system for the leaf tissue and correlated to the higher expression levels of their parent genes. Sixteen out of all 40 genes that have circular RNA candidates are related to the photosynthesis system, and out of the total 146 exonic circular RNA candidates, 63 are found in chloroplast. PMID:29081691

  1. NEAT1 Scaffolds RNA Binding Proteins and the Microprocessor to Globally Enhance Pri-miRNA Processing

    PubMed Central

    Jiang, Li; Shao, Changwei; Wu, Qi-Jia; Chen, Geng; Zhou, Jie; Yang, Bo; Li, Hairi; Gou, Lan-Tao; Zhang, Yi; Wang, Yangming; Yeo, Gene W.; Zhou, Yu; Fu, Xiang-Dong

    2018-01-01

    Summary MicroRNA biogenesis is known to be modulated by a variety of RNA binding proteins (RBPs), but in most cases, individual RBPs appear to influence the processing of a small subset of target miRNAs. We herein report that the RNA binding NONO/PSF heterodimer binds a large number of expressed pri-miRNAs in HeLa cells to globally enhance pri-miRNA processing by the Drosha/DGCR8 Microprocessor. Because NONO/PSF are key components of paraspeckles organized by the lncRNA NEAT1, we further demonstrate that NEAT1 also has a profound effect on global pri-miRNA processing. Mechanistic dissection reveals that NEAT1 broadly interacts with NONO/PSF as well as many other RBPs, and that multiple RNA segments in NEAT1, including a “pseudo pri-miRNA” near its 3′ end, help attract the Microprocessor. These findings suggest a bird nest model for a large non-coding RNA to orchestrate efficient processing of almost an entire class of small non-coding RNAs in the nucleus. PMID:28846091

  2. Stable CoT-1 repeat RNA is abundant and associated with euchromatic interphase chromosomes

    PubMed Central

    Hall, Lisa L.; Carone, Dawn M.; Gomez, Alvin; Kolpa, Heather J.; Byron, Meg; Mehta, Nitish; Fackelmayer, Frank O.; Lawrence, Jeanne B.

    2014-01-01

    SUMMARY Recent studies recognize a vast diversity of non-coding RNAs with largely unknown functions, but few have examined interspersed repeat sequences, which constitute almost half our genome. RNA hybridization in situ using CoT-1 (highly repeated) DNA probes detects surprisingly abundant euchromatin-associated RNA comprised predominantly of repeat sequences (“CoT-1 RNA”), including LINE-1. CoT-1-hybridizing RNA strictly localizes to the interphase chromosome territory in cis, and remains stably associated with the chromosome territory following prolonged transcriptional inhibition. The CoT-1 RNA territory resists mechanical disruption and fractionates with the non-chromatin scaffold, but can be experimentally released. Loss of repeat-rich, stable nuclear RNAs from euchromatin corresponds to aberrant chromatin distribution and condensation. CoT-1 RNA has several properties similar to XIST chromosomal RNA, but is excluded from chromatin condensed by XIST. These findings impact two “black boxes” of genome science: the poorly understood diversity of non-coding RNA and the unexplained abundance of repetitive elements. PMID:24581492

  3. Cis-acting RNA elements in the Hepatitis C virus RNA genome

    PubMed Central

    Sagan, Selena M.; Chahal, Jasmin; Sarnow, Peter

    2017-01-01

    Hepatitis C virus (HCV) infection is a rapidly increasing global health problem with an estimated 170 million people infected worldwide. HCV is a hepatotropic, positive-sense RNA virus of the family Flaviviridae. As a positive-sense RNA virus, the HCV genome itself must serve as a template for translation, replication and packaging. The viral RNA must therefore be a dynamic structure that is able to readily accommodate structural changes to expose different regions of the genome to viral and cellular proteins to carry out the HCV life cycle. The ∼9600 nucleotide viral genome contains a single long open reading frame flanked by 5′ and 3′ non-coding regions that contain cis-acting RNA elements important for viral translation, replication and stability. Additional cis-acting RNA elements have also been identified in the coding sequences as well as in the 3′ end of the negative-strand replicative intermediate. Herein, we provide an overview of the importance of these cis-acting RNA elements in the HCV life cycle. PMID:25576644

  4. The small non-coding RNA response to virus infection in the Leishmania vector Lutzomyia longipalpis.

    PubMed

    Ferreira, Flávia Viana; Aguiar, Eric Roberto Guimarães Rocha; Olmo, Roenick Proveti; de Oliveira, Karla Pollyanna Vieira; Silva, Emanuele Guimarães; Sant'Anna, Maurício Roberto Viana; Gontijo, Nelder de Figueiredo; Kroon, Erna Geessien; Imler, Jean Luc; Marques, João Trindade

    2018-06-01

    Sandflies are well known vectors for Leishmania but also transmit a number of arthropod-borne viruses (arboviruses). Few studies have addressed the interaction between sandflies and arboviruses. RNA interference (RNAi) mechanisms utilize small non-coding RNAs to regulate different aspects of host-pathogen interactions. The small interfering RNA (siRNA) pathway is a broad antiviral mechanism in insects. In addition, at least in mosquitoes, another RNAi mechanism mediated by PIWI interacting RNAs (piRNAs) is activated by viral infection. Finally, endogenous microRNAs (miRNA) may also regulate host immune responses. Here, we analyzed the small non-coding RNA response to Vesicular stomatitis virus (VSV) infection in the sandfly Lutzoymia longipalpis. We detected abundant production of virus-derived siRNAs after VSV infection in adult sandflies. However, there was no production of virus-derived piRNAs and only mild changes in the expression of vector miRNAs in response to infection. We also observed abundant production of virus-derived siRNAs against two other viruses in Lutzomyia Lulo cells. Together, our results suggest that the siRNA but not the piRNA pathway mediates an antiviral response in sandflies. In agreement with this hypothesis, pre-treatment of cells with dsRNA against VSV was able to inhibit viral replication while knock-down of the central siRNA component, Argonaute-2, led to increased virus levels. Our work begins to elucidate the role of RNAi mechanisms in the interaction between L. longipalpis and viruses and should also open the way for studies with other sandfly-borne pathogens.

  5. Correction of the consequences of mitochondrial 3243A>G mutation in the MT-TL1 gene causing the MELAS syndrome by tRNA import into mitochondria

    PubMed Central

    Karicheva, Olga Z.; Kolesnikova, Olga A.; Schirtz, Tom; Vysokikh, Mikhail Y.; Mager-Heckel, Anne-Marie; Lombès, Anne; Boucheham, Abdeldjalil; Krasheninnikov, Igor A.; Martin, Robert P.; Entelis, Nina; Tarassov, Ivan

    2011-01-01

    Mutations in human mitochondrial DNA are often associated with incurable human neuromuscular diseases. Among these mutations, an important number have been identified in tRNA genes, including 29 in the gene MT-TL1 coding for the tRNALeu(UUR). The m.3243A>G mutation was described as the major cause of the MELAS syndrome (mitochondrial encephalomyopathy with lactic acidosis and stroke-like episodes). This mutation was reported to reduce tRNALeu(UUR) aminoacylation and modification of its anti-codon wobble position, which results in a defective mitochondrial protein synthesis and reduced activities of respiratory chain complexes. In the present study, we have tested whether the mitochondrial targeting of recombinant tRNAs bearing the identity elements for human mitochondrial leucyl-tRNA synthetase can rescue the phenotype caused by MELAS mutation in human transmitochondrial cybrid cells. We demonstrate that nuclear expression and mitochondrial targeting of specifically designed transgenic tRNAs results in an improvement of mitochondrial translation, increased levels of mitochondrial DNA-encoded respiratory complexes subunits, and significant rescue of respiration. These findings prove the possibility to direct tRNAs with changed aminoacylation specificities into mitochondria, thus extending the potential therapeutic strategy of allotopic expression to address mitochondrial disorders. PMID:21724600

  6. cncRNAs: Bi-functional RNAs with protein coding and non-coding functions

    PubMed Central

    Kumari, Pooja; Sampath, Karuna

    2015-01-01

    For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036

  7. Replication of poliovirus RNA and subgenomic RNA transcripts in transfected cells.

    PubMed Central

    Collis, P S; O'Donnell, B J; Barton, D J; Rogers, J A; Flanegan, J B

    1992-01-01

    Full-length and subgenomic poliovirus RNAs were transcribed in vitro and transfected into HeLa cells to study viral RNA replication in vivo. RNAs with deletion mutations were analyzed for the ability to replicate in either the absence or the presence of helper RNA by using a cotransfection procedure and Northern (RNA) blot analysis. An advantage of this approach was that viral RNA replication and genetic complementation could be characterized without first isolating conditional-lethal mutants. A subgenomic RNA with a large in-frame deletion in the capsid coding region (P1) replicated more efficiently than full-length viral RNA transcripts. In cotransfection experiments, both the full-length and subgenomic RNAs replicated at slightly reduced levels and appeared to interfere with each other's replication. In contrast, a subgenomic RNA with a similarly sized out-of-frame deletion in P1 did not replicate in transfected cells, either alone or in the presence of helper RNA. Similar results were observed with an RNA transcript containing a large in-frame deletion spanning the P1, P2, and P3 coding regions. A mutant RNA with an in-frame deletion in the P1-2A coding sequence was self-replicating but at a significantly reduced level. The replication of this RNA was fully complemented after cotransfection with a helper RNA that provided 2A in trans. A P1-2A-2B in-frame deletion, however, totally blocked RNA replication and was not complemented. Control experiments showed that all of the expected viral proteins were both synthesized and processed when the RNA transcripts were translated in vitro. Thus, our results indicated that 2A was a trans-acting protein and that 2B and perhaps other viral proteins were cis acting during poliovirus RNA replication in vivo. Our data support a model for poliovirus RNA replication which directly links the translation of a molecule of plus-strand RNA with the formation of a replication complex for minus-strand RNA synthesis. Images PMID:1328676

  8. [Relevance of long non-coding RNAs in tumour biology].

    PubMed

    Nagy, Zoltán; Szabó, Diána Rita; Zsippai, Adrienn; Falus, András; Rácz, Károly; Igaz, Péter

    2012-09-23

    The discovery of the biological relevance of non-coding RNA molecules represents one of the most significant advances in contemporary molecular biology. It has turned out that a major fraction of the non-coding part of the genome is transcribed. Beside small RNAs (including microRNAs) more and more data are disclosed concerning long non-coding RNAs of 200 nucleotides to 100 kb length that are implicated in the regulation of several basic molecular processes (cell proliferation, chromatin functioning, microRNA-mediated effects, etc.). Some of these long non-coding RNAs have been associated with human tumours, including H19, HOTAIR, MALAT1, etc., the different expression of which has been noted in various neoplasms relative to healthy tissues. Long non-coding RNAs may represent novel markers of molecular diagnostics and they might even turn out to be targets of therapeutic intervention.

  9. Characterization of circulating transfer RNA-Derived RNA fragments in cattle

    USDA-ARS?s Scientific Manuscript database

    The objective was to characterize naturally occurring circulating transfer RNA-derived RNA Fragments (tRFs) in cattle. Serum from eight clinically normal adult dairy cows was collected, and small non-coding RNAs were extracted immediately after collection and sequenced by Illumina MiSeq. Sequences a...

  10. Draft Genome Sequence of the Deinococcus-Thermus Bacterium Meiothermus ruber Strain A

    DOE PAGES

    Thiel, Vera; Tomsho, Lynn P.; Burhans, Richard; ...

    2015-03-26

    The draft genome sequence of the Deinococcus-Thermus group bacterium Meiothermus ruber strain A, isolated from a cyanobacterial enrichment culture obtained from Octopus Spring (Yellowstone National Park, WY), comprises 2,968,099 bp in 170 contigs. It is predicted to contain 2,895 protein-coding genes, 44 tRNA-coding genes, and 2 rRNA operons.

  11. Complete mitochondrial genome sequence of the heart failure model of cardiomyopathic Syrian hamster (Mesocricetus auratus).

    PubMed

    Hu, Bo; Liu, Dong-Xing; Zhang, Yu-Qing; Song, Jian-Tao; Ji, Xian-Fei; Hou, Zhi-Qiang; Zhang, Zhen-Hai

    2016-05-01

    In this study we sequenced the complete mitochondrial genome sequencing of a heart failure model of cardiomyopathic Syrian hamster (Mesocricetus auratus) for the first time. The total length of the mitogenome was 16,267 bp. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region.

  12. Small non-coding RNAs (sncRNA) regulate gene silencing and modify homeostatic status in animals faced with porcine reproductive and respiratory syndrome virus (PRRSV)

    USDA-ARS?s Scientific Manuscript database

    It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs ...

  13. Long non-coding RNA and Polycomb: an intricate partnership in cancer biology.

    PubMed

    Achour, Cyrinne; Aguilo, Francesca

    2018-06-01

    High-throughput analyses have revealed that the vast majority of the transcriptome does not code for proteins. These non-translated transcripts, when larger than 200 nucleotides, are termed long non-coding RNAs (lncRNAs), and play fundamental roles in diverse cellular processes. LncRNAs are subject to dynamic chemical modification, adding another layer of complexity to our understanding of the potential roles that lncRNAs play in health and disease. Many lncRNAs regulate transcriptional programs by influencing the epigenetic state through direct interactions with chromatin-modifying proteins. Among these proteins, Polycomb repressive complexes 1 and 2 (PRC1 and PRC2) have been shown to be recruited by lncRNAs to silence target genes. Aberrant expression, deficiency or mutation of both lncRNA and Polycomb have been associated with numerous human diseases, including cancer. In this review, we have highlighted recent findings regarding the concerted mechanism of action of Polycomb group proteins (PcG), acting together with some classically defined lncRNAs including X-inactive specific transcript ( XIST ), antisense non-coding RNA in the INK4 locus ( ANRIL ), metastasis associated lung adenocarcinoma transcript 1 ( MALAT1 ), and HOX transcript antisense RNA ( HOTAIR ).

  14. RNA-protein interactions in an unstructured context.

    PubMed

    Zagrovic, Bojan; Bartonek, Lukas; Polyansky, Anton A

    2018-05-31

    Despite their importance, our understanding of noncovalent RNA-protein interactions is incomplete. This especially concerns the binding between RNA and unstructured protein regions, a widespread class of such interactions. Here, we review the recent experimental and computational work on RNA-protein interactions in an unstructured context with a particular focus on how such interactions may be shaped by the intrinsic interaction affinities between individual nucleobases and protein side chains. Specifically, we articulate the claim that the universal genetic code reflects the binding specificity between nucleobases and protein side chains and that, in turn, the code may be seen as the Rosetta stone for understanding RNA-protein interactions in general. © 2018 The Authors. FEBS Letters published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.

  15. Flavivirus RNAi suppression: decoding non-coding RNA.

    PubMed

    Pijlman, Gorben P

    2014-08-01

    Flaviviruses are important human pathogens that are transmitted by invertebrate vectors, mostly mosquitoes and ticks. During replication in their vector, flaviviruses are subject to a potent innate immune response known as antiviral RNA interference (RNAi). This defense mechanism is associated with the production of small interfering (si)RNA that lead to degradation of viral RNA. To what extent flaviviruses would benefit from counteracting antiviral RNAi is subject of debate. Here, the experimental evidence to suggest the existence of flavivirus RNAi suppressors is discussed. I will highlight the putative role of non-coding, subgenomic flavivirus RNA in suppression of RNAi in insect and mammalian cells. Novel insights from ongoing research will reveal how arthropod-borne viruses modulate innate immunity including antiviral RNAi. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Junk DNA and the long non-coding RNA twist in cancer genetics

    PubMed Central

    Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A

    2015-01-01

    The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839

  17. In Silico Characterization of miRNA and Long Non-Coding RNA Interplay in Multiple Myeloma

    PubMed Central

    Ronchetti, Domenica; Manzoni, Martina; Todoerti, Katia; Neri, Antonino; Agnelli, Luca

    2016-01-01

    The identification of deregulated microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) in multiple myeloma (MM) has progressively added a further level of complexity to MM biology. In addition, the cross-regulation between lncRNAs and miRNAs has begun to emerge, and theoretical and experimental studies have demonstrated the competing endogenous RNA (ceRNA) activity of lncRNAs as natural miRNA decoys in pathophysiological conditions, including cancer. Currently, information concerning lncRNA and miRNA interplay in MM is virtually absent. Herein, we investigated in silico the lncRNA and miRNA relationship in a representative datasets encompassing 95 MM and 30 plasma cell leukemia patients at diagnosis and in four normal controls, whose expression profiles were generated by a custom annotation pipeline to detect specific lncRNAs. We applied target prediction analysis based on miRanda and RNA22 algorithms to 235 lncRNAs and 459 miRNAs selected with a potential pivotal role in the pathology of MM. Among pairs that showed a significant correlation between lncRNA and miRNA expression levels, we identified 11 lncRNA–miRNA relationships suggestive of a novel ceRNA network with relevance in MM. PMID:27916857

  18. Conserved pattern of embryonic actin gene expression in several sea urchins and a sand dollar.

    PubMed

    Bushman, F D; Crain, W R

    1983-08-01

    An examination of the size and relative abundance of actin-coding RNA in embryos of four sea urchins (Strongylocentrotus purpuratus, Strongylocentrotus droebachiensis, Arbacia punctulata, Lytechinus variegatus) and one sand dollar (Echinarachnius parma) reveals a generally conserved program of expression. In each species the relative abundance of these sequences is low in early embryos and begins to rise during late cleavage or blastula stages. In the four sea urchins, actin-coding RNAs increase between approximately 9- and 35-fold by pluteus or an earlier stage, and in the sand dollar about 5.5-fold by blastula. A major actin-coding RNA class of 2.0-2.2 kilobases (kb) is found in each species. A smaller actin-coding RNA class, which accumulates during embryogenesis, is also present in S. purpuratus (1.8 kb), S. droebachiensis (1.9 kb), and A. punctulata (1.6 kb), but apparently absent in L. variegatus and E. parma. In S. droebachiensis, actin-coding RNA is relatively abundant in unfertilized eggs and drops sharply by the 16-cell stage. This is in contrast to the other sea urchins where the actin message content is relatively low in eggs and does not change substantially in the embryos throughout early cleavage. The observations in this study suggest that the pattern of embryonic expression of at least some members of this gene family is ancient and conserved.

  19. Complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus).

    PubMed

    Li, Linmiao; Li, Min; Wu, Zhengjun; Chen, Jinping

    2015-01-01

    We have characterized the complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus) and described its organization in this study. The total length of C. sphinx complete mitochondrial genome was 16,895 bp with the base composition of 32.54% A, 14.05% G, 25.82% T and 27.59% C. The complete mitochondrial genome included 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes (12S rRNA and 16S rRNA) and 1 control region (D-loop). The control region was 1435 bp long with the sequence CATACG repeat 64 times. Three protein-coding genes (ND1, COI and ND4) were ended with incomplete stop codon TA or T.

  20. Mechanisms and consequences of alternative polyadenylation

    PubMed Central

    Di Giammartino, Dafne Campigli; Nishida, Kensei; Manley, James L.

    2011-01-01

    Summary Alternative polyadenylation (APA) is emerging as a widespread mechanism used to control gene expression. Like alternative splicing, usage of alternative poly(A) sites allows a single gene to encode multiple mRNA transcripts. In some cases, this changes the mRNA coding potential; in other cases, the code remains unchanged but the 3’UTR length is altered, influencing the fate of mRNAs in several ways, for example, by altering the availability of RNA binding protein sites and microRNA binding sites. The mechansims governing both global and gene-specific APA are only starting to be deciphered. Here we review what is known about these mechanisms and the functional consequences of alternative polyadenlyation. PMID:21925375

  1. Complete mitochondrial genome of the Tyto longimembris (Strigiformes: Tytonidae).

    PubMed

    Xu, Peng; Li, Yankuo; Miao, Lujun; Xie, Guangyong; Huang, Yan

    2016-07-01

    The complete mitochondrial genome of Tyto longimembris has been determined in this study. It is 18,466 bp in length and consists of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes and a non-coding control region (D-loop). The overall base composition of the heavy strand of the T. longimembris mitochondrial genome is A: 30.1%, T: 23.5%, C: 31.8% and G: 14.6%. The structure of control region should be characterized by a region containing tandem repeats as two definitely separated clusters of tandem repeats were found. This study provided an important data set for phylogenetic and taxonomic analyses of Tyto species.

  2. A-to-I RNA editing independent of ADARs in filamentous fungi

    PubMed Central

    Wang, Chenfang; Xu, Jin-Rong; Liu, Huiquan

    2016-01-01

    ABSTRACT ADAR mediated A-to-I RNA editing is thought to be unique to animals and occurs mainly in the non-coding regions. Recently filamentous fungi such as Fusarium graminearum were found to lack orthologs of animal ADARs but have stage-specific A-to-I editing during sexual reproduction. Unlike animals, majority of editing sites are in the coding regions and often result in missense and stop loss changes in fungi. Furthermore, whereas As in RNA stems are targeted by animal ADARs, RNA editing in fungi preferentially targets As in hairpin loops, implying that fungal RNA editing involves mechanisms related to editing of the anticodon loop by ADATs. Identification and characterization of fungal adenosine deaminases and their stage-specific co-factors may be helpful to understand the evolution of human ADARs. Fungi also can be used to study biological functions of missense and stop loss RNA editing events in eukaryotic organisms. PMID:27533598

  3. Informational structure of genetic sequences and nature of gene splicing

    NASA Astrophysics Data System (ADS)

    Trifonov, E. N.

    1991-10-01

    Only about 1/20 of DNA of higher organisms codes for proteins, by means of classical triplet code. The rest of DNA sequences is largely silent, with unclear functions, if any. The triplet code is not the only code (message) carried by the sequences. There are three levels of molecular communication, where the same sequence ``talks'' to various bimolecules, while having, respectively, three different appearances: DNA, RNA and protein. Since the molecular structures and, hence, sequence specific preferences of these are substantially different, the original DNA sequence has to carry simultaneously three types of sequence patterns (codes, messages), thus, being a composite structure in which one had the same letter (nucleotide) is frequently involved in several overlapping codes of different nature. This multiplicity and overlapping of the codes is a unique feature of the Gnomic, language of genetic sequences. The coexisting codes have to be degenerate in various degrees to allow an optimal and concerted performance of all the encoded functions. There is an obvious conflict between the best possible performance of a given function and necessity to compromise the quality of a given sequence pattern in favor of other patterns. It appears that the major role of various changes in the sequences on their ``ontogenetic'' way from DNA to RNA to protein, like RNA editing and splicing, or protein post-translational modifications is to resolve such conflicts. New data are presented strongly indicating that the gene splicing is such a device to resolve the conflict between the code of DNA folding in chromatin and the triplet code for protein synthesis.

  4. Characterization of the genomic organization of the region bordering the centromere of chromosome V of Podospora anserina by direct sequencing.

    PubMed

    Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean

    2003-08-01

    A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.

  5. Drosophila Melanogaster Mitochondrial DNA: Gene Organization and Evolutionary Considerations

    PubMed Central

    Garesse, R.

    1988-01-01

    The sequence of a 8351-nucleotide mitochondrial DNA (mtDNA) fragment has been obtained extending the knowledge of the Drosophila melanogaster mitochondrial genome to 90% of its coding region. The sequence encodes seven polypeptides, 12 tRNAs and the 3' end of the 16S rRNA and CO III genes. The gene organization is strictly conserved with respect to the Drosophila yakuba mitochondrial genome, and different from that found in mammals and Xenopus. The high A + T content of D. melanogaster mitochondrial DNA is reflected in a reiterative codon usage, with more than 90% of the codons ending in T or A, G + C rich codons being practically absent. The average level of homology between the D. melanogaster and D. yakuba sequences is very high (roughly 94%), although insertion and deletions have been detected in protein, tRNA and large ribosomal genes. The analysis of nucleotide changes reveals a similar frequency for transitions and transversions, and reflects a strong bias against G+C on both strands. The predominant type of transition is strand specific. PMID:3130291

  6. Translational autocontrol of the Escherichia coli ribosomal protein S15.

    PubMed

    Portier, C; Dondon, L; Grunberg-Manago, M

    1990-01-20

    When rpsO, the gene encoding the ribosomal protein S15 in Escherichia coli, is carried by a multicopy plasmid, the mRNA synthesis rate of S15 increases with the gene dosage but the rate of synthesis of S15 does not rise. A translational fusion between S15 and beta-galactosidase was introduced on the chromosome in a delta lac strain and the expression of beta-galactosidase studied under different conditions. The presence of S15 in trans represses the beta-galactosidase level five- to sixfold, while the synthesis rate of the S15-beta-galactosidase mRNA decreases by only 30 to 50%. These data indicate that S15 is subject to autogenous translational control. Derepressed mutants were isolated and sequenced. All the point mutations map in the second codon of S15, suggesting a location for the operator site that is very near to the translation initiation codon. However, the creation of deletion mutations shows that the operator extends into the 5' non-coding part of the message, thus overlapping the ribosome loading site.

  7. Application of miRNAs as Biomarkers of Exposure and Effects in Risk Evaluation

    EPA Science Inventory

    Of the known epigenetic mechanisms, non-coding RNA and more specifically, microRNA (miRNA), offer the most immediate promise for risk assessment applications because these molecules can serve as excellent biomarkers of toxicity. The advantages of miRNA versus more classical prot...

  8. Regulatory BC1 RNA in Cognitive Control

    ERIC Educational Resources Information Center

    Iacoangeli, Anna; Dosunmu, Aderemi; Eom, Taesun; Stefanov, Dimitre G.; Tiedge, Henri

    2017-01-01

    Dendritic regulatory BC1 RNA is a non-protein-coding (npc) RNA that operates in the translational control of gene expression. The absence of BC1 RNA in BC1 knockout (KO) animals causes translational dysregulation that entails neuronal phenotypic alterations including prolonged epileptiform discharges, audiogenic seizure activity in vivo, and…

  9. Nuclear poly(A) binding protein 1 (PABPN1) and Matrin3 interact in muscle cells and regulate RNA processing.

    PubMed

    Banerjee, Ayan; Vest, Katherine E; Pavlath, Grace K; Corbett, Anita H

    2017-10-13

    The polyadenylate binding protein 1 (PABPN1) is a ubiquitously expressed RNA binding protein vital for multiple steps in RNA metabolism. Although PABPN1 plays a critical role in the regulation of RNA processing, mutation of the gene encoding this ubiquitously expressed RNA binding protein causes a specific form of muscular dystrophy termed oculopharyngeal muscular dystrophy (OPMD). Despite the tissue-specific pathology that occurs in this disease, only recently have studies of PABPN1 begun to explore the role of this protein in skeletal muscle. We have used co-immunoprecipitation and mass spectrometry to identify proteins that interact with PABPN1 in mouse skeletal muscles. Among the interacting proteins we identified Matrin 3 (MATR3) as a novel protein interactor of PABPN1. The MATR3 gene is mutated in a form of distal myopathy and amyotrophic lateral sclerosis (ALS). We demonstrate, that like PABPN1, MATR3 is critical for myogenesis. Furthermore, MATR3 controls critical aspects of RNA processing including alternative polyadenylation and intron retention. We provide evidence that MATR3 also binds and regulates the levels of long non-coding RNA (lncRNA) Neat1 and together with PABPN1 is required for normal paraspeckle function. We demonstrate that PABPN1 and MATR3 are required for paraspeckles, as well as for adenosine to inosine (A to I) RNA editing of Ctn RNA in muscle cells. We provide a functional link between PABPN1 and MATR3 through regulation of a common lncRNA target with downstream impact on paraspeckle morphology and function. We extend our analysis to a mouse model of OPMD and demonstrate altered paraspeckle morphology in the presence of endogenous levels of alanine-expanded PABPN1. In this study, we report protein-binding partners of PABPN1, which could provide insight into novel functions of PABPN1 in skeletal muscle and identify proteins that could be sequestered with alanine-expanded PABPN1 in the nuclear aggregates found in OPMD. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Nuclear poly(A) binding protein 1 (PABPN1) and Matrin3 interact in muscle cells and regulate RNA processing

    PubMed Central

    Banerjee, Ayan; Vest, Katherine E.

    2017-01-01

    Abstract The polyadenylate binding protein 1 (PABPN1) is a ubiquitously expressed RNA binding protein vital for multiple steps in RNA metabolism. Although PABPN1 plays a critical role in the regulation of RNA processing, mutation of the gene encoding this ubiquitously expressed RNA binding protein causes a specific form of muscular dystrophy termed oculopharyngeal muscular dystrophy (OPMD). Despite the tissue-specific pathology that occurs in this disease, only recently have studies of PABPN1 begun to explore the role of this protein in skeletal muscle. We have used co-immunoprecipitation and mass spectrometry to identify proteins that interact with PABPN1 in mouse skeletal muscles. Among the interacting proteins we identified Matrin 3 (MATR3) as a novel protein interactor of PABPN1. The MATR3 gene is mutated in a form of distal myopathy and amyotrophic lateral sclerosis (ALS). We demonstrate, that like PABPN1, MATR3 is critical for myogenesis. Furthermore, MATR3 controls critical aspects of RNA processing including alternative polyadenylation and intron retention. We provide evidence that MATR3 also binds and regulates the levels of long non-coding RNA (lncRNA) Neat1 and together with PABPN1 is required for normal paraspeckle function. We demonstrate that PABPN1 and MATR3 are required for paraspeckles, as well as for adenosine to inosine (A to I) RNA editing of Ctn RNA in muscle cells. We provide a functional link between PABPN1 and MATR3 through regulation of a common lncRNA target with downstream impact on paraspeckle morphology and function. We extend our analysis to a mouse model of OPMD and demonstrate altered paraspeckle morphology in the presence of endogenous levels of alanine-expanded PABPN1. In this study, we report protein-binding partners of PABPN1, which could provide insight into novel functions of PABPN1 in skeletal muscle and identify proteins that could be sequestered with alanine-expanded PABPN1 in the nuclear aggregates found in OPMD. PMID:28977530

  11. Ribonucleoprotein complexes in neurologic diseases.

    PubMed

    Ule, Jernej

    2008-10-01

    Ribonucleoprotein (RNP) complexes regulate the tissue-specific RNA processing and transport that increases the coding capacity of our genome and the ability to respond quickly and precisely to the diverse set of signals. This review focuses on three proteins that are part of RNP complexes in most cells of our body: TAR DNA-binding protein (TDP-43), the survival motor neuron protein (SMN), and fragile-X mental retardation protein (FMRP). In particular, the review asks the question why these ubiquitous proteins are primarily associated with defects in specific regions of the central nervous system? To understand this question, it is important to understand the role of genetic and cellular environment in causing the defect in the protein, as well as how the defective protein leads to misregulation of specific target RNAs. Two approaches for comprehensive analysis of defective RNA-protein interactions are presented. The first approach defines the RNA code or the collection of proteins that bind to a certain cis-acting RNA site in order to lead to a predictable outcome. The second approach defines the RNA map or the summary of positions on target RNAs where binding of a particular RNA-binding protein leads to a predictable outcome. As we learn more about the RNA codes and maps that guide the action of the dynamic RNP world in our brain, possibilities for new treatments of neurologic diseases are bound to emerge.

  12. A new way to generate cytolytic tumor-specific T cells: electroporation of RNA coding for a T cell receptor into T lymphocytes.

    PubMed

    Schaft, Niels; Dörrie, Jan; Müller, Ina; Beck, Verena; Baumann, Stefanie; Schunder, Tanja; Kämpgen, Eckhart; Schuler, Gerold

    2006-09-01

    Effective T cell receptor (TCR) transfer until now required stable retroviral transduction. However, retroviral transduction poses the threat of irreversible genetic manipulation of autologous cells. We, therefore, used optimized RNA transfection for transient manipulation. The transfection efficiency, using EGFP RNA, was >90%. The electroporation of primary T cells, isolated from blood, with TCR-coding RNA resulted in functional cytotoxic T lymphocytes (CTLs) (>60% killing at an effector to target ratio of 20:1) with the same HLA-A2/gp100-specificity as the parental CTL clone. The TCR-transfected T cells specifically recognized peptide-pulsed T2 cells, or dendritic cells electroporated with gp100-coding RNA, in an IFNgamma-secretion assay and retained this ability, even after cryopreservation, over 3 days. Most importantly, we show here for the first time that the electroporated T cells also displayed cytotoxicity, and specifically lysed peptide-loaded T2 cells and HLA-A2+/gp100+ melanoma cells over a period of at least 72 h. Peptide-titration studies showed that the lytic efficiency of the RNA-transfected T cells was similar to that of retrovirally transduced T cells, and approximated that of the parental CTL clone. Functional TCR transfer by RNA electroporation is now possible without the disadvantages of retroviral transduction, and forms a new strategy for the immunotherapy of cancer.

  13. Non-coding RNA in cystic fibrosis.

    PubMed

    Glasgow, Arlene M A; De Santi, Chiara; Greene, Catherine M

    2018-05-09

    Non-coding RNAs (ncRNAs) are an abundant class of RNAs that include small ncRNAs, long non-coding RNAs (lncRNA) and pseudogenes. The human ncRNA atlas includes thousands of these specialised RNA molecules that are further subcategorised based on their size or function. Two of the more well-known and widely studied ncRNA species are microRNAs (miRNAs) and lncRNAs. These are regulatory RNAs and their altered expression has been implicated in the pathogenesis of a variety of human diseases. Failure to express a functional cystic fibrosis (CF) transmembrane receptor (CFTR) chloride ion channel in epithelial cells underpins CF. Secondary to the CFTR defect, it is known that other pathways can be altered and these may contribute to the pathophysiology of CF lung disease in particular. For example, quantitative alterations in expression of some ncRNAs are associated with CF. In recent years, there has been a series of published studies exploring ncRNA expression and function in CF. The majority have focussed principally on miRNAs, with just a handful of reports to date on lncRNAs. The present study reviews what is currently known about ncRNA expression and function in CF, and discusses the possibility of applying this knowledge to the clinical management of CF in the near future. © 2018 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.

  14. RNA-Sequencing of Primary Retinoblastoma Tumors Provides New Insights and Challenges Into Tumor Development.

    PubMed

    Elchuri, Sailaja V; Rajasekaran, Swetha; Miles, Wayne O

    2018-01-01

    Retinoblastoma is rare tumor of the retina caused by the homozygous loss of the Retinoblastoma 1 tumor suppressor gene (RB1). Loss of the RB1 protein, pRB, results in de-regulated activity of the E2F transcription factors, chromatin changes and developmental defects leading to tumor development. Extensive microarray profiles of these tumors have enabled the identification of genes sensitive to pRB disruption, however, this technology has a number of limitations in the RNA profiles that they generate. The advent of RNA-sequencing has enabled the global profiling of all of the RNA within the cell including both coding and non-coding features and the detection of aberrant RNA processing events. In this perspective, we focus on discussing how RNA-sequencing of rare Retinoblastoma tumors will build on existing data and open up new area's to improve our understanding of the biology of these tumors. In particular, we discuss how the RB-research field may be to use this data to determine how RB1 loss results in the expression of; non-coding RNAs, causes aberrant RNA processing events and how a deeper analysis of metabolic RNA changes can be utilized to model tumor specific shifts in metabolism. Each section discusses new opportunities and challenges associated with these types of analyses and aims to provide an honest assessment of how understanding these different processes may contribute to the treatment of Retinoblastoma.

  15. Non-coding RNAs in crop genetic modification: considerations and predictable environmental risk assessments (ERA).

    PubMed

    Ramesh, S V

    2013-09-01

    Of late non-coding RNAs (ncRNAs)-mediated gene silencing is an influential tool deliberately deployed to negatively regulate the expression of targeted genes. In addition to the widely employed small interfering RNA (siRNA)-mediated gene silencing approach, other variants like artificial miRNA (amiRNA), miRNA mimics, and artificial transacting siRNAs (tasiRNAs) are being explored and successfully deployed in developing non-coding RNA-based genetically modified plants. The ncRNA-based gene manipulations are typified with mobile nature of silencing signals, interference from viral genome-derived suppressor proteins, and an obligation for meticulous computational analysis to prevaricate any inadvertent effects. In a broad sense, risk assessment inquiries for genetically modified plants based on the expression of ncRNAs are competently addressed by the environmental risk assessment (ERA) models, currently in vogue, designed for the first generation transgenic plants which are based on the expression of heterologous proteins. Nevertheless, transgenic plants functioning on the foundation of ncRNAs warrant due attention with respect to their unique attributes like off-target or non-target gene silencing effects, small RNAs (sRNAs) persistence, food and feed safety assessments, problems in detection and tracking of sRNAs in food, impact of ncRNAs in plant protection measures, effect of mutations etc. The role of recent developments in sequencing techniques like next generation sequencing (NGS) and the ERA paradigm of the different countries in vogue are also discussed in the context of ncRNA-based gene manipulations.

  16. Long noncoding RNA DANCR promotes colorectal cancer proliferation and metastasis via miR-577 sponging.

    PubMed

    Wang, Yong; Lu, Zhi; Wang, Ningnin; Feng, Jianzhou; Zhang, Junjie; Luan, Lan; Zhao, Wei; Zeng, Xiandong

    2018-05-01

    Long non-coding RNAs (lncRNAs) play key roles in various malignant tumors, including colorectal cancer (CRC). Long non-coding RNA differentiation antagonizing non-protein coding RNA (DANCR) is overexpressed in CRC patients, but whether it affects CRC proliferation and metastasis via regulation of heat shock protein 27 (HSP27) remains unclear. In the present study, we found that DANCR was highly expressed and correlated with proliferation and metastasis in CRC. In addition, we demonstrated that DANCR and HSP27 were both targets of microRNA-577 (miR-577) and shared the same binding site. Furthermore, we revealed that DANCR promoted HSP27 expression and its mediation of proliferation/metastasis via miR-577 sponging. Finally, using an in vivo study, we confirmed that overexpression of DANCR promoted CRC tumor growth and liver metastasis. The present study demonstrated the function of DANCR in CRC and might provide a new target in the treatment of CRC.

  17. Evolution and Diversity of the Human Hepatitis D Virus Genome

    PubMed Central

    Huang, Chi-Ruei; Lo, Szecheng J.

    2010-01-01

    Human hepatitis delta virus (HDV) is the smallest RNA virus in genome. HDV genome is divided into a viroid-like sequence and a protein-coding sequence which could have originated from different resources and the HDV genome was eventually constituted through RNA recombination. The genome subsequently diversified through accumulation of mutations selected by interactions between the mutated RNA and proteins with host factors to successfully form the infectious virions. Therefore, we propose that the conservation of HDV nucleotide sequence is highly related with its functionality. Genome analysis of known HDV isolates shows that the C-terminal coding sequences of large delta antigen (LDAg) are the highest diversity than other regions of protein-coding sequences but they still retain biological functionality to interact with the heavy chain of clathrin can be selected and maintained. Since viruses interact with many host factors, including escaping the host immune response, how to design a program to predict RNA genome evolution is a great challenging work. PMID:20204073

  18. Identification and role of regulatory non-coding RNAs in Listeria monocytogenes.

    PubMed

    Izar, Benjamin; Mraheil, Mobarak Abu; Hain, Torsten

    2011-01-01

    Bacterial regulatory non-coding RNAs control numerous mRNA targets that direct a plethora of biological processes, such as the adaption to environmental changes, growth and virulence. Recently developed high-throughput techniques, such as genomic tiling arrays and RNA-Seq have allowed investigating prokaryotic cis- and trans-acting regulatory RNAs, including sRNAs, asRNAs, untranslated regions (UTR) and riboswitches. As a result, we obtained a more comprehensive view on the complexity and plasticity of the prokaryotic genome biology. Listeria monocytogenes was utilized as a model system for intracellular pathogenic bacteria in several studies, which revealed the presence of about 180 regulatory RNAs in the listerial genome. A regulatory role of non-coding RNAs in survival, virulence and adaptation mechanisms of L. monocytogenes was confirmed in subsequent experiments, thus, providing insight into a multifaceted modulatory function of RNA/mRNA interference. In this review, we discuss the identification of regulatory RNAs by high-throughput techniques and in their functional role in L. monocytogenes.

  19. VaDiR: an integrated approach to Variant Detection in RNA.

    PubMed

    Neums, Lisa; Suenaga, Seiji; Beyerlein, Peter; Anders, Sara; Koestler, Devin; Mariani, Andrea; Chien, Jeremy

    2018-02-01

    Advances in next-generation DNA sequencing technologies are now enabling detailed characterization of sequence variations in cancer genomes. With whole-genome sequencing, variations in coding and non-coding sequences can be discovered. But the cost associated with it is currently limiting its general use in research. Whole-exome sequencing is used to characterize sequence variations in coding regions, but the cost associated with capture reagents and biases in capture rate limit its full use in research. Additional limitations include uncertainty in assigning the functional significance of the mutations when these mutations are observed in the non-coding region or in genes that are not expressed in cancer tissue. We investigated the feasibility of uncovering mutations from expressed genes using RNA sequencing datasets with a method called Variant Detection in RNA(VaDiR) that integrates 3 variant callers, namely: SNPiR, RVBoost, and MuTect2. The combination of all 3 methods, which we called Tier 1 variants, produced the highest precision with true positive mutations from RNA-seq that could be validated at the DNA level. We also found that the integration of Tier 1 variants with those called by MuTect2 and SNPiR produced the highest recall with acceptable precision. Finally, we observed a higher rate of mutation discovery in genes that are expressed at higher levels. Our method, VaDiR, provides a possibility of uncovering mutations from RNA sequencing datasets that could be useful in further functional analysis. In addition, our approach allows orthogonal validation of DNA-based mutation discovery by providing complementary sequence variation analysis from paired RNA/DNA sequencing datasets.

  20. Stranded Whole Transcriptome RNA-Seq for All RNA Types

    PubMed Central

    Yan, Pearlly X.; Fang, Fang; Buechlein, Aaron; Ford, James B.; Tang, Haixu; Huang, Tim H.; Burow, Matthew E.; Liu, Yunlong; Rusch, Douglas B.

    2015-01-01

    Stranded whole transcriptome RNA-Seq described in this unit captures quantitative expression data for all types of RNA including, but not limited to miRNA (microRNA), piRNA (Piwi-interacting RNA), snoRNA (small nucleolar RNA), lincRNA (large non-coding intergenic RNA), SRP RNA (signal recognition particle RNA), tRNA (transfer RNA), mtRNA (mitochondrial RNA) and mRNA (messenger RNA). The size and nature of these types of RNA are irrelevant to the approach described here. Barcoded libraries for multiplexing on the Illumina platform are generated with this approach but it can be applied to other platforms with a few modifications. PMID:25599667

  1. Analysis and recognition of 5′ UTR intron splice sites in human pre-mRNA

    PubMed Central

    Eden, E.; Brunak, S.

    2004-01-01

    Prediction of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition. We perform a rigorous analysis of such splice sites embedded in human 5′ untranslated regions (UTRs), and investigate correlations between this class of splice sites and other features found in the adjacent exons and introns. By restricting the training of neural network algorithms to ‘pure’ UTRs (not extending partially into protein coding regions), we for the first time investigate the predictive power of the splicing signal proper, in contrast to conventional splice site prediction, which typically relies on the change in sequence at the transition from protein coding to non-coding. By doing so, the algorithms were able to pick up subtler splicing signals that were otherwise masked by ‘coding’ noise, thus enhancing significantly the prediction of 5′ UTR splice sites. For example, the non-coding splice site predicting networks pick up compositional and positional bias in the 3′ ends of non-coding exons and 5′ non-coding intron ends, where cytosine and guanine are over-represented. This compositional bias at the true UTR donor sites is also visible in the synaptic weights of the neural networks trained to identify UTR donor sites. Conventional splice site prediction methods perform poorly in UTRs because the reading frame pattern is absent. The NetUTR method presented here performs 2–3-fold better compared with NetGene2 and GenScan in 5′ UTRs. We also tested the 5′ UTR trained method on protein coding regions, and discovered, surprisingly, that it works quite well (although it cannot compete with NetGene2). This indicates that the local splicing pattern in UTRs and coding regions is largely the same. The NetUTR method is made publicly available at www.cbs.dtu.dk/services/NetUTR. PMID:14960723

  2. The complete mitochondrial genome of the mudsnail Cipangopaludina cathayensis (Gastropoda: Viviparidae).

    PubMed

    Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang

    2016-05-01

    We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.

  3. Functional Interplay between Small Non-Coding RNAs and RNA Modification in the Brain.

    PubMed

    Leighton, Laura J; Bredy, Timothy W

    2018-06-07

    Small non-coding RNAs are essential for transcription, translation and gene regulation in all cell types, but are particularly important in neurons, with known roles in neurodevelopment, neuroplasticity and neurological disease. Many small non-coding RNAs are directly involved in the post-transcriptional modification of other RNA species, while others are themselves substrates for modification, or are functionally modulated by modification of their target RNAs. In this review, we explore the known and potential functions of several distinct classes of small non-coding RNAs in the mammalian brain, focusing on the newly recognised interplay between the epitranscriptome and the activity of small RNAs. We discuss the potential for this relationship to influence the spatial and temporal dynamics of gene activation in the brain, and predict that further research in the field of epitranscriptomics will identify interactions between small RNAs and RNA modifications which are essential for higher order brain functions such as learning and memory.

  4. Digital data for quick response (QR) codes of alkalophilic Bacillus pumilus to identify and to compare bacilli isolated from Lonar Crator Lake, India

    PubMed Central

    Rekadwad, Bhagwan N.; Khobragade, Chandrahasya N.

    2016-01-01

    Microbiologists are routinely engaged isolation, identification and comparison of isolated bacteria for their novelty. 16S rRNA sequences of Bacillus pumilus were retrieved from NCBI repository and generated QR codes for sequences (FASTA format and full Gene Bank information). 16SrRNA were used to generate quick response (QR) codes of Bacillus pumilus isolated from Lonar Crator Lake (19° 58′ N; 76° 31′ E), India. Bacillus pumilus 16S rRNA gene sequences were used to generate CGR, FCGR and PCA. These can be used for visual comparison and evaluation respectively. The hyperlinked QR codes, CGR, FCGR and PCA of all the isolates are made available to the users on a portal https://sites.google.com/site/bhagwanrekadwad/. This generated digital data helps to evaluate and compare any Bacillus pumilus strain, minimizes laboratory efforts and avoid misinterpretation of the species. PMID:27141529

  5. Beyond the Triplet Code: Context Cues Transform Translation.

    PubMed

    Brar, Gloria A

    2016-12-15

    The elucidation of the genetic code remains among the most influential discoveries in biology. While innumerable studies have validated the general universality of the code and its value in predicting and analyzing protein coding sequences, established and emerging work has also suggested that full genome decryption may benefit from a greater consideration of a codon's neighborhood within an mRNA than has been broadly applied. This Review examines the evidence for context cues in translation, with a focus on several recent studies that reveal broad roles for mRNA context in programming translation start sites, the rate of translation elongation, and stop codon identity. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

    PubMed

    Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

    2016-07-01

    The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.

  7. Gene expression profiles in promoted-growth rice seedlings that germinated from the seeds implanted by low-energy N+ beam

    PubMed Central

    Ya, Huiyuan; Chen, Qiufang; Wang, Weidong; Chen, Wanguang; Qin, Guangyong; Jiao, Zhen

    2012-01-01

    The stimulation effect that some beneficial agronomic qualities have exhibited in present-generation plants have also been observed due to ion implantation on plants. However, there is relatively little knowledge regarding the molecular mechanism of the stimulation effects of ion-beam implantation. In order to extend our current knowledge about the functional genes related to this stimulation effect, we have reported a comprehensive microarray analysis of the transcriptome features of the promoted-growth rice seedlings germinating from seeds implanted by a low-energy N+ beam. The results showed that 351 up-regulated transcripts and 470 down-regulated transcripts, including signaling proteins, kinases, plant hormones, transposable elements, transcription factors, non-coding protein RNA (including miRNA), secondary metabolites, resistance proteins, peroxidase and chromatin modification, are all involved in the stimulating effects of ion-beam implantation. The divergences of the functional catalog between the vacuum and ion implantation suggest that ion implantation is the principle cause of the ion-beam implantation biological effects, and revealed the complex molecular networks required to adapt to ion-beam implantation stress in plants, including enhanced transposition of transposable elements, promoted ABA biosynthesis and changes in chromatin modification. Our data will extend the current understanding of the molecular mechanisms and gene regulation of stimulation effects. Further research on the candidates reported in this study should provide new insights into the molecular mechanisms of biological effects induced by ion-beam implantation. PMID:22843621

  8. The central nervous system transcriptome of the weakly electric brown ghost knifefish (Apteronotus leptorhynchus): de novo assembly, annotation, and proteomics validation.

    PubMed

    Salisbury, Joseph P; Sîrbulescu, Ruxandra F; Moran, Benjamin M; Auclair, Jared R; Zupanc, Günther K H; Agar, Jeffrey N

    2015-03-11

    The brown ghost knifefish (Apteronotus leptorhynchus) is a weakly electric teleost fish of particular interest as a versatile model system for a variety of research areas in neuroscience and biology. The comprehensive information available on the neurophysiology and neuroanatomy of this organism has enabled significant advances in such areas as the study of the neural basis of behavior, the development of adult-born neurons in the central nervous system and their involvement in the regeneration of nervous tissue, as well as brain aging and senescence. Despite substantial scientific interest in this species, no genomic resources are currently available. Here, we report the de novo assembly and annotation of the A. leptorhynchus transcriptome. After evaluating several trimming and transcript reconstruction strategies, de novo assembly using Trinity uncovered 42,459 unique contigs containing at least a partial protein-coding sequence based on alignment to a reference set of known Actinopterygii sequences. As many as 11,847 of these contigs contained full or near-full length protein sequences, providing broad coverage of the proteome. A variety of non-coding RNA sequences were also identified and annotated, including conserved long intergenic non-coding RNA and other long non-coding RNA observed previously to be expressed in adult zebrafish (Danio rerio) brain, as well as a variety of miRNA, snRNA, and snoRNA. Shotgun proteomics confirmed translation of open reading frames from over 2,000 transcripts, including alternative splice variants. Assignment of tandem mass spectra was greatly improved by use of the assembly compared to databases of sequences from closely related organisms. The assembly and raw reads have been deposited at DDBJ/EMBL/GenBank under the accession number GBKR00000000. Tandem mass spectrometry data is available via ProteomeXchange with identifier PXD001285. Presented here is the first release of an annotated de novo transcriptome assembly from Apteronotus leptorhynchus, providing a broad overview of RNA expressed in central nervous system tissue. The assembly, which includes substantial coverage of a wide variety of both protein coding and non-coding transcripts, will allow the development of better tools to understand the mechanisms underlying unique characteristics of the knifefish model system, such as their tremendous regenerative capacity and negligible brain senescence.

  9. The Long Noncoding RNA Landscape of the Mouse Eye.

    PubMed

    Chen, Weiwei; Yang, Shuai; Zhou, Zhonglou; Zhao, Xiaoting; Zhong, Jiayun; Reinach, Peter S; Yan, Dongsheng

    2017-12-01

    Long noncoding RNAs (lncRNAs) are important regulators of diverse biological functions. However, an extensive in-depth analysis of their expression profile and function in mammalian eyes is still lacking. Here we describe comprehensive landscapes of stage-dependent and tissue-specific lncRNA expression in the mouse eye. Affymetrix transcriptome array profiled lncRNA signatures from six different ocular tissue subsets (i.e., cornea, lens, retina, RPE, choroid, and sclera) in newborn and 8-week-old mice. Quantitative RT-PCR analysis validated array findings. Cis analyses and Gene Ontology (GO) annotation of protein-coding genes adjacent to signature lncRNA loci clarified potential lncRNA roles in maintaining tissue identity and regulating eye maturation during the aforementioned phase. In newborn and 8-week-old mice, we identified 47,332 protein-coding and noncoding gene transcripts. LncRNAs comprise 19,313 of these transcripts annotated in public data banks. During this maturation phase of these six different tissue subsets, more than 1000 lncRNAs expression levels underwent ≥2-fold changes. qRT-PCR analysis confirmed part of the gene microarray analysis results. K-means clustering identified 910 lncRNAs in the P0 groups and 686 lncRNAs in the postnatal 8-week-old groups, suggesting distinct tissue-specific lncRNA clusters. GO analysis of protein-coding genes proximal to lncRNA signatures resolved close correlations with their tissue-specific functional maturation between P0 and 8 weeks of age in the 6 tissue subsets. Characterizating maturational changes in lncRNA expression patterns as well as tissue-specific lncRNA signatures in six ocular tissues suggest important contributions made by lncRNA to the control of developmental processes in the mouse eye.

  10. RNA-dependent RNA polymerase of hepatitis C virus binds to its coding region RNA stem-loop structure, 5BSL3.2, and its negative strand.

    PubMed

    Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko

    2010-05-01

    The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.

  11. Reduced levels of protein recoding by A-to-I RNA editing in Alzheimer's disease

    PubMed Central

    Khermesh, Khen; D'Erchia, Anna Maria; Barak, Michal; Annese, Anita; Wachtel, Chaim; Levanon, Erez Y.; Picardi, Ernesto; Eisenberg, Eli

    2016-01-01

    Adenosine to inosine (A-to-I) RNA editing, catalyzed by the ADAR enzyme family, acts on dsRNA structures within pre-mRNA molecules. Editing of the coding part of the mRNA may lead to recoding, amino acid substitution in the resulting protein, possibly modifying its biochemical and biophysical properties. Altered RNA editing patterns have been observed in various neurological pathologies. Here, we present a comprehensive study of recoding by RNA editing in Alzheimer's disease (AD), the most common cause of irreversible dementia. We have used a targeted resequencing approach supplemented by a microfluidic-based high-throughput PCR coupled with next-generation sequencing to accurately quantify A-to-I RNA editing levels in a preselected set of target sites, mostly located within the coding sequence of synaptic genes. Overall, editing levels decreased in AD patients’ brain tissues, mainly in the hippocampus and to a lesser degree in the temporal and frontal lobes. Differential RNA editing levels were observed in 35 target sites within 22 genes. These results may shed light on a possible association between the neurodegenerative processes typical for AD and deficient RNA editing. PMID:26655226

  12. SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

    PubMed

    Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

    2016-06-15

    Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  13. The complete mitochondrial genome of the invasive Africanized Honey Bee, Apis mellifera scutellata (Insecta: Hymenoptera: Apidae).

    PubMed

    Gibson, Joshua D; Hunt, Greg J

    2016-01-01

    The complete mitochondrial genome from an Africanized honey bee population (AHB, derived from Apis mellifera scutellata) was assembled and analyzed. The mitogenome is 16,411 bp long and contains the same gene repertoire and gene order as the European honey bee (13 protein coding genes, 22 tRNA genes and 2 rRNA genes). ND4 appears to use an alternate start codon and the long rRNA gene is 48 bp shorter in AHB due to a deletion in a terminal AT dinucleotide repeat. The dihydrouracil arm is missing from tRNA-Ser (AGN) and tRNA-Glu is missing the TV loop. The A + T content is comparable to the European honey bee (84.7%), which increases to 95% for the 3rd position in the protein coding genes.

  14. Optimizing exosomal RNA isolation for RNA-Seq analyses of archival sera specimens.

    PubMed

    Prendergast, Emily N; de Souza Fonseca, Marcos Abraão; Dezem, Felipe Segato; Lester, Jenny; Karlan, Beth Y; Noushmehr, Houtan; Lin, Xianzhi; Lawrenson, Kate

    2018-01-01

    Exosomes are endosome-derived membrane vesicles that contain proteins, lipids, and nucleic acids. The exosomal transcriptome mediates intercellular communication, and represents an understudied reservoir of novel biomarkers for human diseases. Next-generation sequencing enables complex quantitative characterization of exosomal RNAs from diverse sources. However, detailed protocols describing exosome purification for preparation of exosomal RNA-sequence (RNA-Seq) libraries are lacking. Here we compared methods for isolation of exosomes and extraction of exosomal RNA from human cell-free serum, as well as strategies for attaining equal representation of samples within pooled RNA-Seq libraries. We compared commercial precipitation with ultracentrifugation for exosome purification and confirmed the presence of exosomes via both transmission electron microscopy and immunoblotting. Exosomal RNA extraction was compared using four different RNA purification methods. We determined the minimal starting volume of serum required for exosome preparation and showed that high quality exosomal RNA can be isolated from sera stored for over a decade. Finally, RNA-Seq libraries were successfully prepared with exosomal RNAs extracted from human cell-free serum, cataloguing both coding and non-coding exosomal transcripts. This method provides researchers with strategic options to prepare RNA-Seq libraries and compare RNA-Seq data quantitatively from minimal volumes of fresh and archival human cell-free serum for disease biomarker discovery.

  15. Structural RNAs of known and unknown function identified in malaria parasites by comparative genomics and RNA analysis

    PubMed Central

    Chakrabarti, Kausik; Pearson, Michael; Grate, Leslie; Sterne-Weiler, Timothy; Deans, Jonathan; Donohue, John Paul; Ares, Manuel

    2007-01-01

    As the genomes of more eukaryotic pathogens are sequenced, understanding how molecular differences between parasite and host might be exploited to provide new therapies has become a major focus. Central to cell function are RNA-containing complexes involved in gene expression, such as the ribosome, the spliceosome, snoRNAs, RNase P, and telomerase, among others. In this article we identify by comparative genomics and validate by RNA analysis numerous previously unknown structural RNAs encoded by the Plasmodium falciparum genome, including the telomerase RNA, U3, 31 snoRNAs, as well as previously predicted spliceosomal snRNAs, SRP RNA, MRP RNA, and RNAse P RNA. Furthermore, we identify six new RNA coding genes of unknown function. To investigate the relationships of the RNA coding genes to other genomic features in related parasites, we developed a genome browser for P. falciparum (http://areslab.ucsc.edu/cgi-bin/hgGateway). Additional experiments provide evidence supporting the prediction that snoRNAs guide methylation of a specific position on U4 snRNA, as well as predicting an snRNA promoter element particular to Plasmodium sp. These findings should allow detailed structural comparisons between the RNA components of the gene expression machinery of the parasite and its vertebrate hosts. PMID:17901154

  16. MicroRNA-200c Modulates the Expression of MUC4 and MUC16 by Directly Targeting Their Coding Sequences in Human Pancreatic Cancer

    PubMed Central

    Radhakrishnan, Prakash; Mohr, Ashley M.; Grandgenett, Paul M.; Steele, Maria M.; Batra, Surinder K.; Hollingsworth, Michael A.

    2013-01-01

    Transmembrane mucins, MUC4 and MUC16 are associated with tumor progression and metastatic potential in human pancreatic adenocarcinoma. We discovered that miR-200c interacts with specific sequences within the coding sequence of MUC4 and MUC16 mRNAs, and evaluated the regulatory nature of this association. Pancreatic cancer cell lines S2.028 and T3M-4 transfected with miR-200c showed a 4.18 and 8.50 fold down regulation of MUC4 mRNA, and 4.68 and 4.82 fold down regulation of MUC16 mRNA compared to mock-transfected cells, respectively. A significant reduction of glycoprotein expression was also observed. These results indicate that miR-200c overexpression regulates MUC4 and MUC16 mucins in pancreatic cancer cells by directly targeting the mRNA coding sequence of each, resulting in reduced levels of MUC4 and MUC16 mRNA and protein. These data suggest that, in addition to regulating proteins that modulate EMT, miR-200c influences expression of cell surface mucins in pancreatic cancer. PMID:24204560

  17. MicroRNA-200c modulates the expression of MUC4 and MUC16 by directly targeting their coding sequences in human pancreatic cancer.

    PubMed

    Radhakrishnan, Prakash; Mohr, Ashley M; Grandgenett, Paul M; Steele, Maria M; Batra, Surinder K; Hollingsworth, Michael A

    2013-01-01

    Transmembrane mucins, MUC4 and MUC16 are associated with tumor progression and metastatic potential in human pancreatic adenocarcinoma. We discovered that miR-200c interacts with specific sequences within the coding sequence of MUC4 and MUC16 mRNAs, and evaluated the regulatory nature of this association. Pancreatic cancer cell lines S2.028 and T3M-4 transfected with miR-200c showed a 4.18 and 8.50 fold down regulation of MUC4 mRNA, and 4.68 and 4.82 fold down regulation of MUC16 mRNA compared to mock-transfected cells, respectively. A significant reduction of glycoprotein expression was also observed. These results indicate that miR-200c overexpression regulates MUC4 and MUC16 mucins in pancreatic cancer cells by directly targeting the mRNA coding sequence of each, resulting in reduced levels of MUC4 and MUC16 mRNA and protein. These data suggest that, in addition to regulating proteins that modulate EMT, miR-200c influences expression of cell surface mucins in pancreatic cancer.

  18. A single U/C nucleotide substitution changing alanine to valine in the beet necrotic yellow vein virus P25 protein promotes increased virus accumulation in roots of mechanically inoculated, partially resistant sugar beet seedlings.

    PubMed

    Koenig, R; Loss, S; Specht, J; Varrelmann, M; Lüddecke, P; Deml, G

    2009-03-01

    Beet necrotic yellow vein virus (BNYVV) A type isolates E12 and S8, originating from areas where resistance-breaking had or had not been observed, respectively, served as starting material for studying the influence of sequence variations in BNYVV RNA 3 on virus accumulation in partially resistant sugar beet varieties. Sub-isolates containing only RNAs 1 and 2 were obtained by serial local lesion passages; biologically active cDNA clones were prepared for RNAs 3 which differed in their coding sequences for P25 aa 67, 68 and 129. Sugar beet seedlings were mechanically inoculated with RNA 1+2/RNA 3 pseudorecombinants. The origin of RNAs 1+2 had little influence on virus accumulation in rootlets. E12 RNA 3 coding for V(67)C(68)Y(129) P25, however, enabled a much higher virus accumulation than S8 RNA 3 coding for A(67)H(68)H(129) P25. Mutants revealed that this was due only to the V(67) 'GUU' codon as opposed to the A(67) 'GCU' codon.

  19. The RNA world in the 21st century-a systems approach to finding non-coding keys to clinical questions.

    PubMed

    Schmitz, Ulf; Naderi-Meshkin, Hojjat; Gupta, Shailendra K; Wolkenhauer, Olaf; Vera, Julio

    2016-05-01

    There was evidence that RNAs are a functionally rich class of molecules not only since the arrival of the next-generation sequencing technology. Non-coding RNAs (ncRNA) could be the key to accelerated diagnosis and enhanced prediction of disease and therapy outcomes as well as the design of advanced therapeutic strategies to overcome yet unsatisfactory approaches.In this review, we discuss the state of the art in RNA systems biology with focus on the application in the systems biomedicine field. We propose guidelines for analysing the role of microRNAs and long non-coding RNAs in human pathologies. We introduce RNA expression profiling and network approaches for the identification of stable and effective RNomics-based biomarkers, providing insights into the role of ncRNAs in disease regulation. Towards this, we discuss ways to model the dynamics of gene regulatory networks and signalling pathways that involve ncRNAs. We also describe data resources and computational methods for finding putative mechanisms of action of ncRNAs. Finally, we discuss avenues for the computer-aided design of novel RNA-based therapeutics. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  20. A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

    PubMed

    Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

    2017-01-01

    Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.

  1. The complete mitochondrial DNA of endemic Eastern Pacific coral (Porites panamensis).

    PubMed

    Del Río-Portilla, Miguel A; Vargas-Peralta, Carmen E; Paz-García, David A; Lafarga De La Cruz, Fabiola; Balart, Eduardo F; García-de-León, Francisco J

    2016-01-01

    The mitogenome of the endemic coral Porites panamensis (Genbank accession number KJ546638) has a total length of 18,628 bp, and the arrangement consist of 13 protein-coding genes, 2 ribosomal RNA (rRNA) genes and 2 transfer RNA (tRNA) genes. Gene order was equal to other scleractinian coral mitogenomes.

  2. Non-coding effects of circular RNA CCDC66 promote colon cancer growth and metastasis

    PubMed Central

    Hsiao, Kuei-Yang; Lin, Ya-Chi; Gupta, Sachin Kumar; Chang, Ning; Yen, Laising; Sun, H. Sunny; Tsai, Shaw-Jenq

    2018-01-01

    Circular RNA (circRNA) is a class of non-coding RNA whose functions remain mostly unknown. Recent studies indicate circRNA may be involved in disease pathogenesis, but direct evidence is scarce. Here we characterize the functional role of a novel circRNA, circCCDC66, in colorectal cancer (CRC). RNA-Seq data from matched normal and tumor colon tissue samples identified numerous circRNAs specifically elevated in cancer cells, several of which were verified by quantitative RT-PCR. CircCCDC66 expression was elevated in polyps and colon cancer and was associated with poor prognosis. Gain-of-function and loss-of-function studies in CRC cell-lines demonstrated that circCCDC66 controlled multiple pathological processes, including cell proliferation, migration, invasion, and anchorage-independent growth. In-depth characterization revealed that circCCDC66 exerts its function via regulation of a subset of oncogenes, and knockdown of circCCDC66 inhibited tumor growth and cancer invasion in xenograft and orthotopic mouse models, respectively. Taken together, these findings highlight a novel oncogenic function of circRNA in cancer progression and metastasis. PMID:28249903

  3. RNA editing: trypanosomes rewrite the genetic code.

    PubMed

    Stuart, K

    1998-01-01

    The understanding of how genetic information is stored and expressed has advanced considerably since the "central dogma" asserted that genetic information flows from the nucleotide sequence of DNA to that of messenger RNA (mRNA) which in turn specifies the amino acid sequence of a protein. It was found that genetic information can be stored as RNA (e.g. in RNA viruses) and can flow from RNA to DNA by reverse transcriptase enzyme activity. In addition, some genes contain introns, nucleotide sequences that are removed from their RNA (by RNA splicing) and thus are not represented in the resultant protein. Furthermore, alternative splicing was found to produce variant proteins from a single gene. More recently, the study of trypanosome parasites revealed an unexpected and indeed counter-intuitive genetic complexity. Genetic information for a single protein can be dispersed among several (DNA) genes in these organisms. One of these genes specifies an encrypted precursor mRNA that is converted to a functional mRNA by a process called RNA editing that inserts and deletes uridylate nucleotides. The sequence of the edited mRNA is specified by multiple small RNAs, named guide RNAs, (gRNAs) each of which is encoded in a separate gene. Thus, edited mRNA sequences are assembled from multiple genes by the transfer of information from one type of RNA to another. The existence of editing was surprising but has stimulated the discovery of other types of RNA editing. The Stuart laboratory has been exploring RNA editing in trypanosomes from the time of its discovery. They found dramatic differences between the mitochondrial gene sequences and those of the corresponding mRNAs, which indicated editing by the insertion and deletion of uridylates. Some editing was modest; simply eliminating shifts in sequence register of minimally extending the protein coding sequence. However, editing of many mRNAs was startingly extensive. The RNA sequence was essentially entirely remodeled with its sequence more the result of editing than the gene sequence. The identities of genes for such extensively edited RNA were not recognizable from the DNA sequence but they were readily identifiable from the edited mRNA sequence. Thus, despite the complex and extensive editing the resultant mRNA sequence is precise. Characterization of partially edited RNAs indicated that editing proceeds in the direction opposite to that used to specify the protein which reflects the use of the gRNAs. The numerous gRNAs that are used for editing are encoded in the DNA molecules whose role was previously a mystery. Using information gained in our earlier studies, the Stuart group developed an in vitro system that reproduces the fundamental process of editing in order to resolve the mechanism by which it occurs. They determined that editing entails a series of enzymatic steps rather than the mechanism used in RNA splicing. They also showed that chimeric gRNA-mRNA molecules are aberrant by-products of editing rather than intermediates in the process as had been proposed. Additional studies are exploring precisely how the number of added and deleted uridylates is specified by the gRNA. The Stuart laboratory showed that editing is performed by an aggregation of enzymes that catalyze the separate steps of editing. It also developed a method to purify this multimolecule complex that contains several, perhaps tens of, proteins. This will allow the study of its composition and the functions of its component parts. Indeed, the gene for one component has been identified and its detailed characterization begun. These studies are developing tools to explore related processes. An early finding in the lab was that the various mRNAs are differentially edited during the life cycle of the parasite. The pattern of this editing indicates that editing serves to regulate the alternation between two modes of energy generation. This regulation is coordinated with other events that are occurring during the life c

  4. Post-transcriptional gene silencing triggered by sense transgenes involves uncapped antisense RNA and differs from silencing intentionally triggered by antisense transgenes

    PubMed Central

    Parent, Jean-Sébastien; Jauvion, Vincent; Bouché, Nicolas; Béclin, Christophe; Hachet, Mélanie; Zytnicki, Matthias; Vaucheret, Hervé

    2015-01-01

    Although post-transcriptional gene silencing (PTGS) has been studied for more than a decade, there is still a gap in our understanding of how de novo silencing is initiated against genetic elements that are not supposed to produce double-stranded (ds)RNA. Given the pervasive transcription occurring throughout eukaryote genomes, we tested the hypothesis that unintended transcription could produce antisense (as)RNA molecules that participate to the initiation of PTGS triggered by sense transgenes (S-PTGS). Our results reveal a higher level of asRNA in Arabidopsis thaliana lines that spontaneously trigger S-PTGS than in lines that do not. However, PTGS triggered by antisense transgenes (AS-PTGS) differs from S-PTGS. In particular, a hypomorphic ago1 mutation that suppresses S-PTGS prevents the degradation of asRNA but not sense RNA during AS-PTGS, suggesting a different treatment of coding and non-coding RNA by AGO1, likely because of AGO1 association to polysomes. Moreover, the intended asRNA produced during AS-PTGS is capped whereas the asRNA produced during S-PTGS derives from 3′ maturation of a read-through transcript and is uncapped. Thus, we propose that uncapped asRNA corresponds to the aberrant RNA molecule that is converted to dsRNA by RNA-DEPENDENT RNA POLYMERASE 6 in siRNA-bodies to initiate S-PTGS, whereas capped asRNA must anneal with sense RNA to produce dsRNA that initiate AS-PTGS. PMID:26209135

  5. The complete mitochondrial genome of the bagarius yarrelli from honghe river

    NASA Astrophysics Data System (ADS)

    Du, M.; Zhou, C. J.; Niu, B. Z.; Liu, Y. H.; Li, N.; Ai, J. L.; Xu, G. L.

    2016-08-01

    The total length of mitochondrial DNA sequence of the Bagarius yarrelli from the Honghe river of China is determined in this paper. The total length of the circular molecule is 16524 base pair which denoted a similar gene order to that of the other bony fishes, which include a non-coding control region, a replicated origin, two ribosome RNA (rRNA) genes, 22 transfer RNA (tRNA) genes as well as 13 protein-coding genes. Its whole base constitution is 31.4% for A, 26.9% for C, 15.7% for G and 26.0% for T, with an A+T bias of 57.4%. Those mitochondrial data would contribute to further study molecular evolution and population genetics of this species.

  6. Quantification of non-coding RNA target localization diversity and its application in cancers.

    PubMed

    Cheng, Lixin; Leung, Kwong-Sak

    2018-04-01

    Subcellular localization is pivotal for RNAs and proteins to implement biological functions. The localization diversity of protein interactions has been studied as a crucial feature of proteins, considering that the protein-protein interactions take place in various subcellular locations. Nevertheless, the localization diversity of non-coding RNA (ncRNA) target proteins has not been systematically studied, especially its characteristics in cancers. In this study, we provide a new algorithm, non-coding RNA target localization coefficient (ncTALENT), to quantify the target localization diversity of ncRNAs based on the ncRNA-protein interaction and protein subcellular localization data. ncTALENT can be used to calculate the target localization coefficient of ncRNAs and measure how diversely their targets are distributed among the subcellular locations in various scenarios. We focus our study on long non-coding RNAs (lncRNAs), and our observations reveal that the target localization diversity is a primary characteristic of lncRNAs in different biotypes. Moreover, we found that lncRNAs in multiple cancers, differentially expressed cancer lncRNAs, and lncRNAs with multiple cancer target proteins are prone to have high target localization diversity. Furthermore, the analysis of gastric cancer helps us to obtain a better understanding that the target localization diversity of lncRNAs is an important feature closely related to clinical prognosis. Overall, we systematically studied the target localization diversity of the lncRNAs and uncovered its association with cancer.

  7. Long non-coding RNA HOTAIR, a c-Myc activated driver of malignancy, negatively regulates miRNA-130a in gallbladder cancer

    PubMed Central

    2014-01-01

    Background Protein coding genes account for only about 2% of the human genome, whereas the vast majority of transcripts are non-coding RNAs including long non-coding RNAs. A growing volume of literature has proposed that lncRNAs are important players in cancer. HOTAIR was previously shown to be an oncogene and negative prognostic factor in a variety of cancers. However, the factors that contribute to its upregulation and the interaction between HOTAIR and miRNAs are largely unknown. Methods A computational screen of HOTAIR promoter was conducted to search for transcription-factor-binding sites. HOTAIR promoter activities were examined by luciferase reporter assay. The function of the c-Myc binding site in the HOTAIR promoter region was tested by a promoter assay with nucleotide substitutions in the putative E-box. The association of c-Myc with the HOTAIR promoter in vivo was confirmed by chromatin immunoprecipitation assay and Electrophoretic mobility shift assay. A search for miRNAs with complementary base paring with HOTAIR was performed utilizing online software program. Gain and loss of function approaches were employed to investigate the expression changes of HOTAIR or miRNA-130a. The expression levels of HOTAIR, c-Myc and miRNA-130a were examined in 65 matched pairs of gallbladder cancer tissues. The effects of HOTAIR and miRNA-130a on gallbladder cancer cell invasion and proliferation was tested using in vitro cell invasion and flow cytometric assays. Results We demonstrate that HOTAIR is a direct target of c-Myc through interaction with putative c-Myc target response element (RE) in the upstream region of HOTAIR in gallbladder cancer cells. A positive correlation between c-Myc and HOTAIR mRNA levels was observed in gallbladder cancer tissues. We predicted that HOTAIR harbors a miRNA-130a binding site. Our data showed that this binding site is vital for the regulation of miRNA-130a by HOTAIR. Moreover, a negative correlation between HOTAIR and miRNA-130a was observed in gallbladder cancer tissues. Finally, we demonstrate that the oncogenic activity of HOTAIR is in part through its negative regulation of miRNA-130a. Conclusion Together, these results suggest that HOTAIR is a c-Myc-activated driver of malignancy, which acts in part through repression of miRNA-130a. PMID:24953832

  8. Differential expression of small non-coding RNAs in serum from cattle challenged with viruses causing bovine respiratory disease

    USDA-ARS?s Scientific Manuscript database

    MicroRNAs and tRNA-derived RNA fragments (tRFs) are the two most abundant groups of small non-coding RNAs. The potential for microRNAs and tRFs to be used as pathogen exposure indicators is yet to be fully explored. Our objective was to identify microRNAs and tRFs in cattle challenged with a non-cy...

  9. Integrative analysis of long non-coding RNA acting as ceRNAs involved in chilling injury in tomato fruit.

    PubMed

    Wang, Yunxiang; Gao, Lipu; Zhu, Benzhong; Zhu, Hongliang; Luo, Yunbo; Wang, Qing; Zuo, Jinhua

    2018-08-15

    Long-non-coding RNA (LncRNA) is a kind of non-coding endogenous RNA that plays essential roles in diverse biological processes and various stress responses. To identify and elucidate the intricate regulatory roles of lncRNAs in chilling injury in tomato fruit, deep sequencing and bioinformatics methods were performed here. After strict screening, a total of 1411 lncRNAs were identified. Among these lncRNAs, 239 of them were significantly differentially expressed. A large amount of target genes were identified and many of them were found to code chilling stress related proteins, including redox reaction related enzyme, important enzymes about cell wall degradation, membrane lipid peroxidation related enzymes, heat and cold shock protein, energy metabolism related enzymes, salicylic acid and abscisic acid metabolism related genes. Interestingly, 41 lncRNAs were found to be the precursor of 33 miRNAs, and 186 lncRNAs were targets of 45 miRNAs. These lncRNAs targeted by miRNAs might be potential ceRNAs. Particularly, a sophisticated regulatory model including miRNAs, lncRNAs and their targets was set up. This model revealed that some miRNAs and lncRNAs may be involved in chilling injury, which provided a new perspective of lncRNAs role. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Analysis of Antisense Expression by Whole Genome Tiling Microarrays and siRNAs Suggests Mis-Annotation of Arabidopsis Orphan Protein-Coding Genes

    PubMed Central

    Richardson, Casey R.; Luo, Qing-Jun; Gontcharova, Viktoria; Jiang, Ying-Wen; Samanta, Manoj; Youn, Eunseog; Rock, Christopher D.

    2010-01-01

    Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation. PMID:20520764

  11. Structure and mechanism of the T-box riboswitches

    PubMed Central

    Zhang, Jinwei

    2015-01-01

    In most Gram-positive bacteria, including many clinically devastating pathogens from genera such as Bacillus, Clostridium, Listeria and Staphylococcus, T-box riboswitches sense and regulate intracellular availability of amino acids through a multipartite mRNA-tRNA interaction. The T-box mRNA leaders respond to nutrient starvation by specifically binding cognate tRNAs and sensing whether the bound tRNA is aminoacylated, as a proxy for amino acid availability. Based on this readout, T-boxes direct a transcriptional or translational switch to control the expression of downstream genes involved in various aspects of amino acid metabolism: biosynthesis, transport, aminoacylation, transamidation, etc. Two decades after its discovery, the structural and mechanistic underpinnings of the T-box riboswitch were recently elucidated, producing a wealth of insights into how two structured RNAs can recognize each other with robust affinity and exquisite selectivity. The T-box paradigm exemplifies how natural non-coding RNAs can interact not just through sequence complementarity, but can add molecular specificity by precisely juxtaposing RNA structural motifs, exploiting inherently flexible elements and the biophysical properties of post-transcriptional modifications, ultimately achieving a high degree of shape complementarity through mutually induced fit. The T-box also provides a proof-of-principle that compact RNA domains can recognize minute chemical changes (such as tRNA aminoacylation) on another RNA. The unveiling of the structure and mechanism of the T-box system thus expands our appreciation of the range of capabilities and modes of action of structured non-coding RNAs, and hints at the existence of networks of non-coding RNAs that communicate through both, structural and sequence specificity. PMID:25959893

  12. NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples.

    PubMed

    Yu, Hua; Jiao, Bingke; Lu, Lu; Wang, Pengfei; Chen, Shuangcheng; Liang, Chengzhi; Liu, Wei

    2018-01-01

    Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.

  13. HYBRIDIZATION PROPERTIES OF DNA SEQUENCES DIRECTING THE SYNTHESIS OF MESSENGER RNA AND HETEROGENEOUS NUCLEAR RNA

    PubMed Central

    Greenberg, Jay R.; Perry, Robert P.

    1971-01-01

    The relationship of the DNA sequences from which polyribosomal messenger RNA (mRNA) and heterogeneous nuclear RNA (NRNA) of mouse L cells are transcribed was investigated by means of hybridization kinetics and thermal denaturation of the hybrids. Hybridization was performed in formamide solutions at DNA excess. Under these conditions most of the hybridizing mRNA and NRNA react at values of Dot (DNA concentration multiplied by time) expected for RNA transcribed from the nonrepeated or rarely repeated fraction of the genome. However, a fraction of both mRNA and NRNA hybridize at values of Dot about 10,000 times lower, and therefore must be transcribed from highly redundant DNA sequences. The fraction of NRNA hybridizing to highly repeated sequences is about 1.7 times greater than the corresponding fraction of mRNA. The hybrids formed by the rapidly reacting fractions of both NRNA and mRNA melt over a narrow temperature range with a midpoint about 11°C below that of native L cell DNA. This indicates that these hybrids consist of partially complementary sequences with approximately 11% mismatching of bases. Hybrids formed by the slowly reacting fraction of NRNA melt within 4°–6°C of native DNA, indicating very little, if any, mismatching of bases. Hybrids of the slowly reacting components of mRNA, formed under conditions of sufficiently low RNA input, have a high thermal stability, similar to that observed for hybrids of the slowly reacting NRNA component. However, when higher inputs of mRNA are used, hybrids are formed which have a strikingly lower thermal stability. This observation can be explained by assuming that there is sufficient similarity among the relatively rare DNA sequences coding for mRNA so that under hybridization conditions, in which these DNA sequences are not truly in excess, reversible hybrids exhibiting a considerable amount of mispairing are formed. The fact that a comparable phenomenon has not been observed for NRNA may mean that there is less similarity among the relatively rare DNA sequences coding for NRNA than there is among the rare sequences coding for mRNA. PMID:4999767

  14. Coupled transcription and processing of mouse ribosomal RNA in a cell-free system.

    PubMed Central

    Mishima, Y; Mitsuma, T; Ogata, K

    1985-01-01

    An in vitro processing system of mouse rRNA was achieved using an RNA polymerase I-specific transcription system, (S100) and recombinant plasmids consisting of mouse rRNA gene (rDNA) segments containing the transcription initiation and 5'-terminal region of 18S (or 41S) rRNA. Pulse-chase experiments showed that a specific processing occurred with transcripts of the plasmid DNAs when the direction of transcription was the correct orientation relative to the 18S rRNA coding sequence, but not with transcripts of the DNA templates in which this coding sequence was in the opposite orientation. From the S1 nuclease protection analyses, we concluded that there are several steps of endonucleolytic cleavage including one 105 nucleotides upstream from the 5' end of 18S rRNA. Intermediates cleaved at this site were identified in in vivo processing of rRNA. This result indicates that endonucleolytic cleavage takes place 105 nucleotides upstream from the 5' terminus of 18S rRNA prior to the formation of mature 18S rRNA. Trimming or cleavage of the 105 nucleotides may be involved in the formation of the 5' terminus of mature 18S rRNA. Images Fig. 2. Fig. 3. Fig. 4. Fig. 5. Fig. 6. PMID:3004977

  15. The mitochondrial genome of the ethanol-metabolizing, wine cellar mold Zasmidium cellare is the smallest for a filamentous ascomycete

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goodwin, Stephen; McCorison, Cassandra B.; Cavaletto, Jessica R.

    Fungi in the class Dothideomycetes often live in extreme environments or have unusual physiology. One of these, the wine cellar mold Zasmidium cellare, produces thick curtains of mycelial growth in cellars with high humidity, and its ability to metabolize volatile organic compounds including alcohols, esters and formaldehyde is thought to improve air quality. It grows slowly but appears to outcompete ordinarily faster-growing species under anaerobic conditions.Whether these abilities have affected its mitochondrial genome is not known.To fill this gap, its mitochondrial genome was assembled as part of a whole- genome shotgun-sequencing project.The circular-mapping mitochondrial genome of Z. cellare, at onlymore » 23,743 bp, is the smallest yet reported for a filamentous fungus.It contains the complete set of 14 protein-coding genes seen typically in other filamentous fungi, along with genes for large and small ribosomal RNA subunits, 25 predicted tRNA genes capable of decoding all 20 amino acids, and a single open reading frame potentially coding for a protein of unknown function.The Z. cellare mitochondrial genome had genes encoded on both strands with a single change of direction, different from most other fungi but consistent with the Dothideomycetes. The high synteny among mitochondrial genomes of fungi in the Eurotiomycetes broke down almost completely in the Dothideomycetes.Only a low level of microsynteny was observed among protein-coding and tRNA genes in comparison with Mycosphaerella graminicola (synonym Zymoseptoria tritici), the only other fungus in the order Capnodiales with a sequenced mitochondrial genome, involving the three gene pairs atp8-atp9, nad2-nad3, and nad4L-nad5.However, even this low level of microsynteny did not extend to other fungi in the Dothideomycetes and Eurotiomycetes. Phylogenetic analysis of concatenated protein-coding genes confirmed the relationship between Z. cellare and M. graminicola in the Capnodiales, although conclusions were limited due to low sampling density.Other than its small size, the only unusual feature of the Z. cellare mitochondrial genome was two copies of a 110-bp sequence that were duplicated, inverted and separated by approximately 1 kb. This inverted-repeat sequence confused the assembly program but appears to have no functional significance.The small size of the Z. cellare mitochondrial genome was due to slightly smaller genes, lack of introns and non-essential genes, reduced intergenic spaces and very few ORFs relative to other fungi rather than a loss of essential genes. Whether this reduction facilitates its unusual biology remains unknown.« less

  16. Methylation of microRNA genes regulates gene expression in bisexual flower development in andromonoecious poplar.

    PubMed

    Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang

    2015-04-01

    Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5' and 3' flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  17. Methylation of microRNA genes regulates gene expression in bisexual flower development in andromonoecious poplar

    PubMed Central

    Song, Yuepeng; Tian, Min; Ci, Dong; Zhang, Deqiang

    2015-01-01

    Previous studies showed sex-specific DNA methylation and expression of candidate genes in bisexual flowers of andromonoecious poplar, but the regulatory relationship between methylation and microRNAs (miRNAs) remains unclear. To investigate whether the methylation of miRNA genes regulates gene expression in bisexual flower development, the methylome, microRNA, and transcriptome were examined in female and male flowers of andromonoecious poplar. 27 636 methylated coding genes and 113 methylated miRNA genes were identified. In the coding genes, 64.5% of the methylated reads mapped to the gene body region; by contrast, 60.7% of methylated reads in miRNA genes mainly mapped in the 5′ and 3′ flanking regions. CHH methylation showed the highest methylation levels and CHG showed the lowest methylation levels. Correlation analysis showed a significant, negative, strand-specific correlation of methylation and miRNA gene expression (r=0.79, P <0.05). The methylated miRNA genes included eight long miRNAs (lmiRNAs) of 24 nucleotides and 11 miRNAs related to flower development. miRNA172b might play an important role in the regulation of bisexual flower development-related gene expression in andromonoecious poplar, via modification of methylation. Gynomonoecious, female, and male poplars were used to validate the methylation patterns of the miRNA172b gene, implying that hyper-methylation in andromonoecious and gynomonoecious poplar might function as an important regulator in bisexual flower development. Our data provide a useful resource for the study of flower development in poplar and improve our understanding of the effect of epigenetic regulation on genes other than protein-coding genes. PMID:25617468

  18. Long non-coding RNA repertoire and targeting by nuclear exosome, cytoplasmic exonuclease and RNAi in fission yeast.

    PubMed

    Atkinson, Sophie; Marguerat, Samuel; Bitton, Danny; Bachand, Francois; Rodriguez-Lopez, Maria; Rallis, Charalampos; Lemay, Jean-Francois; Cotobal, Cristina; Malecki, Michal; Smialowski, Pawel; Mata, Juan; Korber, Philipp; Bahler, Jurg

    2018-06-18

    Long non-coding RNAs (lncRNAs), which are longer than 200 nucleotides but often unstable, contribute a substantial and diverse portion to pervasive non-coding transcriptomes. Most lncRNAs are poorly annotated and understood, although several play important roles in gene regulation and diseases. Here we systematically uncover and analyse lncRNAs in Schizosaccharomyces pombe. Based on RNA-seq data from twelve RNA-processing mutants and nine physiological conditions, we identify 5775 novel lncRNAs, nearly 4-times the previously annotated lncRNAs. The expression of most lncRNAs becomes strongly induced under the genetic and physiological perturbations, most notably during late meiosis. Most lncRNAs are cryptic and suppressed by three RNA-processing pathways: the nuclear exosome, cytoplasmic exonuclease, and RNAi. Double-mutant analyses reveal substantial coordination and redundancy among these pathways. We classify lncRNAs by their dominant pathway into cryptic unstable transcripts (CUTs), Xrn1-sensitive unstable transcripts (XUTs), and Dicer-sensitive unstable transcripts (DUTs). XUTs and DUTs are enriched for antisense lncRNAs, while CUTs are often bidirectional and actively translated. The cytoplasmic exonuclease, along with RNAi, dampens the expression of thousands of lncRNAs and mRNAs that become induced during meiosis. Antisense lncRNA expression mostly negatively correlates with sense mRNA expression in the physiological, but not the genetic conditions. Intergenic and bidirectional lncRNAs emerge from nucleosome-depleted regions, upstream of positioned nucleosomes. Our results highlight both similarities and differences to lncRNA regulation in budding yeast. This broad survey of the lncRNA repertoire and characteristics in S. pombe, and the interwoven regulatory pathways that target lncRNAs, provides a rich framework for their further functional analyses. Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  19. Genome-wide identification of long non-coding RNA and mRNA profiling using RNA sequencing in subjects with sensitive skin

    PubMed Central

    Tu, Ying; Xu, Dan; Feng, Jiaqi; He, Li

    2017-01-01

    Sensitive skin (SS) is a condition of subjective cutaneous hyper-reactivity. The role of long non-coding RNAs (lncRNAs) in subjects with SS is unclear. Therefore, the aim of the present study was to provide a comprehensive profile of the mRNAs and lncRNAs in subjects with SS. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis presented the characteristics of associated protein-coding genes. In addition, a co-expression network of lncRNA and mRNA was constructed to identify potential underlying regulation targets; the results were verified by quantitative real-time PCR (qRT-PCR) and RNA-seq analyses in patients with SS and normal samples. Compared with the normal skin group, 266 novel lncRNAs and 6750 annotated lncRNAs were identified in the SS group. A total of 71 lncRNA transcripts and 2615 mRNA transcripts were differentially expressed (P < 0.05). The heat signature of the SS samples could be distinguished from the normal skin samples, whereas the majority of the genes that were present in enriched pathways were those that participated in focal adhesion, PI3K-Akt signaling, and cancer-related pathways. Five transcripts were selected for qRT-PCR analysis and the results were consistent with RNA-seq. The results suggested that LNC_000265 may play a role in the epidermal barrier structure of patient with SS. The data suggest novel genes and pathways that may be involved in the pathogenesis of SS and highlight potential targets that could be used for individualized treatment applications. PMID:29383128

  20. Identification of mRNA-like non-coding RNAs and validation of a mighty one named MAR in Panax ginseng.

    PubMed

    Wang, Meizhen; Wu, Bin; Chen, Chao; Lu, Shanfa

    2015-03-01

    Increasing evidence suggests that long non-coding RNAs (lncRNAs) play significant roles in plants. However, little is known about lncRNAs in Panax ginseng C. A. Meyer, an economically significant medicinal plant species. A total of 3,688 mRNA-like non-coding RNAs (mlncRNAs), a class of lncRNAs, were identified in P. ginseng. Approximately 40% of the identified mlncRNAs were processed into small RNAs, implying their regulatory roles via small RNA-mediated mechanisms. Eleven miRNA-generating mlncRNAs also produced siRNAs, suggesting the coordinated production of miRNAs and siRNAs in P. ginseng. The mlncRNA-derived small RNAs might be 21-, 22-, or 24-nt phased and could be generated from both or only one strand of mlncRNAs, or from super long hairpin structures. A full-length mlncRNA, termed MAR (multiple-function-associated mlncRNA), was cloned. It generated the most abundant siRNAs. The MAR siRNAs were predominantly 24-nt and some of them were distributed in a phased pattern. A total of 228 targets were predicted for 71 MAR siRNAs. Degradome sequencing validated 68 predicted targets involved in diverse metabolic pathways, suggesting the significance of MAR in P. ginseng. Consistently, MAR was detected in all tissues analyzed and responded to methyl jasmonate (MeJA) treatment. It sheds light on the function of mlncRNAs in plants. © 2014 Institute of Botany, Chinese Academy of Sciences.

  1. Mycobacterial RNA isolation optimized for non-coding RNA: high fidelity isolation of 5S rRNA from Mycobacterium bovis BCG reveals novel post-transcriptional processing and a complete spectrum of modified ribonucleosides.

    PubMed

    Hia, Fabian; Chionh, Yok Hian; Pang, Yan Ling Joy; DeMott, Michael S; McBee, Megan E; Dedon, Peter C

    2015-03-11

    A major challenge in the study of mycobacterial RNA biology is the lack of a comprehensive RNA isolation method that overcomes the unusual cell wall to faithfully yield the full spectrum of non-coding RNA (ncRNA) species. Here, we describe a simple and robust procedure optimized for the isolation of total ncRNA, including 5S, 16S and 23S ribosomal RNA (rRNA) and tRNA, from mycobacteria, using Mycobacterium bovis BCG to illustrate the method. Based on a combination of mechanical disruption and liquid and solid-phase technologies, the method produces all major species of ncRNA in high yield and with high integrity, enabling direct chemical and sequence analysis of the ncRNA species. The reproducibility of the method with BCG was evident in bioanalyzer electrophoretic analysis of isolated RNA, which revealed quantitatively significant differences in the ncRNA profiles of exponentially growing and non-replicating hypoxic bacilli. The method also overcame an historical inconsistency in 5S rRNA isolation, with direct sequencing revealing a novel post-transcriptional processing of 5S rRNA to its functional form and with chemical analysis revealing seven post-transcriptional ribonucleoside modifications in the 5S rRNA. This optimized RNA isolation procedure thus provides a means to more rigorously explore the biology of ncRNA species in mycobacteria. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Long non-coding RNA reprogramming (lncRNA-ROR) regulates cell apoptosis and autophagy in chondrocytes.

    PubMed

    Yang, Zhongmeng; Tang, Yuxing; Lu, Huading; Shi, Bo; Ye, Yongheng; Xu, Guoyong; Zhao, Qing

    2018-06-12

    Long Non-Coding RNA Reprogramming (lncRNA-ROR) plays an important role in regulating various biologic processes, whereas the effect of lncRNA-ROR in osteoarthritis (OA) is little studied. This study aimed to explore lncRNA-ROR expression in articular cartilage and identify the functional mechanism of lncRNA-ROR in OA. OA cartilage tissues were obtained from 15 OA patients, and 6 normal cartilage tissues were set as controls. Chondrocytes were isolated from the collected cartilage tissues. lncRNA-ROR was knockdown in normal cells and overexpressed in OA cells. Cell viability was determined with Cell Counting Kit-8 assay, and apoptosis was measured using flow cytometric analysis. Moreover, proteins and mRNAs involved in this study were also measured using Western blotting and quantitative real-time PCR (qPCR). Level of lncRNA-ROR was decreased in OA compared with normal chondrocytes, and overexpression of lncRNA-ROR dramatically promoted cell viability of OA chondrocytes. In addition, knockdown lncRNA-ROR inhibited apoptosis and promoted autophagy of normal chondrocytes. Moreover, lncRNA-ROR inhibited the expression of p53 in both mRNA and protein levels. Furthermore, we revealed that lncRNA-ROR regulated apoptosis and autophagy of chondrocytes via HIF1α and p53. The results indicated that lncRNA-ROR played a critical role in the pathogenesis of OA, suggesting that lncRNA-ROR could serve as a new potential therapeutic target for OA. © 2018 Wiley Periodicals, Inc.

  3. A long and abundant non-coding RNA in Lactobacillus salivarius.

    PubMed

    Cousin, Fabien J; Lynch, Denise B; Chuat, Victoria; Bourin, Maxence J B; Casey, Pat G; Dalmasso, Marion; Harris, Hugh M B; McCann, Angela; O'Toole, Paul W

    2017-09-01

    Lactobacillus salivarius , found in the intestinal microbiota of humans and animals, is studied as an example of the sub-dominant intestinal commensals that may impart benefits upon their host. Strains typically harbour at least one megaplasmid that encodes functions contributing to contingency metabolism and environmental adaptation. RNA sequencing (RNA-seq)transcriptomic analysis of L. salivarius strain UCC118 identified the presence of a novel unusually abundant long non-coding RNA (lncRNA) encoded by the megaplasmid, and which represented more than 75 % of the total RNA-seq reads after depletion of rRNA species. The expression level of this 520 nt lncRNA in L. salivarius UCC118 exceeded that of the 16S rRNA, it accumulated during growth, was very stable over time and was also expressed during intestinal transit in a mouse. This lncRNA sequence is specific to the L. salivarius species; however, among 45 L . salivarius genomes analysed, not all (only 34) harboured the sequence for the lncRNA. This lncRNA was produced in 27 tested L. salivarius strains, but at strain-specific expression levels. High-level lncRNA expression correlated with high megaplasmid copy number. Transcriptome analysis of a deletion mutant lacking this lncRNA identified altered expression levels of genes in a number of pathways, but a definitive function of this new lncRNA was not identified. This lncRNA presents distinctive and unique properties, and suggests potential basic and applied scientific developments of this phenomenon.

  4. Facts and updates about cardiovascular non-coding RNAs in heart failure.

    PubMed

    Thum, Thomas

    2015-09-01

    About 11% of all deaths include heart failure as a contributing cause. The annual cost of heart failure amounts to US $34,000,000,000 in the United States alone. With the exception of heart transplantation, there is no curative therapy available. Only occasionally there are new areas in science that develop into completely new research fields. The topic on non-coding RNAs, including microRNAs, long non-coding RNAs, and circular RNAs, is such a field. In this short review, we will discuss the latest developments about non-coding RNAs in cardiovascular disease. MicroRNAs are short regulatory non-coding endogenous RNA species that are involved in virtually all cellular processes. Long non-coding RNAs also regulate gene and protein levels; however, by much more complicated and diverse mechanisms. In general, non-coding RNAs have been shown to be of great value as therapeutic targets in adverse cardiac remodelling and also as diagnostic and prognostic biomarkers for heart failure. In the future, non-coding RNA-based therapeutics are likely to enter the clinical reality offering a new treatment approach of heart failure.

  5. The Magnetic Reconnection Code: an AMR-based fully implicit simulation suite

    NASA Astrophysics Data System (ADS)

    Germaschewski, K.; Bhattacharjee, A.; Ng, C.-S.

    2006-12-01

    Extended MHD models, which incorporate two-fluid effects, are promising candidates to enhance understanding of collisionless reconnection phenomena in laboratory, space and astrophysical plasma physics. In this paper, we introduce two simulation codes in the Magnetic Reconnection Code suite which integrate reduced and full extended MHD models. Numerical integration of these models comes with two challenges: Small-scale spatial structures, e.g. thin current sheets, develop and must be well resolved by the code. Adaptive mesh refinement (AMR) is employed to provide high resolution where needed while maintaining good performance. Secondly, the two-fluid effects in extended MHD give rise to dispersive waves, which lead to a very stringent CFL condition for explicit codes, while reconnection happens on a much slower time scale. We use a fully implicit Crank--Nicholson time stepping algorithm. Since no efficient preconditioners are available for our system of equations, we instead use a direct solver to handle the inner linear solves. This requires us to actually compute the Jacobian matrix, which is handled by a code generator that calculates the derivative symbolically and then outputs code to calculate it.

  6. Long Non-Coding RNA as Potential Biomarker for Prostate Cancer: Is It Making a Difference?

    PubMed

    Deng, Junli; Tang, Jie; Wang, Guo; Zhu, Yuan-Shan

    2017-03-07

    Whole genome transcriptomic analyses have identified numerous long non-coding RNA (lncRNA) transcripts that are increasingly implicated in cancer biology. LncRNAs are found to promote essential cancer cell functions such as proliferation, invasion, and metastasis, with the potential to serve as novel biomarkers of various cancers and to further reveal uncharacterized aspects of tumor biology. However, the biological and molecular mechanisms as well as the clinical applications of lncRNAs in diverse diseases are not completely understood, and remain to be fully explored. LncRNAs may be critical players and regulators in prostate cancer carcinogenesis and progression, and could serve as potential biomarkers for prostate cancer. This review focuses on lncRNA biomarkers that are already available for clinical use and provides an overview of lncRNA biomarkers that are under investigation for clinical development in prostate cancer.

  7. The Ftx Noncoding Locus Controls X Chromosome Inactivation Independently of Its RNA Products.

    PubMed

    Furlan, Giulia; Gutierrez Hernandez, Nancy; Huret, Christophe; Galupa, Rafael; van Bemmel, Joke Gerarda; Romito, Antonio; Heard, Edith; Morey, Céline; Rougeulle, Claire

    2018-05-03

    Accumulation of the Xist long noncoding RNA (lncRNA) on one X chromosome is the trigger for X chromosome inactivation (XCI) in female mammals. Xist expression, which needs to be tightly controlled, involves a cis-acting region, the X-inactivation center (Xic), containing many lncRNA genes that evolved concomitantly to Xist from protein-coding ancestors through pseudogeneization and loss of coding potential. Here, we uncover an essential role for the Xic-linked noncoding gene Ftx in the regulation of Xist expression. We show that Ftx is required in cis to promote Xist transcriptional activation and establishment of XCI. Importantly, we demonstrate that this function depends on Ftx transcription and not on the RNA products. Our findings illustrate the multiplicity of layers operating in the establishment of XCI and highlight the diversity in the modus operandi of the noncoding players. Copyright © 2018 Elsevier Inc. All rights reserved.

  8. Fluorogenic RNA Mango aptamers for imaging small non-coding RNAs in mammalian cells.

    PubMed

    Autour, Alexis; C Y Jeng, Sunny; D Cawte, Adam; Abdolahzadeh, Amir; Galli, Angela; Panchapakesan, Shanker S S; Rueda, David; Ryckelynck, Michael; Unrau, Peter J

    2018-02-13

    Despite having many key roles in cellular biology, directly imaging biologically important RNAs has been hindered by a lack of fluorescent tools equivalent to the fluorescent proteins available to study cellular proteins. Ideal RNA labelling systems must preserve biological function, have photophysical properties similar to existing fluorescent proteins, and be compatible with established live and fixed cell protein labelling strategies. Here, we report a microfluidics-based selection of three new high-affinity RNA Mango fluorogenic aptamers. Two of these are as bright or brighter than enhanced GFP when bound to TO1-Biotin. Furthermore, we show that the new Mangos can accurately image the subcellular localization of three small non-coding RNAs (5S, U6, and a box C/D scaRNA) in fixed and live mammalian cells. These new aptamers have many potential applications to study RNA function and dynamics both in vitro and in mammalian cells.

  9. Reversible RNA adenosine methylation in biological regulation

    PubMed Central

    Jia, Guifang; Fu, Ye; He, Chuan

    2012-01-01

    N6-methyladenosine (m6A) is a ubiquitous modification in messenger RNA (mRNA) and other RNAs across most eukaryotes. For many years, however, the exact functions of m6A were not clearly understood. The discovery that the fat mass and obesity associated protein (FTO) is an m6A demethylase indicates that this modification is reversible and dynamically regulated, suggesting it has regulatory roles. In addition, it has been shown that m6A affects cell fate decisions in yeast and plant development. Recent affinity-based m6A profiling in mouse and human cells further showed that this modification is a widespread mark in coding and non-coding RNA transcripts and is likely dynamically regulated throughout developmental processes. Therefore, reversible RNA methylation, analogous to reversible DNA and histone modifications, may affect gene expression and cell fate decisions by modulating multiple RNA-related cellular pathways, which potentially provides rapid responses to various cellular and environmental signals, including energy and nutrient availability in mammals. PMID:23218460

  10. Mutations in the RNA exosome component gene EXOSC3 cause pontocerebellar hypoplasia and spinal motor neuron degeneration

    PubMed Central

    Wan, Jijun; Yourshaw, Michael; Mamsa, Hafsa; Rudnik-Schöneborn, Sabine; Menezes, Manoj P.; Hong, Ji Eun; Leong, Derek W.; Senderek, Jan; Salman, Michael S.; Chitayat, David; Seeman, Pavel; von Moers, Arpad; Graul-Neumann, Luitgard; Kornberg, Andrew J.; Castro-Gago, Manuel; Sobrido, María-Jesús; Sanefuji, Masafumi; Shieh, Perry B.; Salamon, Noriko; Kim, Ronald C.; Vinters, Harry V.; Chen, Zugen; Zerres, Klaus; Ryan, Monique M.; Nelson, Stanley F.; Jen, Joanna C.

    2012-01-01

    RNA exosomes are multi-subunit complexes conserved throughout evolution1 and emerging as the major cellular machinery for processing, surveillance, and turnover of a diverse spectrum of coding and non-coding RNA substrates essential for viability2. By exome sequencing, we discovered recessive mutations in exosome component 3 (EXOSC3) in four siblings with infantile spinal motor neuron disease, cerebellar atrophy, progressive microcephaly, and profound global developmental delay, consistent with pontocerebellar hypoplasia type 1 [PCH1; OMIM 607596]3–6. We identified mutations in EXOSC3 in an additional 8 of 12 families with PCH1. Morpholino knockdown of exosc3 in zebrafish embryos caused embryonic maldevelopment with small brain and poor motility, reminiscent of human clinical features and largely rescued by coinjected wildtype but not mutant exosc3 mRNA. These findings represent the first example of an RNA exosome gene responsible for a human disease and further implicate dysregulation of RNA processing in cerebellar and spinal motor neuron maldevelopment and degeneration. PMID:22544365

  11. Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.

    PubMed

    Sanford, Jeremy R; Wang, Xin; Mort, Matthew; Vanduyn, Natalia; Cooper, David N; Mooney, Sean D; Edenberg, Howard J; Liu, Yunlong

    2009-03-01

    Metazoan genes are encrypted with at least two superimposed codes: the genetic code to specify the primary structure of proteins and the splicing code to expand their proteomic output via alternative splicing. Here, we define the specificity of a central regulator of pre-mRNA splicing, the conserved, essential splicing factor SFRS1. Cross-linking immunoprecipitation and high-throughput sequencing (CLIP-seq) identified 23,632 binding sites for SFRS1 in the transcriptome of cultured human embryonic kidney cells. SFRS1 was found to engage many different classes of functionally distinct transcripts including mRNA, miRNA, snoRNAs, ncRNAs, and conserved intergenic transcripts of unknown function. The majority of these diverse transcripts share a purine-rich consensus motif corresponding to the canonical SFRS1 binding site. The consensus site was not only enriched in exons cross-linked to SFRS1 in vivo, but was also enriched in close proximity to splice sites. mRNAs encoding RNA processing factors were significantly overrepresented, suggesting that SFRS1 may broadly influence the post-transcriptional control of gene expression in vivo. Finally, a search for the SFRS1 consensus motif within the Human Gene Mutation Database identified 181 mutations in 82 different genes that disrupt predicted SFRS1 binding sites. This comprehensive analysis substantially expands the known roles of human SR proteins in the regulation of a diverse array of RNA transcripts.

  12. Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster

    PubMed Central

    Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan

    2002-01-01

    Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380

  13. Extracellular Vesicle-Associated RNA as a Carrier of Epigenetic Information

    PubMed Central

    2017-01-01

    Post-transcriptional regulation of messenger RNA (mRNA) metabolism and subcellular localization is of the utmost importance both during development and in cell differentiation. Besides carrying genetic information, mRNAs contain cis-acting signals (zip codes), usually present in their 5′- and 3′-untranslated regions (UTRs). By binding to these signals, trans-acting factors, such as RNA-binding proteins (RBPs), and/or non-coding RNAs (ncRNAs), control mRNA localization, translation and stability. RBPs can also form complexes with non-coding RNAs of different sizes. The release of extracellular vesicles (EVs) is a conserved process that allows both normal and cancer cells to horizontally transfer molecules, and hence properties, to neighboring cells. By interacting with proteins that are specifically sorted to EVs, mRNAs as well as ncRNAs can be transferred from cell to cell. In this review, we discuss the mechanisms underlying the sorting to EVs of different classes of molecules, as well as the role of extracellular RNAs and the associated proteins in altering gene expression in the recipient cells. Importantly, if, on the one hand, RBPs play a critical role in transferring RNAs through EVs, RNA itself could, on the other hand, function as a carrier to transfer proteins (i.e., chromatin modifiers, and transcription factors) that, once transferred, can alter the cell’s epigenome. PMID:28937658

  14. Differential expression of lncRNAs during the HIV replication cycle: an underestimated layer in the HIV-host interplay.

    PubMed

    Trypsteen, Wim; Mohammadi, Pejman; Van Hecke, Clarissa; Mestdagh, Pieter; Lefever, Steve; Saeys, Yvan; De Bleser, Pieter; Vandesompele, Jo; Ciuffi, Angela; Vandekerckhove, Linos; De Spiegelaere, Ward

    2016-10-26

    Studying the effects of HIV infection on the host transcriptome has typically focused on protein-coding genes. However, recent advances in the field of RNA sequencing revealed that long non-coding RNAs (lncRNAs) add an extensive additional layer to the cell's molecular network. Here, we performed transcriptome profiling throughout a primary HIV infection in vitro to investigate lncRNA expression at the different HIV replication cycle processes (reverse transcription, integration and particle production). Subsequently, guilt-by-association, transcription factor and co-expression analysis were performed to infer biological roles for the lncRNAs identified in the HIV-host interplay. Many lncRNAs were suggested to play a role in mechanisms relying on proteasomal and ubiquitination pathways, apoptosis, DNA damage responses and cell cycle regulation. Through transcription factor binding analysis, we found that lncRNAs display a distinct transcriptional regulation profile as compared to protein coding mRNAs, suggesting that mRNAs and lncRNAs are independently modulated. In addition, we identified five differentially expressed lncRNA-mRNA pairs with mRNA involvement in HIV pathogenesis with possible cis regulatory lncRNAs that control nearby mRNA expression and function. Altogether, the present study demonstrates that lncRNAs add a new dimension to the HIV-host interplay and should be further investigated as they may represent targets for controlling HIV replication.

  15. Rapid upregulation and clearance of distinct circulating microRNAs after prolonged aerobic exercise.

    PubMed

    Baggish, Aaron L; Park, Joseph; Min, Pil-Ki; Isaacs, Stephanie; Parker, Beth A; Thompson, Paul D; Troyanos, Chris; D'Hemecourt, Pierre; Dyer, Sophia; Thiel, Marissa; Hale, Andrew; Chan, Stephen Y

    2014-03-01

    Short nonprotein coding RNA molecules, known as microRNAs (miRNAs), are intracellular mediators of adaptive processes, including muscle hypertrophy, contractile force generation, and inflammation. During basal conditions and tissue injury, miRNAs are released into the bloodstream as "circulating" miRNAs (c-miRNAs). To date, the impact of extended-duration, submaximal aerobic exercise on plasma concentrations of c-miRNAs remains incompletely characterized. We hypothesized that specific c-miRNAs are differentially upregulated following prolonged aerobic exercise. To test this hypothesis, we measured concentrations of c-miRNAs enriched in muscle (miR-1, miR-133a, miR-499-5p), cardiac tissue (miR-208a), and the vascular endothelium (miR-126), as well as those important in inflammation (miR-146a) in healthy male marathon runners (N = 21) at rest, immediately after a marathon (42-km foot race), and 24 h after the race. In addition, we compared c-miRNA profiles to those of conventional protein biomarkers reflective of skeletal muscle damage, cardiac stress and necrosis, and systemic inflammation. Candidate c-miRNAs increased immediately after the marathon and declined to prerace levels or lower after 24 h of race completion. However, the magnitude of change for each c-miRNA differed, even when originating from the same tissue type. In contrast, traditional biomarkers increased after exercise but remained elevated 24 h postexercise. Thus c-miRNAs respond differentially to prolonged exercise, suggesting the existence of specific mechanisms of c-miRNA release and clearance not fully explained by generalized cellular injury. Furthermore, c-miRNA expression patterns differ in a temporal fashion from corollary conventional tissue-specific biomarkers, emphasizing the potential of c-miRNAs as unique, real-time markers of exercise-induced tissue adaptation.

  16. Alternative splicing of a viral mirtron differentially affects the expression of other microRNAs from its cluster and of the host transcript

    PubMed Central

    Rasschaert, Perrine; Dambrine, Ginette; Rasschaert, Denis; Laurent, Sylvie

    2016-01-01

    ABSTRACT Interplay between alternative splicing and the Microprocessor may have differential effects on the expression of intronic miRNAs organized into clusters. We used a viral model — the LAT long non-coding RNA (LAT lncRNA) of Marek's disease oncogenic herpesvirus (MDV-1), which has the mdv1-miR-M8-M6-M7-M10 cluster embedded in its first intron — to assess the impact of splicing modifications on the biogenesis of each of the miRNAs from the cluster. Drosha silencing and alternative splicing of an extended exon 2 of the LAT lncRNA from a newly identified 3′ splice site (SS) at the end of the second miRNA of the cluster showed that mdv1-miR-M6 was a 5′-tailed mirtron. We have thus identified the first 5′-tailed mirtron within a cluster of miRNAs for which alternative splicing is directly associated with differential expression of the other miRNAs of the cluster, with an increase in intronic mdv1-miR-M8 expression and a decrease in expression of the exonic mdv1-miR-M7, and indirectly associated with regulation of the host transcript. According to the alternative 3SS used for the host intron splicing, the mdv1-miR-M6 is processed as a mirtron by the spliceosome, dispatching the other miRNAs of the cluster into intron and exon, or as a canonical miRNA by the Microprocessor complex. The viral mdv1-miR-M6 mirtron is the first mirtron described that can also follow the canonical pathway. PMID:27715458

  17. MicroRNA-503 and the Extended MicroRNA-16 Family in Angiogenesis

    PubMed Central

    Caporali, Andrea; Emanueli, Costanza

    2011-01-01

    MicroRNAs (miRs) are post-transcriptional inhibitory regulators of gene expression acting by direct binding to complementary messenger RNA (mRNA) transcripts. Recent studies have demonstrated that miRs are crucial determinants of endothelial cell behavior and angiogenesis. We have provided evidence of the prominent role of miR-503 in impairment of postischemic reparative angiogenesis in the setting of diabetes. Because miR-503 belongs to the miR-16 extended family of miRs, in this review, we describe the cardiovascular functions of miR-503 and other members of the miR-16 family and their impact on angiogenesis. PMID:22814423

  18. Dicer cleaves 5'-extended microRNA precursors originating from RNA polymerase II transcription start sites.

    PubMed

    Sheng, Peike; Fields, Christopher; Aadland, Kelsey; Wei, Tianqi; Kolaczkowski, Oralia; Gu, Tongjun; Kolaczkowski, Bryan; Xie, Mingyi

    2018-05-09

    MicroRNAs (miRNAs) are approximately 22 nucleotide (nt) long and play important roles in post-transcriptional regulation in both plants and animals. In animals, precursor (pre-) miRNAs are ∼70 nt hairpins produced by Drosha cleavage of long primary (pri-) miRNAs in the nucleus. Exportin-5 (XPO5) transports pre-miRNAs into the cytoplasm for Dicer processing. Alternatively, pre-miRNAs containing a 5' 7-methylguanine (m7G-) cap can be generated independently of Drosha and XPO5. Here we identify a class of m7G-capped pre-miRNAs with 5' extensions up to 39 nt long. The 5'-extended pre-miRNAs are transported by Exportin-1 (XPO1). Unexpectedly, a long 5' extension does not block Dicer processing. Rather, Dicer directly cleaves 5'-extended pre-miRNAs by recognizing its 3' end to produce mature 3p miRNA and extended 5p miRNA both in vivo and in vitro. The recognition of 5'-extended pre-miRNAs by the Dicer Platform-PAZ-Connector (PPC) domain can be traced back to ancestral animal Dicers, suggesting that this previously unrecognized Dicer reaction mode is evolutionarily conserved. Our work reveals additional genetic sources for small regulatory RNAs and substantiates Dicer's essential role in RNAi-based gene regulation.

  19. The majority of total nuclear-encoded non-ribosomal RNA in a human cell is 'dark matter' un-annotated RNA.

    PubMed

    Kapranov, Philipp; St Laurent, Georges; Raz, Tal; Ozsolak, Fatih; Reynolds, C Patrick; Sorensen, Poul H B; Reaman, Gregory; Milos, Patrice; Arceci, Robert J; Thompson, John F; Triche, Timothy J

    2010-12-21

    Discovery that the transcriptional output of the human genome is far more complex than predicted by the current set of protein-coding annotations and that most RNAs produced do not appear to encode proteins has transformed our understanding of genome complexity and suggests new paradigms of genome regulation. However, the fraction of all cellular RNA whose function we do not understand and the fraction of the genome that is utilized to produce that RNA remain controversial. This is not simply a bookkeeping issue because the degree to which this un-annotated transcription is present has important implications with respect to its biologic function and to the general architecture of genome regulation. For example, efforts to elucidate how non-coding RNAs (ncRNAs) regulate genome function will be compromised if that class of RNAs is dismissed as simply 'transcriptional noise'. We show that the relative mass of RNA whose function and/or structure we do not understand (the so called 'dark matter' RNAs), as a proportion of all non-ribosomal, non-mitochondrial human RNA (mt-RNA), can be greater than that of protein-encoding transcripts. This observation is obscured in studies that focus only on polyA-selected RNA, a method that enriches for protein coding RNAs and at the same time discards the vast majority of RNA prior to analysis. We further show the presence of a large number of very long, abundantly-transcribed regions (100's of kb) in intergenic space and further show that expression of these regions is associated with neoplastic transformation. These overlap some regions found previously in normal human embryonic tissues and raises an interesting hypothesis as to the function of these ncRNAs in both early development and neoplastic transformation. We conclude that 'dark matter' RNA can constitute the majority of non-ribosomal, non-mitochondrial-RNA and a significant fraction arises from numerous very long, intergenic transcribed regions that could be involved in neoplastic transformation.

  20. The Complete Mitochondrial Genome of Mindarus keteleerifoliae (Insecta: Hemiptera: Aphididae) and Comparison with Other Aphididae Insects.

    PubMed

    Wang, Yuan; Chen, Jing; Jiang, Li-Yun; Qiao, Ge-Xia

    2015-12-17

    The mitogenome of Mindarus keteleerifoliae Zhang (Hemiptera: Aphididae) is a 15,199 bp circular molecule. The gene order and orientation of M. keteleerifoliae is similarly arranged to that of the ancestral insect of other aphid mitogenomes, and, a tRNA isomerism event maybe identified in the mitogenome of M. keteleerifoliae. The tRNA-Trp gene is coded in the J-strand and the same sequence in the N-strand codes for the tRNA-Ser gene. A similar phenomenon was also found in the mitogenome of Eriosoma lanigerum. However, whether tRNA isomers in aphids exist requires further study. Phylogenetic analyses, using all available protein-coding genes, support Mindarinae as the basal position of Aphididae. Two tribes of Aphidinae were recovered with high statistical significance. Characteristics of the M. keteleerifoliae mitogenome revealed distinct mitogenome structures and provided abundant phylogenetic signals, thus advancing our understanding of insect mitogenomic architecture and evolution. But, because only eight complete aphid mitogenomes, including M. keteleerifoliae, were published, future studies with larger taxon sampling sizes are necessary.

  1. Regulated Formation of lncRNA-DNA Hybrids Enables Faster Transcriptional Induction and Environmental Adaptation.

    PubMed

    Cloutier, Sara C; Wang, Siwen; Ma, Wai Kit; Al Husini, Nadra; Dhoondia, Zuzer; Ansari, Athar; Pascuzzi, Pete E; Tran, Elizabeth J

    2016-02-04

    Long non-coding (lnc)RNAs, once thought to merely represent noise from imprecise transcription initiation, have now emerged as major regulatory entities in all eukaryotes. In contrast to the rapidly expanding identification of individual lncRNAs, mechanistic characterization has lagged behind. Here we provide evidence that the GAL lncRNAs in the budding yeast S. cerevisiae promote transcriptional induction in trans by formation of lncRNA-DNA hybrids or R-loops. The evolutionarily conserved RNA helicase Dbp2 regulates formation of these R-loops as genomic deletion or nuclear depletion results in accumulation of these structures across the GAL cluster gene promoters and coding regions. Enhanced transcriptional induction is manifested by lncRNA-dependent displacement of the Cyc8 co-repressor and subsequent gene looping, suggesting that these lncRNAs promote induction by altering chromatin architecture. Moreover, the GAL lncRNAs confer a competitive fitness advantage to yeast cells because expression of these non-coding molecules correlates with faster adaptation in response to an environmental switch. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine.

    PubMed

    Sun, Lei; Liu, Hui; Zhang, Lin; Meng, Jia

    2015-01-01

    Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study.

  3. Novel base-pairing interactions at the tRNA wobble position crucial for accurate reading of the genetic code.

    PubMed

    Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara

    2016-01-21

    Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNA(Lys)(UUU) with hypermodified 5-methylaminomethyl-2-thiouridine (mnm(5)s(2)U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm(5)s(2)U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.

  4. Hypoxic exosomes facilitate bladder tumor growth and development through transferring long non-coding RNA-UCA1.

    PubMed

    Xue, Mei; Chen, Wei; Xiang, An; Wang, Ruiqi; Chen, He; Pan, Jingjing; Pang, Huan; An, Hongli; Wang, Xiang; Hou, Huilian; Li, Xu

    2017-08-25

    To overcome the hostile hypoxic microenvironment of solid tumors, tumor cells secrete a large number of non-coding RNA-containing exosomes that facilitate tumor development and metastasis. However, the precise mechanisms of tumor cell-derived exosomes during hypoxia are unknown. Here, we aim to clarify whether hypoxia affects tumor growth and progression by transferring long non-coding RNA-urothelial cancer-associated 1 (lncRNA-UCA1) enriched exosomes secreted from bladder cancer cells. We used bladder cancer 5637 cells with high expression of lncRNA-UCA1 as exosome-generating cells and bladder cancer UMUC2 cells with low expression of lncRNA-UCA1 as recipient cells. Exosomes derived from 5637 cells cultured under normoxic or hypoxic conditions were isolated and identified by transmission electron microscopy, nanoparticle tracking analysis and western blotting analysis. These exosomes were co-cultured with UMUC2 cells to evaluate cell proliferation, migration and invasion. We further investigated the roles of exosomal lncRNA-UCA1 derived from hypoxic 5637 cells by xenograft models. The availability of lncRNA-UCA1 in serum-derived exosomes as a biomarker for bladder cancer was also assessed. We found that hypoxic exosomes derived from 5637 cells promoted cell proliferation, migration and invasion, and hypoxic exosomal RNAs could be internalized by three bladder cancer cell lines. Importantly, lncRNA-UCA1 was secreted in hypoxic 5637 cell-derived exosomes. Compared with normoxic exosomes, hypoxic exosomes derived from 5637 cells showed the higher expression levels of lncRNA-UCA1. Moreover, Hypoxic exosomal lncRNA-UCA1 could promote tumor growth and progression though epithelial-mesenchymal transition, in vitro and in vivo. In addition, the expression levels of lncRNA-UCA1 in the human serum-derived exosomes of bladder cancer patients were higher than that in the healthy controls. Together, our results demonstrate that hypoxic bladder cancer cells remodel tumor microenvironment to facilitate tumor growth and development though secreting the oncogenic lncRNA-UCA1-enriched exosomes and exosomal lncRNA-UCA1 in human serum has the possibility as a diagnostic biomarker for bladder cancer.

  5. Development of new two-dimensional spectral/spatial code based on dynamic cyclic shift code for OCDMA system

    NASA Astrophysics Data System (ADS)

    Jellali, Nabiha; Najjar, Monia; Ferchichi, Moez; Rezig, Houria

    2017-07-01

    In this paper, a new two-dimensional spectral/spatial codes family, named two dimensional dynamic cyclic shift codes (2D-DCS) is introduced. The 2D-DCS codes are derived from the dynamic cyclic shift code for the spectral and spatial coding. The proposed system can fully eliminate the multiple access interference (MAI) by using the MAI cancellation property. The effect of shot noise, phase-induced intensity noise and thermal noise are used to analyze the code performance. In comparison with existing two dimensional (2D) codes, such as 2D perfect difference (2D-PD), 2D Extended Enhanced Double Weight (2D-Extended-EDW) and 2D hybrid (2D-FCC/MDW) codes, the numerical results show that our proposed codes have the best performance. By keeping the same code length and increasing the spatial code, the performance of our 2D-DCS system is enhanced: it provides higher data rates while using lower transmitted power and a smaller spectral width.

  6. Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).

    PubMed

    Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su

    2014-08-01

    We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.

  7. Molecular Regulatory Pathways Link Sepsis With Metabolic Syndrome: Non-coding RNA Elements Underlying the Sepsis/Metabolic Cross-Talk.

    PubMed

    Meydan, Chanan; Bekenstein, Uriya; Soreq, Hermona

    2018-01-01

    Sepsis and metabolic syndrome (MetS) are both inflammation-related entities with high impact for human health and the consequences of concussions. Both represent imbalanced parasympathetic/cholinergic response to insulting triggers and variably uncontrolled inflammation that indicates shared upstream regulators, including short microRNAs (miRs) and long non-coding RNAs (lncRNAs). These may cross talk across multiple systems, leading to complex molecular and clinical outcomes. Notably, biomedical and RNA-sequencing based analyses both highlight new links between the acquired and inherited pathogenic, cardiac and inflammatory traits of sepsis/MetS. Those include the HOTAIR and MIAT lncRNAs and their targets, such as miR-122, -150, -155, -182, -197, -375, -608 and HLA-DRA. Implicating non-coding RNA regulators in sepsis and MetS may delineate novel high-value biomarkers and targets for intervention.

  8. Long non-coding RNA XIST promotes cell growth by regulating miR-139-5p/PDK1/AKT axis in hepatocellular carcinoma.

    PubMed

    Mo, Yichao; Lu, Yaoyong; Wang, Peng; Huang, Simin; He, Longguang; Li, Dasheng; Li, Fuliang; Huang, Junwei; Lin, Xiaoxia; Li, Xueru; Che, Siyao; Chen, Qinshou

    2017-02-01

    Abnormal expression of long non-coding RNA often contributes to unrestricted growth of cancer cells. Long non-coding RNA XIST expression is upregulated in several cancers; however, its modulatory mechanisms have not been reported in hepatocellular carcinoma. In this study, we found that XIST expression was significantly increased in hepatocellular carcinoma tissues and cell lines. XIST promoted cell cycle progression from the G1 phase to the S phase and protected cells from apoptosis, which contributed to hepatocellular carcinoma cell growth. In addition, we revealed that there was reciprocal repression between XIST and miR-139-5p. PDK1 was identified as a direct target of miR-139-5p. We proposed that XIST was responsible for hepatocellular carcinoma cell proliferation, and XIST exerted its function through the miR-139-5p/PDK1 axis.

  9. The aminoacyl-tRNA synthetases had only a marginal role in the origin of the organization of the genetic code: Evidence in favor of the coevolution theory.

    PubMed

    Di Giulio, Massimo

    2017-11-07

    The coevolution theory of the origin of the genetic code suggests that the organization of the genetic code coevolved with the biosynthetic relationships between amino acids. The mechanism that allowed this coevolution was based on tRNA-like molecules on which-this theory-would postulate the biosynthetic transformations between amino acids to have occurred. This mechanism makes a prediction on how the role conducted by the aminoacyl-tRNA synthetases (ARSs), in the origin of the genetic code, should have been. Indeed, if the biosynthetic transformations between amino acids occurred on tRNA-like molecules, then there was no need to link amino acids to these molecules because amino acids were already charged on tRNA-like molecules, as the coevolution theory suggests. In spite of the fact that ARSs make the genetic code responsible for the first interaction between a component of nucleic acids and that of proteins, for the coevolution theory the role of ARSs should have been entirely marginal in the genetic code origin. Therefore, I have conducted a further analysis of the distribution of the two classes of ARSs and of their subclasses-in the genetic code table-in order to perform a falsification test of the coevolution theory. Indeed, in the case in which the distribution of ARSs within the genetic code would have been highly significant, then the coevolution theory would be falsified since the mechanism on which it is based would not predict a fundamental role of ARSs in the origin of the genetic code. I found that the statistical significance of the distribution of the two classes of ARSs in the table of the genetic code is low or marginal, whereas that of the subclasses of ARSs statistically significant. However, this is in perfect agreement with the postulates of the coevolution theory. Indeed, the only case of statistical significance-regarding the classes of ARSs-is appreciable for the CAG code, whereas for its complement-the UNN/NUN code-only a marginal significance is measurable. These two codes codify roughly for the two ARS classes, in particular, the CAG code for the class II while the UNN/NUN code for the class I. Furthermore, the subclasses of ARSs show a statistical significance of their distribution in the genetic code table. Nevertheless, the more sensible explanation for these observations would be the following. The observation that would link the two classes of ARSs to the CAG and UNN/NUN codes, and the statistical significance of the distribution of the subclasses of ARSs in the genetic code table, would be only a secondary effect due to the highly significant distribution of the polarity of amino acids and their biosynthetic relationships in the genetic code. That is to say, the polarity of amino acids and their biosynthetic relationships would have conditioned the evolution of ARSs so that their presence in the genetic code would have been detectable. Even if the ARSs would not have-on their own-influenced directly the evolutionary organization of the genetic code. In other words, the role that ARSs had in the origin of the genetic code would have been entirely marginal. This conclusion would be in perfect accord with the predictions of the coevolution theory. Conversely, this conclusion would be in contrast-at least partially-with the physicochemical theories of the origin of the genetic code because they would foresee an absolutely more active role of ARSs in the origin of the organization of the genetic code. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.

    PubMed

    Spangler, Jacob B; Feltus, Frank Alex

    2013-01-01

    Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.

  11. Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis

    PubMed Central

    Spangler, Jacob B.; Feltus, Frank Alex

    2013-01-01

    Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377

  12. Polyspecific pyrrolysyl-tRNA synthetases from directed evolution.

    PubMed

    Guo, Li-Tao; Wang, Yane-Shih; Nakamura, Akiyoshi; Eiler, Daniel; Kavran, Jennifer M; Wong, Margaret; Kiessling, Laura L; Steitz, Thomas A; O'Donoghue, Patrick; Söll, Dieter

    2014-11-25

    Pyrrolysyl-tRNA synthetase (PylRS) and its cognate tRNA(Pyl) have emerged as ideal translation components for genetic code innovation. Variants of the enzyme facilitate the incorporation >100 noncanonical amino acids (ncAAs) into proteins. PylRS variants were previously selected to acylate N(ε)-acetyl-Lys (AcK) onto tRNA(Pyl). Here, we examine an N(ε)-acetyl-lysyl-tRNA synthetase (AcKRS), which is polyspecific (i.e., active with a broad range of ncAAs) and 30-fold more efficient with Phe derivatives than it is with AcK. Structural and biochemical data reveal the molecular basis of polyspecificity in AcKRS and in a PylRS variant [iodo-phenylalanyl-tRNA synthetase (IFRS)] that displays both enhanced activity and substrate promiscuity over a chemical library of 313 ncAAs. IFRS, a product of directed evolution, has distinct binding modes for different ncAAs. These data indicate that in vivo selections do not produce optimally specific tRNA synthetases and suggest that translation fidelity will become an increasingly dominant factor in expanding the genetic code far beyond 20 amino acids.

  13. RNA Relics and Origin of Life

    PubMed Central

    Demongeot, Jacques; Glade, Nicolas; Moreira, Andrés; Vial, Laurent

    2009-01-01

    A number of small RNA sequences, located in different non-coding sequences and highly preserved across the tree of life, have been suggested to be molecular fossils, of ancient (and possibly primordial) origin. On the other hand, recent years have revealed the existence of ubiquitous roles for small RNA sequences in modern organisms, in functions ranging from cell regulation to antiviral activity. We propose that a single thread can be followed from the beginning of life in RNA structures selected only for stability reasons through the RNA relics and up to the current coevolution of RNA sequences; such an understanding would shed light both on the history and on the present development of the RNA machinery and interactions. After presenting the evidence (by comparing their sequences) that points toward a common thread, we discuss a scenario of genome coevolution (with emphasis on viral infectious processes) and finally propose a plan for the reevaluation of the stereochemical theory of the genetic code; we claim that it may still be relevant, and not only for understanding the origin of life, but also for a comprehensive picture of regulation in present-day cells. PMID:20111682

  14. GRIL-seq provides a method for identifying direct targets of bacterial small regulatory RNA by in vivo proximity ligation.

    PubMed

    Han, Kook; Tjaden, Brian; Lory, Stephen

    2016-12-22

    The first step in the post-transcriptional regulatory function of most bacterial small non-coding RNAs (sRNAs) is base pairing with partially complementary sequences of targeted transcripts. We present a simple method for identifying sRNA targets in vivo and defining processing sites of the regulated transcripts. The technique, referred to as global small non-coding RNA target identification by ligation and sequencing (GRIL-seq), is based on preferential ligation of sRNAs to the ends of base-paired targets in bacteria co-expressing T4 RNA ligase, followed by sequencing to identify the chimaeras. In addition to the RNA chaperone Hfq, the GRIL-seq method depends on the activity of the pyrophosphorylase RppH. Using PrrF1, an iron-regulated sRNA in Pseudomonas aeruginosa, we demonstrated that direct regulatory targets of this sRNA can readily be identified. Therefore, GRIL-seq represents a powerful tool not only for identifying direct targets of sRNAs in a variety of environments, but also for uncovering novel roles for sRNAs and their targets in complex regulatory networks.

  15. The effects of potato virus Y-derived virus small interfering RNAs of three biologically distinct strains on potato (Solanum tuberosum) transcriptome.

    PubMed

    Moyo, Lindani; Ramesh, Shunmugiah V; Kappagantu, Madhu; Mitter, Neena; Sathuvalli, Vidyasagar; Pappu, Hanu R

    2017-07-17

    Potato virus Y (PVY) is one of the most economically important pathogen of potato that is present as biologically distinct strains. The virus-derived small interfering RNAs (vsiRNAs) from potato cv. Russet Burbank individually infected with PVY-N, PVY-NTN and PVY-O strains were recently characterized. Plant defense RNA-silencing mechanisms deployed against viruses produce vsiRNAs to degrade homologous viral transcripts. Based on sequence complementarity, the vsiRNAs can potentially degrade host RNA transcripts raising the prospect of vsiRNAs as pathogenicity determinants in virus-host interactions. This study investigated the global effects of PVY vsiRNAs on the host potato transcriptome. The strain-specific vsiRNAs of PVY, expressed in high copy number, were analyzed in silico for their proclivity to target potato coding and non-coding RNAs using psRobot and psRNATarget algorithms. Functional annotation of target coding transcripts was carried out to predict physiological effects of the vsiRNAs on the potato cv. Russet Burbank. The downregulation of selected target coding transcripts was further validated using qRT-PCR. The vsiRNAs derived from biologically distinct strains of PVY displayed diversity in terms of absolute number, copy number and hotspots for siRNAs on their respective genomes. The vsiRNAs populations were derived with a high frequency from 6 K1, P1 and Hc-Pro for PVY-N, P1, Hc-Pro and P3 for PVY-NTN, and P1, 3' UTR and NIa for PVY-O genomic regions. The number of vsiRNAs that displayed interaction with potato coding transcripts and number of putative coding target transcripts were comparable between PVY-N and PVY-O, and were relatively higher for PVY-NTN. The most abundant target non-coding RNA transcripts for the strain specific PVY-derived vsiRNAs were found to be MIR821, 28S rRNA,18S rRNA, snoR71, tRNA-Met and U5. Functional annotation and qRT-PCR validation suggested that the vsiRNAs target genes involved in plant hormone signaling, genetic information processing, plant-pathogen interactions, plant defense and stress response processes in potato. The findings suggested that the PVY-derived vsiRNAs could act as a pathogenicity determinant and as a counter-defense strategy to host RNA silencing in PVY-potato interactions. The broad range of host genes targeted by PVY vsiRNAs in infected potato suggests a diverse role for vsiRNAs that includes suppression of host stress responses and developmental processes. The interactome scenario is the first report on the interaction between one of the most important Potyvirus genome-derived siRNAs and the potato transcripts.

  16. Post-transcriptional gene silencing triggered by sense transgenes involves uncapped antisense RNA and differs from silencing intentionally triggered by antisense transgenes.

    PubMed

    Parent, Jean-Sébastien; Jauvion, Vincent; Bouché, Nicolas; Béclin, Christophe; Hachet, Mélanie; Zytnicki, Matthias; Vaucheret, Hervé

    2015-09-30

    Although post-transcriptional gene silencing (PTGS) has been studied for more than a decade, there is still a gap in our understanding of how de novo silencing is initiated against genetic elements that are not supposed to produce double-stranded (ds)RNA. Given the pervasive transcription occurring throughout eukaryote genomes, we tested the hypothesis that unintended transcription could produce antisense (as)RNA molecules that participate to the initiation of PTGS triggered by sense transgenes (S-PTGS). Our results reveal a higher level of asRNA in Arabidopsis thaliana lines that spontaneously trigger S-PTGS than in lines that do not. However, PTGS triggered by antisense transgenes (AS-PTGS) differs from S-PTGS. In particular, a hypomorphic ago1 mutation that suppresses S-PTGS prevents the degradation of asRNA but not sense RNA during AS-PTGS, suggesting a different treatment of coding and non-coding RNA by AGO1, likely because of AGO1 association to polysomes. Moreover, the intended asRNA produced during AS-PTGS is capped whereas the asRNA produced during S-PTGS derives from 3' maturation of a read-through transcript and is uncapped. Thus, we propose that uncapped asRNA corresponds to the aberrant RNA molecule that is converted to dsRNA by RNA-DEPENDENT RNA POLYMERASE 6 in siRNA-bodies to initiate S-PTGS, whereas capped asRNA must anneal with sense RNA to produce dsRNA that initiate AS-PTGS. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Automated and fast building of three-dimensional RNA structures.

    PubMed

    Zhao, Yunjie; Huang, Yangyu; Gong, Zhou; Wang, Yanjie; Man, Jianfen; Xiao, Yi

    2012-01-01

    Building tertiary structures of non-coding RNA is required to understand their functions and design new molecules. Current algorithms of RNA tertiary structure prediction give satisfactory accuracy only for small size and simple topology and many of them need manual manipulation. Here, we present an automated and fast program, 3dRNA, for RNA tertiary structure prediction with reasonable accuracy for RNAs of larger size and complex topology.

  18. Organization and transient expression of the gene for human U11 snRNA

    PubMed Central

    Clemens, Suter-Crazzolara; Walter, Keller

    1991-01-01

    The nucleotide sequence of U11 small nuclear RNA, a minor U RNA from HeLa cells, was determined. Computer analysis of the sequence (135 residues) predicts two strong hairpin loops which are separated by seventeen nucleotides containing an Sm binding site (AAUUUUUUGG). A synthetic gene was constructed in which the coding region of U11 RNA is under the control of a T7 promoter. This vector can be used to produce U11 RNA in vitro. Southern hybridization and PCR analysis of HeLa genomic DNA suggest that U11 RNA is encoded by a single copy gene, and that at least three genomic regions could be U11 RNA pseudogenes. A HeLa genomic copy of a U11 gene was isolated by inverted PCR. This gene contains the U11 RNA coding sequence and several sequence elements unique for the U RNA genes. These include a Distal Sequence Element (DSE, ATTTGCATA) present between positions −215 and −223 relative to the start of transcription; a Proximal Sequence Element (PSE, TTCACCTTTACCAAAAATG) located between positions −43 and −63 ; and a 3′box (GTTAGGCGAAATATTA) between positions +150 and +166. Transfection of HeLa cells with this gene revealed that it is functioning in vivo and can produce U11 RNA. PMID:1820214

  19. Live-cell imaging of budding yeast telomerase RNA and TERRA.

    PubMed

    Laprade, Hadrien; Lalonde, Maxime; Guérit, David; Chartrand, Pascal

    2017-02-01

    In most eukaryotes, the ribonucleoprotein complex telomerase is responsible for maintaining telomere length. In recent years, single-cell microscopy techniques such as fluorescent in situ hybridization and live-cell imaging have been developed to image the RNA subunit of the telomerase holoenzyme. These techniques are now becoming important tools for the study of telomerase biogenesis, its association with telomeres and its regulation. Here, we present detailed protocols for live-cell imaging of the Saccharomyces cerevisiae telomerase RNA subunit, called TLC1, and also of the non-coding telomeric repeat-containing RNA TERRA. We describe the approach used for genomic integration of MS2 stem-loops in these transcripts, and provide information for optimal live-cell imaging of these non-coding RNAs. Copyright © 2016 Elsevier Inc. All rights reserved.

  20. Long non-coding RNA CTA sensitizes osteosarcoma cells to doxorubicin through inhibition of autophagy

    PubMed Central

    Wang, Zhengguang; Liu, Zhendong; Wu, Song

    2017-01-01

    Recently, several long non-coding RNAs (lncRNAs) have been implicated in osteosarcoma (OS). However, the regulatory roles of lncRNAs in chemotherapy resistance of OS still remain unclear. This study aimed to screen a novel lncRNA that contributes to chemotherapeutic resistance of OS, and to explore the underlying mechanisms. Our data showed that lncRNA CTA was markedly downregulated in OS tissues compared to their matched non-tumor tissues, and low expression of lncRNA CTA was significantly associated with the advanced clinical stage and tumor size. In addition, OS patients with low lncRNA CTA levels showed a worse prognosis when compared with those with high expression of lncRNA CTA. Furthermore, we report that lncRNA CTA has an inverse relationship with miR-210 expression in OS tissues. LncRNA CTA could be activated by doxorubicin (DOX), and could promote OS cell apoptosis by competitively binding miR-210, while inhibit cell autophagy. On the other hand, lncRNA CTA was downregulated in DOX-resistant OS cells. Overexpression of lncRNA CTA reduced autophagy and subsequently overcame DOX resistance of OS in vitro and in vivo. Therefore, we demonstrate that lncRNA CTA is an essential regulator in DOX-induced OS cell apoptosis, and the lncRNA CTA-miR-210 axis plays an important role in reducing OS chemoresistance. PMID:28415557

  1. Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA.

    PubMed

    Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

    2018-01-01

    Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC.

  2. Elementary screening of lymph node metastatic-related genes in gastric cancer based on the co-expression network of messenger RNA, microRNA and long non-coding RNA

    PubMed Central

    Song, Zhonghua; Zhao, Wenhua; Cao, Danfeng; Zhang, Jinqing; Chen, Shouhua

    2018-01-01

    Gastric cancer (GC) is the fifth most common cancer and the third leading cause of cancer-related deaths worldwide. The high mortality might be attributed to delay in detection and is closely related to lymph node metastasis. Therefore, it is of great importance to explore the mechanism of lymph node metastasis and find strategies to block GC metastasis. Messenger RNA (mRNA), microRNA (miRNA) and long non-coding RNA (lncRNA) expression data and clinical data were downloaded from The Cancer Genome Atlas (TCGA) database. A total of 908 differentially expressed factors with variance >0.5 including 542 genes, 42 miRNA, and 324 lncRNA were screened using significant analysis microarray algorithm, and interaction networks were constructed using these differentially expressed factors. Furthermore, we conducted functional modules analysis in the network, and found that yellow and turquoise modules could separate samples efficiently. The groups classified in the yellow and turquoise modules had a significant difference in survival time, which was verified in another independent GC mRNA dataset (GSE62254). The results suggested that differentially expressed factors in the yellow and turquoise modules may participate in lymph node metastasis of GC and could be applied as potential biomarkers or therapeutic targets for GC. PMID:29489999

  3. CUDR promotes liver cancer stem cell growth through upregulating TERT and C-Myc

    PubMed Central

    Pu, Hu; Zheng, Qidi; Li, Haiyan; Wu, Mengying; An, Jiahui; Gui, Xin; Li, Tianming; Lu, Dongdong

    2015-01-01

    Cancer up-regulated drug resistant (CUDR) is a novel non-coding RNA gene. Herein, we demonstrate excessive CUDR cooperates with excessive CyclinD1 or PTEN depletion to accelerate liver cancer stem cells growth and liver stem cell malignant transformation in vitro and in vivo. Mechanistically, we reveal the decrease of PTEN in cells may lead to increase binding capacity of CUDR to CyclinD1. Therefore, CUDR-CyclinD1 complex loads onto the long noncoding RNA H19 promoter region that may lead to reduce the DNA methylation on H19 promoter region and then to enhance the H19 expression. Strikingly, the overexpression of H19 increases the binding of TERT to TERC and reduces the interplay between TERT with TERRA, thus enhancing the cell telomerase activity and extending the telomere length. On the other hand, insulator CTCF recruits the CUDR-CyclinD1 complx to form the composite CUDR-CyclinD1-insulator CTCF complex which occupancied on the C-myc gene promoter region, increasing the outcome of oncogene C-myc. Ultimately, excessive TERT and C-myc lead to liver cancer stem cell and hepatocyte-like stem cell malignant proliferation. To understand the novel functions of long noncoding RNA CUDR will help in the development of new liver cancer therapeutic and diagnostic approaches. PMID:26513297

  4. The complete mitochondrial genome of Rapana venosa (Gastropoda, Muricidae).

    PubMed

    Sun, Xiujun; Yang, Aiguo

    2016-01-01

    The complete mitochondrial (mt) genome of the veined rapa whelk, Rapana venosa, was determined using genome walking techniques in this study. The total length of the mt genome sequence of R. venosa was 15,271 bp, which is comparable to the reported Muricidae mitogenomes to date. It contained 13 protein-coding genes, 21 transfer RNA genes, and two ribosomal RNA genes. A bias towards a higher representation of nucleotides A and T (69%) was detected in the mt genome of R. venosa. A small number of non-coding nucleotides (302 bp) was detected, and the largest non-coding region was 74 bp in length.

  5. Genetic Code Expansion of Mammalian Cells with Unnatural Amino Acids.

    PubMed

    Brown, Kalyn A; Deiters, Alexander

    2015-09-01

    The expansion of the genetic code of mammalian cells enables the incorporation of unnatural amino acids into proteins. This is achieved by adding components to the protein biosynthetic machinery, specifically an engineered aminoacyl-tRNA synthetase/tRNA pair. The unnatural amino acids are chemically synthesized and supplemented to the growth medium. Using this methodology, fundamental new chemistries can be added to the functional repertoire of the genetic code of mammalian cells. This protocol outlines the steps necessary to incorporate a photocaged lysine into proteins and showcases its application in the optical triggering of protein translocation to the nucleus. Copyright © 2015 John Wiley & Sons, Inc.

  6. Size, Shape, and Sequence-Dependent Immunogenicity of RNA Nanoparticles.

    PubMed

    Guo, Sijin; Li, Hui; Ma, Mengshi; Fu, Jian; Dong, Yizhou; Guo, Peixuan

    2017-12-15

    RNA molecules have emerged as promising therapeutics. Like all other drugs, the safety profile and immune response are important criteria for drug evaluation. However, the literature on RNA immunogenicity has been controversial. Here, we used the approach of RNA nanotechnology to demonstrate that the immune response of RNA nanoparticles is size, shape, and sequence dependent. RNA triangle, square, pentagon, and tetrahedron with same shape but different sizes, or same size but different shapes were used as models to investigate the immune response. The levels of pro-inflammatory cytokines induced by these RNA nanoarchitectures were assessed in macrophage-like cells and animals. It was found that RNA polygons without extension at the vertexes were immune inert. However, when single-stranded RNA with a specific sequence was extended from the vertexes of RNA polygons, strong immune responses were detected. These immunostimulations are sequence specific, because some other extended sequences induced little or no immune response. Additionally, larger-size RNA square induced stronger cytokine secretion. 3D RNA tetrahedron showed stronger immunostimulation than planar RNA triangle. These results suggest that the immunogenicity of RNA nanoparticles is tunable to produce either a minimal immune response that can serve as safe therapeutic vectors, or a strong immune response for cancer immunotherapy or vaccine adjuvants. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  7. Foxo3 activity promoted by non-coding effects of circular RNA and Foxo3 pseudogene in the inhibition of tumor growth and angiogenesis.

    PubMed

    Yang, W; Du, W W; Li, X; Yee, A J; Yang, B B

    2016-07-28

    It has recently been shown that the upregulation of a pseudogene specific to a protein-coding gene could function as a sponge to bind multiple potential targeting microRNAs (miRNAs), resulting in increased gene expression. Similarly, it was recently demonstrated that circular RNAs can function as sponges for miRNAs, and could upregulate expression of mRNAs containing an identical sequence. Furthermore, some mRNAs are now known to not only translate protein, but also function to sponge miRNA binding, facilitating gene expression. Collectively, these appear to be effective mechanisms to ensure gene expression and protein activity. Here we show that expression of a member of the forkhead family of transcription factors, Foxo3, is regulated by the Foxo3 pseudogene (Foxo3P), and Foxo3 circular RNA, both of which bind to eight miRNAs. We found that the ectopic expression of the Foxo3P, Foxo3 circular RNA and Foxo3 mRNA could all suppress tumor growth and cancer cell proliferation and survival. Our results showed that at least three mechanisms are used to ensure protein translation of Foxo3, which reflects an essential role of Foxo3 and its corresponding non-coding RNAs.

  8. Long non-coding RNA XIST inhibited breast cancer cell growth, migration, and invasion via miR-155/CDX1 axis.

    PubMed

    Zheng, Ruinian; Lin, Shunhuan; Guan, Ling; Yuan, Huiling; Liu, Kejun; Liu, Chun; Ye, Weibiao; Liao, Yuting; Jia, Jun; Zhang, Ruopeng

    2018-04-15

    Long non-coding RNA (lncRNA) is an important member of non-coding RNA family and emerging evidence has indicated that it plays a pivotal role in many physiological and pathological processes. The lncRNA X inactive specific transcript (XIST) is a potential tumour suppressor in some types of cancers. However, the expression and function of XIST in breast cancer remain largely unclear. The objective of this study was to evaluate the expression and biological role of XIST in breast cancer. The results showed that XIST was significantly down-regulated in breast cancer tissues and cell lines. Further functional analysis indicated that overexpression of XIST remarkably inhibited breast cancer cell growth, migration, and invasion. The results of luciferase reporter assays verified that miR-155 was a direct target of XIST in breast cancer. Moreover, caudal-type homeobox 1 (CDX1) was identified as a direct target of miR-155 and miR-155/CDX1 rescued the effects of XIST in breast cancer cells. Taken together, our results suggest that XIST is down-regulated in breast cancer and suppresses breast cancer cell growth, migration, and invasion via the miR-155/CDX1 axis. Copyright © 2018. Published by Elsevier Inc.

  9. Transcriptional role of androgen receptor in the expression of long non-coding RNA Sox2OT in neurogenesis

    PubMed Central

    Tosetti, Valentina; Sassone, Jenny; Ferri, Anna L. M.; Taiana, Michela; Bedini, Gloria; Nava, Sara; Brenna, Greta; Di Resta, Chiara; Pareyson, Davide; Di Giulio, Anna Maria; Carelli, Stephana

    2017-01-01

    The complex architecture of adult brain derives from tightly regulated migration and differentiation of precursor cells generated during embryonic neurogenesis. Changes at transcriptional level of genes that regulate migration and differentiation may lead to neurodevelopmental disorders. Androgen receptor (AR) is a transcription factor that is already expressed during early embryonic days. However, AR role in the regulation of gene expression at early embryonic stage is yet to be determinate. Long non-coding RNA (lncRNA) Sox2 overlapping transcript (Sox2OT) plays a crucial role in gene expression control during development but its transcriptional regulation is still to be clearly defined. Here, using Bicalutamide in order to pharmacologically inactivated AR, we investigated whether AR participates in the regulation of the transcription of the lncRNASox2OTat early embryonic stage. We identified a new DNA binding region upstream of Sox2 locus containing three androgen response elements (ARE), and found that AR binds such a sequence in embryonic neural stem cells and in mouse embryonic brain. Our data suggest that through this binding, AR can promote the RNA polymerase II dependent transcription of Sox2OT. Our findings also suggest that AR participates in embryonic neurogenesis through transcriptional control of the long non-coding RNA Sox2OT. PMID:28704421

  10. Transcriptional role of androgen receptor in the expression of long non-coding RNA Sox2OT in neurogenesis.

    PubMed

    Tosetti, Valentina; Sassone, Jenny; Ferri, Anna L M; Taiana, Michela; Bedini, Gloria; Nava, Sara; Brenna, Greta; Di Resta, Chiara; Pareyson, Davide; Di Giulio, Anna Maria; Carelli, Stephana; Parati, Eugenio A; Gorio, Alfredo

    2017-01-01

    The complex architecture of adult brain derives from tightly regulated migration and differentiation of precursor cells generated during embryonic neurogenesis. Changes at transcriptional level of genes that regulate migration and differentiation may lead to neurodevelopmental disorders. Androgen receptor (AR) is a transcription factor that is already expressed during early embryonic days. However, AR role in the regulation of gene expression at early embryonic stage is yet to be determinate. Long non-coding RNA (lncRNA) Sox2 overlapping transcript (Sox2OT) plays a crucial role in gene expression control during development but its transcriptional regulation is still to be clearly defined. Here, using Bicalutamide in order to pharmacologically inactivated AR, we investigated whether AR participates in the regulation of the transcription of the lncRNASox2OTat early embryonic stage. We identified a new DNA binding region upstream of Sox2 locus containing three androgen response elements (ARE), and found that AR binds such a sequence in embryonic neural stem cells and in mouse embryonic brain. Our data suggest that through this binding, AR can promote the RNA polymerase II dependent transcription of Sox2OT. Our findings also suggest that AR participates in embryonic neurogenesis through transcriptional control of the long non-coding RNA Sox2OT.

  11. The complete mitochondrial genome of the gray garden slug Deroceras reticulatum (Gastropoda: Pulmonata: Stylommatophora)

    USDA-ARS?s Scientific Manuscript database

    The complete circular mitochondrial genome of D. reticulatum is 14,048 bp in length, consisting of 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, and 2 ribosomal RNA (rRNA) genes (GenBank accession number: KY765589). The overall base composition was 31.0 % A, 12.2 % C, 17.7 % G and 39...

  12. A computational search for box C/D snoRNA genes in the Drosophila melanogaster genome.

    PubMed

    Accardo, M C; Giordano, E; Riccardo, S; Digilio, F A; Iazzetti, G; Calogero, R A; Furia, M

    2004-12-12

    In eukaryotes, the family of non-coding RNA genes includes a number of genes encoding small nucleolar RNAs (mainly C/D and H/ACA snoRNAs), which act as guides in the maturation or post-transcriptional modifications of target RNA molecules. Since in Drosophila melanogaster (Dm) only few examples of snoRNAs have been identified so far by cDNA libraries screening, integration of the molecular data with in silico identification of these types of genes could throw light on their organization in the Dm genome. We have performed a computational screening of the Dm genome for C/D snoRNA genes, followed by experimental validation of the putative candidates. Few of the 26 confirmed snoRNAs had been recognized by cDNA library analysis. Organization of the Dm genome was also found to be more variegated than previously suspected, with snoRNA genes nested in both the introns and exons of protein-coding genes. This finding suggests that the presence of additional mechanisms of snoRNA biogenesis based on the alternative production of overlapping mRNA/snoRNA molecules. Additional information is available at http://www.bioinformatica.unito.it/bioinformatics/snoRNAs.

  13. Long non-coding RNA PlncRNA-1 promotes cell proliferation and hepatic metastasis in colorectal cancer.

    PubMed

    Jia, Gui-Qing; Zhang, Ming-Ming; Wang, Kang; Zhao, Gao-Ping; Pang, Ming-Hui; Chen, Zhe-Yu

    2018-05-08

    Emerging evidence has identified that long non-coding RNAs (lncRNAs) may play an important role in the pathogenesis of many cancer types, including colorectal cancer (CRC). However, the role of PlncRNA-1 in CRC remains unclear. The aim of our present study was to investigate the potential functions of PlncRNA-1 in CRC and to identify the underlying mechanisms of action. We demonstrated that up-regulated PlncRNA-1 in CRC tissues and cells promoted cell proliferation by accelerating cell cycle process and inhibiting cell apoptosis in vitro, enhanced tumor growth and matastasis in vivo and was associated with cell migration and invasion, EMT process of CRC cells. In addition, PlncRNA-1 was a target of miR-204 and enhanced the expression of an endogenous miR-204 target, MMP9 in CRC cells. Furthermore, we found that PlncRNA-1 activates Wnt/β-catenin pathway through the miR-204 in CRC cells. These results suggest that the PlncRNA-1/miR-204/ Wnt/β-catenin regulatory network may shed light on tumorigenesis in CRC. © 2018 Wiley Periodicals, Inc.

  14. Extending Mondrian Memory Protection

    DTIC Science & Technology

    2010-11-01

    a kernel semaphore is locked or unlocked. In addition, we extended the system call interface to receive notifications about user-land locking...operations (such as calls to the mutex and semaphore code provided by the C library). By patching the dynamically loadable GLibC5, we are able to test... semaphores , and spinlocks. RTO-MP-IST-091 10- 9 Extending Mondrian Memory Protection to loading extension plugins. This prevents any untrusted code

  15. A deep learning method for lincRNA detection using auto-encoder algorithm.

    PubMed

    Yu, Ning; Yu, Zeng; Pan, Yi

    2017-12-06

    RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.

  16. The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology.

    PubMed

    Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

    2016-01-01

    In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/ncro.owl.

  17. The analysis of convolutional codes via the extended Smith algorithm

    NASA Technical Reports Server (NTRS)

    Mceliece, R. J.; Onyszchuk, I.

    1993-01-01

    Convolutional codes have been the central part of most error-control systems in deep-space communication for many years. Almost all such applications, however, have used the restricted class of (n,1), also known as 'rate 1/n,' convolutional codes. The more general class of (n,k) convolutional codes contains many potentially useful codes, but their algebraic theory is difficult and has proved to be a stumbling block in the evolution of convolutional coding systems. In this article, the situation is improved by describing a set of practical algorithms for computing certain basic things about a convolutional code (among them the degree, the Forney indices, a minimal generator matrix, and a parity-check matrix), which are usually needed before a system using the code can be built. The approach is based on the classic Forney theory for convolutional codes, together with the extended Smith algorithm for polynomial matrices, which is introduced in this article.

  18. Transcripts with in silico predicted RNA structure are enriched everywhere in the mouse brain

    PubMed Central

    2012-01-01

    Background Post-transcriptional control of gene expression is mostly conducted by specific elements in untranslated regions (UTRs) of mRNAs, in collaboration with specific binding proteins and RNAs. In several well characterized cases, these RNA elements are known to form stable secondary structures. RNA secondary structures also may have major functional implications for long noncoding RNAs (lncRNAs). Recent transcriptional data has indicated the importance of lncRNAs in brain development and function. However, no methodical efforts to investigate this have been undertaken. Here, we aim to systematically analyze the potential for RNA structure in brain-expressed transcripts. Results By comprehensive spatial expression analysis of the adult mouse in situ hybridization data of the Allen Mouse Brain Atlas, we show that transcripts (coding as well as non-coding) associated with in silico predicted structured probes are highly and significantly enriched in almost all analyzed brain regions. Functional implications of these RNA structures and their role in the brain are discussed in detail along with specific examples. We observe that mRNAs with a structure prediction in their UTRs are enriched for binding, transport and localization gene ontology categories. In addition, after manual examination we observe agreement between RNA binding protein interaction sites near the 3’ UTR structures and correlated expression patterns. Conclusions Our results show a potential use for RNA structures in expressed coding as well as noncoding transcripts in the adult mouse brain, and describe the role of structured RNAs in the context of intracellular signaling pathways and regulatory networks. Based on this data we hypothesize that RNA structure is widely involved in transcriptional and translational regulatory mechanisms in the brain and ultimately plays a role in brain function. PMID:22651826

  19. microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.

    PubMed

    Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael

    2017-02-01

    We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.

  20. Interleukin-1 homologues IL-1F7b and IL-18 contain functional mRNA instability elements within the coding region responsive to lipopolysaccharide

    PubMed Central

    2004-01-01

    IL-1F7b, a novel homologue of the IL-1 (interleukin 1) family, was discovered by computational cloning. We demonstrated that IL-1F7b shares critical amino acid residues with IL-18 and binds to the IL-18-binding protein enhancing its ability to inhibit IL-18-induced interferon-γ. We also showed that low levels of IL-1F7b are constitutively present intracellularly in human blood monocytes. In this study, we demonstrate that similar to IL-18, both mRNA and intracellular protein expression of IL-1F7b are up-regulated by LPS (lipopolysaccharide) in human monocytes. In stable transfectants of murine RAW264.7 macrophage cells, there was no IL-1F7b protein expression despite a highly active CMV promoter. We found that IL-1F7b-specific mRNA was rapidly degraded in transfected cells, via a 3′-UTR (untranslated region)-independent control of IL-1F7b transcript stability. After LPS stimulation, there was a rapid transient increase in IL-1F7b-specific mRNA and concomitant protein levels. Using sequence alignment, we found a conserved ten-nucleotide homology box within the open reading frame of IL-F7b, which is flanking the coding region instability elements of some selective genes. In-frame deletion of downstream exon 5 from the full-length IL-1F7b cDNA markedly increased the levels of IL-1F7b mRNA. A similar coding region element is located in IL-18. When transfected into RAW264.7 macrophages, IL-18 mRNA was also unstable unless treated with LPS. These results indicate that both IL-1F7b and IL-18 mRNA contain functional instability determinants within their coding region, which influence mRNA decay as a novel mechanism to regulate the expression of IL-1 family members. PMID:15046617

  1. DNA methylation of miRNA coding sequences putatively associated with childhood obesity.

    PubMed

    Mansego, M L; Garcia-Lacarte, M; Milagro, F I; Marti, A; Martinez, J A

    2017-02-01

    Epigenetic mechanisms may be involved in obesity onset and its consequences. The aim of the present study was to evaluate whether DNA methylation status in microRNA (miRNA) coding regions is associated with childhood obesity. DNA isolated from white blood cells of 24 children (identification sample: 12 obese and 12 non-obese) from the Grupo Navarro de Obesidad Infantil study was hybridized in a 450 K methylation microarray. Several CpGs whose DNA methylation levels were statistically different between obese and non-obese were validated by MassArray® in 95 children (validation sample) from the same study. Microarray analysis identified 16 differentially methylated CpGs between both groups (6 hypermethylated and 10 hypomethylated). DNA methylation levels in miR-1203, miR-412 and miR-216A coding regions significantly correlated with body mass index standard deviation score (BMI-SDS) and explained up to 40% of the variation of BMI-SDS. The network analysis identified 19 well-defined obesity-relevant biological pathways from the KEGG database. MassArray® validation identified three regions located in or near miR-1203, miR-412 and miR-216A coding regions differentially methylated between obese and non-obese children. The current work identified three CpG sites located in coding regions of three miRNAs (miR-1203, miR-412 and miR-216A) that were differentially methylated between obese and non-obese children, suggesting a role of miRNA epigenetic regulation in childhood obesity. © 2016 World Obesity Federation.

  2. Identification of novel non-coding small RNAs from Streptococcus pneumoniae TIGR4 using high-resolution genome tiling arrays

    PubMed Central

    2010-01-01

    Background The identification of non-coding transcripts in human, mouse, and Escherichia coli has revealed their widespread occurrence and functional importance in both eukaryotic and prokaryotic life. In prokaryotes, studies have shown that non-coding transcripts participate in a broad range of cellular functions like gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Streptococcus pneumoniae (pneumococcus), an obligate human respiratory pathogen responsible for significant worldwide morbidity and mortality. Tiling microarrays enable genome wide mRNA profiling as well as identification of novel transcripts at a high-resolution. Results Here, we describe a high-resolution transcription map of the S. pneumoniae clinical isolate TIGR4 using genomic tiling arrays. Our results indicate that approximately 66% of the genome is expressed under our experimental conditions. We identified a total of 50 non-coding small RNAs (sRNAs) from the intergenic regions, of which 36 had no predicted function. Half of the identified sRNA sequences were found to be unique to S. pneumoniae genome. We identified eight overrepresented sequence motifs among sRNA sequences that correspond to sRNAs in different functional categories. Tiling arrays also identified approximately 202 operon structures in the genome. Conclusions In summary, the pneumococcal operon structures and novel sRNAs identified in this study enhance our understanding of the complexity and extent of the pneumococcal 'expressed' genome. Furthermore, the results of this study open up new avenues of research for understanding the complex RNA regulatory network governing S. pneumoniae physiology and virulence. PMID:20525227

  3. Does the Genetic Code Have A Eukaryotic Origin?

    PubMed Central

    Zhang, Zhang; Yu, Jun

    2013-01-01

    In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863

  4. Stable RNA nanoparticles as potential new generation drugs for cancer therapy☆

    PubMed Central

    Shu, Yi; Pi, Fengmei; Sharma, Ashwani; Rajabi, Mehdi; Haque, Farzin; Shu, Dan; Leggas, Markos; Evers, B. Mark; Guo, Peixuan

    2014-01-01

    Human genome sequencing revealed that only ~1.5% of the DNA sequence coded for proteins. More and more evidence has uncovered that a substantial part of the 98.5% so-called “junk” DNAs actually code for noncoding RNAs. Two milestones, chemical drugs and protein drugs, have already appeared in the history of drug development, and it is expected that the third milestone in drug development will be RNA drugs or drugs that target RNA. This review focuses on the development of RNA therapeutics for potential cancer treatment by applying RNA nanotechnology. A therapeutic RNA nanoparticle is unique in that its scaffold, ligand, and therapeutic component can all be composed of RNA. The special physicochemical properties lend to the delivery of siRNA, miRNA, ribozymes, or riboswitches; imaging using fluogenenic RNA; and targeting using RNA aptamers. With recent advances in solving the chemical, enzymatic, and thermodynamic stability issues, RNA nanoparticles have been found to be advantageous for in vivo applications due to their uniform nano-scale size, precise stoichiometry, polyvalent nature, low immunogenicity, low toxicity, and target specificity. In vivo animal studies have revealed that RNA nanoparticles can specifically target tumors with favorable pharmacokinetic and pharmacodynamic parameters without unwanted accumulation in normal organs. This review summarizes the key studies that have led to the detailed understanding of RNA nanoparticle formation as well as chemical and thermodynamic stability issue. The methods for RNA nanoparticle construction, and the current challenges in the clinical application of RNA nanotechnology, such as endosome trapping and production costs, are also discussed. PMID:24270010

  5. Characterization of regulatory elements within the coat protein (CP) coding region of Tobacco mosaic virus affecting subgenomic transcription and green fluorescent protein expression from the CP subgenomic RNA promoter.

    PubMed

    Man, Michal; Epel, Bernard L

    2004-06-01

    A replicon based on Tobacco mosaic virus that was engineered to express the open reading frame (ORF) of the green fluorescent protein (GFP) gene in place of the native coat protein (CP) gene from a minimal CP subgenomic (sg) RNA promoter was found to accumulate very low levels of GFP. Regulatory regions within the CP ORF were identified that, when presented as untranslated regions flanking the GFP ORF, enhanced or inhibited sg transcription and GFP expression. Full GFP expression from the CP sgRNA promoter required more than the first 20 nt of the CP ORF but not beyond the first 56 nt. Further analysis indicated the presence of an enhancer element between nt +25 and +55 with respect to the CP translation start site. The inclusion of this enhancer sequence upstream of the GFP ORF led to elevated sg transcription and to a 50-fold increase in GFP accumulation in comparison with a minimal CP promoter in which the entire CP ORF was displaced by the GFP ORF. Inclusion of the 3'-terminal 22 nt had a minor positive effect on GFP accumulation, but the addition of extended untranslated sequences from the 3' terminus of the CP ORF downstream of the GFP ORF was basically found to inhibit sg transcription. Secondary structure analysis programs predicted the CP sgRNA promoter to reside within two stable stem-loop structures, which are followed by an enhancer region.

  6. Combined actions of multiple hairpin loop structures and sites of rate-limiting endonucleolytic cleavage determine differential degradation rates of individual segments within polycistronic puf operon mRNA.

    PubMed Central

    Klug, G; Cohen, S N

    1990-01-01

    Differential expression of the genes within the puf operon of Rhodobacter capsulatus is accomplished in part by differences in the rate of degradation of different segments of the puf transcript. We report here that decay of puf mRNA sequences specifying the light-harvesting I (LHI) and reaction center (RC) photosynthetic membrane peptides is initiated endoribonucleolytically within a discrete 1.4-kilobase segment of the RC-coding region. Deletion of this segment increased the half-life of the RC-coding region from 8 to 20 min while not affecting decay of LHI-coding sequences upstream from an intercistronic hairpin loop structure shown previously to impede 3'-to-5' degradation. Prolongation of RC segment half-life was dependent on the presence of other hairpin structures 3' to the RC region. Inserting the endonuclease-sensitive sites into the LHI-coding segment markedly accelerated its degradation. Our results suggest that differential degradation of the RC- and LHI-coding segments of puf mRNA is accomplished at least in part by the combined actions of RC region-specific endonuclease(s), one or more exonucleases, and several strategically located exonuclease-impeding hairpins. Images PMID:2394682

  7. The CASC15 long intergenic non-coding RNA locus is involved in melanoma progression and phenotype-switching

    PubMed Central

    Lessard, Laurent; Liu, Michelle; Marzese, Diego M.; Wang, Hongwei; Chong, Kelly; Kawas, Neal; Donovan, Nicholas C; Kiyohara, Eiji; Hsu, Sandy; Nelson, Nellie; Izraely, Sivan; Sagi-Assif, Orit; Witz, Isaac P; Ma, Xiao-Jun; Luo, Yuling; Hoon, Dave SB

    2015-01-01

    In recent years, considerable advances have been made in the characterization of protein-coding alterations involved in the pathogenesis of melanoma. However, despite their growing implication in cancer, little is known about the role of long non-coding RNAs in melanoma progression. We hypothesized that copy number alterations of intergenic non-protein coding domains could help identify long intergenic non-coding RNAs (lincRNAs) associated with metastatic cutaneous melanoma. Among several candidates, our approach uncovered the chromosome 6p22.3 CASC15 lincRNA locus as a frequently gained genomic segment in metastatic melanoma tumors and cell lines. The locus was actively transcribed in metastatic melanoma cells, and up-regulation of CASC15 expression was associated with metastatic progression to brain metastasis in a mouse xenograft model. In clinical specimens, CASC15 levels increased during melanoma progression and were independent predictors of disease recurrence in a cohort of 141 patients with AJCC stage III lymph node metastasis. Moreover, siRNA knockdown experiments revealed that CASC15 regulates melanoma cell phenotype switching between proliferative and invasive states. Accordingly, CASC15 levels correlated with known gene signatures corresponding to melanoma proliferative and invasive phenotypes. These findings support a key role for CASC15 in metastatic melanoma. PMID:26016895

  8. Structural and functional analysis of 5S rRNA in Saccharomyces cerevisiae

    PubMed Central

    Kiparisov, S.; Sergiev, P. V.; Dontsova, O. A.; Petrov, A.; Meskauskas, A.; Dinman, J. D.

    2005-01-01

    5S rRNA extends from the central protuberance of the large ribosomal subunit, through the A-site finger, and down to the GTPase-associated center. Here, we present a structure-function analysis of seven 5S rRNA alleles which are sufficient for viability in the yeast Saccharomyces cerevisiae when expressed in the absence of wild-type 5S rRNAs, and extend this analysis using a large bank of mutant alleles that show semidominant phenotypes in the presence of wild-type 5S rRNA. This analysis supports the hypothesis that 5S rRNA serves to link together several different functional centers of the ribosome. Data are also presented which suggest that in eukaryotic genomes selection has favored the maintenance of multiple alleles of 5S rRNA, and that these may provide cells with a mechanism to post-transcriptionally regulate gene expression. PMID:16047201

  9. CRNDE: An important oncogenic long non-coding RNA in human cancers.

    PubMed

    Zhang, Jiaming; Yin, Minuo; Peng, Gang; Zhao, Yingchao

    2018-06-01

    Aberrant overexpression of long non-coding RNA CRNDE (Colorectal Neoplasia Differentially Expressed) is confirmed in various human cancers, which is correlated with advanced clinicopathological features and poor prognosis. CRNDE promotes cancer cell proliferation, migration and invasion, and suppresses apoptosis in complicated mechanisms, which result in the initialization and development of human cancers. In this review, we provide an overview of the oncogenic role and potential clinical applications of CRNDE. © 2018 John Wiley & Sons Ltd.

  10. Long Non-Coding RNA CASC2 Improves Diabetic Nephropathy by Inhibiting JNK Pathway.

    PubMed

    Yang, Huihui; Kan, Quan E; Su, Yong; Man, Hua

    2018-06-11

    It's known that long non-coding RNA CASC2 overexpression inhibit the JNK pathway in some disease models, while JNK pathway activation exacerbates diabetic nephropathy. Therefore we speculate that long non-coding RNA CASC2 can improve diabetic nephropathy by inhibiting JNK pathway. Thus, our study was carried out to investigate the involvement of CASC2 in diabetic nephropathy. We found that serum level of CASC2 was significantly lower in diabetic nephropathy patients than in normal people, and serum level of CASC2 showed no significant correlations with age, gender, alcohol consumption and smoking habits, but was correlated with course of disease. ROC curve analysis showed that serum level of CASC2 could be used to accurately predict diabetic nephropathy. Diabetes mellitus has many complications. This study also included a series of complications of diabetes, such as diabetic retinopathy, diabetic ketoacidosis, diabetic foot infections and diabetic cardiopathy, while serum level of CASC2 was specifically reduced in diabetic nephropathy. CASC2 expression level decreased, while JNK1 phosphorylation level increased in mouse podocyte cells treated with high glucose. CASC2 overexpression inhibited apoptosis of podocyte cells and reduced phosphorylation level of JNK1. We conclude that long non-coding RNA CASC2 may improve diabetic nephropathy by inhibiting JNK pathway. © Georg Thieme Verlag KG Stuttgart · New York.

  11. Ontological function annotation of long non-coding RNAs through hierarchical multi-label classification.

    PubMed

    Zhang, Jingpu; Zhang, Zuping; Wang, Zixiang; Liu, Yuting; Deng, Lei

    2018-05-15

    Long non-coding RNAs (lncRNAs) are an enormous collection of functional non-coding RNAs. Over the past decades, a large number of novel lncRNA genes have been identified. However, most of the lncRNAs remain function uncharacterized at present. Computational approaches provide a new insight to understand the potential functional implications of lncRNAs. Considering that each lncRNA may have multiple functions and a function may be further specialized into sub-functions, here we describe NeuraNetL2GO, a computational ontological function prediction approach for lncRNAs using hierarchical multi-label classification strategy based on multiple neural networks. The neural networks are incrementally trained level by level, each performing the prediction of gene ontology (GO) terms belonging to a given level. In NeuraNetL2GO, we use topological features of the lncRNA similarity network as the input of the neural networks and employ the output results to annotate the lncRNAs. We show that NeuraNetL2GO achieves the best performance and the overall advantage in maximum F-measure and coverage on the manually annotated lncRNA2GO-55 dataset compared to other state-of-the-art methods. The source code and data are available at http://denglab.org/NeuraNetL2GO/. leideng@csu.edu.cn. Supplementary data are available at Bioinformatics online.

  12. Nodeomics: Pathogen Detection in Vertebrate Lymph Nodes Using Meta-Transcriptomics

    USGS Publications Warehouse

    Wittekindt, Nicola E.; Padhi, Abinash; Schuster, Stephan C.; Qi, Ji; Zhao, Fangqing; Tomsho, Lynn P.; Kasson, Lindsay R.; Packard, Michael; Cross, Paul C.; Poss, Mary

    2010-01-01

    The ongoing emergence of human infections originating from wildlife highlights the need for better knowledge of the microbial community in wildlife species where traditional diagnostic approaches are limited. Here we evaluate the microbial biota in healthy mule deer (Odocoileus hemionus) by analyses of lymph node meta-transcriptomes. cDNA libraries from five individuals and two pools of samples were prepared from retropharyngeal lymph node RNA enriched for polyadenylated RNA and sequenced using Roche-454 Life Sciences technology. Protein-coding and 16S ribosomal RNA (rRNA) sequences were taxonomically profiled using protein and rRNA specific databases. Representatives of all bacterial phyla were detected in the seven libraries based on protein-coding transcripts indicating that viable microbiota were present in lymph nodes. Residents of skin and rumen, and those ubiquitous in mule deer habitat dominated classifiable bacterial species. Based on detection of both rRNA and protein-coding transcripts, we identified two new proteobacterial species; a Helicobacter closely related to Helicobacter cetorum in the Helicobacter pylori/Helicobacter acinonychis complex and an Acinetobacter related to Acinetobacter schindleri. Among viruses, a novel gamma retrovirus and other members of the Poxviridae and Retroviridae were identified. We additionally evaluated bacterial diversity by amplicon sequencing the hypervariable V6 region of 16S rRNA and demonstrate that overall taxonomic diversity is higher with the meta-transcriptomic approach. These data provide the most complete picture to date of the microbial diversity within a wildlife host. Our research advances the use of meta-transcriptomics to study microbiota in wildlife tissues, which will facilitate detection of novel organisms with pathogenic potential to human and animals.

  13. Downregulation of long non-coding RNA LET predicts poor prognosis and increases Notch signaling in non-small cell lung cancer

    PubMed Central

    Li, Shengwen; Zhao, Hui; Li, Jianqiang; Zhang, Aizheng; Wang, Haibin

    2018-01-01

    Long non-coding RNAs (lncRNAs) have been found to be dysregulated in a variety of tumors. The lncRNA-Low Expression in Tumor (LET) is a recently identified lncRNA, but its expression pattern and biological significance in human non-small cell lung cancer (NSCLC) are still largely unknown. In this study, we found that lncRNA-LET was significantly downregulated in human NSCLC lung tissues and cell lines. Decreased lncRNA-LET expression was strongly associated with advanced tumor stages and poorer overall survival of NSCLC patients. Functionally, overexpression of lncRNA-LET in NSCLC H292 cells significantly suppressed cell proliferation, migration and invasion, and promoted cell cycle arrest and apoptosis, while knockdown of lncRNA-LET in NSCLC H1975 cells showed an opposite effect, pointing to a tumor-suppressive role for lncRNA-LET in NSCLC. Mechanistically, we demonstrated that lncRNA-LET overexpression significantly reduced the expression of Notch1 intracellular Domain (NICD1) in H292 cells while knockdown of lncRNA-LET increased NICD1 expression in H1975 cells. Similarly, NSCLC lung tissues with high levels of lncRNA-LET had lower NICD1 expression. Thus, our results provide a strong rationale for lncRNA-LET to be used as a prognostic indicator and a potent therapeutic target for NSCLC patients, and highlight a novel lncRNA-LET/Notch axis in regulating NSCLC cell fate and tumor progression. PMID:29416684

  14. Fast decoding techniques for extended single-and-double-error-correcting Reed Solomon codes

    NASA Technical Reports Server (NTRS)

    Costello, D. J., Jr.; Deng, H.; Lin, S.

    1984-01-01

    A problem in designing semiconductor memories is to provide some measure of error control without requiring excessive coding overhead or decoding time. For example, some 256K-bit dynamic random access memories are organized as 32K x 8 bit-bytes. Byte-oriented codes such as Reed Solomon (RS) codes provide efficient low overhead error control for such memories. However, the standard iterative algorithm for decoding RS codes is too slow for these applications. Some special high speed decoding techniques for extended single and double error correcting RS codes. These techniques are designed to find the error locations and the error values directly from the syndrome without having to form the error locator polynomial and solve for its roots.

  15. Cross-site comparison of ribosomal depletion kits for Illumina RNAseq library construction.

    PubMed

    Herbert, Zachary T; Kershner, Jamie P; Butty, Vincent L; Thimmapuram, Jyothi; Choudhari, Sulbha; Alekseyev, Yuriy O; Fan, Jun; Podnar, Jessica W; Wilcox, Edward; Gipson, Jenny; Gillaspy, Allison; Jepsen, Kristen; BonDurant, Sandra Splinter; Morris, Krystalynne; Berkeley, Maura; LeClerc, Ashley; Simpson, Stephen D; Sommerville, Gary; Grimmett, Leslie; Adams, Marie; Levine, Stuart S

    2018-03-15

    Ribosomal RNA (rRNA) comprises at least 90% of total RNA extracted from mammalian tissue or cell line samples. Informative transcriptional profiling using massively parallel sequencing technologies requires either enrichment of mature poly-adenylated transcripts or targeted depletion of the rRNA fraction. The latter method is of particular interest because it is compatible with degraded samples such as those extracted from FFPE and also captures transcripts that are not poly-adenylated such as some non-coding RNAs. Here we provide a cross-site study that evaluates the performance of ribosomal RNA removal kits from Illumina, Takara/Clontech, Kapa Biosystems, Lexogen, New England Biolabs and Qiagen on intact and degraded RNA samples. We find that all of the kits are capable of performing significant ribosomal depletion, though there are differences in their ease of use. All kits were able to remove ribosomal RNA to below 20% with intact RNA and identify ~ 14,000 protein coding genes from the Universal Human Reference RNA sample at >1FPKM. Analysis of differentially detected genes between kits suggests that transcript length may be a key factor in library production efficiency. These results provide a roadmap for labs on the strengths of each of these methods and how best to utilize them.

  16. Conserved Curvature of RNA Polymerase I Core Promoter Beyond rRNA Genes: The Case of the Tritryps

    PubMed Central

    Smircich, Pablo; Duhagon, María Ana; Garat, Beatriz

    2015-01-01

    In trypanosomatids, the RNA polymerase I (RNAPI)-dependent promoters controlling the ribosomal RNA (rRNA) genes have been well identified. Although the RNAPI transcription machinery recognizes the DNA conformation instead of the DNA sequence of promoters, no conformational study has been reported for these promoters. Here we present the in silico analysis of the intrinsic DNA curvature of the rRNA gene core promoters in Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major. We found that, in spite of the absence of sequence conservation, these promoters hold conformational properties similar to other eukaryotic rRNA promoters. Our results also indicated that the intrinsic DNA curvature pattern is conserved within the Leishmania genus and also among strains of T. cruzi and T. brucei. Furthermore, we analyzed the impact of point mutations on the intrinsic curvature and their impact on the promoter activity. Furthermore, we found that the core promoters of protein-coding genes transcribed by RNAPI in T. brucei show the same conserved conformational characteristics. Overall, our results indicate that DNA intrinsic curvature of the rRNA gene core promoters is conserved in these ancient eukaryotes and such conserved curvature might be a requirement of RNAPI machinery for transcription of not only rRNA genes but also protein-coding genes. PMID:26718450

  17. MicroRNAs and non-coding RNAs in virus-infected cells

    PubMed Central

    Ouellet, Dominique L.; Provost, Patrick

    2010-01-01

    Within the past few years, microRNAs (miRNAs) and other non-coding RNAs (ncRNAs) have emerged as elements with critically high importance in post-transcriptional control of cellular and, more recently, viral processes. Endogenously produced by a component of the miRNA-guided RNA silencing machinery known as Dicer, miRNAs are known to control messenger RNA (mRNA) translation through recognition of specific binding sites usually located in their 3′ untranslated region. Recent evidences indicate that the host miRNA pathway may represent an adapted antiviral defense mechanism that can act either by direct miRNA-mediated modulation of viral gene expression or through recognition and inactivation of structured viral RNA species by the protein components of the RNA silencing machinery, such as Dicer. This latter process, however, is a double-edge sword, as it may yield viral miRNAs exerting gene regulatory properties on both host and viral mRNAs. Our knowledge of the interaction between viruses and host RNA silencing machineries, and how this influences the course of infection, is becoming increasingly complex. This review article aims to summarize our current knowledge about viral miRNAs/ncRNAs and their targets, as well as cellular miRNAs that are modulated by viruses upon infection. PMID:20217543

  18. mTOR referees memory and disease through mRNA repression and competition.

    PubMed

    Raab-Graham, Kimberly F; Niere, Farr

    2017-06-01

    Mammalian target of rapamycin (mTOR) activity is required for memory and is dysregulated in disease. Activation of mTOR promotes protein synthesis; however, new studies are demonstrating that mTOR activity also represses the translation of mRNAs. Almost three decades ago, Kandel and colleagues hypothesised that memory was due to the induction of positive regulators and removal of negative constraints. Are these negative constraints repressed mRNAs that code for proteins that block memory formation? Herein, we will discuss the mRNAs coded by putative memory suppressors, how activation/inactivation of mTOR repress protein expression at the synapse, how mTOR activity regulates RNA binding proteins, mRNA stability, and translation, and what the possible implications of mRNA repression are to memory and neurodegenerative disorders. © 2017 Federation of European Biochemical Societies.

  19. Influence of genome-scale RNA structure disruption on the replication of murine norovirus—similar replication kinetics in cell culture but attenuation of viral fitness in vivo

    PubMed Central

    McFadden, Nora; Arias, Armando; Dry, Inga; Bailey, Dalan; Witteveldt, Jeroen; Evans, David J.; Goodfellow, Ian; Simmonds, Peter

    2013-01-01

    Mechanisms by which certain RNA viruses, such as hepatitis C virus, establish persistent infections and cause chronic disease are of fundamental importance in viral pathogenesis. Mammalian positive-stranded RNA viruses establishing persistence typically possess genome-scale ordered RNA secondary structure (GORS) in their genomes. Murine norovirus (MNV) persists in immunocompetent mice and provides an experimental model to functionally characterize GORS. Substitution mutants were constructed with coding sequences in NS3/4- and NS6/7-coding regions replaced with sequences with identical coding and (di-)nucleotide composition but disrupted RNA secondary structure (F1, F2, F1/F2 mutants). Mutants replicated with similar kinetics to wild-type (WT) MNV3 in RAW264.7 cells and primary macrophages, exhibited similar (highly restricted) induction and susceptibility to interferon-coupled cellular responses and equal replication fitness by serial passaging of co-cultures. In vivo, both WT and F1/F2 mutant viruses persistently infected mice, although F1, F2 and F1/F2 mutant viruses were rapidly eliminated 1–7 days post-inoculation in competition experiments with WT. F1/F2 mutants recovered from tissues at 9 months showed higher synonymous substitution rates than WT and nucleotide substitutions that potentially restored of RNA secondary structure. GORS plays no role in basic replication of MNV but potentially contributes to viral fitness and persistence in vivo. PMID:23630317

  20. The CRISPR RNA-guided surveillance complex in Escherichia coli accommodates extended RNA spacers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, Michelle L.; Jackson, Ryan N.; Denny, Steven R.

    Bacteria and archaea acquire resistance to foreign genetic elements by integrating fragments of foreign DNA into CRISPR (clustered regularly interspaced short palindromic repeats) loci. In Escherichia coli, CRISPR-derived RNAs (crRNAs) assemble with Cas proteins into a multi-subunit surveillance complex called Cascade (CRISPR-associated complex for antiviral defense). Cascade recognizes DNA targets via protein-mediated recognition of a protospacer adjacent motif and complementary base pairing between the crRNA spacer and the DNA target. Previously determined structures of Cascade showed that the crRNA is stretched along an oligomeric protein assembly, leading us to ask how crRNA length impacts the assembly and function of thismore » complex. We found that extending the spacer portion of the crRNA resulted in larger Cascade complexes with altered stoichiometry and preserved in vitro binding affinity for target DNA. Longer spacers also preserved the in vivo ability of Cascade to repress target gene expression and to recruit the Cas3 endonuclease for target degradation. Lastly, longer spacers exhibited enhanced silencing at particular target locations and were sensitive to mismatches within the extended region. These findings demonstrate the flexibility of the Type I-E CRISPR machinery and suggest that spacer length can be modified to fine-tune Cascade activity.« less

  1. The CRISPR RNA-guided surveillance complex in Escherichia coli accommodates extended RNA spacers

    DOE PAGES

    Luo, Michelle L.; Jackson, Ryan N.; Denny, Steven R.; ...

    2016-05-12

    Bacteria and archaea acquire resistance to foreign genetic elements by integrating fragments of foreign DNA into CRISPR (clustered regularly interspaced short palindromic repeats) loci. In Escherichia coli, CRISPR-derived RNAs (crRNAs) assemble with Cas proteins into a multi-subunit surveillance complex called Cascade (CRISPR-associated complex for antiviral defense). Cascade recognizes DNA targets via protein-mediated recognition of a protospacer adjacent motif and complementary base pairing between the crRNA spacer and the DNA target. Previously determined structures of Cascade showed that the crRNA is stretched along an oligomeric protein assembly, leading us to ask how crRNA length impacts the assembly and function of thismore » complex. We found that extending the spacer portion of the crRNA resulted in larger Cascade complexes with altered stoichiometry and preserved in vitro binding affinity for target DNA. Longer spacers also preserved the in vivo ability of Cascade to repress target gene expression and to recruit the Cas3 endonuclease for target degradation. Lastly, longer spacers exhibited enhanced silencing at particular target locations and were sensitive to mismatches within the extended region. These findings demonstrate the flexibility of the Type I-E CRISPR machinery and suggest that spacer length can be modified to fine-tune Cascade activity.« less

  2. A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

    PubMed

    Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

    2017-03-01

    Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  3. Efficient full wave code for the coupling of large multirow multijunction LH grills

    NASA Astrophysics Data System (ADS)

    Preinhaelter, Josef; Hillairet, Julien; Milanesio, Daniele; Maggiora, Riccardo; Urban, Jakub; Vahala, Linda; Vahala, George

    2017-11-01

    The full wave code OLGA, for determining the coupling of a single row lower hybrid launcher (waveguide grills) to the plasma, is extended to handle multirow multijunction active passive structures (like the C3 and C4 launchers on TORE SUPRA) by implementing the scattering matrix formalism. The extended code is still computationally fast because of the use of (i) 2D splines of the plasma surface admittance in the accessibility region of the k-space, (ii) high order Gaussian quadrature rules for the integration of the coupling elements and (iii) utilizing the symmetries of the coupling elements in the multiperiodic structures. The extended OLGA code is benchmarked against the ALOHA-1D, ALOHA-2D and TOPLHA codes for the coupling of the C3 and C4 TORE SUPRA launchers for several plasma configurations derived from reflectometry and interferometery. Unlike nearly all codes (except the ALOHA-1D code), OLGA does not require large computational resources and can be used for everyday usage in planning experimental runs. In particular, it is shown that the OLGA code correctly handles the coupling of the C3 and C4 launchers over a very wide range of plasma densities in front of the grill.

  4. Towards a Model for Protein Production Rates

    NASA Astrophysics Data System (ADS)

    Dong, J. J.; Schmittmann, B.; Zia, R. K. P.

    2007-07-01

    In the process of translation, ribosomes read the genetic code on an mRNA and assemble the corresponding polypeptide chain. The ribosomes perform discrete directed motion which is well modeled by a totally asymmetric simple exclusion process (TASEP) with open boundaries. Using Monte Carlo simulations and a simple mean-field theory, we discuss the effect of one or two "bottlenecks" (i.e., slow codons) on the production rate of the final protein. Confirming and extending previous work by Chou and Lakatos, we find that the location and spacing of the slow codons can affect the production rate quite dramatically. In particular, we observe a novel "edge" effect, i.e., an interaction of a single slow codon with the system boundary. We focus in detail on ribosome density profiles and provide a simple explanation for the length scale which controls the range of these interactions.

  5. Identification of phasiRNAs in wild rice (Oryza rufipogon).

    PubMed

    Liu, Yang; Wang, Yu; Zhu, Qian-Hao; Fan, Longjiang

    2013-08-01

    Plant miRNAs can trigger the production of phased, secondary siRNAs from either non-coding or protein-coding genes. In this study, at least 864 and 3,961 loci generating 21-nt and 24-nt phased siRNAs (phasiRNAs),respectively, were identified in three tissues from wild rice. Of these phasiRNA-producing loci, or PHAS genes, biogenesis of phasiRNAs in at least 160 of 21-nt and 254 of 24-nt loci could be triggered by interaction with miRNA(s). Developing seeds had more PHAS genes than leaves and roots. Genetic constrain on miRNA-triggered PHAS genes suggests that phasiRNAs might be one of the driving forces contributed to rice domestication.

  6. NOBAI: a web server for character coding of geometrical and statistical features in RNA structure

    PubMed Central

    Knudsen, Vegeir; Caetano-Anollés, Gustavo

    2008-01-01

    The Numeration of Objects in Biology: Alignment Inferences (NOBAI) web server provides a web interface to the applications in the NOBAI software package. This software codes topological and thermodynamic information related to the secondary structure of RNA molecules as multi-state phylogenetic characters, builds character matrices directly in NEXUS format and provides sequence randomization options. The web server is an effective tool that facilitates the search for evolutionary history embedded in the structure of functional RNA molecules. The NOBAI web server is accessible at ‘http://www.manet.uiuc.edu/nobai/nobai.php’. This web site is free and open to all users and there is no login requirement. PMID:18448469

  7. An Extended Proof-Carrying Code Framework for Security Enforcement

    NASA Astrophysics Data System (ADS)

    Pirzadeh, Heidar; Dubé, Danny; Hamou-Lhadj, Abdelwahab

    The rapid growth of the Internet has resulted in increased attention to security to protect users from being victims of security threats. In this paper, we focus on security mechanisms that are based on Proof-Carrying Code (PCC) techniques. In a PCC system, a code producer sends a code along with its safety proof to the consumer. The consumer executes the code only if the proof is valid. Although PCC has been shown to be a useful security framework, it suffers from the sheer size of typical proofs -proofs of even small programs can be considerably large. In this paper, we propose an extended PCC framework (EPCC) in which, instead of the proof, a proof generator for the program in question is transmitted. This framework enables the execution of the proof generator and the recovery of the proof on the consumer's side in a secure manner using a newly created virtual machine called the VEP (Virtual Machine for Extended PCC).

  8. Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign

    PubMed Central

    2007-01-01

    Background Joint alignment and secondary structure prediction of two RNA sequences can significantly improve the accuracy of the structural predictions. Methods addressing this problem, however, are forced to employ constraints that reduce computation by restricting the alignments and/or structures (i.e. folds) that are permissible. In this paper, a new methodology is presented for the purpose of establishing alignment constraints based on nucleotide alignment and insertion posterior probabilities. Using a hidden Markov model, posterior probabilities of alignment and insertion are computed for all possible pairings of nucleotide positions from the two sequences. These alignment and insertion posterior probabilities are additively combined to obtain probabilities of co-incidence for nucleotide position pairs. A suitable alignment constraint is obtained by thresholding the co-incidence probabilities. The constraint is integrated with Dynalign, a free energy minimization algorithm for joint alignment and secondary structure prediction. The resulting method is benchmarked against the previous version of Dynalign and against other programs for pairwise RNA structure prediction. Results The proposed technique eliminates manual parameter selection in Dynalign and provides significant computational time savings in comparison to prior constraints in Dynalign while simultaneously providing a small improvement in the structural prediction accuracy. Savings are also realized in memory. In experiments over a 5S RNA dataset with average sequence length of approximately 120 nucleotides, the method reduces computation by a factor of 2. The method performs favorably in comparison to other programs for pairwise RNA structure prediction: yielding better accuracy, on average, and requiring significantly lesser computational resources. Conclusion Probabilistic analysis can be utilized in order to automate the determination of alignment constraints for pairwise RNA structure prediction methods in a principled fashion. These constraints can reduce the computational and memory requirements of these methods while maintaining or improving their accuracy of structural prediction. This extends the practical reach of these methods to longer length sequences. The revised Dynalign code is freely available for download. PMID:17445273

  9. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction

    PubMed Central

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions. PMID:27043436

  10. Efficient Translation of Pelargonium line pattern virus RNAs Relies on a TED-Like 3´-Translational Enhancer that Communicates with the Corresponding 5´-Region through a Long-Distance RNA-RNA Interaction.

    PubMed

    Blanco-Pérez, Marta; Pérez-Cañamás, Miryam; Ruiz, Leticia; Hernández, Carmen

    2016-01-01

    Cap-independent translational enhancers (CITEs) have been identified at the 3´-terminal regions of distinct plant positive-strand RNA viruses belonging to families Tombusviridae and Luteoviridae. On the bases of their structural and/or functional requirements, at least six classes of CITEs have been defined whose distribution does not correlate with taxonomy. The so-called TED class has been relatively under-studied and its functionality only confirmed in the case of Satellite tobacco necrosis virus, a parasitic subviral agent. The 3´-untranslated region of the monopartite genome of Pelargonium line pattern virus (PLPV), the recommended type member of a tentative new genus (Pelarspovirus) in the family Tombusviridae, was predicted to contain a TED-like CITE. Similar CITEs can be anticipated in some other related viruses though none has been experimentally verified. Here, in the first place, we have performed a reassessment of the structure of the putative PLPV-TED through in silico predictions and in vitro SHAPE analysis with the full-length PLPV genome, which has indicated that the presumed TED element is larger than previously proposed. The extended conformation of the TED is strongly supported by the pattern of natural sequence variation, thus providing comparative structural evidence in support of the structural data obtained by in silico and in vitro approaches. Next, we have obtained experimental evidence demonstrating the in vivo activity of the PLPV-TED in the genomic (g) RNA, and also in the subgenomic (sg) RNA that the virus produces to express 3´-proximal genes. Besides other structural features, the results have highlighted the key role of long-distance kissing-loop interactions between the 3´-CITE and 5´-proximal hairpins for gRNA and sgRNA translation. Bioassays of CITE mutants have confirmed the importance of the identified 5´-3´ RNA communication for viral infectivity and, moreover, have underlined the strong evolutionary constraints that may operate on genome stretches with both regulatory and coding functions.

  11. DCU@TRECMed 2012: Using Ad-Hoc Baselines for Domain-Specific Retrieval

    DTIC Science & Technology

    2012-11-01

    description to extend the query, for example: Patients with complicated GERD who receive endoscopy will be extended with Gastroesophageal reflux disease ... Diseases and Related Health Problems, version 9) for the patient’s admission or discharge status [1, 5]; treating negation (e.g. negative test results or...codes were mapped to a description of the code, usually a short phrase/sentence. For instance, the ICD9 code 253.5 corresponds to the disease Diabetes

  12. Engineering naturally occurring trans-acting non-coding RNAs to sense molecular signals

    PubMed Central

    Qi, Lei; Lucks, Julius B.; Liu, Chang C.; Mutalik, Vivek K.; Arkin, Adam P.

    2012-01-01

    Non-coding RNAs (ncRNAs) are versatile regulators in cellular networks. While most trans-acting ncRNAs possess well-defined mechanisms that can regulate transcription or translation, they generally lack the ability to directly sense cellular signals. In this work, we describe a set of design principles for fusing ncRNAs to RNA aptamers to engineer allosteric RNA fusion molecules that modulate the activity of ncRNAs in a ligand-inducible way in Escherichia coli. We apply these principles to ncRNA regulators that can regulate translation (IS10 ncRNA) and transcription (pT181 ncRNA), and demonstrate that our design strategy exhibits high modularity between the aptamer ligand-sensing motif and the ncRNA target-recognition motif, which allows us to reconfigure these two motifs to engineer orthogonally acting fusion molecules that respond to different ligands and regulate different targets in the same cell. Finally, we show that the same ncRNA fused with different sensing domains results in a sensory-level NOR gate that integrates multiple input signals to perform genetic logic. These ligand-sensing ncRNA regulators provide useful tools to modulate the activity of structurally related families of ncRNAs, and building upon the growing body of RNA synthetic biology, our ability to design aptamer–ncRNA fusion molecules offers new ways to engineer ligand-sensing regulatory circuits. PMID:22383579

  13. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences.

    PubMed

    Šulc, Miroslav; Marín, Ray M; Robins, Harlan S; Vaníček, Jiří

    2015-07-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3' untranslated regions (3' UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3' UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA-mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA-mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  14. Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

    PubMed

    Beauparlant, Marc A; Drouin, Guy

    2014-02-01

    Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.

  15. Computational analysis of ribonomics datasets identifies long non-coding RNA targets of γ-herpesviral miRNAs.

    PubMed

    Sethuraman, Sunantha; Thomas, Merin; Gay, Lauren A; Renne, Rolf

    2018-05-29

    Ribonomics experiments involving crosslinking and immuno-precipitation (CLIP) of Ago proteins have expanded the understanding of the miRNA targetome of several organisms. These techniques, collectively referred to as CLIP-seq, have been applied to identifying the mRNA targets of miRNAs expressed by Kaposi's Sarcoma-associated herpes virus (KSHV) and Epstein-Barr virus (EBV). However, these studies focused on identifying only those RNA targets of KSHV and EBV miRNAs that are known to encode proteins. Recent studies have demonstrated that long non-coding RNAs (lncRNAs) are also targeted by miRNAs. In this study, we performed a systematic re-analysis of published datasets from KSHV- and EBV-driven cancers. We used CLIP-seq data from lymphoma cells or EBV-transformed B cells, and a crosslinking, ligation and sequencing of hybrids dataset from KSHV-infected endothelial cells, to identify novel lncRNA targets of viral miRNAs. Here, we catalog the lncRNA targetome of KSHV and EBV miRNAs, and provide a detailed in silico analysis of lncRNA-miRNA binding interactions. Viral miRNAs target several hundred lncRNAs, including a subset previously shown to be aberrantly expressed in human malignancies. In addition, we identified thousands of lncRNAs to be putative targets of human miRNAs, suggesting that miRNA-lncRNA interactions broadly contribute to the regulation of gene expression.

  16. Detection of human microRNAs across miRNA Array and Next Generation DNA Sequencing Platforms

    EPA Science Inventory

    microRNA (miRNAs) are non-coding RNA molecules between 19 and 30 nucleotides in length that are believed to regulate approximately 30 per cent of all human genes. They act as negative regulators of their gene targets in many biological processes. Recent developments in microar...

  17. Decoding critical long non-coding RNA in ovarian cancer epithelial-to-mesenchymal transition.

    PubMed

    Mitra, Ramkrishna; Chen, Xi; Greenawalt, Evan J; Maulik, Ujjwal; Jiang, Wei; Zhao, Zhongming; Eischen, Christine M

    2017-11-17

    Long non-coding RNA (lncRNA) are emerging as contributors to malignancies. Little is understood about the contribution of lncRNA to epithelial-to-mesenchymal transition (EMT), which correlates with metastasis. Ovarian cancer is usually diagnosed after metastasis. Here we report an integrated analysis of >700 ovarian cancer molecular profiles, including genomic data sets, from four patient cohorts identifying lncRNA DNM3OS, MEG3, and MIAT overexpression and their reproducible gene regulation in ovarian cancer EMT. Genome-wide mapping shows 73% of MEG3-regulated EMT-linked pathway genes contain MEG3 binding sites. DNM3OS overexpression, but not MEG3 or MIAT, significantly correlates to worse overall patient survival. DNM3OS knockdown results in altered EMT-linked genes/pathways, mesenchymal-to-epithelial transition, and reduced cell migration and invasion. Proteotranscriptomic characterization further supports the DNM3OS and ovarian cancer EMT connection. TWIST1 overexpression and DNM3OS amplification provides an explanation for increased DNM3OS levels. Therefore, our results elucidate lncRNA that regulate EMT and demonstrate DNM3OS specifically contributes to EMT in ovarian cancer.

  18. Green fluorescent protein expression from recombinant lettuce infectious yellows virus-defective RNAs originating from RNA 2.

    PubMed

    Yeh, H H; Tian, T; Medina, V; Falk, B W

    2001-10-10

    Lettuce infectious yellows virus (LIYV) RNA 2 defective RNAs (D RNAs) were compared in protoplasts for their ability to replicate and to express the green fluorescent protein (GFP) from recombinant D RNA constructs. Initially four LIYV D RNAs of different genetic composition were compared, but only two (LIYV D RNA M5 and M18) replicated to high levels. Both of these contained at least two complete ORFs, one being the 3'-terminal ORF encoding P26. Northern hybridization analysis using probes corresponding to 3' regions of LIYV RNA 2 detected the P26 subgenomic RNA from protoplasts infected with LIYV RNAs 1 and 2 or protoplasts inoculated only with RNA 1 plus either the LIYV D RNA M5 or M18, suggesting that these LIYV D RNAs served as templates to generate the P26 subgenomic RNA. The GFP coding region was inserted as an in-frame insertion into the P26 coding region of the LIYV M5 and M18 D RNAs, yielding M5gfp and M18gfp. When transcripts of M5gfp and M18gfp were used to inoculate protoplasts, bright fluorescence was seen only when they were co-inoculated with LIYV RNA 1. The percentage of fluorescent protoplasts ranged from experiment to experiment, but was as high as 5.8%. Time course analyses showed that fluorescence was not detected before 48 h pi, and this correlated with the timing of LIYV RNA 2 and RNA 2 D RNA accumulation, but not with that of LIYV RNA 1. Copyright 2001 Academic Press.

  19. The Long Non-Coding RNA Transcriptome Landscape in CHO Cells Under Batch and Fed-Batch Conditions.

    PubMed

    Vito, Davide; Smales, C Mark

    2018-05-21

    The role of non-coding RNAs in determining growth, productivity and recombinant product quality attributes in Chinese hamster ovary (CHO) cells has received much attention in recent years, exemplified by studies into microRNAs in particular. However, other classes of non-coding RNAs have received less attention. One such class are the non-coding RNAs known collectively as long non-coding RNAs (lncRNAs). We have undertaken the first landscape analysis of the lncRNA transcriptome in CHO using a mouse based microarray that also provided for the surveillance of the coding transcriptome. We report on those lncRNAs present in a model host CHO cell line under batch and fed-batch conditions on two different days and relate the expression of different lncRNAs to each other. We demonstrate that the mouse microarray was suitable for the detection and analysis of thousands of CHO lncRNAs and validated a number of these by qRT-PCR. We then further analysed the data to identify those lncRNAs whose expression changed the most between growth and stationary phases of culture or between batch and fed-batch culture to identify potential lncRNA targets for further functional studies with regard to their role in controlling growth of CHO cells. We discuss the implications for the publication of this rich dataset and how this may be used by the community. This article is protected by copyright. All rights reserved.

  20. Transcriptomes of six mutants in the Sen1 pathway reveal combinatorial control of transcription termination across the Saccharomyces cerevisiae genome

    PubMed Central

    Carver, Melissa N.; Müller, Ulrika; Bekiranov, Stefan; Auble, David T.

    2017-01-01

    Transcriptome studies on eukaryotic cells have revealed an unexpected abundance and diversity of noncoding RNAs synthesized by RNA polymerase II (Pol II), some of which influence the expression of protein-coding genes. Yet, much less is known about biogenesis of Pol II non-coding RNA than mRNAs. In the budding yeast Saccharomyces cerevisiae, initiation of non-coding transcripts by Pol II appears to be similar to that of mRNAs, but a distinct pathway is utilized for termination of most non-coding RNAs: the Sen1-dependent or “NNS” pathway. Here, we examine the effect on the S. cerevisiae transcriptome of conditional mutations in the genes encoding six different essential proteins that influence Sen1-dependent termination: Sen1, Nrd1, Nab3, Ssu72, Rpb11, and Hrp1. We observe surprisingly diverse effects on transcript abundance for the different proteins that cannot be explained simply by differing severity of the mutations. Rather, we infer from our results that termination of Pol II transcription of non-coding RNA genes is subject to complex combinatorial control that likely involves proteins beyond those studied here. Furthermore, we identify new targets and functions of Sen1-dependent termination, including a role in repression of meiotic genes in vegetative cells. In combination with other recent whole-genome studies on termination of non-coding RNAs, our results provide promising directions for further investigation. PMID:28665995

  1. Perspectives on the mechanism of transcriptional regulation by long non-coding RNAs.

    PubMed

    Roberts, Thomas C; Morris, Kevin V; Weinberg, Marc S

    2014-01-01

    Long non-coding RNAs (lncRNAs) are increasingly being recognized as epigenetic regulators of gene transcription. The diversity and complexity of lncRNA genes means that they exert their regulatory effects by a variety of mechanisms. Although there is still much to be learned about the mechanism of lncRNA function, general principles are starting to emerge. In particular, the application of high throughput (deep) sequencing methodologies has greatly advanced our understanding of lncRNA gene function. lncRNAs function as adaptors that link specific chromatin loci with chromatin-remodeling complexes and transcription factors. lncRNAs can act in cis or trans to guide epigenetic-modifier complexes to distinct genomic sites, or act as scaffolds which recruit multiple proteins simultaneously, thereby coordinating their activities. In this review we discuss the genomic organization of lncRNAs, the importance of RNA secondary structure to lncRNA functionality, the multitude of ways in which they interact with the genome, and what evolutionary conservation tells us about their function.

  2. RNA Editing in Plant Mitochondria

    NASA Astrophysics Data System (ADS)

    Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel

    1989-12-01

    Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.

  3. Non-protein coding RNA genes as the novel diagnostic markers for the discrimination of Salmonella species using PCR.

    PubMed

    Nithya, Ravichantar; Ahmed, Siti Aminah; Hoe, Chee-Hock; Gopinath, Subash C B; Citartan, Marimuthu; Chinni, Suresh V; Lee, Li Pin; Rozhdestvensky, Timofey S; Tang, Thean-Hock

    2015-01-01

    Salmonellosis, a communicable disease caused by members of the Salmonella species, transmitted to humans through contaminated food or water. It is of paramount importance, to generate accurate detection methods for discriminating the various Salmonella species that cause severe infection in humans, including S. Typhi and S. Paratyphi A. Here, we formulated a strategy of detection and differentiation of salmonellosis by a multiplex polymerase chain reaction assay using S. Typhi non-protein coding RNA (sRNA) genes. With the designed sequences that specifically detect sRNA genes from S. Typhi and S. Paratyphi A, a detection limit of up to 10 pg was achieved. Moreover, in a stool-seeding experiment with S. Typhi and S. Paratyphi A, we have attained a respective detection limit of 15 and 1.5 CFU/mL. The designed strategy using sRNA genes shown here is comparatively sensitive and specific, suitable for clinical diagnosis and disease surveillance, and sRNAs represent an excellent molecular target for infectious disease.

  4. Non-Coding RNA Analysis Using the Rfam Database.

    PubMed

    Kalvari, Ioanna; Nawrocki, Eric P; Argasinska, Joanna; Quinones-Olvera, Natalia; Finn, Robert D; Bateman, Alex; Petrov, Anton I

    2018-06-01

    Rfam is a database of non-coding RNA families in which each family is represented by a multiple sequence alignment, a consensus secondary structure, and a covariance model. Using a combination of manual and literature-based curation and a custom software pipeline, Rfam converts descriptions of RNA families found in the scientific literature into computational models that can be used to annotate RNAs belonging to those families in any DNA or RNA sequence. Valuable research outputs that are often locked up in figures and supplementary information files are encapsulated in Rfam entries and made accessible through the Rfam Web site. The data produced by Rfam have a broad application, from genome annotation to providing training sets for algorithm development. This article gives an overview of how to search and navigate the Rfam Web site, and how to annotate sequences with RNA families. The Rfam database is freely available at http://rfam.org. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.

  5. Mitochondrial genome of Pteronotus personatus (Chiroptera: Mormoopidae): comparison with selected bats and phylogenetic considerations.

    PubMed

    López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel

    2017-02-01

    We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.

  6. RNAi mediates post-transcriptional repression of gene expression in fission yeast Schizosaccharomyces pombe

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Smialowska, Agata, E-mail: smialowskaa@gmail.com; School of Life Sciences, Södertörn Högskola, Huddinge 141-89; Djupedal, Ingela

    Highlights: • Protein coding genes accumulate anti-sense sRNAs in fission yeast S. pombe. • RNAi represses protein-coding genes in S. pombe. • RNAi-mediated gene repression is post-transcriptional. - Abstract: RNA interference (RNAi) is a gene silencing mechanism conserved from fungi to mammals. Small interfering RNAs are products and mediators of the RNAi pathway and act as specificity factors in recruiting effector complexes. The Schizosaccharomyces pombe genome encodes one of each of the core RNAi proteins, Dicer, Argonaute and RNA-dependent RNA polymerase (dcr1, ago1, rdp1). Even though the function of RNAi in heterochromatin assembly in S. pombe is established, its rolemore » in controlling gene expression is elusive. Here, we report the identification of small RNAs mapped anti-sense to protein coding genes in fission yeast. We demonstrate that these genes are up-regulated at the protein level in RNAi mutants, while their mRNA levels are not significantly changed. We show that the repression by RNAi is not a result of heterochromatin formation. Thus, we conclude that RNAi is involved in post-transcriptional gene silencing in S. pombe.« less

  7. Complete mitochondrial genome of Taharana fasciana (Insecta, Hemiptera: Cicadellidae) and comparison with other Cicadellidae insects.

    PubMed

    Wang, Jiajia; Li, Hu; Dai, Renhuai

    2017-12-01

    Here, we describe the first complete mitochondrial genome (mitogenome) sequence of the leafhopper Taharana fasciana (Coelidiinae). The mitogenome sequence contains 15,161 bp with an A + T content of 77.9%. It includes 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding (A + T-rich) region; in addition, a repeat region is also present (GenBank accession no. KY886913). These genes/regions are in the same order as in the inferred insect ancestral mitogenome. All protein-coding genes have ATN as the start codon, and TAA or single T as the stop codons, except the gene ND3, which ends with TAG. Furthermore, we predicted the secondary structures of the rRNAs in T. fasciana. Six domains (domain III is absent in arthropods) and 41 helices were predicted for 16S rRNA, and 12S rRNA comprised three structural domains and 24 helices. Phylogenetic tree analysis confirmed that T. fasciana and other members of the Cicadellidae are clustered into a clade, and it identified the relationships among the subfamilies Deltocephalinae, Coelidiinae, Idiocerinae, Cicadellinae, and Typhlocybinae.

  8. The complete mitochondrial genome of the gall-forming fly, Fergusonina taylori Nelson and Yeates (Diptera: Fergusoninidae).

    PubMed

    Nelson, Leigh A; Cameron, Stephen L; Yeates, David K

    2011-10-01

    The monogeneric family Fergusoninidae consists of gall-forming flies that, together with Fergusobia (Tylenchida: Neotylenchidae) nematodes, form the only known mutualistic association between insects and nematodes. In this study, the entire 16,000 bp mitochondrial genome of Fergusonina taylori Nelson and Yeates was sequenced. The circular genome contains one encoding region including 27 genes and one non-coding A+T-rich region. The arrangement of the protein-coding, ribosomal RNA (rRNA) and transfer RNA (tRNA) genes was the same as that found in the ancestral insect. Nucleotide composition is highly A+T biased. All of the protein initiation codons are ATN, except for nad1 which begins with TTT. All 22 tRNA anticodons of F. taylori match those observed in Drosophila yakuba, and all form the typical cloverleaf structure except for tRNA-Ser((AGN)) which lacks a dihydrouridine (DHU) arm. Secondary structural features of the rRNA genes of Fergusonina are similar to those proposed for other insects, with minor modifications. The mitochondrial genome of Fergusonina presented here may prove valuable for resolving the sister group to the Fergusoninidae, and expands the available mtDNA data sources for acalyptrates overall.

  9. Armored long non-coding RNA MEG3 targeting EGFR based on recombinant MS2 bacteriophage virus-like particles against hepatocellular carcinoma.

    PubMed

    Chang, Le; Wang, Guojing; Jia, Tingting; Zhang, Lei; Li, Yulong; Han, Yanxi; Zhang, Kuo; Lin, Guigao; Zhang, Rui; Li, Jinming; Wang, Lunan

    2016-04-26

    Hepatocellular carcinoma (HCC) is one of the most frequently diagnosed cancers worldwide. However, the treatment of patients with HCC is particularly challenging. Long non-coding RNA maternally expressed gene 3 (MEG3) has been identified as a potential suppressor of several types of tumors, but the delivery of long RNA remains problematic, limiting its applications. In the present study, we designed a novel delivery system based on MS2 virus-like particles (VLPs) crosslinked with GE11 polypeptide. This vector was found to be fast, effective and safe for the targeted delivery of lncRNA MEG3 RNA to the epidermal growth factor receptor (EGFR)-positive HCC cell lines without the activation of EGFR downstream pathways, and significantly attenuated both in vitro and in vivo tumor cell growth. Our study also revealed that the targeted delivery was mainly dependent on clathrin-mediated endocytosis and MEG3 RNA suppresses tumor growth mainly via increasing the expression of p53 and its downstream gene GDF15, but decreasing the expression of MDM2. Thus, this vector is promising as a novel delivery system and may facilitate a new approach to lncRNA based cancer therapy.

  10. Matrix factorization-based data fusion for the prediction of lncRNA-disease associations.

    PubMed

    Fu, Guangyuan; Wang, Jun; Domeniconi, Carlotta; Yu, Guoxian

    2018-05-01

    Long non-coding RNAs (lncRNAs) play crucial roles in complex disease diagnosis, prognosis, prevention and treatment, but only a small portion of lncRNA-disease associations have been experimentally verified. Various computational models have been proposed to identify lncRNA-disease associations by integrating heterogeneous data sources. However, existing models generally ignore the intrinsic structure of data sources or treat them as equally relevant, while they may not be. To accurately identify lncRNA-disease associations, we propose a Matrix Factorization based LncRNA-Disease Association prediction model (MFLDA in short). MFLDA decomposes data matrices of heterogeneous data sources into low-rank matrices via matrix tri-factorization to explore and exploit their intrinsic and shared structure. MFLDA can select and integrate the data sources by assigning different weights to them. An iterative solution is further introduced to simultaneously optimize the weights and low-rank matrices. Next, MFLDA uses the optimized low-rank matrices to reconstruct the lncRNA-disease association matrix and thus to identify potential associations. In 5-fold cross validation experiments to identify verified lncRNA-disease associations, MFLDA achieves an area under the receiver operating characteristic curve (AUC) of 0.7408, at least 3% higher than those given by state-of-the-art data fusion based computational models. An empirical study on identifying masked lncRNA-disease associations again shows that MFLDA can identify potential associations more accurately than competing models. A case study on identifying lncRNAs associated with breast, lung and stomach cancers show that 38 out of 45 (84%) associations predicted by MFLDA are supported by recent biomedical literature and further proves the capability of MFLDA in identifying novel lncRNA-disease associations. MFLDA is a general data fusion framework, and as such it can be adopted to predict associations between other biological entities. The source code for MFLDA is available at: http://mlda.swu.edu.cn/codes.php? name = MFLDA. gxyu@swu.edu.cn. Supplementary data are available at Bioinformatics online.

  11. Next stop for the CRISPR revolution: RNA-guided epigenetic regulators.

    PubMed

    Vora, Suhani; Tuttle, Marcelle; Cheng, Jenny; Church, George

    2016-09-01

    Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins offer a breakthrough platform for cheap, programmable, and effective sequence-specific DNA targeting. The CRISPR-Cas system is naturally equipped for targeted DNA cutting through its native nuclease activity. As such, groups researching a broad spectrum of biological organisms have quickly adopted the technology with groundbreaking applications to genomic sequence editing in over 20 different species. However, the biological code of life is not only encoded in genetics but also in epigenetics as well. While genetic sequence editing is a powerful ability, we must also be able to edit and regulate transcriptional and epigenetic code. Taking inspiration from work on earlier sequence-specific targeting technologies such as zinc fingers (ZFs) and transcription activator-like effectors (TALEs), researchers quickly expanded the CRISPR-Cas toolbox to include transcriptional activation, repression, and epigenetic modification. In this review, we highlight advances that extend the CRISPR-Cas toolkit for transcriptional and epigenetic regulation, as well as best practice guidelines for these tools, and a perspective on future applications. © 2016 The Authors. The FEBS Journal published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.

  12. Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

    PubMed

    Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

    2012-07-01

    This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.

  13. Poly(A) code analyses reveal key determinants for tissue-specific mRNA alternative polyadenylation

    PubMed Central

    Weng, Lingjie; Li, Yi; Xie, Xiaohui; Shi, Yongsheng

    2016-01-01

    mRNA alternative polyadenylation (APA) is a critical mechanism for post-transcriptional gene regulation and is often regulated in a tissue- and/or developmental stage-specific manner. An ultimate goal for the APA field has been to be able to computationally predict APA profiles under different physiological or pathological conditions. As a first step toward this goal, we have assembled a poly(A) code for predicting tissue-specific poly(A) sites (PASs). Based on a compendium of over 600 features that have known or potential roles in PAS selection, we have generated and refined a machine-learning algorithm using multiple high-throughput sequencing-based data sets of tissue-specific and constitutive PASs. This code can predict tissue-specific PASs with >85% accuracy. Importantly, by analyzing the prediction performance based on different RNA features, we found that PAS context, including the distance between alternative PASs and the relative position of a PAS within the gene, is a key feature for determining the susceptibility of a PAS to tissue-specific regulation. Our poly(A) code provides a useful tool for not only predicting tissue-specific APA regulation, but also for studying its underlying molecular mechanisms. PMID:27095026

  14. Code Development in Coupled PARCS/RELAP5 for Supercritical Water Reactor

    DOE PAGES

    Hu, Po; Wilson, Paul

    2014-01-01

    The new capability is added to the existing coupled code package PARCS/RELAP5, in order to analyze SCWR design under supercritical pressure with the separated water coolant and moderator channels. This expansion is carried out on both codes. In PARCS, modification is focused on extending the water property tables to supercritical pressure, modifying the variable mapping input file and related code module for processing thermal-hydraulic information from separated coolant/moderator channels, and modifying neutronics feedback module to deal with the separated coolant/moderator channels. In RELAP5, modification is focused on incorporating more accurate water properties near SCWR operation/transient pressure and temperature in themore » code. Confirming tests of the modifications is presented and the major analyzing results from the extended codes package are summarized.« less

  15. Competing endogenous RNA network crosstalk reveals novel molecular markers in colorectal cancer.

    PubMed

    Samir, Nehal; Matboli, Marwa; El-Tayeb, Hanaa; El-Tawdi, Ahmed; Hassan, Mohmed K; Waly, Amr; El-Akkad, Hesham A E; Ramadan, Mohamed G; Al-Belkini, Tarek N; El-Khamisy, Sherif; El-Asmar, Farid

    2018-05-08

    The competing endogenous RNA networks play a pivotal role in cancer diagnosis and progression. Novel properstrategies for early detection of colorectal cancer (CRC) are strongly needed. We investigated a novel CRC-specific RNA-based integrated competing endogenous network composed of lethal3 malignant brain tumor like1 (L3MBTL1) gene, long non-coding intergenic RNA- (lncRNA RP11-909B2.1) and homo sapiens microRNA-595 (hsa-miRNA-595) using in silico data analysis. RT-qPCR-based validation of the network was achieved in serum of 70 patients with CRC, 40 patients with benign colorectal neoplasm, and 20 healthy controls. Moreover, in cancer tissues of 20 of the 70 CRC cases were involved in the study. The expression of RNA-based biomarker network in both CRC and adjacent non-tumor tissues and their correlation with the serum levels of this network members was investigated. Lastly, the expression levels of the chosen ceRNA was verified in CRC cell line. Our results revealed that the three RNAs-based biomarker network (long non-coding intergenic RNA-[lncRNA RP11-909B2.1], Homo sapiens microRNA-595 [hsa-miRNA-595], and L3MBTL1 mRNA), had high sensitivity and specificity for discriminating CRC from healthy controls and also from benign colorectal neoplasm. The data suggest that among these three RNAs, serum lncRNA RP11-909B2.1 could be a promising independent prognostic factors in CRC. The circulatory RNA based biomarker panel can act as potential biomarker for CRC diagnosis and prognosis. © 2018 Wiley Periodicals, Inc.

  16. Localization of the mRNA for the dopamine D sub 2 receptor in the rat brain by in situ hybridization histochemistry

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mengod, G.; Martinez-Mir, M.I.; Vilaro, M.T.

    1989-11-01

    {sup 32}P-labeled oligonucleotides derived from the coding region of rat dopamine D{sub 2} receptor cDNA were used as probes to localize cells in the rat brain that contain the mRNA coding for this receptor by using in situ hybridization histochemistry. The highest level of hybridization was found in the intermediate lobe of the pituitary gland. High mRNA content was observed in the anterior lobe of the pituitary gland, the nuclei caudate-putamen and accumbens, and the olfactory tubercle. Lower levels were seen in the substantia nigra pars compacta and the ventral tegmental area, as well as in the lateral mammillary body.more » In these areas the distribution was comparable to that of the dopamine D{sub 2} receptor binding sites as visualized by autoradiography using ({sup 3}H)SDZ 205-502 as a ligand. However, in some areas such as the olfactory bulb, neocortex, hippocampus, superior colliculus, and cerebellum, D{sub 2} receptors have been visualized but no significant hybridization signal could be detected. The mRNA coding for these receptors in these areas could be contained in cells outside those brain regions, be different from the one recognized by our probes, or be present at levels below the detection limits of our procedure. The possibility of visualizing and quantifying the mRNA coding for dopamine D{sub 2} receptor at the microscopic level will yield more information about the in vivo regulation of the synthesis of these receptor and their alteration following selective lesions or drug treatments.« less

  17. Global assessment of small RNAs reveals a non-coding transcript involved in biofilm formation and attachment in Acinetobacter baumannii ATCC 17978

    PubMed Central

    Pérez, Astrid; Gómez, Manuel J.; Gayoso, Carmen; Vallejo, Juan A.; Ohneck, Emily J.; Valle, Jaione; Actis, Luis A.; Beceiro, Alejandro; Bou, Germán

    2017-01-01

    Many strains of Acinetobacter baumannii have been described as being able to form biofilm. Small non-coding RNAs (sRNAs) control gene expression in many regulatory circuits in bacteria. The aim of the present work was to provide a global description of the sRNAs produced both by planktonic and biofilm-associated (sessile) cells of A. baumannii ATCC 17978, and to compare the corresponding gene expression profiles to identify sRNAs molecules associated to biofilm formation and virulence. sRNA was extracted from both planktonic and sessile cells and reverse transcribed. cDNA was subjected to 454-pyrosequencing using the GS-FLX Titanium chemistry. The global analysis of the small RNA transcriptome revealed different sRNA expression patterns in planktonic and biofilm associated cells, with some of the transcripts only expressed or repressed in sessile bacteria. A total of 255 sRNAs were detected, with 185 of them differentially expressed in the different types of cells. A total of 9 sRNAs were expressed only in biofilm cells, while the expression of other 21 coding regions were repressed only in biofilm cells. Strikingly, the expression level of the sRNA 13573 was 120 times higher in biofilms than in planktonic cells, an observation that prompted us to further investigate the biological role of this non-coding transcript. Analyses of an isogenic mutant and over-expressing strains revealed that the sRNA 13573 gene is involved in biofilm formation and attachment to A549 human alveolar epithelial cells. The present work serves as a basis for future studies examining the complex regulatory network that regulate biofilm biogenesis and attachment to eukaryotic cells in A. baumannii ATCC 17978. PMID:28763494

  18. Amino acid codes in mitochondria as possible clues to primitive codes

    NASA Technical Reports Server (NTRS)

    Jukes, T. H.

    1981-01-01

    Differences between mitochondrial codes and the universal code indicate that an evolutionary simplification has taken place, rather than a return to a more primitive code. However, these differences make it evident that the universal code is not the only code possible, and therefore earlier codes may have differed markedly from the previous code. The present universal code is probably a 'frozen accident.' The change in CUN codons from leucine to threonine (Neurospora vs. yeast mitochondria) indicates that neutral or near-neutral changes occurred in the corresponding proteins when this code change took place, caused presumably by a mutation in a tRNA gene.

  19. Expression of versican 3'-untranslated region modulates endogenous microRNA functions.

    PubMed

    Lee, Daniel Y; Jeyapalan, Zina; Fang, Ling; Yang, Jennifer; Zhang, Yaou; Yee, Albert Y; Li, Minhui; Du, William W; Shatseva, Tatiana; Yang, Burton B

    2010-10-25

    Mature microRNAs (miRNAs) are single-stranded RNAs that regulate post-transcriptional gene expression. In our previous study, we have shown that versican 3'UTR, a fragment of non-coding transcript, has the ability to antagonize miR-199a-3p function thereby regulating expression of the matrix proteins versican and fibronectin, and thus resulting in enhanced cell-cell adhesion and organ adhesion. However, the impact of this non-coding fragment on tumorigenesis is yet to be determined. Using computational prediction confirmed with in vitro and in vivo experiments, we report that the expression of versican 3'UTR not only antagonizes miR-199a-3p but can also lower its steady state expression. We found that expression of versican 3'UTR in a mouse breast carcinoma cell line, 4T1, decreased miR-199a-3p levels. The decrease in miRNA activity consequently translated into differences in tumor growth. Computational analysis indicated that both miR-199a-3p and miR-144 targeted a cell cycle regulator, Rb1. In addition, miR-144 and miR-136, which have also been shown to interact with versican 3'UTR, was found to target PTEN. Expression of Rb1 and PTEN were up-regulated synergistically in vitro and in vivo, suggesting that the 3'UTR binds and modulates miRNA activities, freeing Rb1 and PTEN mRNAs for translation. In tumor formation assays, cells transfected with the 3'UTR formed smaller tumors compared with cells transfected with a control vector. Our results demonstrated that a 3'UTR fragment can be used to modulate miRNA functions. Our study also suggests that miRNAs in the cancer cells are more susceptible to degradation, due to its interaction with a non-coding 3'UTR. This non-coding component of mRNA may be used retrospectively to modulate miRNA activities.

  20. CANT1 lncRNA Triggers Efficient Therapeutic Efficacy by Correcting Aberrant lncing Cascade in Malignant Uveal Melanoma.

    PubMed

    Xing, Yue; Wen, Xuyang; Ding, Xia; Fan, Jiayan; Chai, Peiwei; Jia, Renbing; Ge, Shengfang; Qian, Guanxiang; Zhang, He; Fan, Xianqun

    2017-05-03

    Uveal melanoma (UM) is an intraocular malignant tumor with a high mortality rate. Recent studies have shown the functions of long non-coding RNAs (lncRNAs) in tumorigenesis; thus, targeting tumor-specific lncRNA abnormalities has become an attractive approach for developing therapeutics to treat uveal melanoma. In this study, we identified a novel nuclear CANT1 lncRNA (CASC15-New-Transcript 1) that acts as a necessary UM suppressor. CANT1 significantly reduced tumor metastatic capacity and tumor formation, either in cell culture or in animals harboring tumor xenograft. Intriguingly, XIST lncRNA serves as a potential target of CANT1, and JPX or FTX lncRNA subsequently serves as a contextual hinge to activate a novel CANT1-JPX/FTX-XIST long non-coding (lncing) pathway in UM. Moreover, CANT1 triggers the expression of JPX and FTX by directly binding to their promoters and promoting H3K4 methylation. These observations delineate a novel lncing cascade in which lncRNAs directly build a lncing cascade without coding genes that aims to modulate UM tumorigenesis, thereby specifying a novel "lncing-cascade renewal" anti-tumor therapeutic strategy by correcting aberrant lncing cascade in uveal melanoma. Copyright © 2017 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

  1. The long non-coding RNA PARTICLE is associated with WWOX and the absence of FRA16D breakage in osteosarcoma patients.

    PubMed

    O'Leary, Valerie Bríd; Maugg, Doris; Smida, Jan; Baumhoer, Daniel; Nathrath, Michaela; Ovsepian, Saak Victor; Atkinson, Michael John

    2017-10-20

    Breakage of the fragile site FRA16D disrupts the WWOX (WW Domain Containing Oxidoreductase) tumor suppressor gene in osteosarcoma. However, the frequency of breakage is not sufficient to explain the rate of WWOX loss in pathogenesis. The involvement of non-coding RNA transcripts is proposed due to their accumulation at fragile sites, where they are advocated to influence specific chromosomal regions associated with malignancy. The long ncRNA PARTICLE (promoter of MAT2A antisense radiation-induced circulating long non-coding RNA) is transiently elevated in response to irradiation and influences epigenetic silencing modification within WWOX . It now emerges that elevated PARTICLE levels are significantly associated with FRA16D non-breakage in OS patients. Although not associated with overall survival, high PARTICLE levels were found to be significantly linked to metastasis free outcome. The transcription of both PARTICLE and WWOX are transiently responsive to exposure to low doses of radiation in osteosarcoma cell lines. Herein, a relationship between WWOX and PARTICLE transcription is suggested in human osteosarcoma cell lines representing alternative genetic backgrounds. PARTICLE over-expression ameliorated WWOX promoter activity in U2OS harboring FRA16D non-breakage. It can be concluded that the lncRNA PARTICLE influences the WWOX tumor suppressor and in the absence of WWOX FRA16D breakage, it is associated with OS metastasis-free survival.

  2. Reduced expression of the long non-coding RNA AI364715 in gastric cancer and its clinical significance.

    PubMed

    Zhu, Shengqian; Mao, Jinqin; Shao, Yongfu; Chen, Fang; Zhu, Xiaoqin; Xu, Dingli; Zhang, Xinjun; Guo, Junming

    2015-09-01

    Long non-coding RNA (lncRNA), which is greater than 200 nucleotides, is a class of RNA molecules without protein coding function. In recent years, studies have shown that lncRNAs are associated with cancers. They are affecting the occurrence and development of cancers. However, the diagnostic significances of lncRNAs in gastric cancer are largely unknown. In this study, we focused on AI364715, one typical lncRNA. A total of 186 samples were collected from two cancer centers. To find the potential association between its level and gastric cancer, we first collected 75 paired gastric cancer tissues and normal tissues, which are 5 cm away from the edge of carcinoma. Besides, 18 human healthy gastric mucosa and 18 gastric precancerous lesions (dysplasia) were also collected. Quantitative reverse transcription-polymerase chain reaction (RT-PCR) was first used to detect the expression level of AI364715 at multiple stages of gastric tumorigenesis. Then, the relationships between AI364715 level and the clinicopathological factors of patients with gastric cancer were analyzed. The results showed that the expression level of AI364715 in gastric cancer tissues was downregulated. Meanwhile, its expression level was closely associated with tumor size and differentiation. More importantly, AI364715 expression level was significantly changed in dysplasia, the typical precancerous lesions. Taken together, AI364715 may be a potential biomarker for the diagnosis of gastric cancer.

  3. Exploration of sequence space as the basis of viral RNA genome segmentation.

    PubMed

    Moreno, Elena; Ojosnegros, Samuel; García-Arriaza, Juan; Escarmís, Cristina; Domingo, Esteban; Perales, Celia

    2014-05-06

    The mechanisms of viral RNA genome segmentation are unknown. On extensive passage of foot-and-mouth disease virus in baby hamster kidney-21 cells, the virus accumulated multiple point mutations and underwent a transition akin to genome segmentation. The standard single RNA genome molecule was replaced by genomes harboring internal in-frame deletions affecting the L- or capsid-coding region. These genomes were infectious and killed cells by complementation. Here we show that the point mutations in the nonstructural protein-coding region (P2, P3) that accumulated in the standard genome before segmentation increased the relative fitness of the segmented version relative to the standard genome. Fitness increase was documented by intracellular expression of virus-coded proteins and infectious progeny production by RNAs with the internal deletions placed in the sequence context of the parental and evolved genome. The complementation activity involved several viral proteins, one of them being the leader proteinase L. Thus, a history of genetic drift with accumulation of point mutations was needed to allow a major variation in the structure of a viral genome. Thus, exploration of sequence space by a viral genome (in this case an unsegmented RNA) can reach a point of the space in which a totally different genome structure (in this case, a segmented RNA) is favored over the form that performed the exploration.

  4. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3′ UTRs and coding sequences

    PubMed Central

    Šulc, Miroslav; Marín, Ray M.; Robins, Harlan S.; Vaníček, Jiří

    2015-01-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3′ untranslated regions (3′ UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3′ UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA–mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA–mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. PMID:25948580

  5. Genomewide analysis of Drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation

    PubMed Central

    Westholm, Jakub O.; Miura, Pedro; Olson, Sara; Shenker, Sol; Joseph, Brian; Sanfilippo, Piero; Celniker, Susan E.; Graveley, Brenton R.; Lai, Eric C.

    2014-01-01

    Circularization was recently recognized to broadly expand transcriptome complexity. Here, we exploit massive Drosophila total RNA-sequencing data, >5 billion paired-end reads from >100 libraries covering diverse developmental stages, tissues and cultured cells, to rigorously annotate >2500 fruitfly circular RNAs. These mostly derive from back-splicing of protein-coding genes and lack poly(A) tails, and circularization of hundreds of genes is conserved across multiple Drosophila species. We elucidate structural and sequence properties of Drosophila circular RNAs, which exhibit commonalities and distinctions from mammalian circles. Notably, Drosophila circular RNAs harbor >1000 well-conserved canonical miRNA seed matches, especially within coding regions, and coding conserved miRNA sites reside preferentially within circularized exons. Finally, we analyze the developmental and tissue specificity of circular RNAs, and note their preferred derivation from neural genes and enhanced accumulation in neural tissues. Interestingly, circular isoforms increase dramatically relative to linear isoforms during CNS aging, and constitute a novel aging biomarker. PMID:25544350

  6. Genetic code translation displays a linear trade-off between efficiency and accuracy of tRNA selection.

    PubMed

    Johansson, Magnus; Zhang, Jingji; Ehrenberg, Måns

    2012-01-03

    Rapid and accurate translation of the genetic code into protein is fundamental to life. Yet due to lack of a suitable assay, little is known about the accuracy-determining parameters and their correlation with translational speed. Here, we develop such an assay, based on Mg(2+) concentration changes, to determine maximal accuracy limits for a complete set of single-mismatch codon-anticodon interactions. We found a simple, linear trade-off between efficiency of cognate codon reading and accuracy of tRNA selection. The maximal accuracy was highest for the second codon position and lowest for the third. The results rationalize the existence of proofreading in code reading and have implications for the understanding of tRNA modifications, as well as of translation error-modulating ribosomal mutations and antibiotics. Finally, the results bridge the gap between in vivo and in vitro translation and allow us to calibrate our test tube conditions to represent the environment inside the living cell.

  7. Non-coding RNAs' partitioning in the evolution of photosynthetic organisms via energy transduction and redox signaling.

    PubMed

    Kotakis, Christos

    2015-01-01

    Ars longa, vita brevis -Hippocrates Chloroplasts and mitochondria are genetically semi-autonomous organelles inside the plant cell. These constructions formed after endosymbiosis and keep evolving throughout the history of life. Experimental evidence is provided for active non-coding RNAs (ncRNAs) in these prokaryote-like structures, and a possible functional imprinting on cellular electrophysiology by those RNA entities is described. Furthermore, updated knowledge on RNA metabolism of organellar genomes uncovers novel inter-communication bridges with the nucleus. This class of RNA molecules is considered as a unique ontogeny which transforms their biological role as a genetic rheostat into a synchronous biochemical one that can affect the energetic charge and redox homeostasis inside cells. A hypothesis is proposed where such modulation by non-coding RNAs is integrated with genetic signals regulating gene transfer. The implications of this working hypothesis are discussed, with particular reference to ncRNAs involvement in the organellar and nuclear genomes evolution since their integrity is functionally coupled with redox signals in photosynthetic organisms.

  8. RNA catalysis and the origins of life

    NASA Technical Reports Server (NTRS)

    Orgel, Leslie E.

    1986-01-01

    The role of RNA catalysis in the origins of life is considered in connection with the discovery of riboszymes, which are RNA molecules that catalyze sequence-specific hydrolysis and transesterification reactions of RNA substrates. Due to this discovery, theories positing protein-free replication as preceding the appearance of the genetic code are more plausible. The scope of RNA catalysis in biology and chemistry is discussed, and it is noted that the development of methods to select (or predict) RNA sequences with preassigned catalytic functions would be a major contribution to the study of life's origins.

  9. RNA Helicase Associated with AU-rich Element (RHAU/DHX36) Interacts with the 3′-Tail of the Long Non-coding RNA BC200 (BCYRN1)*

    PubMed Central

    Booy, Evan P.; McRae, Ewan K. S.; Howard, Ryan; Deo, Soumya R.; Ariyo, Emmanuel O.; Dzananovic, Edis; Meier, Markus; Stetefeld, Jörg; McKenna, Sean A.

    2016-01-01

    RNA helicase associated with AU-rich element (RHAU) is an ATP-dependent RNA helicase that demonstrates high affinity for quadruplex structures in DNA and RNA. To elucidate the significance of these quadruplex-RHAU interactions, we have performed RNA co-immunoprecipitation screens to identify novel RNAs bound to RHAU and characterize their function. In the course of this study, we have identified the non-coding RNA BC200 (BCYRN1) as specifically enriched upon RHAU immunoprecipitation. Although BC200 does not adopt a quadruplex structure and does not bind the quadruplex-interacting motif of RHAU, it has direct affinity for RHAU in vitro. Specifically designed BC200 truncations and RNase footprinting assays demonstrate that RHAU binds to an adenosine-rich region near the 3′-end of the RNA. RHAU truncations support binding that is dependent upon a region within the C terminus and is specific to RHAU isoform 1. Tests performed to assess whether BC200 interferes with RHAU helicase activity have demonstrated the ability of BC200 to act as an acceptor of unwound quadruplexes via a cytosine-rich region near the 3′-end of the RNA. Furthermore, an interaction between BC200 and the quadruplex-containing telomerase RNA was confirmed by pull-down assays of the endogenous RNAs. This leads to the possibility that RHAU may direct BC200 to bind and exert regulatory functions at quadruplex-containing RNA or DNA sequences. PMID:26740632

  10. The role of sequence context, nucleotide pool balance and stress in 2′-deoxynucleotide misincorporation in viral, bacterial and mammalian RNA

    PubMed Central

    Wang, Jin; Dong, Hongping; Chionh, Yok Hian; McBee, Megan E.; Sirirungruang, Sasilada; Cunningham, Richard P.; Shi, Pei-Yong; Dedon, Peter C.

    2016-01-01

    The misincorporation of 2′-deoxyribonucleotides (dNs) into RNA has important implications for the function of non-coding RNAs, the translational fidelity of coding RNAs and the mutagenic evolution of viral RNA genomes. However, quantitative appreciation for the degree to which dN misincorporation occurs is limited by the lack of analytical tools. Here, we report a method to hydrolyze RNA to release 2′-deoxyribonucleotide-ribonucleotide pairs (dNrN) that are then quantified by chromatography-coupled mass spectrometry (LC-MS). Using this platform, we found misincorporated dNs occurring at 1 per 103 to 105 ribonucleotide (nt) in mRNA, rRNAs and tRNA in human cells, Escherichia coli, Saccharomyces cerevisiae and, most abundantly, in the RNA genome of dengue virus. The frequency of dNs varied widely among organisms and sequence contexts, and partly reflected the in vitro discrimination efficiencies of different RNA polymerases against 2′-deoxyribonucleoside 5′-triphosphates (dNTPs). Further, we demonstrate a strong link between dN frequencies in RNA and the balance of dNTPs and ribonucleoside 5′-triphosphates (rNTPs) in the cellular pool, with significant stress-induced variation of dN incorporation. Potential implications of dNs in RNA are discussed, including the possibilities of dN incorporation in RNA as a contributing factor in viral evolution and human disease, and as a host immune defense mechanism against viral infections. PMID:27365049

  11. Dynamic landscape and regulation of RNA editing in mammals

    PubMed Central

    Tan, Meng How; Li, Qin; Shanmugam, Raghuvaran; Piskol, Robert; Kohler, Jennefer; Young, Amy N.; Liu, Kaiwen Ivy; Zhang, Rui; Ramaswami, Gokul; Ariyoshi, Kentaro; Gupte, Ankita; Keegan, Liam P.; George, Cyril X.; Ramu, Avinash; Huang, Ni; Pollina, Elizabeth A.; Leeman, Dena S.; Rustighi, Alessandra; Sharon Goh, Y. P.; Chawla, Ajay; Del Sal, Giannino; Peltz, Gary; Brunet, Anne; Conrad, Donald F.; Samuel, Charles E.; O’Connell, Mary A.; Walkley, Carl R.; Nishikura, Kazuko; Li, Jin Billy

    2017-01-01

    Adenosine-to-inosine (A-to-I) RNA editing is a conserved post-transcriptional mechanism mediated by ADAR enzymes that diversifies the transcriptome by altering selected nucleotides in RNA molecules1. Although many editing sites have recently been discovered2–7, the extent to which most sites are edited and how the editing is regulated in different biological contexts are not fully understood8–10. Here we report dynamic spatiotemporal patterns and new regulators of RNA editing, discovered through an extensive profiling of A-to-I RNA editing in 8,551 human samples (representing 53 body sites from 552 individuals) from the Genotype-Tissue Expression (GTEx) project and in hundreds of other primate and mouse samples. We show that editing levels in non-repetitive coding regions vary more between tissues than editing levels in repetitive regions. Globally, ADAR1 is the primary editor of repetitive sites and ADAR2 is the primary editor of non-repetitive coding sites, whereas the catalytically inactive ADAR3 predominantly acts as an inhibitor of editing. Cross-species analysis of RNA editing in several tissues revealed that species, rather than tissue type, is the primary determinant of editing levels, suggesting stronger cis-directed regulation of RNA editing for most sites, although the small set of conserved coding sites is under stronger trans-regulation. In addition, we curated an extensive set of ADAR1 and ADAR2 targets and showed that many editing sites display distinct tissue-specific regulation by the ADAR enzymes in vivo. Further analysis of the GTEx data revealed several potential regulators of editing, such as AIMP2, which reduces editing in muscles by enhancing the degradation of the ADAR proteins. Collectively, our work provides insights into the complex cis- and trans-regulation of A-to-I editing. PMID:29022589

  12. Dynamic landscape and regulation of RNA editing in mammals.

    PubMed

    Tan, Meng How; Li, Qin; Shanmugam, Raghuvaran; Piskol, Robert; Kohler, Jennefer; Young, Amy N; Liu, Kaiwen Ivy; Zhang, Rui; Ramaswami, Gokul; Ariyoshi, Kentaro; Gupte, Ankita; Keegan, Liam P; George, Cyril X; Ramu, Avinash; Huang, Ni; Pollina, Elizabeth A; Leeman, Dena S; Rustighi, Alessandra; Goh, Y P Sharon; Chawla, Ajay; Del Sal, Giannino; Peltz, Gary; Brunet, Anne; Conrad, Donald F; Samuel, Charles E; O'Connell, Mary A; Walkley, Carl R; Nishikura, Kazuko; Li, Jin Billy

    2017-10-11

    Adenosine-to-inosine (A-to-I) RNA editing is a conserved post-transcriptional mechanism mediated by ADAR enzymes that diversifies the transcriptome by altering selected nucleotides in RNA molecules. Although many editing sites have recently been discovered, the extent to which most sites are edited and how the editing is regulated in different biological contexts are not fully understood. Here we report dynamic spatiotemporal patterns and new regulators of RNA editing, discovered through an extensive profiling of A-to-I RNA editing in 8,551 human samples (representing 53 body sites from 552 individuals) from the Genotype-Tissue Expression (GTEx) project and in hundreds of other primate and mouse samples. We show that editing levels in non-repetitive coding regions vary more between tissues than editing levels in repetitive regions. Globally, ADAR1 is the primary editor of repetitive sites and ADAR2 is the primary editor of non-repetitive coding sites, whereas the catalytically inactive ADAR3 predominantly acts as an inhibitor of editing. Cross-species analysis of RNA editing in several tissues revealed that species, rather than tissue type, is the primary determinant of editing levels, suggesting stronger cis-directed regulation of RNA editing for most sites, although the small set of conserved coding sites is under stronger trans-regulation. In addition, we curated an extensive set of ADAR1 and ADAR2 targets and showed that many editing sites display distinct tissue-specific regulation by the ADAR enzymes in vivo. Further analysis of the GTEx data revealed several potential regulators of editing, such as AIMP2, which reduces editing in muscles by enhancing the degradation of the ADAR proteins. Collectively, our work provides insights into the complex cis- and trans-regulation of A-to-I editing.

  13. Altered long non-coding RNA expression profile in rabbit atria with atrial fibrillation: TCONS_00075467 modulates atrial electrical remodeling by sponging miR-328 to regulate CACNA1C.

    PubMed

    Li, Zhan; Wang, Ximin; Wang, Weizong; Du, Juanjuan; Wei, Jinqiu; Zhang, Yong; Wang, Jiangrong; Hou, Yinglong

    2017-07-01

    Electrical remodeling has been reported to play a major role in the initiation and maintenance of atrial fibrillation (AF). Long non-coding RNAs (lncRNAs) have been increasingly recognized as contributors to the pathology of heart diseases. However, the roles and mechanisms of lncRNAs in electrical remodeling during AF remain unknown. In this study, the lncRNA expression profiles of right atria were investigated in AF and non-AF rabbit models by using RNA sequencing technique and validated using quantitative real-time polymerase chain reaction (qRT-PCR). A total of 99,843 putative new lncRNAs were identified, in which 1220 differentially expressed transcripts exhibited >2-fold change. Bioinformatics analysis was conducted to predict the functions and interactions of the aberrantly expressed genes. On the basis of a series of filtering pipelines, one lncRNA, TCONS_00075467, was selected to explore its effects and mechanisms on electrical remodeling. The atrial effective refractory period was shortened in vivo and the L-type calcium current and action potential duration were decreased in vitro by silencing of TCONS_00075467 with lentiviruses. Besides, the expression of miRNA-328 was negatively correlated with TCONS_00075467. We further demonstrated that TCONS_00075467 could sponge miRNA-328 in vitro and in vivo to regulate the downstream protein coding gene CACNA1C. In addition, miRNA-328 could partly reverse the effects of TCONS_00075467 on electrical remodeling. In summary, dysregulated lncRNAs may play important roles in modulating electrical remodeling during AF. Our study may facilitate the mechanism studies of lncRNAs in AF pathogenesis and provide potential therapeutic targets for AF. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Systematic analyses reveal long non-coding RNA (PTAF)-mediated promotion of EMT and invasion-metastasis in serous ovarian cancer.

    PubMed

    Liang, Haihai; Zhao, Xiaoguang; Wang, Chengyu; Sun, Jian; Chen, Yingzhun; Wang, Guoyuan; Fang, Lei; Yang, Rui; Yu, Mengxue; Gu, Yunyan; Shan, Hongli

    2018-06-21

    A deeper mechanistic understanding of epithelial-to-mesenchymal transition (EMT) regulation is needed to improve current anti-metastasis strategies in ovarian cancer (OvCa). This study was designed to investigate the role of lncRNAs in EMT regulation during process of invasion-metastasis in serous OvCa to improve current anti-metastasis strategies for OvCa. We systematically analyzes high-throughput gene expression profiles of both lncRNAs and protein-coding genes in OvCa samples with integrated epithelial (iE) subtype and integrated mesenchymal (iM) subtype labels. Mouse models, cytobiology, molecular biology assays and clinical samples were performed to elucidate the function and underlying mechanisms of lncRNA PTAF-mediated promotion of EMT and invasion-metastasis in serous OvCa. We constructed a lncRNA-mediated competing endogenous RNA (ceRNA) regulatory network that affects the expression of many EMT-related protein-coding genes in mesenchymal OvCa. Using a combination of in vitro and in vivo studies, we provided evidence that the lncRNA PTAF-miR-25-SNAI2 axis controlled EMT in OvCa. Our results revealed that up-regulated PTAF induced elevated SNAI2 expression by competitively binding to miR-25, which in turn promoted OvCa cell EMT and invasion. Moreover, we found that silencing of PTAF inhibited tumor progression and metastasis in an orthotopic mouse model of OvCa. We then observed a significant correlation between PTAF expression and EMT markers in OvCa patients. The lncRNA PTAF, a mediator of TGF-β signaling, can predispose OvCa patients to metastases and may serve as a potential target for anti-metastatic therapies for mesenchymal OvCa patients.

  15. PLMItRNA, a database on the heterogeneous genetic origin of mitochondrial tRNA genes and tRNAs in photosynthetic eukaryotes.

    PubMed

    Rainaldi, Guglielmo; Volpicella, Mariateresa; Licciulli, Flavio; Liuni, Sabino; Gallerani, Raffaele; Ceci, Luigi R

    2003-01-01

    The updated version of PLMItRNA reports information and multialignments on 609 genes and 34 tRNA molecules active in the mitochondria of Viridiplantae (27 Embryophyta and 10 Chlorophyta), and photosynthetic algae (one Cryptophyta, four Rhodophyta and two Stramenopiles). Colour-code based tables reporting the different genetic origin of identified genes allow hyper-textual link to single entries. Promoter sequences identified for tRNA genes in the mitochondrial genomes of Angiospermae are also reported. The PLMItRNA database is accessible at http://bighost.area.ba.cnr.it/PLMItRNA/.

  16. Non-coding RNAs in lung cancer

    PubMed Central

    Ricciuti, Biagio; Mecca, Carmen; Crinò, Lucio; Baglivo, Sara; Cenci, Matteo; Metro, Giulio

    2014-01-01

    The discovery that protein-coding genes represent less than 2% of all human genome, and the evidence that more than 90% of it is actively transcribed, changed the classical point of view of the central dogma of molecular biology, which was always based on the assumption that RNA functions mainly as an intermediate bridge between DNA sequences and protein synthesis machinery. Accumulating data indicates that non-coding RNAs are involved in different physiological processes, providing for the maintenance of cellular homeostasis. They are important regulators of gene expression, cellular differentiation, proliferation, migration, apoptosis, and stem cell maintenance. Alterations and disruptions of their expression or activity have increasingly been associated with pathological changes of cancer cells, this evidence and the prospect of using these molecules as diagnostic markers and therapeutic targets, make currently non-coding RNAs among the most relevant molecules in cancer research. In this paper we will provide an overview of non-coding RNA function and disruption in lung cancer biology, also focusing on their potential as diagnostic, prognostic and predictive biomarkers. PMID:25593996

  17. Identification of small non-coding RNA classes expressed in swine whole blood during HP-PRRSV infection.

    PubMed

    Fleming, Damarius S; Miller, Laura C

    2018-04-01

    It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs have emerged as having an important role in the immune system in humans. The study uses transcriptomic read counts to profile the type and quantity of both well and lesser characterized sncRNAs, such as microRNAs and small nucleolar RNAs to identify and quantify the classes of sncRNA expressed in whole blood between healthy and highly pathogenic PRRSV-infected pigs. Our results returned evidence on nine classes of sncRNA, four of which were consistently statistically significantly different based on Fisher's Exact Test, that can be detected and possibly interrogated for their effect on host dysregulation during PRRSV infections. Published by Elsevier Inc.

  18. Circular RNA profiling reveals that circular RNAs from ANXA2 can be used as new biomarkers for multiple sclerosis.

    PubMed

    Iparraguirre, Leire; Muñoz-Culla, Maider; Prada-Luengo, Iñigo; Castillo-Triviño, Tamara; Olascoaga, Javier; Otaegui, David

    2017-09-15

    Multiple sclerosis is an autoimmune disease, with higher prevalence in women, in whom the immune system is dysregulated. This dysregulation has been shown to correlate with changes in transcriptome expression as well as in gene-expression regulators, such as non-coding RNAs (e.g. microRNAs). Indeed, some of these have been suggested as biomarkers for multiple sclerosis even though few biomarkers have reached the clinical practice. Recently, a novel family of non-coding RNAs, circular RNAs, has emerged as a new player in the complex network of gene-expression regulation. MicroRNA regulation function through a 'sponge system' and a RNA splicing regulation function have been proposed for the circular RNAs. This regulating role together with their high stability in biofluids makes them seemingly good candidates as biomarkers. Given the dysregulation of both protein-coding and non-coding transcriptome that have been reported in multiple sclerosis patients, we hypothesised that circular RNA expression may also be altered. Therefore, we carried out expression profiling of 13.617 circular RNAs in peripheral blood leucocytes from multiple sclerosis patients and healthy controls finding 406 differentially expressed (P-value < 0.05, Fold change > 1.5) and demonstrate after validation that, circ_0005402 and circ_0035560 are underexpressed in multiple sclerosis patients and could be used as biomarkers of the disease. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

    PubMed

    Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

    2016-05-11

    Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.

  20. Both coding exons of the c-myc gene contribute to its posttranscriptional regulation in the quiescent liver and regenerating liver and after protein synthesis inhibition.

    PubMed Central

    Lavenu, A; Pistoi, S; Pournin, S; Babinet, C; Morello, D

    1995-01-01

    In vivo, the steady-state level of c-myc mRNA is mainly controlled by posttranscriptional mechanisms. Using a panel of transgenic mice in which various versions of the human c-myc proto-oncogene were under the control of major histocompatibility complex H-2Kb class I regulatory sequences, we have shown that the 5' and the 3' noncoding sequences are dispensable for obtaining a regulated expression of the transgene in adult quiescent tissues, at the start of liver regeneration, and after inhibition of protein synthesis. These results indicated that the coding sequences were sufficient to ensure a regulated c-myc expression. In the present study, we have pursued this analysis with transgenes containing one or the other of the two c-myc coding exons either alone or in association with the c-myc 3' untranslated region. We demonstrate that each of the exons contains determinants which control c-myc mRNA expression. Moreover, we show that in the liver, c-myc exon 2 sequences are able to down-regulate an otherwise stable H-2K mRNA when embedded within it and to induce its transient accumulation after cycloheximide treatment and soon after liver ablation. Finally, the use of transgenes with different coding capacities has allowed us to postulate that the primary mRNA sequence itself and not c-Myc peptides is an important component of c-myc posttranscriptional regulation. PMID:7623834

Top