highly transcribed noncoding: Topics by Science.gov

Sample records for highly transcribed noncoding

The primary transcriptome of the marine diazotroph Trichodesmium erythraeum IMS101

NASA Astrophysics Data System (ADS)

Pfreundt, Ulrike; Kopf, Matthias; Belkin, Natalia; Berman-Frank, Ilana; Hess, Wolfgang R.

2014-08-01

Blooms of the dinitrogen-fixing marine cyanobacterium Trichodesmium considerably contribute to new nitrogen inputs into tropical oceans. Intriguingly, only 60% of the Trichodesmium erythraeum IMS101 genome sequence codes for protein, compared with ~85% in other sequenced cyanobacterial genomes. The extensive non-coding genome fraction suggests space for an unusually high number of unidentified, potentially regulatory non-protein-coding RNAs (ncRNAs). To identify the transcribed fraction of the genome, here we present a genome-wide map of transcriptional start sites (TSS) at single nucleotide resolution, revealing the activity of 6,080 promoters. We demonstrate that T. erythraeum has the highest number of actively splicing group II introns and the highest percentage of TSS yielding ncRNAs of any bacterium examined to date. We identified a highly transcribed retroelement that serves as template repeat for the targeted mutation of at least 12 different genes by mutagenic homing. Our findings explain the non-coding portion of the T. erythraeum genome by the transcription of an unusually high number of non-coding transcripts in addition to the known high incidence of transposable elements. We conclude that riboregulation and RNA maturation-dependent processes constitute a major part of the Trichodesmium regulatory apparatus.
Highly conserved elements discovered in vertebrates are present in non-syntenic loci of tunicates, act as enhancers and can be transcribed during development

PubMed Central

Sanges, Remo; Hadzhiev, Yavor; Gueroult-Bellone, Marion; Roure, Agnes; Ferg, Marco; Meola, Nicola; Amore, Gabriele; Basu, Swaraj; Brown, Euan R.; De Simone, Marco; Petrera, Francesca; Licastro, Danilo; Strähle, Uwe; Banfi, Sandro; Lemaire, Patrick; Birney, Ewan; Müller, Ferenc; Stupka, Elia

2013-01-01

Co-option of cis-regulatory modules has been suggested as a mechanism for the evolution of expression sites during development. However, the extent and mechanisms involved in mobilization of cis-regulatory modules remains elusive. To trace the history of non-coding elements, which may represent candidate ancestral cis-regulatory modules affirmed during chordate evolution, we have searched for conserved elements in tunicate and vertebrate (Olfactores) genomes. We identified, for the first time, 183 non-coding sequences that are highly conserved between the two groups. Our results show that all but one element are conserved in non-syntenic regions between vertebrate and tunicate genomes, while being syntenic among vertebrates. Nevertheless, in all the groups, they are significantly associated with transcription factors showing specific functions fundamental to animal development, such as multicellular organism development and sequence-specific DNA binding. The majority of these regions map onto ultraconserved elements and we demonstrate that they can act as functional enhancers within the organism of origin, as well as in cross-transgenesis experiments, and that they are transcribed in extant species of Olfactores. We refer to the elements as ‘Olfactores conserved non-coding elements’. PMID:23393190
Identification and Functional Prediction of Large Intergenic Noncoding RNAs (lincRNAs) in Rainbow Trout (Oncorhynchus mykiss)

USDA-ARS?s Scientific Manuscript database

Long noncoding RNAs (lncRNAs) have been recognized in recent years as key regulators of diverse cellular processes. Genome-wide large-scale projects have uncovered thousands of lncRNAs in many model organisms. Large intergenic noncoding RNAs (lincRNAs) are lncRNAs that are transcribed from intergeni...
Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin

PubMed Central

Böhmdorfer, Gudrun; Sethuraman, Shriya; Rowley, M Jordan; Krzyszton, Michal; Rothi, M Hafiz; Bouzit, Lilia; Wierzbicki, Andrzej T

2016-01-01

RNA-mediated transcriptional gene silencing is a conserved process where small RNAs target transposons and other sequences for repression by establishing chromatin modifications. A central element of this process are long non-coding RNAs (lncRNA), which in Arabidopsis thaliana are produced by a specialized RNA polymerase known as Pol V. Here we show that non-coding transcription by Pol V is controlled by preexisting chromatin modifications located within the transcribed regions. Most Pol V transcripts are associated with AGO4 but are not sliced by AGO4. Pol V-dependent DNA methylation is established on both strands of DNA and is tightly restricted to Pol V-transcribed regions. This indicates that chromatin modifications are established in close proximity to Pol V. Finally, Pol V transcription is preferentially enriched on edges of silenced transposable elements, where Pol V transcribes into TEs. We propose that Pol V may play an important role in the determination of heterochromatin boundaries. DOI: http://dx.doi.org/10.7554/eLife.19092.001 PMID:27779094
[Long non-coding RNAs in the pathophysiology of atherosclerosis].

PubMed

Novak, Jan; Vašků, Julie Bienertová; Souček, Miroslav

2018-01-01

The human genome contains about 22 000 protein-coding genes that are transcribed to an even larger amount of messenger RNAs (mRNA). Interestingly, the results of the project ENCODE from 2012 show, that despite up to 90 % of our genome being actively transcribed, protein-coding mRNAs make up only 2-3 % of the total amount of the transcribed RNA. The rest of RNA transcripts is not translated to proteins and that is why they are referred to as "non-coding RNAs". Earlier the non-coding RNA was considered "the dark matter of genome", or "the junk", whose genes has accumulated in our DNA during the course of evolution. Today we already know that non-coding RNAs fulfil a variety of regulatory functions in our body - they intervene into epigenetic processes from chromatin remodelling to histone methylation, or into the transcription process itself, or even post-transcription processes. Long non-coding RNAs (lncRNA) are one of the classes of non-coding RNAs that have more than 200 nucleotides in length (non-coding RNAs with less than 200 nucleotides in length are called small non-coding RNAs). lncRNAs represent a widely varied and large group of molecules with diverse regulatory functions. We can identify them in all thinkable cell types or tissues, or even in an extracellular space, which includes blood, specifically plasma. Their levels change during the course of organogenesis, they are specific to different tissues and their changes also occur along with the development of different illnesses, including atherosclerosis. This review article aims to present lncRNAs problematics in general and then focuses on some of their specific representatives in relation to the process of atherosclerosis (i.e. we describe lncRNA involvement in the biology of endothelial cells, vascular smooth muscle cells or immune cells), and we further describe possible clinical potential of lncRNA, whether in diagnostics or therapy of atherosclerosis and its clinical manifestations.Key words: atherosclerosis - lincRNA - lncRNA - MALAT - MIAT.
Birth, coming of age and death: The intriguing life of long noncoding RNAs.

PubMed

Samudyata; Castelo-Branco, Gonçalo; Bonetti, Alessandro

2018-07-01

Mammalian genomes are pervasively transcribed, with long noncoding RNAs being the most abundant fraction. Recent studies have highlighted the central role played by these transcripts in several physiological and pathological processes. Despite several metabolic features shared between coding and noncoding transcripts, these two classes of RNAs exhibit multiple differences regarding their biogenesis and processing. Here we review such distinctions, focusing on the unique features of specific long noncoding RNAs. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Long Non-coding RNA HOTTIP Enhances Pancreatic Cancer Cell Proliferation, Survival and Migration

EPA Science Inventory

ABSTRACTHOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5' tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expres...
RNA Polymerase III promoter screen uncovers a novel noncoding RNA family conserved in Caenorhabditis and other clade V nematodes.

PubMed

Gruber, Andreas R

2014-07-10

RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.
[Relevance of long non-coding RNAs in tumour biology].

PubMed

Nagy, Zoltán; Szabó, Diána Rita; Zsippai, Adrienn; Falus, András; Rácz, Károly; Igaz, Péter

2012-09-23

The discovery of the biological relevance of non-coding RNA molecules represents one of the most significant advances in contemporary molecular biology. It has turned out that a major fraction of the non-coding part of the genome is transcribed. Beside small RNAs (including microRNAs) more and more data are disclosed concerning long non-coding RNAs of 200 nucleotides to 100 kb length that are implicated in the regulation of several basic molecular processes (cell proliferation, chromatin functioning, microRNA-mediated effects, etc.). Some of these long non-coding RNAs have been associated with human tumours, including H19, HOTAIR, MALAT1, etc., the different expression of which has been noted in various neoplasms relative to healthy tissues. Long non-coding RNAs may represent novel markers of molecular diagnostics and they might even turn out to be targets of therapeutic intervention.
Technological Developments in lncRNA Biology.

PubMed

Jathar, Sonali; Kumar, Vikram; Srivastava, Juhi; Tripathi, Vidisha

2017-01-01

It is estimated that more than 90% of the mammalian genome is transcribed as non-coding RNAs. Recent evidences have established that these non-coding transcripts are not junk or just transcriptional noise, but they do serve important biological purpose. One of the rapidly expanding fields of this class of transcripts is the regulatory lncRNAs, which had been a major challenge in terms of their molecular functions and mechanisms of action. The emergence of high-throughput technologies and the development in various conventional approaches have led to the expansion of the lncRNA world. The combination of multidisciplinary approaches has proven to be essential to unravel the complexity of their regulatory networks and helped establish the importance of their existence. Here, we review the current methodologies available for discovering and investigating functions of long non-coding RNAs (lncRNAs) and focus on the powerful technological advancement available to specifically address their functional importance.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.

PubMed

Spangler, Jacob B; Feltus, Frank Alex

2013-01-01

Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis

PubMed Central

Spangler, Jacob B.; Feltus, Frank Alex

2013-01-01

Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377
Noncoding RNAs in DNA Repair and Genome Integrity

PubMed Central

Wan, Guohui; Liu, Yunhua; Han, Cecil; Zhang, Xinna

2014-01-01

Abstract Significance: The well-studied sequences in the human genome are those of protein-coding genes, which account for only 1%–2% of the total genome. However, with the advent of high-throughput transcriptome sequencing technology, we now know that about 90% of our genome is extensively transcribed and that the vast majority of them are transcribed into noncoding RNAs (ncRNAs). It is of great interest and importance to decipher the functions of these ncRNAs in humans. Recent Advances: In the last decade, it has become apparent that ncRNAs play a crucial role in regulating gene expression in normal development, in stress responses to internal and environmental stimuli, and in human diseases. Critical Issues: In addition to those constitutively expressed structural RNA, such as ribosomal and transfer RNAs, regulatory ncRNAs can be classified as microRNAs (miRNAs), Piwi-interacting RNAs (piRNAs), small interfering RNAs (siRNAs), small nucleolar RNAs (snoRNAs), and long noncoding RNAs (lncRNAs). However, little is known about the biological features and functional roles of these ncRNAs in DNA repair and genome instability, although a number of miRNAs and lncRNAs are regulated in the DNA damage response. Future Directions: A major goal of modern biology is to identify and characterize the full profile of ncRNAs with regard to normal physiological functions and roles in human disorders. Clinically relevant ncRNAs will also be evaluated and targeted in therapeutic applications. Antioxid. Redox Signal. 20, 655–677. PMID:23879367
3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.

PubMed

Goldfarb, Katherine C; Cech, Thomas R

2013-09-21

Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.
Decoding the function of nuclear long non-coding RNAs.

PubMed

Chen, Ling-Ling; Carmichael, Gordon G

2010-06-01

Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.
Scanning the Human Genome for Novel Therapeutic Targets for Breast Cancer

DTIC Science & Technology

2006-04-01

action of this class of non-coding regulatory RNAs13,14. MicroRNAs are transcribed by RNA polymerase II as long primary polyadenylated transcripts...Artificial miRNAs can be expressed from both RNA polymerase II and III promoters resulting in silencing to varying degrees. At present there...the highest levels of mature microRNA in RISC and generally effective silencing. These structures can be transcribed by either RNA polymerase II or
Noncoding RNAs of the Ultrabithorax Domain of the Drosophila Bithorax Complex

PubMed Central

Pease, Benjamin; Borges, Ana C.; Bender, Welcome

2013-01-01

RNA transcripts without obvious coding potential are widespread in many creatures, including the fruit fly, Drosophila melanogaster. Several noncoding RNAs have been identified within the Drosophila bithorax complex. These first appear in blastoderm stage embryos, and their expression patterns indicate that they are transcribed only from active domains of the bithorax complex. It has been suggested that these noncoding RNAs have a role in establishing active domains, perhaps by setting the state of Polycomb Response Elements A comprehensive survey across the proximal half of the bithorax complex has now revealed nine distinct noncoding RNA transcripts, including four within the Ultrabithorax transcription unit. At the blastoderm stage, the noncoding transcripts collectively span ∼75% of the 135 kb surveyed. Recombination-mediated cassette exchange was used to invert the promoter of one of the noncoding RNAs, a 23-kb transcript from the bxd domain of the bithorax complex. The resulting animals fail to make the normal bxd noncoding RNA and show no transcription across the bxd Polycomb Response Element in early embryos. The mutant flies look normal; the regulation of the bxd domain appears unaffected. Thus, the bxd noncoding RNA has no apparent function. PMID:24077301
RNAP-II transcribes two small RNAs at the promoter and terminator regions of the RNAP-I gene in Saccharomyces cerevisiae.

PubMed

Mayán, Maria D

2013-01-01

Three RNA polymerases coexist in the ribosomal DNA of Saccharomyces cerevisiae. RNAP-I transcribes the 35S rRNA, RNAP-III transcribes the 5S rRNA and RNAP-II is found in both intergenic non-coding regions. Previously, we demonstrated that RNAP-II molecules bound to the intergenic non-coding regions (IGS) of the ribosomal locus are mainly found in a stalled conformation, and the stalled polymerase mediates chromatin interactions, which isolate RNAP-I from the RNAP-III transcriptional domain. Besides, RNAP-II transcribes both IGS regions at low levels, using different cryptic promoters. This report demonstrates that RNAP-II also transcribes two sequences located in the 5'- and 3'-ends of the 35S rRNA gene that overlap with the sequences of the 35S rRNA precursor transcribed by RNAP-I. The sequence located at the promoter region of RNAP-I, called the p-RNA transcript, binds to the transcription termination-related protein, Reb1p, while the T-RNA sequence, located in the termination sites of RNAP-I gene, contains the stem-loop recognized by Rtn1p, which is necessary for proper termination of RNAP-I. Because of their location, these small RNAs may play a key role in the initiation and termination of RNAP-I transcription. To correctly synthesize proteins, eukaryotic cells may retain a mechanism that connects the three main polymerases. This report suggests that cryptic transcription by RNAP-II may be required for normal transcription by RNAP-I in the ribosomal locus of S. cerevisiae. Copyright © 2012 John Wiley & Sons, Ltd.
Long noncoding RNAs as enhancers of gene expression.

PubMed

Ørom, U A; Derrien, T; Guigo, R; Shiekhattar, R

2010-01-01

The human genome contains thousands of long noncoding RNAs (ncRNAs) transcribed from diverse genomic locations. A large set of long ncRNAs is transcribed independent of protein-coding genes. We have used the GENCODE annotation of the human genome to identify 3019 long ncRNAs expressed in various human cell lines and tissue. This set of long ncRNAs responds to differentiation signals in primary human keratinocytes and is coexpressed with important regulators of keratinocyte development. Depletion of a number of these long ncRNAs leads to the repression of specific genes in their surrounding locus, supportive of an activating function for ncRNAs. Using reporter assays, we confirmed such activating function and show that such transcriptional enhancement is mediated through the long ncRNA transcripts. Our studies show that long ncRNAs exhibit functions similar to classically defined enhancers, through an RNA-dependent mechanism.
Role of non-coding RNAs in non-aging-related neurological disorders.

PubMed

Vieira, A S; Dogini, D B; Lopes-Cendes, I

2018-06-11

Protein coding sequences represent only 2% of the human genome. Recent advances have demonstrated that a significant portion of the genome is actively transcribed as non-coding RNA molecules. These non-coding RNAs are emerging as key players in the regulation of biological processes, and act as "fine-tuners" of gene expression. Neurological disorders are caused by a wide range of genetic mutations, epigenetic and environmental factors, and the exact pathophysiology of many of these conditions is still unknown. It is currently recognized that dysregulations in the expression of non-coding RNAs are present in many neurological disorders and may be relevant in the mechanisms leading to disease. In addition, circulating non-coding RNAs are emerging as potential biomarkers with great potential impact in clinical practice. In this review, we discuss mainly the role of microRNAs and long non-coding RNAs in several neurological disorders, such as epilepsy, Huntington disease, fragile X-associated ataxia, spinocerebellar ataxias, amyotrophic lateral sclerosis (ALS), and pain. In addition, we give information about the conditions where microRNAs have demonstrated to be potential biomarkers such as in epilepsy, pain, and ALS.

3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing

PubMed Central

2013-01-01

Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768
Targeting Non-Coding RNAs in Plants with the CRISPR-Cas Technology is a Challenge yet Worth Accepting.

PubMed

Basak, Jolly; Nithin, Chandran

2015-01-01

Non-coding RNAs (ncRNAs) have emerged as versatile master regulator of biological functions in recent years. MicroRNAs (miRNAs) are small endogenous ncRNAs of 18-24 nucleotides in length that originates from long self-complementary precursors. Besides their direct involvement in developmental processes, plant miRNAs play key roles in gene regulatory networks and varied biological processes. Alternatively, long ncRNAs (lncRNAs) are a large and diverse class of transcribed ncRNAs whose length exceed that of 200 nucleotides. Plant lncRNAs are transcribed by different RNA polymerases, showing diverse structural features. Plant lncRNAs also are important regulators of gene expression in diverse biological processes. There has been a breakthrough in the technology of genome editing, the CRISPR-Cas9 (clustered regulatory interspaced short palindromic repeats/CRISPR-associated protein 9) technology, in the last decade. CRISPR loci are transcribed into ncRNA and eventually form a functional complex with Cas9 and further guide the complex to cleave complementary invading DNA. The CRISPR-Cas technology has been successfully applied in model plants such as Arabidopsis and tobacco and important crops like wheat, maize, and rice. However, all these studies are focused on protein coding genes. Information about targeting non-coding genes is scarce. Hitherto, the CRISPR-Cas technology has been exclusively used in vertebrate systems to engineer miRNA/lncRNAs, but it is still relatively unexplored in plants. While briefing miRNAs, lncRNAs and applications of the CRISPR-Cas technology in human and animals, this review essentially elaborates several strategies to overcome the challenges of applying the CRISPR-Cas technology in editing ncRNAs in plants and the future perspective of this field.
Conserved noncoding sequences (CNSs) in higher plants.

PubMed

Freeling, Michael; Subramaniam, Shabarinath

2009-04-01

Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
[Long non-coding RNAs in plants].

PubMed

Xiaoqing, Huang; Dandan, Li; Juan, Wu

2015-04-01

Long non-coding RNAs (lncRNAs), which are longer than 200 nucleotides in length, widely exist in organisms and function in a variety of biological processes. Currently, most of lncRNAs found in plants are transcribed by RNA polymerase Ⅱ and mediate gene expression through multiple mechanisms, such as target mimicry, transcription interference, histone methylation and DNA methylation, and play important roles in flowering, male sterility, nutrition metabolism, biotic and abiotic stress and other biological processes as regulators in plants. In this review, we summarize the databases, prediction methods, and possible functions of plant lncRNAs discovered in recent years.
Non-coding RNAs in lung cancer

PubMed Central

Ricciuti, Biagio; Mecca, Carmen; Crinò, Lucio; Baglivo, Sara; Cenci, Matteo; Metro, Giulio

2014-01-01

The discovery that protein-coding genes represent less than 2% of all human genome, and the evidence that more than 90% of it is actively transcribed, changed the classical point of view of the central dogma of molecular biology, which was always based on the assumption that RNA functions mainly as an intermediate bridge between DNA sequences and protein synthesis machinery. Accumulating data indicates that non-coding RNAs are involved in different physiological processes, providing for the maintenance of cellular homeostasis. They are important regulators of gene expression, cellular differentiation, proliferation, migration, apoptosis, and stem cell maintenance. Alterations and disruptions of their expression or activity have increasingly been associated with pathological changes of cancer cells, this evidence and the prospect of using these molecules as diagnostic markers and therapeutic targets, make currently non-coding RNAs among the most relevant molecules in cancer research. In this paper we will provide an overview of non-coding RNA function and disruption in lung cancer biology, also focusing on their potential as diagnostic, prognostic and predictive biomarkers. PMID:25593996
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.

PubMed

Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel

2013-09-01

RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans

PubMed Central

Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

2015-01-01

Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. PMID:26199191
Long noncoding RNA growth arrest-specific 5 promotes proliferation and survival of female germline stem cells in vitro.

PubMed

Wang, Jie; Gong, Xiaowen; Tian, Geng G; Hou, Changliang; Zhu, Xiaoqin; Pei, Xiuying; Wang, Yanrong; Wu, Ji

2018-05-05

Female germline stem cells (FGSCs) are proposed to be a key factor for ameliorating female infertility. Previously we have shown that neonatal and adult FGSCs could be isolated and purified from mouse ovarian tissues. The long noncoding (lnc) RNA growth arrest-specific 5 sequence (GAS5) transcribed from mammalian genomes plays important regulatory roles in various developmental processes. However, there is no study on the relationship between GAS5 and FGSC development in vitro. In this study, we showed that GAS5 was highly expressed in the neonatal mouse ovary and was located in both FGSCs and oocytes. GAS5 facilitated FGSC proliferation and promoted their survival in vitro. Moreover, GAS5 also inhibited apoptosis of cultured FGSCs. These findings indicate that GAS5 is a crucial regulator of FGSC development. This might serve as a foundation for a strategy of lncRNA-directed diagnosis or treatment of female infertility. Copyright © 2018. Published by Elsevier B.V.
Non-coding RNAs as regulators of gene expression and epigenetics

PubMed Central

Kaikkonen, Minna U.; Lam, Michael T.Y.; Glass, Christopher K.

2011-01-01

Genome-wide studies have revealed that mammalian genomes are pervasively transcribed. This has led to the identification and isolation of novel classes of non-coding RNAs (ncRNAs) that influence gene expression by a variety of mechanisms. Here we review the characteristics and functions of regulatory ncRNAs in chromatin remodelling and at multiple levels of transcriptional and post-transcriptional regulation. We also describe the potential roles of ncRNAs in vascular biology and in mediating epigenetic modifications that might play roles in cardiovascular disease susceptibility. The emerging recognition of the diverse functions of ncRNAs in regulation of gene expression suggests that they may represent new targets for therapeutic intervention. PMID:21558279
Non-coding RNAs: Therapeutic Strategies and Delivery Systems.

PubMed

Ling, Hui

The vast majority of the human genome is transcribed into RNA molecules that do not code for proteins, which could be small ones approximately 20 nucleotide in length, known as microRNAs, or transcripts longer than 200 bp, defined as long noncoding RNAs. The prevalent deregulation of microRNAs in human cancers prompted immediate interest on the therapeutic value of microRNAs as drugs and drug targets. Many features of microRNAs such as well-defined mechanisms, and straightforward oligonucleotide design further make them attractive candidates for therapeutic development. The intensive efforts of exploring microRNA therapeutics are reflected by the large body of preclinical studies using oligonucleotide-based mimicking and blocking, culminated by the recent entry of microRNA therapeutics in clinical trial for several human diseases including cancer. Meanwhile, microRNA therapeutics faces the challenge of effective and safe delivery of nucleic acid therapeutics into the target site. Various chemical modifications of nucleic acids and delivery systems have been developed to increase targeting specificity and efficacy, and reduce the associated side effects including activation of immune response. Recently, long noncoding RNAs become attractive targets for therapeutic intervention because of their association with complex and delicate phenotypes, and their unconventional pharmaceutical activities such as capacity of increasing output of proteins. Here I discuss the general therapeutic strategies targeting noncoding RNAs, review delivery systems developed to maximize noncoding RNA therapeutic efficacy, and offer perspectives on the future development of noncoding RNA targeting agents for colorectal cancer.
The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans.

PubMed

Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

2015-07-20

Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chromatin looping and eRNA transcription precede the transcriptional activation of gene in the β-globin locus

PubMed Central

Kim, Yea Woon; Lee, Sungkung; Yun, Jangmi; Kim, AeRi

2015-01-01

Enhancers are closely positioned with actively transcribed target genes by chromatin looping. Non-coding RNAs are often transcribed on active enhancers, referred to as eRNAs (enhancer RNAs). To explore the kinetics of enhancer–promoter looping and eRNA transcription during transcriptional activation, we induced the β-globin locus by chemical treatment and analysed cross-linking frequency between the β-globin gene and locus control region (LCR) and the amount of eRNAs transcribed on the LCR in a time course manner. The cross-linking frequency was increased after chemical induction but before the transcriptional activation of gene in the β-globin locus. Transcription of eRNAs was increased in concomitant with the increase in cross-linking frequency. These results show that chromatin looping and eRNA transcription precedes the transcriptional activation of gene. Concomitant occurrence of the two events suggests functional relationship between them. PMID:25588787
Long Noncoding RNAs: Past, Present, and Future

PubMed Central

Kung, Johnny T. Y.; Colognori, David; Lee, Jeannie T.

2013-01-01

Long noncoding RNAs (lncRNAs) have gained widespread attention in recent years as a potentially new and crucial layer of biological regulation. lncRNAs of all kinds have been implicated in a range of developmental processes and diseases, but knowledge of the mechanisms by which they act is still surprisingly limited, and claims that almost the entirety of the mammalian genome is transcribed into functional noncoding transcripts remain controversial. At the same time, a small number of well-studied lncRNAs have given us important clues about the biology of these molecules, and a few key functional and mechanistic themes have begun to emerge, although the robustness of these models and classification schemes remains to be seen. Here, we review the current state of knowledge of the lncRNA field, discussing what is known about the genomic contexts, biological functions, and mechanisms of action of lncRNAs. We also reflect on how the recent interest in lncRNAs is deeply rooted in biology’s longstanding concern with the evolution and function of genomes. PMID:23463798
Circular non-coding RNA ANRIL modulates ribosomal RNA maturation and atherosclerosis in humans

PubMed Central

Holdt, Lesca M.; Stahringer, Anika; Sass, Kristina; Pichler, Garwin; Kulak, Nils A.; Wilfert, Wolfgang; Kohlmaier, Alexander; Herbst, Andreas; Northoff, Bernd H.; Nicolaou, Alexandros; Gäbel, Gabor; Beutner, Frank; Scholz, Markus; Thiery, Joachim; Musunuru, Kiran; Krohn, Knut; Mann, Matthias; Teupser, Daniel

2016-01-01

Circular RNAs (circRNAs) are broadly expressed in eukaryotic cells, but their molecular mechanism in human disease remains obscure. Here we show that circular antisense non-coding RNA in the INK4 locus (circANRIL), which is transcribed at a locus of atherosclerotic cardiovascular disease on chromosome 9p21, confers atheroprotection by controlling ribosomal RNA (rRNA) maturation and modulating pathways of atherogenesis. CircANRIL binds to pescadillo homologue 1 (PES1), an essential 60S-preribosomal assembly factor, thereby impairing exonuclease-mediated pre-rRNA processing and ribosome biogenesis in vascular smooth muscle cells and macrophages. As a consequence, circANRIL induces nucleolar stress and p53 activation, resulting in the induction of apoptosis and inhibition of proliferation, which are key cell functions in atherosclerosis. Collectively, these findings identify circANRIL as a prototype of a circRNA regulating ribosome biogenesis and conferring atheroprotection, thereby showing that circularization of long non-coding RNAs may alter RNA function and protect from human disease. PMID:27539542
Non-coding RNA derived from the region adjacent to the human HO-1 E2 enhancer selectively regulates HO-1 gene induction by modulating Pol II binding

PubMed Central

Maruyama, Atsushi; Mimura, Junsei; Itoh, Ken

2014-01-01

Recent studies have disclosed the function of enhancer RNAs (eRNAs), which are long non-coding RNAs transcribed from gene enhancer regions, in transcriptional regulation. However, it remains unclear whether eRNAs are involved in the regulation of human heme oxygenase-1 gene (HO-1) induction. Here, we report that multiple nuclear-enriched eRNAs are transcribed from the regions adjacent to two human HO-1 enhancers (i.e. the distal E2 and proximal E1 enhancers), and some of these eRNAs are induced by the oxidative stress-causing reagent diethyl maleate (DEM). We demonstrated that the expression of one forward direction (5′ to 3′) eRNA transcribed from the human HO-1 E2 enhancer region (named human HO-1enhancer RNA E2-3; hereafter called eRNA E2-3) was induced by DEM in an NRF2-dependent manner in HeLa cells. Conversely, knockdown of BACH1, a repressor of HO-1 transcription, further increased DEM-inducible eRNA E2-3 transcription as well as HO-1 expression. In addition, we showed that knockdown of eRNA E2-3 selectively down-regulated DEM-induced HO-1 expression. Furthermore, eRNA E2-3 knockdown attenuated DEM-induced Pol II binding to the promoter and E2 enhancer regions of HO-1 without affecting NRF2 recruitment to the E2 enhancer. These findings indicate that eRNAE2-3 is functional and is required for HO-1 induction. PMID:25404134
Genome-Wide Discovery of Long Non-Coding RNAs in Rainbow Trout.

PubMed

Al-Tobasei, Rafet; Paneru, Bam; Salem, Mohamed

2016-01-01

The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1-2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200 nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200 nt, with more than 83-100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.
A riboswitch-regulated antisense RNA in Listeria monocytogenes.

PubMed

Mellin, J R; Tiensuu, Teresa; Bécavin, Christophe; Gouin, Edith; Johansson, Jörgen; Cossart, Pascale

2013-08-06

Riboswitches are ligand-binding elements located in 5' untranslated regions of messenger RNAs, which regulate expression of downstream genes. In Listeria monocytogenes, a vitamin B12-binding (B12) riboswitch was identified, not upstream of a gene but downstream, and antisense to the adjacent gene, pocR, suggesting it might regulate pocR in a nonclassical manner. In Salmonella enterica, PocR is a transcription factor that is activated by 1,2-propanediol, and subsequently activates expression of the pdu genes. The pdu genes mediate propanediol catabolism and are implicated in pathogenesis. As enzymes involved in propanediol catabolism require B12 as a cofactor, we hypothesized that the Listeria B12 riboswitch might be involved in pocR regulation. Here we demonstrate that the B12 riboswitch is transcribed as part of a noncoding antisense RNA, herein named AspocR. In the presence of B12, the riboswitch induces transcriptional termination, causing aspocR to be transcribed as a short transcript. In contrast, in the absence of B12, aspocR is transcribed as a long antisense RNA, which inhibits pocR expression. Regulation by AspocR ensures that pocR, and consequently the pdu genes, are maximally expressed only when both propanediol and B12 are present. Strikingly, AspocR can inhibit pocR expression in trans, suggesting it acts through a direct interaction with pocR mRNA. Together, this study demonstrates how pocR and the pdu genes can be regulated by B12 in bacteria and extends the classical definition of riboswitches from elements governing solely the expression of mRNAs to a wider role in controlling transcription of noncoding RNAs.
A 3' UTR-Derived Small RNA Provides the Regulatory Noncoding Arm of the Inner Membrane Stress Response.

PubMed

Chao, Yanjie; Vogel, Jörg

2016-02-04

Small RNAs (sRNAs) from conserved noncoding genes are crucial regulators in bacterial signaling pathways but have remained elusive in the Cpx response to inner membrane stress. Here we report that an alternative biogenesis pathway releasing the conserved mRNA 3' UTR of stress chaperone CpxP as an ∼60-nt sRNA provides the noncoding arm of the Cpx response. This so-called CpxQ sRNA, generated by general mRNA decay through RNase E, acts as an Hfq-dependent repressor of multiple mRNAs encoding extracytoplasmic proteins. Both CpxQ and the Cpx pathway are required for cell survival under conditions of dissipation of membrane potential. Our discovery of CpxQ illustrates how the conversion of a transcribed 3' UTR into an sRNA doubles the output of a single mRNA to produce two factors with spatially segregated functions during inner membrane stress: a chaperone that targets problematic proteins in the periplasm and a regulatory RNA that dampens their synthesis in the cytosol. Copyright © 2016 Elsevier Inc. All rights reserved.
The FANTOM5 collection, a data series underpinning mammalian transcriptome atlases in diverse cell types.

PubMed

Kawaji, Hideya; Kasukawa, Takeya; Forrest, Alistair; Carninci, Piero; Hayashizaki, Yoshihide

2017-08-29

The latest project from the FANTOM consortium, an international collaborative effort initiated by RIKEN, generated atlases of transcriptomes, in particular promoters, transcribed enhancers, and long-noncoding RNAs, across a diverse set of mammalian cell types. Here, we introduce the FANTOM5 collection, bringing together data descriptors, articles and analyses of FANTOM5 data published across the Nature Research journals. Associated data are openly available for reuse by all.
Cell cycle, oncogenic and tumor suppressor pathways regulate numerous long and macro non-protein-coding RNAs

PubMed Central

2014-01-01

Background The genome is pervasively transcribed but most transcripts do not code for proteins, constituting non-protein-coding RNAs. Despite increasing numbers of functional reports of individual long non-coding RNAs (lncRNAs), assessing the extent of functionality among the non-coding transcriptional output of mammalian cells remains intricate. In the protein-coding world, transcripts differentially expressed in the context of processes essential for the survival of multicellular organisms have been instrumental in the discovery of functionally relevant proteins and their deregulation is frequently associated with diseases. We therefore systematically identified lncRNAs expressed differentially in response to oncologically relevant processes and cell-cycle, p53 and STAT3 pathways, using tiling arrays. Results We found that up to 80% of the pathway-triggered transcriptional responses are non-coding. Among these we identified very large macroRNAs with pathway-specific expression patterns and demonstrated that these are likely continuous transcripts. MacroRNAs contain elements conserved in mammals and sauropsids, which in part exhibit conserved RNA secondary structure. Comparing evolutionary rates of a macroRNA to adjacent protein-coding genes suggests a local action of the transcript. Finally, in different grades of astrocytoma, a tumor disease unrelated to the initially used cell lines, macroRNAs are differentially expressed. Conclusions It has been shown previously that the majority of expressed non-ribosomal transcripts are non-coding. We now conclude that differential expression triggered by signaling pathways gives rise to a similar abundance of non-coding content. It is thus unlikely that the prevalence of non-coding transcripts in the cell is a trivial consequence of leaky or random transcription events. PMID:24594072

Genetic differentiation of Artyfechinostomum malayanum and A. sufrartyfex (Trematoda: Echinostomatidae) based on internal transcribed spacer sequences.

PubMed

Tantrawatpan, Chairat; Saijuntha, Weerachai; Sithithaworn, Paiboon; Andrews, Ross H; Petney, Trevor N

2013-01-01

Genetic differentiation between two synonymous echinostomes species, Artyfechinostomum malayanum and Artyfechinostomum sufrartyfex was determined by using the first and second internal transcribed spacers (ITS1 and ITS2), the non-coding region of rDNA as genetic makers. Of the 699 bp of combined ITS1 and ITS2 sequences examined, 18 variable nucleotide positions (2.58 %) were observed. Of these, 17 positions could be used as diagnostic position between these two sibling species, whereas the other one variation was intraspecific variation of A. malayanum. A clade of A. malayanum was closely aligned with A. sufrartyfex and clearly distance from the cluster of other echinostomes. Our results may sufficiently suggest that the current synonymy of these species is not valid.
An intronic microRNA silences genes that are functionally antagonistic to its host gene.

PubMed

Barik, Sailen

2008-09-01

MicroRNAs (miRNAs) are short noncoding RNAs that down-regulate gene expression by silencing specific target mRNAs. While many miRNAs are transcribed from their own genes, nearly half map within introns of 'host' genes, the significance of which remains unclear. We report that transcriptional activation of apoptosis-associated tyrosine kinase (AATK), essential for neuronal differentiation, also generates miR-338 from an AATK gene intron that silences a family of mRNAs whose protein products are negative regulators of neuronal differentiation. We conclude that an intronic miRNA, transcribed together with the host gene mRNA, may serve the interest of its host gene by silencing a cohort of genes that are functionally antagonistic to the host gene itself.
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

PubMed

Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

2012-01-01

Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.
Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi

We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less
Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

DOE PAGES

Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi; ...

2014-08-19

We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less
The sequence of camelpox virus shows it is most closely related to variola virus, the cause of smallpox.

PubMed

Gubser, Caroline; Smith, Geoffrey L

2002-04-01

Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.
Mining for Micropeptides.

PubMed

Makarewich, Catherine A; Olson, Eric N

2017-09-01

Advances in computational biology and large-scale transcriptome analyses have revealed that a much larger portion of the genome is transcribed than was previously recognized, resulting in the production of a diverse population of RNA molecules with both protein-coding and noncoding potential. Emerging evidence indicates that several RNA molecules have been mis-annotated as noncoding and in fact harbor short open reading frames (sORFs) that encode functional peptides and that have evaded detection until now due to their small size. sORF-encoded peptides (SEPs), or micropeptides, have been shown to have important roles in fundamental biological processes and in the maintenance of cellular homeostasis. These small proteins can act independently, for example as ligands or signaling molecules, or they can exert their biological functions by engaging with and modulating larger regulatory proteins. Given their small size, micropeptides may be uniquely suited to fine-tune complex biological systems. Copyright © 2017 Elsevier Ltd. All rights reserved.
Impact of Noncoding Satellite Repeats on Pancreatic Cancer Metastasis

DTIC Science & Technology

2015-11-01

in 2D and Xenografts . B) Panel of cancer cell lines grown in 2D or 3D culture. 5 cancers (Fig. 3). We have completed RNA-seq analysis of 2D and 3D...reverse transcribed (RT) as a means to expand these regions in tumor genomes. We evaluated the presence of HSATII RT products by treating xenograft small...specific for satellite repeats in human cells. These RNA derived DNAs (rdDNA) are found in primary tumors, xenografts , and tumorspheres in large
Interplay between chromatin modulators and histone acetylation regulates the formation of accessible chromatin in the upstream regulatory region of fission yeast fbp1.

PubMed

Adachi, Akira; Senmatsu, Satoshi; Asada, Ryuta; Abe, Takuya; Hoffman, Charles S; Ohta, Kunihiro; Hirota, Kouji

2018-05-03

Numerous noncoding RNA transcripts are detected in eukaryotic cells. Noncoding RNAs transcribed across gene promoters are involved in the regulation of mRNA transcription via chromatin modulation. This function of noncoding RNA transcription was first demonstrated for the fission yeast fbp1 gene, where a cascade of noncoding RNA transcription events induces chromatin remodeling to facilitate transcription factor binding. We recently demonstrated that the noncoding RNAs from the fbp1 upstream region facilitate binding of the transcription activator Atf1 and thereby promote histone acetylation. Histone acetylation by histone acetyl transferases (HATs) and ATP-dependent chromatin remodelers (ADCRs) are implicated in chromatin remodeling, but the interplay between HATs and ADCRs in this process has not been fully elucidated. Here, we examine the roles played by two distinct ADCRs, Snf22 and Hrp3, and by the HAT Gcn5 in the transcriptional activation of fbp1. Snf22 and Hrp3 redundantly promote disassembly of chromatin in the fbp1 upstream region. Gcn5 critically contributes to nucleosome eviction in the absence of either Snf22 or Hrp3, presumably by recruiting Hrp3 in snf22∆ cells and Snf22 in hrp3∆ cells. Conversely, Gcn5-dependent histone H3 acetylation is impaired in snf22∆/hrp3∆ cells, suggesting that both redundant ADCRs induce recruitment of Gcn5 to the chromatin array in the fbp1 upstream region. These results reveal a previously unappreciated interplay between ADCRs and histone acetylation in which histone acetylation facilitates recruitment of ADCRs, while ADCRs are required for histone acetylation.
Computational Identification and Functional Predictions of Long Noncoding RNA in Zea mays

PubMed Central

Boerner, Susan; McGinnis, Karen M.

2012-01-01

Background Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. Methodology/Principal Findings To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. Conclusions/Significance Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms. PMID:22916204
Origin and evolution of the long non-coding genes in the X-inactivation center.

PubMed

Romito, Antonio; Rougeulle, Claire

2011-11-01

Random X chromosome inactivation (XCI), the eutherian mechanism of X-linked gene dosage compensation, is controlled by a cis-acting locus termed the X-inactivation center (Xic). One of the striking features that characterize the Xic landscape is the abundance of loci transcribing non-coding RNAs (ncRNAs), including Xist, the master regulator of the inactivation process. Recent comparative genomic analyses have depicted the evolutionary scenario behind the origin of the X-inactivation center, revealing that this locus evolved from a region harboring protein-coding genes. During mammalian radiation, this ancestral protein-coding region was disrupted in the marsupial group, whilst it provided in eutherian lineage the starting material for the non-translated RNAs of the X-inactivation center. The emergence of non-coding genes occurred by a dual mechanism involving loss of protein-coding function of the pre-existing genes and integration of different classes of mobile elements, some of which modeled the structure and sequence of the non-coding genes in a species-specific manner. The rising genes started to produce transcripts that acquired function in regulating the epigenetic status of the X chromosome, as shown for Xist, its antisense Tsix, Jpx, and recently suggested for Ftx. Thus, the appearance of the Xic, which occurred after the divergence between eutherians and marsupials, was the basis for the evolution of random X inactivation as a strategy to achieve dosage compensation. Copyright © 2011. Published by Elsevier Masson SAS.
A transcribed ultraconserved noncoding RNA, Uc.173, is a key molecule for the inhibition of lead-induced neuronal apoptosis

PubMed Central

Chen, Lijian; Liu, Meiling; Zhang, Nan; Zhang, Li; Luo, Yuanwei; Liu, Zhenzhong; Dai, Lijun; Jiang, Yiguo

2016-01-01

As a common toxic metal, lead has significant neurotoxicity to brain development. Long non-coding RNAs (lncRNAs) function in multiple biological processes. However, whether lncRNAs are involved in lead-induced neurotoxicity remains unclear. Uc.173 is a lncRNA from a transcribed ultra-conservative region (T-UCR) of human, mouse and rat genomes. We established a lead-induced nerve injury mouse model. It showed the levels of Uc.173 decreased significantly in hippocampus tissue and serum of the model. We further tested the expression of Uc.173 in serum of lead-exposed children, which also showed a tendency to decrease. To explore the effects of Uc.173 on lead-induced nerve injury, we overexpressed Uc.173 in an N2a mouse nerve cell line and found Uc.173 had an inhibitory effect on lead-induced apoptosis of N2a. To investigate the molecular mechanisms of Uc.173 in apoptosis associated with lead-induced nerve injury, we predicted the target microRNAs of Uc.173 by using miRanda, TargetScan and RegRNA. After performing quantitative real-time PCR and bioinformatics analysis, we showed Uc.173 might inter-regulate with miR-291a-3p in lead-induced apoptosis and regulate apoptosis-associated genes. Our study suggests Uc.173 significantly inhibits the apoptosis of nerve cells, which may be mediated by inter-regulation with miRNAs in lead-induced nerve injury. PMID:26683706
Phylogeny and classification of Naucleeae s.l. (Rubiaceae) inferred from molecular (ITS, rBCL, and tRNT-F) and morphological data.

PubMed

Razafimandimbison, Sylvain G; Bremer, Birgitta

2002-07-01

Parsimony analyses of the tribe Naucleeae sensu lato (s.l.) using the noncoding internal transcribed spacer (ITS) regions of nuclear rDNA, the protein-coding rbcL and noncoding trnT-F regions of chloroplast DNA, and morphological data were performed to construct new intratribal classification, test the monophyly of previous subtribal circumscriptions, and evaluate the generic status of Naucleeae s.l. Fifty-two ITS, 45 rbcL, and 55 trnT-F new sequences are published here. Our study supports the monophyly of the subtribes Anthocephalidae, Mitragynae, Uncariae all sensu Haviland and Naucleinae sensu Ridsdale. There was no support for Cephalanthidae sensu Haviland and Adininae sensu Ridsdale. Naucleeae can be subdivided into six highly supported and morphologically distinct subtribes, Breoniinae, Cephalanthinae, Corynantheinae, Naucleinae, and Mitragyninae, Uncarinae, plus one, Adininae, which is poorly supported. The relationships among these subtribes were largely unresolved. We maintain the following 22 genera: Adina, Adinauclea, Breonadia, Breonia, Burttdavya, Cephalanthus, Gyrostipula, Haldina, Janotia, Ludekia, Metadina, Mitragyna, Myrmeconauclea, Nauclea, Neolamarckia, Neonauclea, Ochreinauclea, Pausinystalia, Pertusadina, Sarcocephalus, Sinoadina, and Uncaria. Pseudocinchona is reestablished. Corynanthe is restricted to C. paniculata and Hallea is reincluded in Mitragyna. Our results were inconclusive for assessing the relationships among Adina, Adinauclea, Metadina, and Pertusadina due to lack of resolution.
Non-coding RNA may be associated with cytoplasmic male sterility in Silene vulgaris

PubMed Central

Stone, James D.; Koloušková, Pavla; Sloan, Daniel B.

2017-01-01

Abstract Cytoplasmic male sterility (CMS) is a widespread phenomenon in flowering plants caused by mitochondrial (mt) genes. CMS genes typically encode novel proteins that interfere with mt functions and can be silenced by nuclear fertility-restorer genes. Although the molecular basis of CMS is well established in a number of crop systems, our understanding of it in natural populations is far more limited. To identify CMS genes in a gynodioecious plant, Silene vulgaris, we constructed mt transcriptomes and compared transcript levels and RNA editing patterns in floral bud tissue from female and hermaphrodite full siblings. The transcriptomes from female and hermaphrodite individuals were very similar overall with respect to variation in levels of transcript abundance across the genome, the extent of RNA editing, and the order in which RNA editing and intron splicing events occurred. We found only a single genomic region that was highly overexpressed and differentially edited in females relative to hermaphrodites. This region is not located near any other transcribed elements and lacks an open-reading frame (ORF) of even moderate size. To our knowledge, this transcript would represent the first non-coding mt RNA associated with CMS in plants and is, therefore, an important target for future functional validation studies. PMID:28369520
Long Non-Coding RNAs As Potential Novel Prognostic Biomarkers in Colorectal Cancer

PubMed Central

Saus, Ester; Brunet-Vega, Anna; Iraola-Guzmán, Susana; Pegueroles, Cinta; Gabaldón, Toni; Pericay, Carles

2016-01-01

Colorectal cancer (CRC) is the fourth most common cause of death worldwide. Surgery is usually the first line of treatment for patients with CRC but many tumors with similar histopathological features show significantly different clinical outcomes. The discovery of robust prognostic biomarkers in patients with CRC is imperative to achieve more effective treatment strategies and improve patient's care. Recent progress in next generation sequencing methods and transcriptome analysis has revealed that a much larger part of the genome is transcribed into RNA than previously assumed. Collectively referred to as non-coding RNAs (ncRNAs), some of these RNA molecules such as microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) have been shown to be altered and to play critical roles in tumor biology. This discovery leads to exciting possibilities for personalized cancer diagnosis, and therapy. Many lncRNAs are tissue and cancer-type specific and have already revealed to be useful as prognostic markers. In this review, we focus on recent findings concerning aberrant expression of lncRNAs in CRC tumors and emphasize their prognostic potential in CRC. Further studies focused on the mechanisms of action of lncRNAs will contribute to the development of novel biomarkers for diagnosis and disease progression. PMID:27148353
The CASC15 long intergenic non-coding RNA locus is involved in melanoma progression and phenotype-switching

PubMed Central

Lessard, Laurent; Liu, Michelle; Marzese, Diego M.; Wang, Hongwei; Chong, Kelly; Kawas, Neal; Donovan, Nicholas C; Kiyohara, Eiji; Hsu, Sandy; Nelson, Nellie; Izraely, Sivan; Sagi-Assif, Orit; Witz, Isaac P; Ma, Xiao-Jun; Luo, Yuling; Hoon, Dave SB

2015-01-01

In recent years, considerable advances have been made in the characterization of protein-coding alterations involved in the pathogenesis of melanoma. However, despite their growing implication in cancer, little is known about the role of long non-coding RNAs in melanoma progression. We hypothesized that copy number alterations of intergenic non-protein coding domains could help identify long intergenic non-coding RNAs (lincRNAs) associated with metastatic cutaneous melanoma. Among several candidates, our approach uncovered the chromosome 6p22.3 CASC15 lincRNA locus as a frequently gained genomic segment in metastatic melanoma tumors and cell lines. The locus was actively transcribed in metastatic melanoma cells, and up-regulation of CASC15 expression was associated with metastatic progression to brain metastasis in a mouse xenograft model. In clinical specimens, CASC15 levels increased during melanoma progression and were independent predictors of disease recurrence in a cohort of 141 patients with AJCC stage III lymph node metastasis. Moreover, siRNA knockdown experiments revealed that CASC15 regulates melanoma cell phenotype switching between proliferative and invasive states. Accordingly, CASC15 levels correlated with known gene signatures corresponding to melanoma proliferative and invasive phenotypes. These findings support a key role for CASC15 in metastatic melanoma. PMID:26016895
Evolution of the unspliced transcriptome.

PubMed

Engelhardt, Jan; Stadler, Peter F

2015-08-20

Despite their abundance, unspliced EST data have received little attention as a source of information on non-coding RNAs. Very little is know, therefore, about the genomic distribution of unspliced non-coding transcripts and their relationship with the much better studied regularly spliced products. In particular, their evolution has remained virtually unstudied. We systematically study the evidence on unspliced transcripts available in EST annotation tracks for human and mouse, comprising 104,980 and 66,109 unspliced EST clusters, respectively. Roughly one third of these are located totally inside introns of known genes (TINs) and another third overlaps exonic regions (PINs). Eleven percent are "intergenic", far away from any annotated gene. Direct evidence for the independent transcription of many PINs and TINs is obtained from CAGE tag and chromatin data. We predict more than 2000 3'UTR-associated RNA candidates for each human and mouse. Fifteen to twenty percent of the unspliced EST cluster are conserved between human and mouse. With the exception of TINs, the sequences of unspliced EST clusters evolve significantly slower than genomic background. Furthermore, like spliced lincRNAs, they show highly tissue-specific expression patterns. Unspliced long non-coding RNAs are an important, rapidly evolving, component of mammalian transcriptomes. Their analysis is complicated by their preferential association with complex transcribed loci that usually also harbor a plethora of spliced transcripts. Unspliced EST data, although typically disregarded in transcriptome analysis, can be used to gain insights into this rarely investigated transcriptome component. The frequently postulated connection between lack of splicing and nuclear retention and the surprising overlap of chromatin-associated transcripts suggests that this class of transcripts might be involved in chromatin organization and possibly other mechanisms of epigenetic control.
Dual Analysis of the Murine Cytomegalovirus and Host Cell Transcriptomes Reveal New Aspects of the Virus-Host Cell Interface

PubMed Central

Juranic Lisnic, Vanda; Babic Cac, Marina; Lisnic, Berislav; Trsan, Tihana; Mefferd, Adam; Das Mukhopadhyay, Chitrangada; Cook, Charles H.; Jonjic, Stipan; Trgovcich, Joanne

2013-01-01

Major gaps in our knowledge of pathogen genes and how these gene products interact with host gene products to cause disease represent a major obstacle to progress in vaccine and antiviral drug development for the herpesviruses. To begin to bridge these gaps, we conducted a dual analysis of Murine Cytomegalovirus (MCMV) and host cell transcriptomes during lytic infection. We analyzed the MCMV transcriptome during lytic infection using both classical cDNA cloning and sequencing of viral transcripts and next generation sequencing of transcripts (RNA-Seq). We also investigated the host transcriptome using RNA-Seq combined with differential gene expression analysis, biological pathway analysis, and gene ontology analysis. We identify numerous novel spliced and unspliced transcripts of MCMV. Unexpectedly, the most abundantly transcribed viral genes are of unknown function. We found that the most abundant viral transcript, recently identified as a noncoding RNA regulating cellular microRNAs, also codes for a novel protein. To our knowledge, this is the first viral transcript that functions both as a noncoding RNA and an mRNA. We also report that lytic infection elicits a profound cellular response in fibroblasts. Highly upregulated and induced host genes included those involved in inflammation and immunity, but also many unexpected transcription factors and host genes related to development and differentiation. Many top downregulated and repressed genes are associated with functions whose roles in infection are obscure, including host long intergenic noncoding RNAs, antisense RNAs or small nucleolar RNAs. Correspondingly, many differentially expressed genes cluster in biological pathways that may shed new light on cytomegalovirus pathogenesis. Together, these findings provide new insights into the molecular warfare at the virus-host interface and suggest new areas of research to advance the understanding and treatment of cytomegalovirus-associated diseases. PMID:24086132
The full mitochondrial genome sequence of Raillietina tetragona from chicken (Cestoda: Davaineidae).

PubMed

Liang, Jian-Ying; Lin, Rui-Qing

2016-11-01

In the present study, the complete mitochondrial DNA (mtDNA) sequence of Raillietina tetragona was sequenced and its gene contents and genome organizations was compared with that of other tapeworm. The complete mt genome sequence of R. tetragona is 14,444 bp in length. It contains 12 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and two non-coding region. All genes are transcribed in the same direction and have a nucleotide composition high in A and T. The contents of A + T of the complete mt genome are 71.4% for R. tetragona. The R. tetragona mt genome sequence provides novel mtDNA marker for studying the molecular epidemiology and population genetics of Raillietina and has implications for the molecular diagnosis of chicken cestodosis caused by Raillietina.
Quantitative Profiling of Peptides from RNAs classified as non-coding

PubMed Central

Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.

2014-01-01

Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355

Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript

PubMed Central

Rose, Dominic; Stadler, Peter F.

2011-01-01

Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Advanced Design of Dumbbell-shaped Genetic Minimal Vectors Improves Non-coding and Coding RNA Expression.

PubMed

Jiang, Xiaoou; Yu, Han; Teo, Cui Rong; Tan, Genim Siu Xian; Goh, Sok Chin; Patel, Parasvi; Chua, Yiqiang Kevin; Hameed, Nasirah Banu Sahul; Bertoletti, Antonio; Patzel, Volker

2016-09-01

Dumbbell-shaped DNA minimal vectors lacking nontherapeutic genes and bacterial sequences are considered a stable, safe alternative to viral, nonviral, and naked plasmid-based gene-transfer systems. We investigated novel molecular features of dumbbell vectors aiming to reduce vector size and to improve the expression of noncoding or coding RNA. We minimized small hairpin RNA (shRNA) or microRNA (miRNA) expressing dumbbell vectors in size down to 130 bp generating the smallest genetic expression vectors reported. This was achieved by using a minimal H1 promoter with integrated transcriptional terminator transcribing the RNA hairpin structure around the dumbbell loop. Such vectors were generated with high conversion yields using a novel protocol. Minimized shRNA-expressing dumbbells showed accelerated kinetics of delivery and transcription leading to enhanced gene silencing in human tissue culture cells. In primary human T cells, minimized miRNA-expressing dumbbells revealed higher stability and triggered stronger target gene suppression as compared with plasmids and miRNA mimics. Dumbbell-driven gene expression was enhanced up to 56- or 160-fold by implementation of an intron and the SV40 enhancer compared with control dumbbells or plasmids. Advanced dumbbell vectors may represent one option to close the gap between durable expression that is achievable with integrating viral vectors and short-term effects triggered by naked RNA.
The Mediator complex: a central integrator of transcription

PubMed Central

Allen, Benjamin L.; Taatjes, Dylan J.

2016-01-01

The RNA polymerase II (pol II) enzyme transcribes all protein-coding and most non-coding RNA genes and is globally regulated by Mediator, a large, conformationally flexible protein complex with variable subunit composition (for example, a four-subunit CDK8 module can reversibly associate). These biochemical characteristics are fundamentally important for Mediator's ability to control various processes important for transcription, including organization of chromatin architecture and regulation of pol II pre-initiation, initiation, re-initiation, pausing, and elongation. Although Mediator exists in all eukaryotes, a variety of Mediator functions appear to be specific to metazoans, indicative of more diverse regulatory requirements. PMID:25693131
Antisense and sense poly(A)-RNAs from the Xenopus laevis pyruvate dehydrogenase gene loci are regulated with message production during embryogenesis.

PubMed

Islam, N; Poitras, L; Gagnon, F; Moss, T

1996-10-17

The structure and temporal expression of two Xenopus cDNAs encoding the beta subunit of pyruvate dehydrogenase (XPdhE1 beta) have been determined. XPdhE1 beta was 88% homologous to mature human PdhE1 beta, but the putative N-terminal mitochondrial signal peptide was poorly conserved. Zygotic expression of XPdhE1 beta mRNA was detected at neural tube closure and increased until stage 40. RT-PCR cloning identified a short homology to a protein kinase open reading frame within the 3' non-coding sequence of the XPdhE1 beta cDNAs. This homology, which occurred on the antisense cDNA strand, was shown by strand specific RT-PCR to be transcribed in vivo as part of an antisense RNA. Northern analysis showed that this RNA formed part of an abundant and heterogeneous population of antisense and sense poly(A)-RNAs transcribed from the XPdhE1 beta loci and coordinately regulated with message production.
Keeping abreast with long non-coding RNAs in mammary gland development and breast cancer

PubMed Central

Hansji, Herah; Leung, Euphemia Y.; Baguley, Bruce C.; Finlay, Graeme J.; Askarian-Amiri, Marjan E.

2014-01-01

The majority of the human genome is transcribed, even though only 2% of transcripts encode proteins. Non-coding transcripts were originally dismissed as evolutionary junk or transcriptional noise, but with the development of whole genome technologies, these non-coding RNAs (ncRNAs) are emerging as molecules with vital roles in regulating gene expression. While shorter ncRNAs have been extensively studied, the functional roles of long ncRNAs (lncRNAs) are still being elucidated. Studies over the last decade show that lncRNAs are emerging as new players in a number of diseases including cancer. Potential roles in both oncogenic and tumor suppressive pathways in cancer have been elucidated, but the biological functions of the majority of lncRNAs remain to be identified. Accumulated data are identifying the molecular mechanisms by which lncRNA mediates both structural and functional roles. LncRNA can regulate gene expression at both transcriptional and post-transcriptional levels, including splicing and regulating mRNA processing, transport, and translation. Much current research is aimed at elucidating the function of lncRNAs in breast cancer and mammary gland development, and at identifying the cellular processes influenced by lncRNAs. In this paper we review current knowledge of lncRNAs contributing to these processes and present lncRNA as a new paradigm in breast cancer development. PMID:25400658
Mediator directs co-transcriptional heterochromatin assembly by RNA interference-dependent and -independent pathways.

PubMed

Oya, Eriko; Kato, Hiroaki; Chikashige, Yuji; Tsutsumi, Chihiro; Hiraoka, Yasushi; Murakami, Yota

2013-01-01

Heterochromatin at the pericentromeric repeats in fission yeast is assembled and spread by an RNAi-dependent mechanism, which is coupled with the transcription of non-coding RNA from the repeats by RNA polymerase II. In addition, Rrp6, a component of the nuclear exosome, also contributes to heterochromatin assembly and is coupled with non-coding RNA transcription. The multi-subunit complex Mediator, which directs initiation of RNA polymerase II-dependent transcription, has recently been suggested to function after initiation in processes such as elongation of transcription and splicing. However, the role of Mediator in the regulation of chromatin structure is not well understood. We investigated the role of Mediator in pericentromeric heterochromatin formation and found that deletion of specific subunits of the head domain of Mediator compromised heterochromatin structure. The Mediator head domain was required for Rrp6-dependent heterochromatin nucleation at the pericentromere and for RNAi-dependent spreading of heterochromatin into the neighboring region. In the latter process, Mediator appeared to contribute to efficient processing of siRNA from transcribed non-coding RNA, which was required for efficient spreading of heterochromatin. Furthermore, the head domain directed efficient transcription in heterochromatin. These results reveal a pivotal role for Mediator in multiple steps of transcription-coupled formation of pericentromeric heterochromatin. This observation further extends the role of Mediator to co-transcriptional chromatin regulation.
Noncoding transcripts in sense and antisense orientation regulate the epigenetic state of ribosomal RNA genes.

PubMed

Bierhoff, H; Schmitz, K; Maass, F; Ye, J; Grummt, I

2010-01-01

Alternative transcription of the same gene in sense and antisense orientation regulates expression of protein-coding genes. Here we show that noncoding RNA (ncRNA) in sense and antisense orientation also controls transcription of rRNA genes (rDNA). rDNA exists in two types of chromatin--a euchromatic conformation that is permissive to transcription and a heterochromatic conformation that is transcriptionally silent. Silencing of rDNA is mediated by NoRC, a chromatin-remodeling complex that triggers heterochromatin formation. NoRC function requires RNA that is complementary to the rDNA promoter (pRNA). pRNA forms a DNA:RNA triplex with a regulatory element in the rDNA promoter, and this triplex structure is recognized by DNMT3b. The results imply that triplex-mediated targeting of DNMT3b to specific sequences may be a common pathway in epigenetic regulation. We also show that rDNA is transcribed in antisense orientation. The level of antisense RNA (asRNA) is down-regulated in cancer cells and up-regulated in senescent cells. Ectopic asRNA triggers trimethylation of histone H4 at lysine 20 (H4K20me3), suggesting that antisense transcripts guide the histone methyltransferase Suv4-20 to rDNA. The results reveal that noncoding RNAs in sense and antisense orientation are important determinants of the epigenetic state of rDNA.
Natural Antisense Transcripts: Molecular Mechanisms and Implications in Breast Cancers

PubMed Central

Latgé, Guillaume; Poulet, Christophe; Bours, Vincent; Jerusalem, Guy

2018-01-01

Natural antisense transcripts are RNA sequences that can be transcribed from both DNA strands at the same locus but in the opposite direction from the gene transcript. Because strand-specific high-throughput sequencing of the antisense transcriptome has only been available for less than a decade, many natural antisense transcripts were first described as long non-coding RNAs. Although the precise biological roles of natural antisense transcripts are not known yet, an increasing number of studies report their implication in gene expression regulation. Their expression levels are altered in many physiological and pathological conditions, including breast cancers. Among the potential clinical utilities of the natural antisense transcripts, the non-coding|coding transcript pairs are of high interest for treatment. Indeed, these pairs can be targeted by antisense oligonucleotides to specifically tune the expression of the coding-gene. Here, we describe the current knowledge about natural antisense transcripts, their varying molecular mechanisms as gene expression regulators, and their potential as prognostic or predictive biomarkers in breast cancers. PMID:29301303
Natural Antisense Transcripts: Molecular Mechanisms and Implications in Breast Cancers.

PubMed

Latgé, Guillaume; Poulet, Christophe; Bours, Vincent; Josse, Claire; Jerusalem, Guy

2018-01-02

Natural antisense transcripts are RNA sequences that can be transcribed from both DNA strands at the same locus but in the opposite direction from the gene transcript. Because strand-specific high-throughput sequencing of the antisense transcriptome has only been available for less than a decade, many natural antisense transcripts were first described as long non-coding RNAs. Although the precise biological roles of natural antisense transcripts are not known yet, an increasing number of studies report their implication in gene expression regulation. Their expression levels are altered in many physiological and pathological conditions, including breast cancers. Among the potential clinical utilities of the natural antisense transcripts, the non-coding|coding transcript pairs are of high interest for treatment. Indeed, these pairs can be targeted by antisense oligonucleotides to specifically tune the expression of the coding-gene. Here, we describe the current knowledge about natural antisense transcripts, their varying molecular mechanisms as gene expression regulators, and their potential as prognostic or predictive biomarkers in breast cancers.
Identification and Characterization of Long Non-Coding RNAs Related to Mouse Embryonic Brain Development from Available Transcriptomic Data

PubMed Central

He, Hongjuan; Xiu, Youcheng; Guo, Jing; Liu, Hui; Liu, Qi; Zeng, Tiebo; Chen, Yan; Zhang, Yan; Wu, Qiong

2013-01-01

Long non-coding RNAs (lncRNAs) as a key group of non-coding RNAs have gained widely attention. Though lncRNAs have been functionally annotated and systematic explored in higher mammals, few are under systematical identification and annotation. Owing to the expression specificity, known lncRNAs expressed in embryonic brain tissues remain still limited. Considering a large number of lncRNAs are only transcribed in brain tissues, studies of lncRNAs in developmental brain are therefore of special interest. Here, publicly available RNA-sequencing (RNA-seq) data in embryonic brain are integrated to identify thousands of embryonic brain lncRNAs by a customized pipeline. A significant proportion of novel transcripts have not been annotated by available genomic resources. The putative embryonic brain lncRNAs are shorter in length, less spliced and show less conservation than known genes. The expression of putative lncRNAs is in one tenth on average of known coding genes, while comparable with known lncRNAs. From chromatin data, putative embryonic brain lncRNAs are associated with active chromatin marks, comparable with known lncRNAs. Embryonic brain expressed lncRNAs are also indicated to have expression though not evident in adult brain. Gene Ontology analysis of putative embryonic brain lncRNAs suggests that they are associated with brain development. The putative lncRNAs are shown to be related to possible cis-regulatory roles in imprinting even themselves are deemed to be imprinted lncRNAs. Re-analysis of one knockdown data suggests that four regulators are associated with lncRNAs. Taken together, the identification and systematic analysis of putative lncRNAs would provide novel insights into uncharacterized mouse non-coding regions and the relationships with mammalian embryonic brain development. PMID:23967161
The Ever-Evolving Concept of the Gene: The Use of RNA/Protein Experimental Techniques to Understand Genome Functions

PubMed Central

Cipriano, Andrea; Ballarino, Monica

2018-01-01

The completion of the human genome sequence together with advances in sequencing technologies have shifted the paradigm of the genome, as composed of discrete and hereditable coding entities, and have shown the abundance of functional noncoding DNA. This part of the genome, previously dismissed as “junk” DNA, increases proportionally with organismal complexity and contributes to gene regulation beyond the boundaries of known protein-coding genes. Different classes of functionally relevant nonprotein-coding RNAs are transcribed from noncoding DNA sequences. Among them are the long noncoding RNAs (lncRNAs), which are thought to participate in the basal regulation of protein-coding genes at both transcriptional and post-transcriptional levels. Although knowledge of this field is still limited, the ability of lncRNAs to localize in different cellular compartments, to fold into specific secondary structures and to interact with different molecules (RNA or proteins) endows them with multiple regulatory mechanisms. It is becoming evident that lncRNAs may play a crucial role in most biological processes such as the control of development, differentiation and cell growth. This review places the evolution of the concept of the gene in its historical context, from Darwin's hypothetical mechanism of heredity to the post-genomic era. We discuss how the original idea of protein-coding genes as unique determinants of phenotypic traits has been reconsidered in light of the existence of noncoding RNAs. We summarize the technological developments which have been made in the genome-wide identification and study of lncRNAs and emphasize the methodologies that have aided our understanding of the complexity of lncRNA-protein interactions in recent years. PMID:29560353
Divergent transcription is associated with promoters of transcriptional regulators

PubMed Central

2013-01-01

Background Divergent transcription is a wide-spread phenomenon in mammals. For instance, short bidirectional transcripts are a hallmark of active promoters, while longer transcripts can be detected antisense from active genes in conditions where the RNA degradation machinery is inhibited. Moreover, many described long non-coding RNAs (lncRNAs) are transcribed antisense from coding gene promoters. However, the general significance of divergent lncRNA/mRNA gene pair transcription is still poorly understood. Here, we used strand-specific RNA-seq with high sequencing depth to thoroughly identify antisense transcripts from coding gene promoters in primary mouse tissues. Results We found that a substantial fraction of coding-gene promoters sustain divergent transcription of long non-coding RNA (lncRNA)/mRNA gene pairs. Strikingly, upstream antisense transcription is significantly associated with genes related to transcriptional regulation and development. Their promoters share several characteristics with those of transcriptional developmental genes, including very large CpG islands, high degree of conservation and epigenetic regulation in ES cells. In-depth analysis revealed a unique GC skew profile at these promoter regions, while the associated coding genes were found to have large first exons, two genomic features that might enforce bidirectional transcription. Finally, genes associated with antisense transcription harbor specific H3K79me2 epigenetic marking and RNA polymerase II enrichment profiles linked to an intensified rate of early transcriptional elongation. Conclusions We concluded that promoters of a class of transcription regulators are characterized by a specialized transcriptional control mechanism, which is directly coupled to relaxed bidirectional transcription. PMID:24365181
The Big Entity of New RNA World: Long Non-Coding RNAs in Microvascular Complications of Diabetes.

PubMed

Raut, Satish K; Khullar, Madhu

2018-01-01

A major part of the genome is known to be transcribed into non-protein coding RNAs (ncRNAs), such as microRNA and long non-coding RNA (lncRNA). The importance of ncRNAs is being increasingly recognized in physiological and pathological processes. lncRNAs are a novel class of ncRNAs that do not code for proteins and are important regulators of gene expression. In the past, these molecules were thought to be transcriptional "noise" with low levels of evolutionary conservation. However, recent studies provide strong evidence indicating that lncRNAs are (i) regulated during various cellular processes, (ii) exhibit cell type-specific expression, (iii) localize to specific organelles, and (iv) associated with human diseases. Emerging evidence indicates an aberrant expression of lncRNAs in diabetes and diabetes-related microvascular complications. In the present review, we discuss the current state of knowledge of lncRNAs, their genesis from genome, and the mechanism of action of individual lncRNAs in the pathogenesis of microvascular complications of diabetes and therapeutic approaches.
Noncoding RNA:RNA Regulatory Networks in Cancer

PubMed Central

Chan, Jia Jia; Tay, Yvonne

2018-01-01

Noncoding RNAs (ncRNAs) constitute the majority of the human transcribed genome. This largest class of RNA transcripts plays diverse roles in a multitude of cellular processes, and has been implicated in many pathological conditions, especially cancer. The different subclasses of ncRNAs include microRNAs, a class of short ncRNAs; and a variety of long ncRNAs (lncRNAs), such as lincRNAs, antisense RNAs, pseudogenes, and circular RNAs. Many studies have demonstrated the involvement of these ncRNAs in competitive regulatory interactions, known as competing endogenous RNA (ceRNA) networks, whereby lncRNAs can act as microRNA decoys to modulate gene expression. These interactions are often interconnected, thus aberrant expression of any network component could derail the complex regulatory circuitry, culminating in cancer development and progression. Recent integrative analyses have provided evidence that new computational platforms and experimental approaches can be harnessed together to distinguish key ceRNA interactions in specific cancers, which could facilitate the identification of robust biomarkers and therapeutic targets, and hence, more effective cancer therapies and better patient outcome and survival. PMID:29702599
Mutation in a primate-conserved retrotransposon reveals a noncoding RNA as a mediator of infantile encephalopathy

PubMed Central

Cartault, François; Munier, Patrick; Benko, Edgar; Desguerre, Isabelle; Hanein, Sylvain; Boddaert, Nathalie; Bandiera, Simonetta; Vellayoudom, Jeanine; Krejbich-Trotot, Pascale; Bintner, Marc; Hoarau, Jean-Jacques; Girard, Muriel; Génin, Emmanuelle; de Lonlay, Pascale; Fourmaintraux, Alain; Naville, Magali; Rodriguez, Diana; Feingold, Josué; Renouil, Michel; Munnich, Arnold; Westhof, Eric; Fähling, Michael; Lyonnet, Stanislas; Henrion-Caude, Alexandra

2012-01-01

The human genome is densely populated with transposons and transposon-like repetitive elements. Although the impact of these transposons and elements on human genome evolution is recognized, the significance of subtle variations in their sequence remains mostly unexplored. Here we report homozygosity mapping of an infantile neurodegenerative disease locus in a genetic isolate. Complete DNA sequencing of the 400-kb linkage locus revealed a point mutation in a primate-specific retrotransposon that was transcribed as part of a unique noncoding RNA, which was expressed in the brain. In vitro knockdown of this RNA increased neuronal apoptosis, consistent with the inappropriate dosage of this RNA in vivo and with the phenotype. Moreover, structural analysis of the sequence revealed a small RNA-like hairpin that was consistent with the putative gain of a functional site when mutated. We show here that a mutation in a unique transposable element-containing RNA is associated with lethal encephalopathy, and we suggest that RNAs that harbor evolutionarily recent repetitive elements may play important roles in human brain development. PMID:22411793
The RNA Exosome Adaptor ZFC3H1 Functionally Competes with Nuclear Export Activity to Retain Target Transcripts.

PubMed

Silla, Toomas; Karadoulama, Evdoxia; Mąkosa, Dawid; Lubas, Michal; Jensen, Torben Heick

2018-05-15

Mammalian genomes are promiscuously transcribed, yielding protein-coding and non-coding products. Many transcripts are short lived due to their nuclear degradation by the ribonucleolytic RNA exosome. Here, we show that abolished nuclear exosome function causes the formation of distinct nuclear foci, containing polyadenylated (pA + ) RNA secluded from nucleocytoplasmic export. We asked whether exosome co-factors could serve such nuclear retention. Co-localization studies revealed the enrichment of pA + RNA foci with "pA-tail exosome targeting (PAXT) connection" components MTR4, ZFC3H1, and PABPN1 but no overlap with known nuclear structures such as Cajal bodies, speckles, paraspeckles, or nucleoli. Interestingly, ZFC3H1 is required for foci formation, and in its absence, selected pA + RNAs, including coding and non-coding transcripts, are exported to the cytoplasm in a process dependent on the mRNA export factor AlyREF. Our results establish ZFC3H1 as a central nuclear pA + RNA retention factor, counteracting nuclear export activity. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
Adipocyte Long-Noncoding RNA Transcriptome Analysis of Obese Mice Identified Lnc-Leptin, Which Regulates Leptin.

PubMed

Lo, Kinyui Alice; Huang, Shiqi; Walet, Arcinas Camille Esther; Zhang, Zhi-Chun; Leow, Melvin Khee-Shing; Liu, Meihui; Sun, Lei

2018-06-01

Obesity induces profound transcriptome changes in adipocytes, and recent evidence suggests that long-noncoding RNAs (lncRNAs) play key roles in this process. We performed a comprehensive transcriptome study by RNA sequencing in adipocytes isolated from interscapular brown, inguinal, and epididymal white adipose tissue in diet-induced obese mice. The analysis revealed a set of obesity-dysregulated lncRNAs, many of which exhibit dynamic changes in the fed versus fasted state, potentially serving as novel molecular markers of adipose energy status. Among the most prominent lncRNAs is Lnc-leptin , which is transcribed from an enhancer region upstream of leptin ( Lep ). Expression of Lnc-leptin is sensitive to insulin and closely correlates to Lep expression across diverse pathophysiological conditions. Functionally, induction of Lnc-leptin is essential for adipogenesis, and its presence is required for the maintenance of Lep expression in vitro and in vivo. Direct interaction was detected between DNA loci of Lnc-leptin and Lep in mature adipocytes, which diminished upon Lnc-leptin knockdown. Our study establishes Lnc-leptin as a new regulator of Lep . © 2018 by the American Diabetes Association.
LncRNAs: the bridge linking RNA and colorectal cancer

PubMed Central

Yang, Qilian; Le, Xiaobing; Yang, Huiliang; Wang, Chenlu; Luo, Zhongyue; Xuan, Yu; Chen, Yi; Deng, Xiangbing; Xu, Lian; Feng, Min; Yi, Tao; Zhao, Xia; Zhou, Shengtao

2017-01-01

Long noncoding RNAs (lncRNAs) are transcribed by genomic regions (exceeding 200 nucleotides in length) that do not encode proteins. While the exquisite regulation of lncRNA transcription can provide signals of malignant transformation, lncRNAs control pleiotropic cancer phenotypes through interactions with other cellular molecules including DNA, protein, and RNA. Recent studies have demonstrated that dysregulation of lncRNAs is influential in proliferation, angiogenesis, metastasis, invasion, apoptosis, stemness, and genome instability in colorectal cancer (CRC), with consequent clinical implications. In this review, we explicate the roles of different lncRNAs in CRC, and the potential implications for their clinical application. PMID:27888635
Specific inhibition of aphthovirus infection by RNAs transcribed from both the 5' and the 3' noncoding regions.

PubMed Central

Gutiérrez, A; Martínez-Salas, E; Pintado, B; Sobrino, F

1994-01-01

RNA molecules containing the 3' terminal region of foot-and-mouth disease virus (FMDV) RNA in both antisense and sense orientations were able to inhibit viral FMDV translation and infective particle formation in BHK-21 cells following comicroinjection or cotransfection with infectious viral RNA. Antisense, but not sense, transcripts from the 5' noncoding region including the proximal element of the internal ribosome entry site and the two functional initiation AUGs were also inhibitory, both in in vitro translation and in vivo in comicroinjected or cotransfected BHK-21 cells. This effect was not observed with nonrelated RNA transcripts from lambda phage. The inhibitions found were permanent, sequence specific, and dose dependent; an inverse correlation between the length of the transcript and the extent of the antiviral effect was seen. In all cases, the extent of inhibition increased when viral RNAs and transcripts were allowed to reanneal before transfection, concomitant with a decrease in the doses required. The antiviral effect was specific for FMDV, since transcripts failed to inhibit infective particle formation by other picornavirus, such as encephalomyocarditis virus. These results indicate that the ability of RNA transcripts to inhibit viral multiplication depends on their efficient hybridization with target regions on the viral genome. Furthermore, cells transfected with the 5'1as transcript, which is complementary to the 5' noncoding region, showed a significant reduction of plaque-forming ability during the course of a natural infection. RNA 5'1as was able to inhibit FMDV RNA translation in vitro, suggesting that the inhibitions observed are mediated by a blockage of the viral translation initiation. Conversely, hybridization of short sequences of both sense and antisense transcripts from the 3' end induces distortion of predicted highly ordered structural motifs, which could be required for the synthesis of negative-stranded viral RNA, and correlates with inhibition of viral propagation. Images PMID:7933126
Specific inhibition of aphthovirus infection by RNAs transcribed from both the 5' and the 3' noncoding regions.

PubMed

Gutiérrez, A; Martínez-Salas, E; Pintado, B; Sobrino, F

1994-11-01

RNA molecules containing the 3' terminal region of foot-and-mouth disease virus (FMDV) RNA in both antisense and sense orientations were able to inhibit viral FMDV translation and infective particle formation in BHK-21 cells following comicroinjection or cotransfection with infectious viral RNA. Antisense, but not sense, transcripts from the 5' noncoding region including the proximal element of the internal ribosome entry site and the two functional initiation AUGs were also inhibitory, both in in vitro translation and in vivo in comicroinjected or cotransfected BHK-21 cells. This effect was not observed with nonrelated RNA transcripts from lambda phage. The inhibitions found were permanent, sequence specific, and dose dependent; an inverse correlation between the length of the transcript and the extent of the antiviral effect was seen. In all cases, the extent of inhibition increased when viral RNAs and transcripts were allowed to reanneal before transfection, concomitant with a decrease in the doses required. The antiviral effect was specific for FMDV, since transcripts failed to inhibit infective particle formation by other picornavirus, such as encephalomyocarditis virus. These results indicate that the ability of RNA transcripts to inhibit viral multiplication depends on their efficient hybridization with target regions on the viral genome. Furthermore, cells transfected with the 5'1as transcript, which is complementary to the 5' noncoding region, showed a significant reduction of plaque-forming ability during the course of a natural infection. RNA 5'1as was able to inhibit FMDV RNA translation in vitro, suggesting that the inhibitions observed are mediated by a blockage of the viral translation initiation. Conversely, hybridization of short sequences of both sense and antisense transcripts from the 3' end induces distortion of predicted highly ordered structural motifs, which could be required for the synthesis of negative-stranded viral RNA, and correlates with inhibition of viral propagation.

Genomic assessment of the evolution of the prion protein gene family in vertebrates.

PubMed

Harrison, Paul M; Khachane, Amit; Kumar, Manish

2010-05-01

Prion diseases are devastating neurological disorders caused by the propagation of particles containing an alternative beta-sheet-rich form of the prion protein (PrP). Genes paralogous to PrP, called Doppel and Shadoo, have been identified, that also have neuropathological relevance. To aid in the further functional characterization of PrP and its relatives, we annotated completely the PrP gene family (PrP-GF), in the genomes of 42 vertebrates, through combined strategic application of gene prediction programs and advanced remote homology detection techniques (such as HMMs, PSI-TBLASTN and pGenThreader). We have uncovered several previously undescribed paralogous genes and pseudogenes. We find that current high-quality genomic evidence indicates that the PrP relative Doppel, was likely present in the last common ancestor of present-day Tetrapoda, but was lost in the bird lineage, since its divergence from reptiles. Using the new gene annotations, we have defined the consensus of structural features that are characteristic of the PrP and Doppel structures, across diverse Tetrapoda clades. Furthermore, we describe in detail a transcribed pseudogene derived from Shadoo that is conserved across primates, and that overlaps the meiosis gene, SYCE1, thus possibly regulating its expression. In addition, we analysed the locus of PRNP/PRND for significant conservation across the genomic DNA of eleven mammals, and determined the phylogenetic penetration of non-coding exons. The genomic evidence indicates that the second PRNP non-coding exon found in even-toed ungulates and rodents, is conserved in all high-coverage genome assemblies of primates (human, chimp, orang utan and macaque), and is, at least, likely to have fallen out of use during primate speciation. Furthermore, we have demonstrated that the PRNT gene (at the PRNP human locus) is conserved across at least sixteen mammals, and evolves like a long non-coding RNA, fashioned from fragments of ancient, long, interspersed elements. These annotations and evolutionary analyses will be of further use for functional characterisation of the PrP-GF, and will be updatable in a semi-automated fashion as more genomes accumulate. Copyright 2010 Elsevier Inc. All rights reserved.
Melatonin promotes Cashmere goat (Capra hircus) secondary hair follicle growth: A view from integrated analysis of long non-coding and coding RNAs.

PubMed

Ge, Wei; Wang, Shan-He; Sun, Bing; Zhang, Yue-Lang; Shen, Wei; Khatib, Hasan; Wang, Xin

2018-06-12

The role of melatonin in promoting the yield of Cashmere goat wool has been demonstrated for decades though there remains a lack of knowledge regarding melatonin mediated hair follicle growth. Recent studies have demonstrated that long non-coding RNAs (lncRNAs) are widely transcribed in the genome and play ubiquitous roles in regulating biological processes. However, the role of lncRNAs in regulating melatonin mediated hair follicle growth remains unclear. In this study, we established an in vitro Cashmere goat secondary hair follicle culture system, and demonstrated that 500 ng/L melatonin exposure promoted hair follicle fiber growth. Based on long intergenic RNA sequencing, we demonstrated that melatonin promoted hair follicle elongation via regulating genes involved in focal adhesion and extracellular matrix receptor pathways and further cis predicting of lncRNAs targeted genes indicated that melatonin mediated lncRNAs mainly targeted vascular smooth muscle contraction and signaling pathways regulating the pluripotency of stem cells. We proposed that melatonin exposure not only perturbed key signals secreted from hair follicle stem cells to regulate hair follicle development, but also mediated lncRNAs mainly targeted to pathways involved in the microvascular system and extracellular matrix, which constitute the highly orchestrated microenvironment for hair follicle stem cell. Taken together, our findings here provide a profound view of lncRNAs in regulating Cashmere goat hair follicle circadian rhythms and broaden our knowledge on melatonin mediated hair follicle morphological changes.
Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

PubMed Central

Hunt, C; Morimoto, R I

1985-01-01

We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

PubMed

Kawaguchi, Risa; Kiryu, Hisanori

2016-05-06

RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .
Genome-scale deletion screening of human long non-coding RNAs using a paired-guide RNA CRISPR library

PubMed Central

Zhu, Shiyou; Li, Wei; Liu, Jingze; Chen, Chen-Hao; Liao, Qi; Xu, Ping; Xu, Han; Xiao, Tengfei; Cao, Zhongzheng; Peng, Jingyu; Yuan, Pengfei; Brown, Myles; Liu, Xiaole Shirley; Wei, Wensheng

2017-01-01

CRISPR/Cas9 screens have been widely adopted to analyse coding gene functions, but high throughput screening of non-coding elements using this method is more challenging, because indels caused by a single cut in non-coding regions are unlikely to produce a functional knockout. A high-throughput method to produce deletions of non-coding DNA is needed. Herein, we report a high throughput genomic deletion strategy to screen for functional long non-coding RNAs (lncRNAs) that is based on a lentiviral paired-guide RNA (pgRNA) library. Applying our screening method, we identified 51 lncRNAs that can positively or negatively regulate human cancer cell growth. We individually validated 9 lncRNAs using CRISPR/Cas9-mediated genomic deletion and functional rescue, CRISPR activation or inhibition, and gene expression profiling. Our high-throughput pgRNA genome deletion method should enable rapid identification of functional mammalian non-coding elements. PMID:27798563
The Secret Life of RNA: Lessons from Emerging Methodologies.

PubMed

Medioni, Caroline; Besse, Florence

2018-01-01

The last past decade has witnessed a revolution in our appreciation of transcriptome complexity and regulation. This remarkable expansion in our knowledge largely originates from the advent of high-throughput methodologies, and the consecutive discovery that up to 90% of eukaryotic genomes are transcribed, thus generating an unanticipated large range of noncoding RNAs (Hangauer et al., 15(4):112, 2014). Besides leading to the identification of new noncoding RNA species, transcriptome-wide studies have uncovered novel layers of posttranscriptional regulatory mechanisms controlling RNA processing, maturation or translation, and each contributing to the precise and dynamic regulation of gene expression. Remarkably, the development of systems-level studies has been accompanied by tremendous progress in the visualization of individual RNA molecules in single cells, such that it is now possible to image RNA species with a single-molecule resolution from birth to translation or decay. Monitoring quantitatively, with unprecedented spatiotemporal resolution, the fate of individual molecules has been key to understanding the molecular mechanisms underlying the different steps of RNA regulation. This has also revealed biologically relevant, intracellular and intercellular heterogeneities in RNA distribution or regulation. More recently, the convergence of imaging and high-throughput technologies has led to the emergence of spatially resolved transcriptomic techniques that provide a means to perform large-scale analyses while preserving spatial information. By generating transcriptome-wide data on single-cell RNA content, or even subcellular RNA distribution, these methodologies are opening avenues to a wide range of network-level studies at the cell and organ-level, and promise to strongly improve disease diagnostic and treatment.In this introductory chapter, we highlight how recently developed technologies aiming at detecting and visualizing RNA molecules have contributed to the emergence of entirely new research fields, and to dramatic progress in our understanding of gene expression regulation.
Current Research on Non-Coding Ribonucleic Acid (RNA).

PubMed

Wang, Jing; Samuels, David C; Zhao, Shilin; Xiang, Yu; Zhao, Ying-Yong; Guo, Yan

2017-12-05

Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.
From Discovery to Function: The Expanding Roles of Long NonCoding RNAs in Physiology and Disease

PubMed Central

Sun, Miao

2015-01-01

Long noncoding RNAs (lncRNAs) are a relatively poorly understood class of RNAs with little or no coding capacity transcribed from a set of incompletely annotated genes. They have received considerable attention in the past few years and are emerging as potentially important players in biological regulation. Here we discuss the evolving understanding of this new class of molecular regulators that has emerged from ongoing research, which continues to expand our databases of annotated lncRNAs and provide new insights into their physical properties, molecular mechanisms of action, and biological functions. We outline the current strategies and approaches that have been employed to identify and characterize lncRNAs, which have been instrumental in revealing their multifaceted roles ranging from cis- to trans-regulation of gene expression and from epigenetic modulation in the nucleus to posttranscriptional control in the cytoplasm. In addition, we highlight the molecular and biological functions of some of the best characterized lncRNAs in physiology and disease, especially those relevant to endocrinology, reproduction, metabolism, immunology, neurobiology, muscle biology, and cancer. Finally, we discuss the tremendous diagnostic and therapeutic potential of lncRNAs in cancer and other diseases. PMID:25426780
Long Non-Coding RNA Emergence During Renal Cell Carcinoma Tumorigenesis.

PubMed

Liu, Xiaobing; Hao, Yaxing; Yu, Wei; Yang, Xia; Luo, Xing; Zhao, Jiang; Li, Jia; Hu, Xiaoyan; Li, Longkun

2018-05-22

Renal cell carcinoma (RCC) is the most common kidney cancer diagnosed across the globe and has steadily increased in incidence in recent decades. Techniques for diagnosing or treating RCC are limited, and confined mostly to later stages of the disease. Almost all RCC pathological types are resistant to chemotherapeutics and radiation therapy. To this effect, new markers for diagnosis and target therapy are urgently needed. Advanced genome sequencing technologies have revealed long non-coding RNAs (lncRNAs) as a novel marker, transcribed throughout the human genome. The emergence of lncRNAs is an aberrant expression and is involved in the tumorigenesis of RCC. LncRNAs drive cancer phenotypes through their interaction with other cellular macromolecules including DNA, protein, and RNA. Recent research on lncRNA molecular mechanisms has revealed new markers to functionally annotate these cancers' associated transcripts, making them targets for effective diagnosis and therapeutic intervention in the fight against cancer. In this review, we first highlight the common mechanisms that underlie aberrant lncRNA expression in RCC. We go on to discuss the potential translational application of lncRNA research in the diagnosis, prognosis, and treatment of RCC. © 2018 The Author(s). Published by S. Karger AG, Basel.
The long non-coding RNA HOTTIP enhances pancreatic cancer cell proliferation, survival and migration.

PubMed

Cheng, Yating; Jutooru, Indira; Chadalapaka, Gayathri; Corton, J Christopher; Safe, Stephen

2015-05-10

HOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5' tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expressed in pancreatic cancer cell lines and knockdown of HOTTIP by RNA interference (siHOTTIP) in Panc1 pancreatic cancer cells decreased proliferation, induced apoptosis and decreased migration. In Panc1 cells transfected with siHOTTIP, there was a decrease in expression of 757 genes and increased expression of 514 genes, and a limited gene analysis indicated that HOTTIP regulation of genes is complex. For example, Aurora kinase A, an important regulator of cell growth, is coregulated by MLL and not WDR5 and, in contrast to previous studies in liver cancer cells, HOTTIP does not regulate HOXA13 but plays a role in regulation of several other HOX genes including HOXA10, HOXB2, HOXA11, HOXA9 and HOXA1. Although HOTTIP and the HOX-associated lncRNA HOTAIR have similar pro-oncogenic functions, they regulate strikingly different sets of genes in Panc1 cells and in pancreatic tumors.
From discovery to function: the expanding roles of long noncoding RNAs in physiology and disease.

PubMed

Sun, Miao; Kraus, W Lee

2015-02-01

Long noncoding RNAs (lncRNAs) are a relatively poorly understood class of RNAs with little or no coding capacity transcribed from a set of incompletely annotated genes. They have received considerable attention in the past few years and are emerging as potentially important players in biological regulation. Here we discuss the evolving understanding of this new class of molecular regulators that has emerged from ongoing research, which continues to expand our databases of annotated lncRNAs and provide new insights into their physical properties, molecular mechanisms of action, and biological functions. We outline the current strategies and approaches that have been employed to identify and characterize lncRNAs, which have been instrumental in revealing their multifaceted roles ranging from cis- to trans-regulation of gene expression and from epigenetic modulation in the nucleus to posttranscriptional control in the cytoplasm. In addition, we highlight the molecular and biological functions of some of the best characterized lncRNAs in physiology and disease, especially those relevant to endocrinology, reproduction, metabolism, immunology, neurobiology, muscle biology, and cancer. Finally, we discuss the tremendous diagnostic and therapeutic potential of lncRNAs in cancer and other diseases.
A noncoding RNA transcribed from the AGAMOUS (AG) second intron binds to CURLY LEAF and represses AG expression in leaves.

PubMed

Wu, Hui-Wen; Deng, Shulin; Xu, Haiying; Mao, Hui-Zhu; Liu, Jun; Niu, Qi-Wen; Wang, Huan; Chua, Nam-Hai

2018-06-04

Dispersed H3K27 trimethylation (H3K27me3) of the AGAMOUS (AG) genomic locus is mediated by CURLY LEAF (CLF), a component of the Polycomb Repressive Complex (PRC) 2. Previous reports have shown that the AG second intron, which confers AG tissue-specific expression, harbors sequences targeted by several positive and negative regulators. Using RACE reverse transcription polymerase chain reaction, we found that the AG intron 2 encodes several noncoding RNAs. RNAi experiment showed that incRNA4 is needed for CLF repressive activity. AG-incRNA4RNAi lines showed increased leaf AG mRNA levels associated with a decrease of H3K27me3 levels; these plants displayed AG overexpression phenotypes. Genetic and biochemical analyses demonstrated that the AG-incRNA4 can associate with CLF to repress AG expression in leaf tissues through H3K27me3-mediated repression and to autoregulate its own expression level. The mechanism of AG-incRNA4-mediated repression may be relevant to investigations on tissue-specific expression of Arabidopsis MADS-box genes. © 2018 The Authors New Phytologist © 2018 New Phytologist Trust.
A lncRNA Perspective into (Re)Building the Heart.

PubMed

Frank, Stefan; Aguirre, Aitor; Hescheler, Juergen; Kurian, Leo

2016-01-01

Our conception of the human genome, long focused on the 2% that codes for proteins, has profoundly changed since its first draft assembly in 2001. Since then, an unanticipatedly expansive functionality and convolution has been attributed to the majority of the genome that is transcribed in a cell-type/context-specific manner into transcripts with no apparent protein coding ability. While the majority of these transcripts, currently annotated as long non-coding RNAs (lncRNAs), are functionally uncharacterized, their prominent role in embryonic development and tissue homeostasis, especially in the context of the heart, is emerging. In this review, we summarize and discuss the latest advances in understanding the relevance of lncRNAs in (re)building the heart.
De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

PubMed Central

Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

2013-01-01

How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
High-density functional-RNA arrays as a versatile platform for studying RNA-based interactions.

PubMed

Phillips, Jack O; Butt, Louise E; Henderson, Charlotte A; Devonshire, Martin; Healy, Jess; Conway, Stuart J; Locker, Nicolas; Pickford, Andrew R; Vincent, Helen A; Callaghan, Anastasia J

2018-05-28

We are just beginning to unravel the myriad of interactions in which non-coding RNAs participate. The intricate RNA interactome is the foundation of many biological processes, including bacterial virulence and human disease, and represents unexploited resources for the development of potential therapeutic interventions. However, identifying specific associations of a given RNA from the multitude of possible binding partners within the cell requires robust high-throughput systems for their rapid screening. Here, we present the first demonstration of functional-RNA arrays as a novel platform technology designed for the study of such interactions using immobilized, active RNAs. We have generated high-density RNA arrays by an innovative method involving surface-capture of in vitro transcribed RNAs. This approach has significant advantages over existing technologies, particularly in its versatility in regards to binding partner character. Indeed, proof-of-principle application of RNA arrays to both RNA-small molecule and RNA-RNA pairings is demonstrated, highlighting their potential as a platform technology for mapping RNA-based networks and for pharmaceutical screening. Furthermore, the simplicity of the method supports greater user-accessibility over currently available technologies. We anticipate that functional-RNA arrays will find broad utility in the expanding field of RNA characterization.
Transcriptional Coupling of Neighboring Genes and Gene Expression Noise: Evidence that Gene Orientation and Noncoding Transcripts Are Modulators of Noise

PubMed Central

Wang, Guang-Zhong; Lercher, Martin J.; Hurst, Laurence D.

2011-01-01

Abstract How is noise in gene expression modulated? Do mechanisms of noise control impact genome organization? In yeast, the expression of one gene can affect that of a very close neighbor. As the effect is highly regionalized, we hypothesize that genes in different orientations will have differing degrees of coupled expression and, in turn, different noise levels. Divergently organized gene pairs, in particular those with bidirectional promoters, have close promoters, maximizing the likelihood that expression of one gene affects the neighbor. With more distant promoters, the same is less likely to hold for gene pairs in nondivergent orientation. Stochastic models suggest that coupled chromatin dynamics will typically result in low abundance-corrected noise (ACN). Transcription of noncoding RNA (ncRNA) from a bidirectional promoter, we thus hypothesize to be a noise-reduction, expression-priming, mechanism. The hypothesis correctly predicts that protein-coding genes with a bidirectional promoter, including those with a ncRNA partner, have lower ACN than other genes and divergent gene pairs uniquely have correlated ACN. Moreover, as predicted, ACN increases with the distance between promoters. The model also correctly predicts ncRNA transcripts to be often divergently transcribed from genes that a priori would be under selection for low noise (essential genes, protein complex genes) and that the latter genes should commonly reside in divergent orientation. Likewise, that genes with bidirectional promoters are rare subtelomerically, cluster together, and are enriched in essential gene clusters is expected and observed. We conclude that gene orientation and transcription of ncRNAs are candidate modulators of noise. PMID:21402863
A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains

PubMed Central

Hutchinson, John N; Ensminger, Alexander W; Clemson, Christine M; Lynch, Christopher R; Lawrence, Jeanne B; Chess, Andrew

2007-01-01

Background Noncoding RNA species play a diverse set of roles in the eukaryotic cell. While much recent attention has focused on smaller RNA species, larger noncoding transcripts are also thought to be highly abundant in mammalian cells. To search for large noncoding RNAs that might control gene expression or mRNA metabolism, we used Affymetrix expression arrays to identify polyadenylated RNA transcripts displaying nuclear enrichment. Results This screen identified no more than three transcripts; XIST, and two unique noncoding nuclear enriched abundant transcripts (NEAT) RNAs strikingly located less than 70 kb apart on human chromosome 11: NEAT1, a noncoding RNA from the locus encoding for TncRNA, and NEAT2 (also known as MALAT-1). While the two NEAT transcripts share no significant homology with each other, each is conserved within the mammalian lineage, suggesting significant function for these noncoding RNAs. NEAT2 is extraordinarily well conserved for a noncoding RNA, more so than even XIST. Bioinformatic analyses of publicly available mouse transcriptome data support our findings from human cells as they confirm that the murine homologs of these noncoding RNAs are also nuclear enriched. RNA FISH analyses suggest that these noncoding RNAs function in mRNA metabolism as they demonstrate an intimate association of these RNA species with SC35 nuclear speckles in both human and mouse cells. These studies show that one of these transcripts, NEAT1 localizes to the periphery of such domains, whereas the neighboring transcript, NEAT2, is part of the long-sought polyadenylated component of nuclear speckles. Conclusion Our genome-wide screens in two mammalian species reveal no more than three abundant large non-coding polyadenylated RNAs in the nucleus; the canonical large noncoding RNA XIST and NEAT1 and NEAT2. The function of these noncoding RNAs in mRNA metabolism is suggested by their high levels of conservation and their intimate association with SC35 splicing domains in multiple mammalian species. PMID:17270048
Splicing-independent loading of TREX on nascent RNA is required for efficient expression of dual-strand piRNA clusters in Drosophila

PubMed Central

Hur, Junho K.; Luo, Yicheng; Moon, Sungjin; Ninova, Maria; Marinov, Georgi K.; Chung, Yun D.; Aravin, Alexei A.

2016-01-01

The conserved THO/TREX (transcription/export) complex is critical for pre-mRNA processing and mRNA nuclear export. In metazoa, TREX is loaded on nascent RNA transcribed by RNA polymerase II in a splicing-dependent fashion; however, how TREX functions is poorly understood. Here we show that Thoc5 and other TREX components are essential for the biogenesis of piRNA, a distinct class of small noncoding RNAs that control expression of transposable elements (TEs) in the Drosophila germline. Mutations in TREX lead to defects in piRNA biogenesis, resulting in derepression of multiple TE families, gametogenesis defects, and sterility. TREX components are enriched on piRNA precursors transcribed from dual-strand piRNA clusters and colocalize in distinct nuclear foci that overlap with sites of piRNA transcription. The localization of TREX in nuclear foci and its loading on piRNA precursor transcripts depend on Cutoff, a protein associated with chromatin of piRNA clusters. Finally, we show that TREX is required for accumulation of nascent piRNA precursors. Our study reveals a novel splicing-independent mechanism for TREX loading on nascent RNA and its importance in piRNA biogenesis. PMID:27036967
[Sequencing and analysis of the complete mitochondrial genome of the King Cobra, Ophiophagus hannah (Serpents: Elapidae)].

PubMed

Chen, Nian; Lai, Xiao-Ping

2010-07-01

We obtained the complete mitochondrial genome of King Cobra(GenBank accession number: EU_921899) by Ex Taq-PCR, TA-cloning and primer-walking methods. This genome is very similar to other vertebrate, which is 17 267 bp in length and encodes 38 genes (including 13 protein-coding, 2 ribosomal RNA and 23 transfer RNA genes) and two long non-coding regions. The duplication of tRNA-Ile gene forms a new mitochondrial gene rearrangement model. Eight tRNA genes and one protein genes were transcribed from L strand, and the other genes were transcribed genes from H strand. Genes on the H strand show a fairly similar content of Adenosine and Thymine respectively, whereas those on the L strand have higher proportion of A than T. Combined rDNA sequence data (12S+16S rRNA) were used to reconstruct the phylogeny of 21 snake species for which complete mitochondrial genome sequences were available in the public databases. This large data set and an appropriate range of outgroup taxa demonstrated that Elapidae is more closely related to colubridae than viperidae, which supports the traditional viewpoints.
The identification and functional annotation of RNA structures conserved in vertebrates

PubMed Central

Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan

2017-01-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280

Elevated Rate of Fixation of Endogenous Retroviral Elements in Haplorhini TRIM5 and TRIM22 Genomic Sequences: Impact on Transcriptional Regulation

PubMed Central

Diehl, William E.; Johnson, Welkin E.; Hunter, Eric

2013-01-01

All genes in the TRIM6/TRIM34/TRIM5/TRIM22 locus are type I interferon inducible, with TRIM5 and TRIM22 possessing antiviral properties. Evolutionary studies involving the TRIM6/34/5/22 locus have predominantly focused on the coding sequence of the genes, finding that TRIM5 and TRIM22 have undergone high rates of both non-synonymous nucleotide replacements and in-frame insertions and deletions. We sought to understand if divergent evolutionary pressures on TRIM6/34/5/22 coding regions have selected for modifications in the non-coding regions of these genes and explore whether such non-coding changes may influence the biological function of these genes. The transcribed genomic regions, including the introns, of TRIM6, TRIM34, TRIM5, and TRIM22 from ten Haplorhini primates and one prosimian species were analyzed for transposable element content. In Haplorhini species, TRIM5 displayed an exaggerated interspecies variability, predominantly resulting from changes in the composition of transposable elements in the large first and fourth introns. Multiple lineage-specific endogenous retroviral long terminal repeats (LTRs) were identified in the first intron of TRIM5 and TRIM22. In the prosimian genome, we identified a duplication of TRIM5 with a concomitant loss of TRIM22. The transposable element content of the prosimian TRIM5 genes appears to largely represent the shared Haplorhini/prosimian ancestral state for this gene. Furthermore, we demonstrated that one such differentially fixed LTR provides for species-specific transcriptional regulation of TRIM22 in response to p53 activation. Our results identify a previously unrecognized source of species-specific variation in the antiviral TRIM genes, which can lead to alterations in their transcriptional regulation. These observations suggest that there has existed long-term pressure for exaptation of retroviral LTRs in the non-coding regions of these genes. This likely resulted from serial viral challenges and provided a mechanism for rapid alteration of transcriptional regulation. To our knowledge, this represents the first report of persistent evolutionary pressure for the capture of retroviral LTR insertions. PMID:23516500
An Alu-like RNA promotes cell differentiation and reduces malignancy of human neuroblastoma cells.

PubMed

Castelnuovo, Manuele; Massone, Sara; Tasso, Roberta; Fiorino, Gloria; Gatti, Monica; Robello, Mauro; Gatta, Elena; Berger, Audrey; Strub, Katharina; Florio, Tullio; Dieci, Giorgio; Cancedda, Ranieri; Pagano, Aldo

2010-10-01

Neuroblastoma (NB) is a pediatric cancer characterized by remarkable cell heterogeneity within the tumor nodules. Here, we demonstrate that the synthesis of a pol III-transcribed noncoding (nc) RNA (NDM29) strongly restricts NB development by promoting cell differentiation, a drop of malignancy processes, and a dramatic reduction of the tumor initiating cell (TIC) fraction in the NB cell population. Notably, the overexpression of NDM29 also confers to malignant NB cells an unpredicted susceptibility to the effects of antiblastic drugs used in NB therapy. Altogether, these results suggest the induction of NDM29 expression as possible treatment to increase cancer cells vulnerability to therapeutics and the measure of its synthesis in NB explants as prognostic factor of this cancer type.
Biological significance of long non-coding RNA FTX expression in human colorectal cancer.

PubMed

Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

2015-01-01

The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer.
The non-coding RNA landscape of human hematopoiesis and leukemia.

PubMed

Schwarzer, Adrian; Emmrich, Stephan; Schmidt, Franziska; Beck, Dominik; Ng, Michelle; Reimer, Christina; Adams, Felix Ferdinand; Grasedieck, Sarah; Witte, Damian; Käbler, Sebastian; Wong, Jason W H; Shah, Anushi; Huang, Yizhou; Jammal, Razan; Maroz, Aliaksandra; Jongen-Lavrencic, Mojca; Schambach, Axel; Kuchenbauer, Florian; Pimanda, John E; Reinhardt, Dirk; Heckl, Dirk; Klusmann, Jan-Henning

2017-08-09

Non-coding RNAs have emerged as crucial regulators of gene expression and cell fate decisions. However, their expression patterns and regulatory functions during normal and malignant human hematopoiesis are incompletely understood. Here we present a comprehensive resource defining the non-coding RNA landscape of the human hematopoietic system. Based on highly specific non-coding RNA expression portraits per blood cell population, we identify unique fingerprint non-coding RNAs-such as LINC00173 in granulocytes-and assign these to critical regulatory circuits involved in blood homeostasis. Following the incorporation of acute myeloid leukemia samples into the landscape, we further uncover prognostically relevant non-coding RNA stem cell signatures shared between acute myeloid leukemia blasts and healthy hematopoietic stem cells. Our findings highlight the importance of the non-coding transcriptome in the formation and maintenance of the human blood hierarchy.While micro-RNAs are known regulators of haematopoiesis and leukemogenesis, the role of long non-coding RNAs is less clear. Here the authors provide a non-coding RNA expression landscape of the human hematopoietic system, highlighting their role in the formation and maintenance of the human blood hierarchy.
Non-coding landscapes of colorectal cancer

PubMed Central

Ragusa, Marco; Barbagallo, Cristina; Statello, Luisa; Condorelli, Angelo Giuseppe; Battaglia, Rosalia; Tamburello, Lucia; Barbagallo, Davide; Di Pietro, Cinzia; Purrello, Michele

2015-01-01

For two decades Vogelstein’s model has been the paradigm for describing the sequence of molecular changes within protein-coding genes that would lead to overt colorectal cancer (CRC). This model is now too simplistic in the light of recent studies, which have shown that our genome is pervasively transcribed in RNAs other than mRNAs, denominated non-coding RNAs (ncRNAs). The discovery that mutations in genes encoding these RNAs [i.e., microRNAs (miRNAs), long non-coding RNAs, and circular RNAs] are causally involved in cancer phenotypes has profoundly modified our vision of tumour molecular genetics and pathobiology. By exploiting a wide range of different mechanisms, ncRNAs control fundamental cellular processes, such as proliferation, differentiation, migration, angiogenesis and apoptosis: these data have also confirmed their role as oncogenes or tumor suppressors in cancer development and progression. The existence of a sophisticated RNA-based regulatory system, which dictates the correct functioning of protein-coding networks, has relevant biological and biomedical consequences. Different miRNAs involved in neoplastic and degenerative diseases exhibit potential predictive and prognostic properties. Furthermore, the key roles of ncRNAs make them very attractive targets for innovative therapeutic approaches. Several recent reports have shown that ncRNAs can be secreted by cells into the extracellular environment (i.e., blood and other body fluids): this suggests the existence of extracellular signalling mechanisms, which may be exploited by cells in physiology and pathology. In this review, we will summarize the most relevant issues on the involvement of cellular and extracellular ncRNAs in disease. We will then specifically describe their involvement in CRC pathobiology and their translational applications to CRC diagnosis, prognosis and therapy. PMID:26556998
RNA therapeutics: RNAi and antisense mechanisms and clinical applications.

PubMed

Chery, Jessica

2016-07-01

RNA therapeutics refers to the use of oligonucleotides to target primarily ribonucleic acids (RNA) for therapeutic efforts or in research studies to elucidate functions of genes. Oligonucleotides are distinct from other pharmacological modalities, such as small molecules and antibodies that target mainly proteins, due to their mechanisms of action and chemical properties. Nucleic acids come in two forms: deoxyribonucleic acids (DNA) and ribonucleic acids (RNA). Although DNA is more stable, RNA offers more structural variety ranging from messenger RNA (mRNA) that codes for protein to non-coding RNAs, microRNA (miRNA), transfer RNA (tRNA), short interfering RNAs (siRNAs), ribosomal RNA (rRNA), and long-noncoding RNAs (lncRNAs). As our understanding of the wide variety of RNAs deepens, researchers have sought to target RNA since >80% of the genome is estimated to be transcribed. These transcripts include non-coding RNAs such as miRNAs and siRNAs that function in gene regulation by playing key roles in the transfer of genetic information from DNA to protein, the final product of the central dogma in biology 1 . Currently there are two main approaches used to target RNA: double stranded RNA-mediated interference (RNAi) and antisense oligonucleotides (ASO). Both approaches are currently in clinical trials for targeting of RNAs involved in various diseases, such as cancer and neurodegeneration. In fact, ASOs targeting spinal muscular atrophy and amyotrophic lateral sclerosis have shown positive results in clinical trials 2 . Advantages of ASOs include higher affinity due to the development of chemical modifications that increase affinity, selectivity while decreasing toxicity due to off-target effects. This review will highlight the major therapeutic approaches of RNA medicine currently being applied with a focus on RNAi and ASOs.
Global assessment of small RNAs reveals a non-coding transcript involved in biofilm formation and attachment in Acinetobacter baumannii ATCC 17978

PubMed Central

Pérez, Astrid; Gómez, Manuel J.; Gayoso, Carmen; Vallejo, Juan A.; Ohneck, Emily J.; Valle, Jaione; Actis, Luis A.; Beceiro, Alejandro; Bou, Germán

2017-01-01

Many strains of Acinetobacter baumannii have been described as being able to form biofilm. Small non-coding RNAs (sRNAs) control gene expression in many regulatory circuits in bacteria. The aim of the present work was to provide a global description of the sRNAs produced both by planktonic and biofilm-associated (sessile) cells of A. baumannii ATCC 17978, and to compare the corresponding gene expression profiles to identify sRNAs molecules associated to biofilm formation and virulence. sRNA was extracted from both planktonic and sessile cells and reverse transcribed. cDNA was subjected to 454-pyrosequencing using the GS-FLX Titanium chemistry. The global analysis of the small RNA transcriptome revealed different sRNA expression patterns in planktonic and biofilm associated cells, with some of the transcripts only expressed or repressed in sessile bacteria. A total of 255 sRNAs were detected, with 185 of them differentially expressed in the different types of cells. A total of 9 sRNAs were expressed only in biofilm cells, while the expression of other 21 coding regions were repressed only in biofilm cells. Strikingly, the expression level of the sRNA 13573 was 120 times higher in biofilms than in planktonic cells, an observation that prompted us to further investigate the biological role of this non-coding transcript. Analyses of an isogenic mutant and over-expressing strains revealed that the sRNA 13573 gene is involved in biofilm formation and attachment to A549 human alveolar epithelial cells. The present work serves as a basis for future studies examining the complex regulatory network that regulate biofilm biogenesis and attachment to eukaryotic cells in A. baumannii ATCC 17978. PMID:28763494
The long non-coding RNA HOTTIP enhances pancreatic cancer cell proliferation, survival and migration

PubMed Central

Cheng, Yating; Jutooru, Indira; Chadalapaka, Gayathri; Corton, J. Christopher; Safe, Stephen

2015-01-01

HOTTIP is a long non-coding RNA (lncRNA) transcribed from the 5′ tip of the HOXA locus and is associated with the polycomb repressor complex 2 (PRC2) and WD repeat containing protein 5 (WDR5)/mixed lineage leukemia 1 (MLL1) chromatin modifying complexes. HOTTIP is expressed in pancreatic cancer cell lines and knockdown of HOTTIP by RNA interference (siHOTTIP) in Panc1 pancreatic cancer cells decreased proliferation, induced apoptosis and decreased migration. In Panc1 cells transfected with siHOTTIP, there was a decrease in expression of 757 genes and increased expression of 514 genes, and a limited gene analysis indicated that HOTTIP regulation of genes is complex. For example, Aurora kinase A, an important regulator of cell growth, is coregulated by MLL and not WDR5 and, in contrast to previous studies in liver cancer cells, HOTTIP does not regulate HOXA13 but plays a role in regulation of several other HOX genes including HOXA10, HOXB2, HOXA11, HOXA9 and HOXA1. Although HOTTIP and the HOX-associated lncRNA HOTAIR have similar pro-oncogenic functions, they regulate strikingly different sets of genes in Panc1 cells and in pancreatic tumors. PMID:25912306
Long Non-Coding RNAs in Multiple Myeloma

PubMed Central

Ronchetti, Domenica; Taiana, Elisa; Vinci, Cristina; Neri, Antonino

2018-01-01

Multiple myeloma (MM) is an incurable disease caused by the malignant proliferation of bone marrow plasma cells, whose pathogenesis remains largely unknown. Although a large fraction of the genome is actively transcribed, most of the transcripts do not serve as templates for proteins and are referred to as non-coding RNAs (ncRNAs), broadly divided into short and long transcripts on the basis of a 200-nucleotide threshold. Short ncRNAs, especially microRNAs, have crucial roles in virtually all types of cancer, including MM, and have gained importance in cancer diagnosis and prognosis, predicting the response to therapy and, notably, as innovative therapeutic targets. Long ncRNAs (lncRNAs) are a very heterogeneous group, involved in many physiological cellular and genomic processes as well as in carcinogenesis, cancer metastasis, and invasion. LncRNAs are aberrantly expressed in various types of cancers, including hematological malignancies, showing either oncogenic or tumor suppressive functions. However, the mechanisms of the related disease-causing events are not yet revealed in most cases. Besides emerging as key players in cancer initiation and progression, lncRNAs own many interesting features as biomarkers with diagnostic and prognostic importance and, possibly, for their utility in therapeutic terms as druggable molecules. This review focuses on the role of lncRNAs in the pathogenesis of MM and summarizes the recent literature. PMID:29389884
The Emerging Roles of Long Non-coding RNA in Cancer.

PubMed

Sanchez Calle, Anna; Kawamura, Yumi; Yamamoto, Yusuke; Takeshita, Fumitaka; Ochiya, Takahiro

2018-05-17

Since comprehensive analysis of the mammalian genome has revealed that the vast majority of genomic products are transcribed in long non-coding RNAs (lncRNAs), increasing attention has been paid towards these transcripts. The applied next-generation sequencing technologies have provided accumulating evidence of dysregulated lncRNAs in cancer. The implication of this finding may be seen in many forms and at multiple levels. With impacts ranging from integrating chromatin remodeling complexes to regulating transcription and post-transcriptional processes, aberrant expression of lncRNAs may have repercussions in cell proliferation, tumor progression or metastasis. lncRNAs may act as enhancers, scaffolds or decoys by physically interacting with other RNA species or proteins, resulting in a direct impact on cell signaling cascades. Even though their functional classification is well-established in the context of cancer, clearer characterization in terms of their phenotypic outputs is needed to optimize and identify suitable candidates that enable the development of new therapeutic strategies and the design of novel diagnostic approaches. The present article aims to outline different cancer-associated lncRNAs according to their contribution to tumor suppression or tumor promotion based on their most current functional annotations. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Biological significance of long non-coding RNA FTX expression in human colorectal cancer

PubMed Central

Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

2015-01-01

The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer. PMID:26629053
U6 small nuclear RNA is transcribed by RNA polymerase III.

PubMed Central

Kunkel, G R; Maser, R L; Calvet, J P; Pederson, T

1986-01-01

A DNA fragment homologous to U6 small nuclear RNA was isolated from a human genomic library and sequenced. The immediate 5'-flanking region of the U6 DNA clone had significant homology with a potential mouse U6 gene, including a "TATA box" at a position 26-29 nucleotides upstream from the transcription start site. Although this sequence element is characteristic of RNA polymerase II promoters, the U6 gene also contained a polymerase III "box A" intragenic control region and a typical run of five thymines at the 3' terminus (noncoding strand). The human U6 DNA clone was accurately transcribed in a HeLa cell S100 extract lacking polymerase II activity. U6 RNA transcription in the S100 extract was resistant to alpha-amanitin at 1 microgram/ml but was completely inhibited at 200 micrograms/ml. A comparison of fingerprints of the in vitro transcript and of U6 RNA synthesized in vivo revealed sequence congruence. U6 RNA synthesis in isolated HeLa cell nuclei also displayed low sensitivity to alpha-amanitin, in contrast to U1 and U2 RNA transcription, which was inhibited greater than 90% at 1 microgram/ml. In addition, U6 RNA synthesized in isolated nuclei was efficiently immunoprecipitated by an antibody against the La antigen, a protein known to bind most other RNA polymerase III transcripts. These results establish that, in contrast to the polymerase II-directed transcription of mammalian genes for U1-U5 small nuclear RNAs, human U6 RNA is transcribed by RNA polymerase III. Images PMID:3464970
High-throughput screens in mammalian cells using the CRISPR-Cas9 system.

PubMed

Peng, Jingyu; Zhou, Yuexin; Zhu, Shiyou; Wei, Wensheng

2015-06-01

As a powerful genome-editing tool, the clustered regularly interspaced short palindromic repeats (CRISPR)-clustered regularly interspaced short palindromic repeats-associated protein 9 (Cas9) system has been quickly developed into a large-scale function-based screening strategy in mammalian cells. This new type of genetic library is constructed through the lentiviral delivery of single-guide RNA collections that direct Cas9 or inactive dead Cas9 fused with effectors to interrogate gene function or regulate gene transcription in targeted cells. Compared with RNA interference screening, the CRISPR-Cas9 system demonstrates much higher levels of effectiveness and reliability with respect to both loss-of-function and gain-of-function screening. Unlike the RNA interference strategy, a CRISPR-Cas9 library can target both protein-coding sequences and regulatory elements, including promoters, enhancers and elements transcribing microRNAs and long noncoding RNAs. This powerful genetic tool will undoubtedly accelerate the mechanistic discovery of various biological processes. In this mini review, we summarize the general procedure of CRISPR-Cas9 library mediated functional screening, system optimization strategies and applications of this new genetic toolkit. © 2015 FEBS.
Multiple horizontal transfers of nuclear ribosomal genes between phylogenetically distinct grass lineages.

PubMed

Mahelka, Václav; Krak, Karol; Kopecký, David; Fehrer, Judith; Šafář, Jan; Bartoš, Jan; Hobza, Roman; Blavet, Nicolas; Blattner, Frank R

2017-02-14

The movement of nuclear DNA from one vascular plant species to another in the absence of fertilization is thought to be rare. Here, nonnative rRNA gene [ribosomal DNA (rDNA)] copies were identified in a set of 16 diploid barley ( Hordeum ) species; their origin was traceable via their internal transcribed spacer (ITS) sequence to five distinct Panicoideae genera, a lineage that split from the Pooideae about 60 Mya. Phylogenetic, cytogenetic, and genomic analyses implied that the nonnative sequences were acquired between 1 and 5 Mya after a series of multiple events, with the result that some current Hordeum sp. individuals harbor up to five different panicoid rDNA units in addition to the native Hordeum rDNA copies. There was no evidence that any of the nonnative rDNA units were transcribed; some showed indications of having been silenced via pseudogenization. A single copy of a Panicum sp. rDNA unit present in H. bogdanii had been interrupted by a native transposable element and was surrounded by about 70 kbp of mostly noncoding sequence of panicoid origin. The data suggest that horizontal gene transfer between vascular plants is not a rare event, that it is not necessarily restricted to one or a few genes only, and that it can be selectively neutral.
The identification and functional annotation of RNA structures conserved in vertebrates.

PubMed

Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

2017-08-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
Perspectives of Long Non-Coding RNAs in Cancer Diagnostics

PubMed Central

Reis, Eduardo M.; Verjovski-Almeida, Sergio

2012-01-01

Long non-coding RNAs (lncRNAs) transcribed from intergenic and intronic regions of the human genome constitute a broad class of cellular transcripts that are under intensive investigation. While only a handful of lncRNAs have been characterized, their involvement in fundamental cellular processes that control gene expression highlights a central role in cell homeostasis. Not surprisingly, aberrant expression of regulatory lncRNAs has been increasingly documented in different types of cancer, where they can mediate both oncogenic or tumor suppressor effects. Interaction with chromatin remodeling complexes that promote silencing of specific genes or modulation of splicing factor proteins seem to be two general modes of lncRNA regulation, but it is conceivable that additional mechanisms of action are yet to be unveiled. LncRNAs show greater tissue specificity compared to protein-coding mRNAs making them attractive in the search of novel diagnostics/prognostics cancer biomarkers in body fluid samples. In fact, lncRNA prostate cancer antigen 3 can be detected in urine samples and has been shown to improve diagnosis of prostate cancer. We suggest that an unbiased screening of the presence of RNAs in easily accessible body fluids such as serum and urine might reveal novel circulating lncRNAs as potential biomarkers in many types of cancer. Annotation and functional characterization of the lncRNA complement of the cancer transcriptome will conceivably provide new venues for early diagnosis and treatment of the disease. PMID:22408643
Long noncoding RNA FTX inhibits hepatocellular carcinoma proliferation and metastasis by binding MCM2 and miR-374a.

PubMed

Liu, F; Yuan, J-H; Huang, J-F; Yang, F; Wang, T-T; Ma, J-Z; Zhang, L; Zhou, C-C; Wang, F; Yu, J; Zhou, W-P; Sun, S-H

2016-10-13

It has long been known that males are more susceptible than females to hepatocellular carcinoma (HCC), but the reason remains elusive. In this study, we investigated the expression and function of the long noncoding RNA FTX (lnc-FTX), an X-inactive-specific transcript (XIST) regulator transcribed from the X chromosome inactivation center, in both HCC and HCC gender disparity. lnc-FTX is expressed at higher levels in female livers than in male livers and is significantly downregulated in HCC tissues compared with normal liver tissues. Patients with higher lnc-FTX expression exhibited longer survival, suggesting that lnc-FTX is a useful prognostic factor for HCC patients. lnc-FTX inhibits HCC cell growth and metastasis both in vitro and in vivo. Mechanistically, lnc-FTX represses Wnt/β-catenin signaling activity by competitively sponging miR-374a and inhibits HCC cell epithelial-mesenchymal transition and invasion. In addition, lnc-FTX binds to the DNA replication licensing factor MCM2, thereby impeding DNA replication and inhibiting proliferation in HCC cells. In conclusion, these findings suggest that lnc-FTX may act as a tumor suppressor in HCC through physically binding miR-374a and MCM2. It may also be one of the reasons for HCC gender disparity and may potentially contribute to HCC treatment.
A 3' UTR-derived non-coding RNA RibS increases expression of cfa and promotes biofilm formation of Salmonella enterica serovar Typhi.

PubMed

Zhao, Xin; Liu, Rui; Tang, Hao; Osei-Adjei, George; Xu, Shungao; Zhang, Ying; Huang, Xinxiang

2018-05-08

Bacterial non-coding RNAs (ncRNAs) are widely studied and found to play important roles in regulating various cellular processes. Recently, many ncRNAs have been discovered to be transcribed or processed from 3' untranslated regions (3' UTRs). Here we reported a novel 3' UTR-derived ncRNA, RibS, which could influence biofilm formation of Salmonella enterica serovar Typhi (S. Typhi). RibS was confirmed to be a ∼700 nt processed product produced by RNase III-catalyzed cleavage from the 3' UTR of riboflavin synthase subunit alpha mRNA, RibE. Overexpression of RibS increased the expression of the cyclopropane fatty acid synthase gene, cfa, which was located at the antisense strand. Biofilm formation of S. Typhi was enhanced by overexpressing RibS both in the wild type strain and cfa deletion mutant. Deletion of cfa attenuated biofilm formation of S. Typhi, while complementation of cfa partly restored the phenotype. Moreover, overexpressing cfa enhanced the biofilm formation of S. Typhi. In summary, RibS has been identified as a novel ncRNA derived from the 3' UTR of RibE that promotes biofilm formation of S. Typhi, and it appears to do so, at least in part, by increasing the expression of cfa. Copyright © 2018 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Genetic diversity of Histoplasma and Sporothrix complexes based on sequences of their ITS1-5.8S-ITS2 regions from the BOLD System.

PubMed

Estrada-Bárcenas, Daniel Alfonso; Vite-Garín, Tania; Navarro-Barranco, Hortensia; de la Torre-Arciniega, Raúl; Pérez-Mejía, Amelia; Rodríguez-Arellanes, Gabriela; Ramirez, Jose Antonio; Humberto Sahaza, Jorge; Taylor, Maria Lucia; Toriello, Conchita

2014-01-01

High sensitivity and specificity of molecular biology techniques have proven usefulness for the detection, identification and typing of different pathogens. The ITS (Internal Transcribed Spacer) regions of the ribosomal DNA are highly conserved non-coding regions, and have been widely used in different studies including the determination of the genetic diversity of human fungal pathogens. This article wants to contribute to the understanding of the intra- and interspecific genetic diversity of isolates of the Histoplasma capsulatum and Sporothrix schenckii species complexes by an analysis of the available sequences of the ITS regions from different sequence databases. ITS1-5.8S-ITS2 sequences of each fungus, either deposited in GenBank, or from our research groups (registered in the Fungi Barcode of Life Database), were analyzed using the maximum likelihood (ML) method. ML analysis of the ITS sequences discriminated isolates from distant geographic origins and particular wild hosts, depending on the fungal species analyzed. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.
Non-isotopic Method for In Situ LncRNA Visualization and Quantitation.

PubMed

Maqsodi, Botoul; Nikoloff, Corina

2016-01-01

In mammals and other eukaryotes, most of the genome is transcribed in a developmentally regulated manner to produce large numbers of long noncoding RNAs (lncRNAs). Genome-wide studies have identified thousands of lncRNAs lacking protein-coding capacity. RNA in situ hybridization technique is especially beneficial for the visualization of RNA (mRNA and lncRNA) expression in a heterogeneous population of cells/tissues; however its utility has been hampered by complicated procedures typically developed and optimized for the detection of a specific gene and therefore not amenable to a wide variety of genes and tissues.Recently, bDNA has revolutionized RNA in situ detection with fully optimized, robust assays for the detection of any mRNA and lncRNA targets in formalin-fixed paraffin-embedded (FFPE) and fresh frozen tissue sections using manual processing.

Generation of Infectious Poliovirus with Altered Genetic Information from Cloned cDNA.

PubMed

Bujaki, Erika

2016-01-01

The effect of specific genetic alterations on virus biology and phenotype can be studied by a great number of available assays. The following method describes the basic protocol to generate infectious poliovirus with altered genetic information from cloned cDNA in cultured cells.The example explained here involves generation of a recombinant poliovirus genome by simply replacing a portion of the 5' noncoding region with a synthetic gene by restriction cloning. The vector containing the full length poliovirus genome and the insert DNA with the known mutation(s) are cleaved for directional cloning, then ligated and transformed into competent bacteria. The recombinant plasmid DNA is then propagated in bacteria and transcribed to RNA in vitro before RNA transfection of cultured cells is performed. Finally, viral particles are recovered from the cell culture.
Long noncoding RNA HOTTIP cooperates with CCCTC-binding factor to coordinate HOXA gene expression.

PubMed

Wang, Feng; Tang, Zhongqiong; Shao, Honglian; Guo, Jun; Tan, Tao; Dong, Yang; Lin, Lianbing

2018-06-12

The spatiotemporal control of HOX gene expression is dependent on positional identity and often correlated to their genomic location within each loci. Maintenance of HOX expression patterns is under complex transcriptional and epigenetic regulation, which is not well understood. Here we demonstrate that HOTTIP, a lincRNA transcribed from the 5' edge of the HOXA locus, physically associates with the CCCTC-binding factor (CTCF) that serves as an insulator by organizing HOXA cluster into disjoint domains, to cooperatively maintain the chromatin modifications of HOXA genes and thus coordinate the transcriptional activation of distal HOXA genes in human foreskin fibroblasts. Our results reveal the functional connection of HOTTIP and CTCF, and shed light on lincRNAs in gene activation and CTCF mediated chromatin organization. Copyright © 2018 Elsevier Inc. All rights reserved.
Digital genome-wide ncRNA expression, including SnoRNAs, across 11 human tissues using polyA-neutral amplification.

PubMed

Castle, John C; Armour, Christopher D; Löwer, Martin; Haynor, David; Biery, Matthew; Bouzek, Heather; Chen, Ronghua; Jackson, Stuart; Johnson, Jason M; Rohl, Carol A; Raymond, Christopher K

2010-07-26

Non-coding RNAs (ncRNAs) are an essential class of molecular species that have been difficult to monitor on high throughput platforms due to frequent lack of polyadenylation. Using a polyadenylation-neutral amplification protocol and next-generation sequencing, we explore ncRNA expression in eleven human tissues. ncRNAs 7SL, U2, 7SK, and HBII-52 are expressed at levels far exceeding mRNAs. C/D and H/ACA box snoRNAs are associated with rRNA methylation and pseudouridylation, respectively: spleen expresses both, hypothalamus expresses mainly C/D box snoRNAs, and testes show enriched expression of both H/ACA box snoRNAs and RNA telomerase TERC. Within the snoRNA 14q cluster, 14q(I-6) is expressed at much higher levels than other cluster members. More reads align to mitochondrial than nuclear tRNAs. Many lincRNAs are actively transcribed, particularly those overlapping known ncRNAs. Within the Prader-Willi syndrome loci, the snoRNA HBII-85 (group I) cluster is highly expressed in hypothalamus, greater than in other tissues and greater than group II or III. Additionally, within the disease locus we find novel transcription across a 400,000 nt span in ovaries. This genome-wide polyA-neutral expression compendium demonstrates the richness of ncRNA expression, their high expression patterns, their function-specific expression patterns, and is publicly available.
Variation in MHC genotypes in two populations of house sparrow (Passer domesticus) with different population histories.

PubMed

Borg, Asa Alexandra; Pedersen, Sindre Andre; Jensen, Henrik; Westerdahl, Helena

2011-10-01

Small populations are likely to have a low genetic ability for disease resistance due to loss of genetic variation through inbreeding and genetic drift. In vertebrates, the highest genetic diversity of the immune system is located at genes within the major histocompatibility complex (MHC). Interestingly, parasite-mediated selection is thought to potentially maintain variation at MHC loci even in populations that are monomorphic at other loci. Therefore, general loss of genetic variation in the genome may not necessarily be associated with low variation at MHC loci. We evaluated inter- and intrapopulation variation in MHC genotypes between an inbred (Aldra) and a relatively outbred population (Hestmannøy) of house sparrows (Passer domesticus) in a metapopulation at Helgeland, Norway. Genomic (gDNA) and transcribed (cDNA) alleles of functional MHC class I and IIB loci, along with neutral noncoding microsatellite markers, were analyzed to obtain relevant estimates of genetic variation. We found lower allelic richness in microsatellites in the inbred population, but high genetic variation in MHC class I and IIB loci in both populations. This suggests that also the inbred population could be under balancing selection to maintain genetic variation for pathogen resistance.
Variation in MHC genotypes in two populations of house sparrow (Passer domesticus) with different population histories

PubMed Central

Borg, Åsa Alexandra; Pedersen, Sindre Andre; Jensen, Henrik; Westerdahl, Helena

2011-01-01

Small populations are likely to have a low genetic ability for disease resistance due to loss of genetic variation through inbreeding and genetic drift. In vertebrates, the highest genetic diversity of the immune system is located at genes within the major histocompatibility complex (MHC). Interestingly, parasite-mediated selection is thought to potentially maintain variation at MHC loci even in populations that are monomorphic at other loci. Therefore, general loss of genetic variation in the genome may not necessarily be associated with low variation at MHC loci. We evaluated inter- and intrapopulation variation in MHC genotypes between an inbred (Aldra) and a relatively outbred population (Hestmannøy) of house sparrows (Passer domesticus) in a metapopulation at Helgeland, Norway. Genomic (gDNA) and transcribed (cDNA) alleles of functional MHC class I and IIB loci, along with neutral noncoding microsatellite markers, were analyzed to obtain relevant estimates of genetic variation. We found lower allelic richness in microsatellites in the inbred population, but high genetic variation in MHC class I and IIB loci in both populations. This suggests that also the inbred population could be under balancing selection to maintain genetic variation for pathogen resistance. PMID:22393491
Molecular phylogeny of 21 tropical bamboo species reconstructed by integrating non-coding internal transcribed spacer (ITS1 and 2) sequences and their consensus secondary structure.

PubMed

Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita

2017-06-01

The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
RNA- and protein-mediated control of Listeria monocytogenes virulence gene expression

PubMed Central

Lebreton, Alice; Cossart, Pascale

2017-01-01

ABSTRACT The model opportunistic pathogen Listeria monocytogenes has been the object of extensive research, aiming at understanding its ability to colonize diverse environmental niches and animal hosts. Bacterial transcriptomes in various conditions reflect this efficient adaptability. We review here our current knowledge of the mechanisms allowing L. monocytogenes to respond to environmental changes and trigger pathogenicity, with a special focus on RNA-mediated control of gene expression. We highlight how these studies have brought novel concepts in prokaryotic gene regulation, such as the ‘excludon’ where the 5′-UTR of a messenger also acts as an antisense regulator of an operon transcribed in opposite orientation, or the notion that riboswitches can regulate non-coding RNAs to integrate complex metabolic stimuli into regulatory networks. Overall, the Listeria model exemplifies that fine RNA tuners act together with master regulatory proteins to orchestrate appropriate transcriptional programmes. PMID:27217337
Point-of-care diagnostic tools to detect circulating microRNAS as biomarkers of disease.

PubMed

Vaca, Luis

2014-05-22

MicroRNAs or miRNAs are a form of small non-coding RNAs (ncRNAs) of 19-22 nucleotides in length in their mature form. miRNAs are transcribed in the nucleus of all cells from large precursors, many of which have several kilobases in length. Originally identified as intracellular modulators of protein synthesis via posttranscriptional gene silencing, more recently it has been found that miRNAs can travel in extracellular human fluids inside specialized vesicles known as exosomes. We will be referring to this miRNAs as circulating microRNAs. More interestingly, the miRNA content inside exosomes changes during pathological events. In the present review we analyze the literature about circulating miRNAs and their possible use as biomarkers. Furthermore, we explore their future in point-of-care (POC) diagnostics and provide an example of a portable POC apparatus useful in the detection of circulating miRNAs.
Xist recruits the X chromosome to the nuclear lamina to enable chromosome-wide silencing.

PubMed

Chen, Chun-Kan; Blanco, Mario; Jackson, Constanza; Aznauryan, Erik; Ollikainen, Noah; Surka, Christine; Chow, Amy; Cerase, Andrea; McDonel, Patrick; Guttman, Mitchell

2016-10-28

The Xist long noncoding RNA orchestrates X chromosome inactivation, a process that entails chromosome-wide silencing and remodeling of the three-dimensional (3D) structure of the X chromosome. Yet, it remains unclear whether these changes in nuclear structure are mediated by Xist and whether they are required for silencing. Here, we show that Xist directly interacts with the Lamin B receptor, an integral component of the nuclear lamina, and that this interaction is required for Xist-mediated silencing by recruiting the inactive X to the nuclear lamina and by doing so enables Xist to spread to actively transcribed genes across the X. Our results demonstrate that lamina recruitment changes the 3D structure of DNA, enabling Xist and its silencing proteins to spread across the X to silence transcription. Copyright © 2016, American Association for the Advancement of Science.
Heterochromatin-Encoded Satellite RNAs Induce Breast Cancer.

PubMed

Zhu, Quan; Hoong, Nien; Aslanian, Aaron; Hara, Toshiro; Benner, Christopher; Heinz, Sven; Miga, Karen H; Ke, Eugene; Verma, Sachin; Soroczynski, Jan; Yates, John R; Hunter, Tony; Verma, Inder M

2018-06-07

Heterochromatic repetitive satellite RNAs are extensively transcribed in a variety of human cancers, including BRCA1 mutant breast cancer. Aberrant expression of satellite RNAs in cultured cells induces the DNA damage response, activates cell cycle checkpoints, and causes defects in chromosome segregation. However, the mechanism by which satellite RNA expression leads to genomic instability is not well understood. Here we provide evidence that increased levels of satellite RNAs in mammary glands induce tumor formation in mice. Using mass spectrometry, we further show that genomic instability induced by satellite RNAs occurs through interactions with BRCA1-associated protein networks required for the stabilization of DNA replication forks. Additionally, de-stabilized replication forks likely promote the formation of RNA-DNA hybrids in cells expressing satellite RNAs. These studies lay the foundation for developing novel therapeutic strategies that block the effects of non-coding satellite RNAs in cancer cells. Copyright © 2018 Elsevier Inc. All rights reserved.
Deep sequencing approaches for the analysis of prokaryotic transcriptional boundaries and dynamics.

PubMed

James, Katherine; Cockell, Simon J; Zenkin, Nikolay

2017-05-01

The identification of the protein-coding regions of a genome is straightforward due to the universality of start and stop codons. However, the boundaries of the transcribed regions, conditional operon structures, non-coding RNAs and the dynamics of transcription, such as pausing of elongation, are non-trivial to identify, even in the comparatively simple genomes of prokaryotes. Traditional methods for the study of these areas, such as tiling arrays, are noisy, labour-intensive and lack the resolution required for densely-packed bacterial genomes. Recently, deep sequencing has become increasingly popular for the study of the transcriptome due to its lower costs, higher accuracy and single nucleotide resolution. These methods have revolutionised our understanding of prokaryotic transcriptional dynamics. Here, we review the deep sequencing and data analysis techniques that are available for the study of transcription in prokaryotes, and discuss the bioinformatic considerations of these analyses. Copyright © 2017 Elsevier Inc. All rights reserved.
Nuclear Proximity of Mtr4 with RNA exosome restricts DNA mutational asymmetry

PubMed Central

Lim, Junghyun; Giri, Pankaj Kumar; Kazadi, David; Laffleur, Brice; Zhang, Wanwei; Grinstein, Veronika; Pefanis, Evangelos; Brown, Lewis M.; Ladewig, Erik; Martin, Ophélie; Chen, Yuling; Rabadan, Raul; Boyer, François; Rothschild, Gerson; Cogné, Michel; Pinaud, Eric; Deng, Haiteng; Basu, Uttiya

2017-01-01

SUMMARY The distribution of sense and antisense strand DNA mutations on transcribed duplex DNA contributes to the development of immune and neural systems along with the progression of cancer. Because developmentally matured B cells undergo biologically programmed strand-specific DNA mutagenesis at focal DNA/RNA hybrid structures, they make a convenient system to investigate strand-specific mutagenesis mechanisms. We demonstrate that the sense and antisense strand DNA mutagenesis at the immunoglobulin heavy chain locus and some other regions of the B cell genome depends upon localized RNA processing protein complex formation in the nucleus. Both the physical proximity and coupled activities of RNA helicase Mtr4 (and Senataxin) with the noncoding RNA processing function of RNA exosome determine the strand specific distribution of DNA mutations. Our study suggests that strand-specific DNA mutagenesis-associated mechanisms will play major roles in other undiscovered aspects of organismic development. PMID:28431250
LncRNAs: key players and novel insights into diabetes mellitus

PubMed Central

He, Xiaoyun; Ou, Chunlin; Xiao, Yanhua; Han, Qing; Li, Hao; Zhou, Suxian

2017-01-01

Long non-coding RNAs (LncRNAs) are a class of endogenous RNA molecules, which have a transcribing length of over 200 nt, lack a complete functional open reading frame (ORF), and rarely encode a functional short peptide. Recent studies have revealed that disruption of LncRNAs levels correlates with several human diseases, including diabetes mellitus (DM), a complex multifactorial metabolic disorder affecting more than 400 million people worldwide. LncRNAs are emerging as pivotal regulators in various biological processes, in the progression of DM and its associated complications, involving pancreatic β-cell disorder, insulin resistance, and epigenetic regulation, etc. Further investigation into the mechanisms of action of LncRNAs in DM will be of great value in the thorough understanding of pathogenesis. However, prior to successful application of LncRNAs, further search for molecular biomarkers and drug targets to provide a new strategy for DM prevention, early diagnosis, and therapy is warranted. PMID:29050364
Staphylococcus aureus Transcriptome Architecture: From Laboratory to Infection-Mimicking Conditions

PubMed Central

Depke, Maren; Pané-Farré, Jan; Debarbouille, Michel; van der Kooi-Pol, Magdalena M.; Guérin, Cyprien; Dérozier, Sandra; Hiron, Aurelia; Jarmer, Hanne; Leduc, Aurélie; Michalik, Stephan; Reilman, Ewoud; Schaffer, Marc; Schmidt, Frank; Bessières, Philippe; Noirot, Philippe; Hecker, Michael; Msadek, Tarek; Völker, Uwe; van Dijl, Jan Maarten

2016-01-01

Staphylococcus aureus is a major pathogen that colonizes about 20% of the human population. Intriguingly, this Gram-positive bacterium can survive and thrive under a wide range of different conditions, both inside and outside the human body. Here, we investigated the transcriptional adaptation of S. aureus HG001, a derivative of strain NCTC 8325, across experimental conditions ranging from optimal growth in vitro to intracellular growth in host cells. These data establish an extensive repertoire of transcription units and non-coding RNAs, a classification of 1412 promoters according to their dependence on the RNA polymerase sigma factors SigA or SigB, and allow identification of new potential targets for several known transcription factors. In particular, this study revealed a relatively low abundance of antisense RNAs in S. aureus, where they overlap only 6% of the coding genes, and only 19 antisense RNAs not co-transcribed with other genes were found. Promoter analysis and comparison with Bacillus subtilis links the small number of antisense RNAs to a less profound impact of alternative sigma factors in S. aureus. Furthermore, we revealed that Rho-dependent transcription termination suppresses pervasive antisense transcription, presumably originating from abundant spurious transcription initiation in this A+T-rich genome, which would otherwise affect expression of the overlapped genes. In summary, our study provides genome-wide information on transcriptional regulation and non-coding RNAs in S. aureus as well as new insights into the biological function of Rho and the implications of spurious transcription in bacteria. PMID:27035918
A Molecular Portrait of De Novo Genes in Yeasts.

PubMed

Vakirlis, Nikolaos; Hebert, Alex S; Opulente, Dana A; Achaz, Guillaume; Hittinger, Chris Todd; Fischer, Gilles; Coon, Joshua J; Lafontaine, Ingrid

2018-03-01

New genes, with novel protein functions, can evolve "from scratch" out of intergenic sequences. These de novo genes can integrate the cell's genetic network and drive important phenotypic innovations. Therefore, identifying de novo genes and understanding how the transition from noncoding to coding occurs are key problems in evolutionary biology. However, identifying de novo genes is a difficult task, hampered by the presence of remote homologs, fast evolving sequences and erroneously annotated protein coding genes. To overcome these limitations, we developed a procedure that handles the usual pitfalls in de novo gene identification and predicted the emergence of 703 de novo gene candidates in 15 yeast species from 2 genera whose phylogeny spans at least 100 million years of evolution. We validated 85 candidates by proteomic data, providing new translation evidence for 25 of them through mass spectrometry experiments. We also unambiguously identified the mutations that enabled the transition from noncoding to coding for 30 Saccharomyces de novo genes. We established that de novo gene origination is a widespread phenomenon in yeasts, only a few being ultimately maintained by selection. We also found that de novo genes preferentially emerge next to divergent promoters in GC-rich intergenic regions where the probability of finding a fortuitous and transcribed ORF is the highest. Finally, we found a more than 3-fold enrichment of de novo genes at recombination hot spots, which are GC-rich and nucleosome-free regions, suggesting that meiotic recombination contributes to de novo gene emergence in yeasts.
Stimulation of Pol III-dependent 5S rRNA and U6 snRNA gene expression by AP-1 transcription factors.

PubMed

Ahuja, Richa; Kumar, Vijay

2017-07-01

RNA polymerase III transcribes structurally diverse group of essential noncoding RNAs including 5S ribosomal RNA (5SrRNA) and U6 snRNA. These noncoding RNAs are involved in RNA processing and ribosome biogenesis, thus, coupling Pol III activity to the rate of protein synthesis, cell growth, and proliferation. Even though a few Pol II-associated transcription factors have been reported to participate in Pol III-dependent transcription, its activation by activator protein 1 (AP-1) factors, c-Fos and c-Jun, has remained unexplored. Here, we show that c-Fos and c-Jun bind to specific sites in the regulatory regions of 5S rRNA (type I) and U6 snRNA (type III) gene promoters and stimulate their transcription. Our chromatin immunoprecipitation studies suggested that endogenous AP-1 factors bind to their cognate promoter elements during the G1/S transition of cell cycle apparently synchronous with Pol III transcriptional activity. Furthermore, the interaction of c-Jun with histone acetyltransferase p300 promoted the recruitment of p300/CBP complex on the promoters and facilitated the occupancy of Pol III transcriptional machinery via histone acetylation and chromatin remodeling. The findings of our study, together, suggest that AP-1 factors are novel regulators of Pol III-driven 5S rRNA and U6 snRNA expression with a potential role in cell proliferation. © 2017 Federation of European Biochemical Societies.
Non-coding RNAs and Berberine: A new mechanism of its anti-diabetic activities.

PubMed

Chang, Wenguang

2017-01-15

Type 2 Diabetes (T2D) is a metabolic disease with high mortality and morbidity. Non-coding RNAs, including small and long non-coding RNAs, are a novel class of functional RNA molecules that regulate multiple biological functions through diverse mechanisms. Studies in the last decade have demonstrated that non-coding RNAs may represent compelling therapeutic targets and play important roles in regulating the course of insulin resistance and T2D. Berberine, a plant-based alkaloid, has shown promise as an anti-hyperglycaemic, anti-hyperlipidaemic agent against T2D. Previous studies have primarily focused on a diverse array of efficacy end points of berberine in the pathogenesis of metabolic syndromes and inflammation or oxidative stress. Currently, an increasing number of studies have revealed the importance of non-coding RNAs as regulators of the anti-diabetic effects of berberine. The regulation of non-coding RNAs has been associated with several therapeutic actions of berberine in T2D progression. Thus, this review summarizes the anti-diabetic mechanisms of berberine by focusing on its role in regulating non-coding RNA, thus demonstrating that berberine exerts global anti-diabetic effects by targeting non-coding RNAs and that these effects involve several miRNAs, lncRNAs and multiple signal pathways, which may enhance the current understanding of the anti-diabetic mechanism actions of berberine and provide new pathological targets for the development of berberine-related drugs. Copyright © 2016 Elsevier B.V. All rights reserved.
The mitochondrial genomes of the human hookworms, Ancylostoma duodenale and Necator americanus (Nematoda: Secernentea).

PubMed

Hu, Min; Chilton, Neil B; Gasser, Robin B

2002-02-01

The complete mitochondrial genome sequences were determined for two species of human hookworms, Ancylostoma duodenale (13,721 bp) and Necator americanus (13,604 bp). The circular hookworm genomes are amongst the smallest reported to date for any metazoan organism. Their relatively small size relates mainly to a reduced length in the AT-rich region. Both hookworm genomes encode 12 protein, two ribosomal RNA and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with three other species of Secernentea studied to date. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. For both hookworm species, genes were arranged in the same order as for Caenorhabditis elegans, except for the presence of a non-coding region between genes nad3 and nad5. In A. duodenale, this non-coding region is predicted to form a stem-and-loop structure which is not present in N. americanus. The mitochondrial genome structure for both hookworms differs from Ascaris suum only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus, including four gene or gene-block translocations and the positions of some transfer RNA genes and the AT-rich region. Based on genome organisation and amino acid sequence identity, A. duodenale and N. americanus were more closely related to C. elegans than to A. suum or O. volvulus (all secernentean nematodes), consistent with a previous phylogenetic study using ribosomal DNA sequence data. Determination of the complete mitochondrial genome sequences for two human hookworms (the first members of the order Strongylida ever sequenced) provides a foundation for studying the systematics, population genetics and ecology of these and other nematodes of socio-economic importance.
Characterization of noncoding regulatory DNA in the human genome.

PubMed

Elkon, Ran; Agami, Reuven

2017-08-08

Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
Rational Design of Small Molecules Targeting Oncogenic Noncoding RNAs from Sequence.

PubMed

Disney, Matthew D; Angelbello, Alicia J

2016-12-20

The discovery of RNA catalysis in the 1980s and the dissemination of the human genome sequence at the start of this century inspired investigations of the regulatory roles of noncoding RNAs in biology. In fact, the Encyclopedia of DNA Elements (ENCODE) project has shown that only 1-2% of the human genome encodes protein, yet 75% is transcribed into RNA. Functional studies both preceding and following the ENCODE project have shown that these noncoding RNAs have important roles in regulating gene expression, developmental timing, and other critical functions. RNA's diverse roles are often a consequence of the various folds that it adopts. The single-stranded nature of the biopolymer enables it to adopt intramolecular folds with noncanonical pairings to lower its free energy. These folds can be scaffolds to bind proteins or to form frameworks to interact with other RNAs. Not surprisingly, dysregulation of certain noncoding RNAs has been shown to be causative of disease. Given this as the background, it is easy to see why it would be useful to develop methods that target RNA and manipulate its biology in rational and predictable ways. The antisense approach has afforded strategies to target RNAs via Watson-Crick base pairing and has typically focused on targeting partially unstructured regions of RNA. Small molecule strategies to target RNA would be desirable not only because compounds could be lead optimized via medicinal chemistry but also because structured regions within an RNA of interest could be targeted to directly interfere with RNA folds that contribute to disease. Additionally, small molecules have historically been the most successful drug candidates. Until recently, the ability to design small molecules that target non-ribosomal RNAs has been elusive, creating the perception that they are "undruggable". In this Account, approaches to demystify targeting RNA with small molecules are described. Rather than bulk screening for compounds that bind to singular targets, which is the purview of the pharmaceutical industry and academic institutions with high throughput screening facilities, we focus on methods that allow for the rational design of small molecules toward biological RNAs. One enabling and foundational technology that has been developed is two-dimensional combinatorial screening (2DCS), a library-versus-library selection approach that allows the identification of the RNA motif binding preferences of small molecules from millions of combinations. A landscape map of the 2DCS-defined and annotated RNA motif-small molecule interactions is then placed into Inforna, a computational tool that allows one to mine these interactions against an RNA of interest or an entire transcriptome. Indeed, this approach has been enabled by tools to annotate RNA structure from sequence, an invaluable asset to the RNA community and this work, and has allowed for the rational identification of "druggable" RNAs in a target agnostic fashion.

The development of non-coding RNA ontology.

PubMed

Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; de Silva, Nisansa; Kasukurthi, Mohan Vamsi; Jha, Vikash Kumar; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

2016-01-01

Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data.
A systemic identification approach for primary transcription start site of Arabidopsis miRNAs from multidimensional omics data.

PubMed

You, Qi; Yan, Hengyu; Liu, Yue; Yi, Xin; Zhang, Kang; Xu, Wenying; Su, Zhen

2017-05-01

The 22-nucleotide non-coding microRNAs (miRNAs) are mostly transcribed by RNA polymerase II and are similar to protein-coding genes. Unlike the clear process from stem-loop precursors to mature miRNAs, the primary transcriptional regulation of miRNA, especially in plants, still needs to be further clarified, including the original transcription start site, functional cis-elements and primary transcript structures. Due to several well-characterized transcription signals in the promoter region, we proposed a systemic approach integrating multidimensional "omics" (including genomics, transcriptomics, and epigenomics) data to improve the genome-wide identification of primary miRNA transcripts. Here, we used the model plant Arabidopsis thaliana to improve the ability to identify candidate promoter locations in intergenic miRNAs and to determine rules for identifying primary transcription start sites of miRNAs by integrating high-throughput omics data, such as the DNase I hypersensitive sites, chromatin immunoprecipitation-sequencing of polymerase II and H3K4me3, as well as high throughput transcriptomic data. As a result, 93% of refined primary transcripts could be confirmed by the primer pairs from a previous study. Cis-element and secondary structure analyses also supported the feasibility of our results. This work will contribute to the primary transcriptional regulatory analysis of miRNAs, and the conserved regulatory pattern may be a suitable miRNA characteristic in other plant species.
High-resolution transcriptional analysis of the regulatory influence of cell-to-cell signalling reveals novel genes that contribute to Xanthomonas phytopathogenesis

PubMed Central

An, Shi-Qi; Febrer, Melanie; McCarthy, Yvonne; Tang, Dong-Jie; Clissold, Leah; Kaithakottil, Gemy; Swarbreck, David; Tang, Ji-Liang; Rogers, Jane; Dow, J Maxwell; Ryan, Robert P

2013-01-01

The bacterium Xanthomonas campestris is an economically important pathogen of many crop species and a model for the study of bacterial phytopathogenesis. In X. campestris, a regulatory system mediated by the signal molecule DSF controls virulence to plants. The synthesis and recognition of the DSF signal depends upon different Rpf proteins. DSF signal generation requires RpfF whereas signal perception and transduction depends upon a system comprising the sensor RpfC and regulator RpfG. Here we have addressed the action and role of Rpf/DSF signalling in phytopathogenesis by high-resolution transcriptional analysis coupled to functional genomics. We detected transcripts for many genes that were unidentified by previous computational analysis of the genome sequence. Novel transcribed regions included intergenic transcripts predicted as coding or non-coding as well as those that were antisense to coding sequences. In total, mutation of rpfF, rpfG and rpfC led to alteration in transcript levels (more than fourfold) of approximately 480 genes. The regulatory influence of RpfF and RpfC demonstrated considerable overlap. Contrary to expectation, the regulatory influence of RpfC and RpfG had limited overlap, indicating complexities of the Rpf signalling system. Importantly, functional analysis revealed over 160 new virulence factors within the group of Rpf-regulated genes. PMID:23617851
A genome-wide survey of maternal and embryonic transcripts during Xenopus tropicalis development.

PubMed

Paranjpe, Sarita S; Jacobi, Ulrike G; van Heeringen, Simon J; Veenstra, Gert Jan C

2013-11-06

Dynamics of polyadenylation vs. deadenylation determine the fate of several developmentally regulated genes. Decay of a subset of maternal mRNAs and new transcription define the maternal-to-zygotic transition, but the full complement of polyadenylated and deadenylated coding and non-coding transcripts has not yet been assessed in Xenopus embryos. To analyze the dynamics and diversity of coding and non-coding transcripts during development, both polyadenylated mRNA and ribosomal RNA-depleted total RNA were harvested across six developmental stages and subjected to high throughput sequencing. The maternally loaded transcriptome is highly diverse and consists of both polyadenylated and deadenylated transcripts. Many maternal genes show peak expression in the oocyte and include genes which are known to be the key regulators of events like oocyte maturation and fertilization. Of all the transcripts that increase in abundance between early blastula and larval stages, about 30% of the embryonic genes are induced by fourfold or more by the late blastula stage and another 35% by late gastrulation. Using a gene model validation and discovery pipeline, we identified novel transcripts and putative long non-coding RNAs (lncRNA). These lncRNA transcripts were stringently selected as spliced transcripts generated from independent promoters, with limited coding potential and a codon bias characteristic of noncoding sequences. Many lncRNAs are conserved and expressed in a developmental stage-specific fashion. These data reveal dynamics of transcriptome polyadenylation and abundance and provides a high-confidence catalogue of novel and long non-coding RNAs.
Digital Genome-Wide ncRNA Expression, Including SnoRNAs, across 11 Human Tissues Using PolyA-Neutral Amplification

PubMed Central

Castle, John C.; Armour, Christopher D.; Löwer, Martin; Haynor, David; Biery, Matthew; Bouzek, Heather; Chen, Ronghua; Jackson, Stuart; Johnson, Jason M.; Rohl, Carol A.; Raymond, Christopher K.

2010-01-01

Non-coding RNAs (ncRNAs) are an essential class of molecular species that have been difficult to monitor on high throughput platforms due to frequent lack of polyadenylation. Using a polyadenylation-neutral amplification protocol and next-generation sequencing, we explore ncRNA expression in eleven human tissues. ncRNAs 7SL, U2, 7SK, and HBII-52 are expressed at levels far exceeding mRNAs. C/D and H/ACA box snoRNAs are associated with rRNA methylation and pseudouridylation, respectively: spleen expresses both, hypothalamus expresses mainly C/D box snoRNAs, and testes show enriched expression of both H/ACA box snoRNAs and RNA telomerase TERC. Within the snoRNA 14q cluster, 14q(I-6) is expressed at much higher levels than other cluster members. More reads align to mitochondrial than nuclear tRNAs. Many lincRNAs are actively transcribed, particularly those overlapping known ncRNAs. Within the Prader-Willi syndrome loci, the snoRNA HBII-85 (group I) cluster is highly expressed in hypothalamus, greater than in other tissues and greater than group II or III. Additionally, within the disease locus we find novel transcription across a 400,000 nt span in ovaries. This genome-wide polyA-neutral expression compendium demonstrates the richness of ncRNA expression, their high expression patterns, their function-specific expression patterns, and is publicly available. PMID:20668672
The developmental transcriptome of Drosophila melanogaster

DOE Office of Scientific and Technical Information (OSTI.GOV)

University of Connecticut; Graveley, Brenton R.; Brooks, Angela N.

Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, predictionmore » and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development. Drosophila melanogaster is an important non-mammalian model system that has had a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information and uncovering the role of genes in development. Because it shares a substantial genic content with humans, Drosophila is increasingly used as a translational model for human development, homeostasis and disease. High-quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly. Although expression profiling using microarrays has revealed the expression of, 13,000 annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing using such approaches. Single-base resolution is essential to define precisely the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. Whereas, 20% of Drosophila genes are annotated as encoding alternatively spliced premRNAs, splice-junction microarray experiments indicate that this number is at least 40% (ref. 7). Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including short interfering RNAs (siRNAs) and microRNAS (miRNAs) (reviewed in ref. 10), and longer ncRNAs such as bxd (ref. 11) and rox (ref. 12), have important roles in gene regulation, whereas others such as small nucleolar RNAs (snoRNAs)and small nuclear RNAs (snRNAs) are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and Caenorhabditis elegans genomes, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to ageing male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.« less
A 5' UTR-Overlapping LncRNA Activates the Male-Determining Gene doublesex1 in the Crustacean Daphnia magna.

PubMed

Kato, Yasuhiko; Perez, Christelle Alexa G; Mohamad Ishak, Nur Syafiqah; Nong, Quang D; Sudo, Yuumi; Matsuura, Tomoaki; Wada, Tadashi; Watanabe, Hajime

2018-06-04

Long noncoding RNAs (lncRNAs) are pervasively transcribed in the eukaryotic genome [1] and are important for the control of master regulatory genes that are involved in cell differentiation and development [2, 3]. Here, we show that a 5' UTR-overlapping lncRNA regulates the male-specific expression of the DM-domain gene doublesex1 (dsx1) in the crustacean Daphnia magna, which produces males in response to environmental stimuli. This lncRNA, named doublesex1 alpha promoter-associated long RNA (DAPALR), is transcribed upstream the transcription start site (TSS) in a sense orientation and subjected to 5' end capping and 3' end processing at a stem-loop structure before the dsx1 coding exon. Similar to dsx1, its expression is only activated in males by the juvenile hormone (JH) and basic-leucine zipper (bZIP) transcription factor Vrille (Vri) and is maintained during embryogenesis. Knockdown of DAPALR in males silenced dsx1 and led to feminization, including egg production, whereas ectopic expression of DAPALR in dsx1-silenced females resulted in the de-repression of dsx1. We further demonstrate that the DAPALR transcript overlaps the dsx1 5'-UTR, and this overlapping region is required for dsx1 activation. Our results suggest that DAPALR can transactivate and possibly maintain dsx1 expression. This might be important for converting transient environmental signals into stable male development, controlled by the continuous expression of dsx1. Copyright © 2018 Elsevier Ltd. All rights reserved.
Natural antisense transcript-targeted regulation of inducible nitric oxide synthase mRNA levels.

PubMed

Yoshigai, Emi; Hara, Takafumi; Araki, Yoshiro; Tanaka, Yoshito; Oishi, Masaharu; Tokuhara, Katsuji; Kaibori, Masaki; Okumura, Tadayoshi; Kwon, A-Hon; Nishizawa, Mikio

2013-04-01

Natural antisense transcripts (asRNAs) are frequently transcribed from mammalian genes. Recently, we found that non-coding asRNAs are transcribed from the 3' untranslated region (3'UTR) of the rat and mouse genes encoding inducible nitric oxide synthase (iNOS), which catalyzes the production of the inflammatory mediator nitric oxide. The iNOS asRNA stabilizes iNOS mRNA by interacting with the mRNA 3'UTR. Furthermore, single-stranded 'sense' oligonucleotides corresponding to the iNOS mRNA sequence were found to reduce iNOS mRNA levels by interfering with mRNA-asRNA interactions in rat hepatocytes. This method was named natural antisense transcript-targeted regulation (NATRE) technology. In this study, we detected human iNOS asRNA expressed in hepatocarcinoma and colon carcinoma tissues. The human iNOS asRNA harbored a sequence complementary to an evolutionarily conserved region of the iNOS mRNA 3'UTR. When introduced into hepatocytes, iNOS sense oligonucleotides that were modified by substitution with partial phosphorothioate bonds and locked nucleic acids or 2'-O-methyl nucleic acids greatly reduced levels of iNOS mRNA and iNOS protein. Moreover, sense oligonucleotides and short interfering RNAs decreased iNOS mRNA to comparable levels. These results suggest that NATRE technology using iNOS sense oligonucleotides could potentially be used to treat human inflammatory diseases and cancers by reducing iNOS mRNA levels. Copyright © 2013 Elsevier Inc. All rights reserved.
Cis-encoded non-coding antisense RNAs in streptococci and other low GC Gram (+) bacterial pathogens

PubMed Central

Cho, Kyu Hong; Kim, Jeong-Ho

2015-01-01

Due to recent advances of bioinformatics and high throughput sequencing technology, discovery of regulatory non-coding RNAs in bacteria has been increased to a great extent. Based on this bandwagon, many studies searching for trans-acting small non-coding RNAs in streptococci have been performed intensively, especially in the important human pathogen, group A and B streptococci. However, studies for cis-encoded non-coding antisense RNAs in streptococci have been scarce. A recent study shows antisense RNAs are involved in virulence gene regulation in group B streptococcus, S. agalactiae. This suggests antisense RNAs could have important roles in the pathogenesis of streptococcal pathogens. In this review, we describe recent discoveries of chromosomal cis-encoded antisense RNAs in streptococcal pathogens and other low GC Gram (+) bacteria to provide a guide for future studies. PMID:25859258
Heavy Chronic Intermittent Ethanol Exposure Alters Small Noncoding RNAs in Mouse Sperm and Epididymosomes.

PubMed

Rompala, Gregory R; Mounier, Anais; Wolfe, Cody M; Lin, Qishan; Lefterov, Iliya; Homanics, Gregg E

2018-01-01

While the risks of maternal alcohol abuse during pregnancy are well-established, several preclinical studies suggest that chronic preconception alcohol consumption by either parent may also have significance consequences for offspring health and development. Notably, since isogenic male mice used in these studies are not involved in gestation or rearing of offspring, the cross-generational effects of paternal alcohol exposure suggest a germline-based epigenetic mechanism. Many recent studies have demonstrated that the effects of paternal environmental exposures such as stress or malnutrition can be transmitted to the next generation via alterations to small noncoding RNAs in sperm. Therefore, we used high throughput sequencing to examine the effect of preconception ethanol on small noncoding RNAs in sperm. We found that chronic intermittent ethanol exposure altered several small noncoding RNAs from three of the major small RNA classes in sperm, tRNA-derived small RNA (tDR), mitochondrial small RNA, and microRNA. Six of the ethanol-responsive small noncoding RNAs were evaluated with RT-qPCR on a separate cohort of mice and five of the six were confirmed to be altered by chronic ethanol exposure, supporting the validity of the sequencing results. In addition to altered sperm RNA abundance, chronic ethanol exposure affected post-transcriptional modifications to sperm small noncoding RNAs, increasing two nucleoside modifications previously identified in mitochondrial tRNA. Furthermore, we found that chronic ethanol reduced epididymal expression of a tRNA methyltransferase, Nsun2 , known to directly regulate tDR biogenesis. Finally, ethanol-responsive sperm tDR are similarly altered in extracellular vesicles of the epididymis (i.e., epididymosomes), supporting the hypothesis that alterations to sperm tDR emerge in the epididymis and that epididymosomes are the primary source of small noncoding RNAs in sperm. These results add chronic ethanol to the growing list of paternal exposures that can affect small noncoding RNA abundance and nucleoside modifications in sperm. As small noncoding RNAs in sperm have been shown to causally induce heritable phenotypes in offspring, additional research is warranted to understand the potential effects of ethanol-responsive sperm small noncoding RNAs on offspring health and development.
Heavy Chronic Intermittent Ethanol Exposure Alters Small Noncoding RNAs in Mouse Sperm and Epididymosomes

PubMed Central

Rompala, Gregory R.; Mounier, Anais; Wolfe, Cody M.; Lin, Qishan; Lefterov, Iliya; Homanics, Gregg E.

2018-01-01

While the risks of maternal alcohol abuse during pregnancy are well-established, several preclinical studies suggest that chronic preconception alcohol consumption by either parent may also have significance consequences for offspring health and development. Notably, since isogenic male mice used in these studies are not involved in gestation or rearing of offspring, the cross-generational effects of paternal alcohol exposure suggest a germline-based epigenetic mechanism. Many recent studies have demonstrated that the effects of paternal environmental exposures such as stress or malnutrition can be transmitted to the next generation via alterations to small noncoding RNAs in sperm. Therefore, we used high throughput sequencing to examine the effect of preconception ethanol on small noncoding RNAs in sperm. We found that chronic intermittent ethanol exposure altered several small noncoding RNAs from three of the major small RNA classes in sperm, tRNA-derived small RNA (tDR), mitochondrial small RNA, and microRNA. Six of the ethanol-responsive small noncoding RNAs were evaluated with RT-qPCR on a separate cohort of mice and five of the six were confirmed to be altered by chronic ethanol exposure, supporting the validity of the sequencing results. In addition to altered sperm RNA abundance, chronic ethanol exposure affected post-transcriptional modifications to sperm small noncoding RNAs, increasing two nucleoside modifications previously identified in mitochondrial tRNA. Furthermore, we found that chronic ethanol reduced epididymal expression of a tRNA methyltransferase, Nsun2, known to directly regulate tDR biogenesis. Finally, ethanol-responsive sperm tDR are similarly altered in extracellular vesicles of the epididymis (i.e., epididymosomes), supporting the hypothesis that alterations to sperm tDR emerge in the epididymis and that epididymosomes are the primary source of small noncoding RNAs in sperm. These results add chronic ethanol to the growing list of paternal exposures that can affect small noncoding RNA abundance and nucleoside modifications in sperm. As small noncoding RNAs in sperm have been shown to causally induce heritable phenotypes in offspring, additional research is warranted to understand the potential effects of ethanol-responsive sperm small noncoding RNAs on offspring health and development. PMID:29472946
cncRNAs: Bi-functional RNAs with protein coding and non-coding functions

PubMed Central

Kumari, Pooja; Sampath, Karuna

2015-01-01

For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036
Standing your Ground to Exoribonucleases: Function of Flavivirus Long Non-coding RNAs

PubMed Central

Charley, Phillida A.; Wilusz, Jeffrey

2015-01-01

Members of the Flaviviridae (e.g. Dengue virus, West Nile virus, and Hepatitis C virus) contain a positive-sense RNA genome that encodes a large polyprotein. It is now also clear most if not all of these viruses also produce an abundant subgenomic long non-coding RNA. These non-coding RNAs, which are called subgenomicflavivirus RNAs (sfRNAs) or Xrn1-resistant RNAs (xrRNAs), are stable decay intermediates generated from the viral genomic RNA through the stalling of the cellular exoribonuclease Xrn1 at highly structured regions. Several functions of these flavivirus long non-coding RNAs have been revealed in recent years. The generation of these sfRNAs/xrRNAs from viral transcripts results in the repression of Xrn1 and the dysregulation of cellular mRNA stability. The abundant sfRNAs also serve directly as a decoy for important cellular protein regulators of the interferon and RNA interference antiviral pathways. Thus the generation of long non-coding RNAs from flaviviruses, hepaciviruses and pestiviruses likely disrupts aspects of innate immunity and may directly contribute to viral replication, cytopathology and pathogenesis. PMID:26368052
Differential expression and emerging functions of non-coding RNAs in cold adaptation.

PubMed

Frigault, Jacques J; Morin, Mathieu D; Morin, Pier Jr

2017-01-01

Several species undergo substantial physiological and biochemical changes to confront the harsh conditions associated with winter. Small mammalian hibernators and cold-hardy insects are examples of natural models of cold adaptation that have been amply explored. While the molecular picture associated with cold adaptation has started to become clearer in recent years, notably through the use of high-throughput experimental approaches, the underlying cold-associated functions attributed to several non-coding RNAs, including microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), remain to be better characterized. Nevertheless, key pioneering work has provided clues on the likely relevance of these molecules in cold adaptation. With an emphasis on mammalian hibernation and insect cold hardiness, this work first reviews various molecular changes documented so far in these processes. The cascades leading to miRNA and lncRNA production as well as the mechanisms of action of these non-coding RNAs are subsequently described. Finally, we present examples of differentially expressed non-coding RNAs in models of cold adaptation and elaborate on the potential significance of this modulation with respect to low-temperature adaptation.
Identification of novel non-coding small RNAs from Streptococcus pneumoniae TIGR4 using high-resolution genome tiling arrays

PubMed Central

2010-01-01

Background The identification of non-coding transcripts in human, mouse, and Escherichia coli has revealed their widespread occurrence and functional importance in both eukaryotic and prokaryotic life. In prokaryotes, studies have shown that non-coding transcripts participate in a broad range of cellular functions like gene regulation, stress and virulence. However, very little is known about non-coding transcripts in Streptococcus pneumoniae (pneumococcus), an obligate human respiratory pathogen responsible for significant worldwide morbidity and mortality. Tiling microarrays enable genome wide mRNA profiling as well as identification of novel transcripts at a high-resolution. Results Here, we describe a high-resolution transcription map of the S. pneumoniae clinical isolate TIGR4 using genomic tiling arrays. Our results indicate that approximately 66% of the genome is expressed under our experimental conditions. We identified a total of 50 non-coding small RNAs (sRNAs) from the intergenic regions, of which 36 had no predicted function. Half of the identified sRNA sequences were found to be unique to S. pneumoniae genome. We identified eight overrepresented sequence motifs among sRNA sequences that correspond to sRNAs in different functional categories. Tiling arrays also identified approximately 202 operon structures in the genome. Conclusions In summary, the pneumococcal operon structures and novel sRNAs identified in this study enhance our understanding of the complexity and extent of the pneumococcal 'expressed' genome. Furthermore, the results of this study open up new avenues of research for understanding the complex RNA regulatory network governing S. pneumoniae physiology and virulence. PMID:20525227
Long Non-Coding RNAs: A Novel Paradigm for Toxicology

PubMed Central

Dempsey, Joseph L.; Cui, Julia Yue

2017-01-01

Long non-coding RNAs (lncRNAs) are over 200 nucleotides in length and are transcribed from the mammalian genome in a tissue-specific and developmentally regulated pattern. There is growing recognition that lncRNAs are novel biomarkers and/or key regulators of toxicological responses in humans and animal models. Lacking protein-coding capacity, the numerous types of lncRNAs possess a myriad of transcriptional regulatory functions that include cis and trans gene expression, transcription factor activity, chromatin remodeling, imprinting, and enhancer up-regulation. LncRNAs also influence mRNA processing, post-transcriptional regulation, and protein trafficking. Dysregulation of lncRNAs has been implicated in various human health outcomes such as various cancers, Alzheimer’s disease, cardiovascular disease, autoimmune diseases, as well as intermediary metabolism such as glucose, lipid, and bile acid homeostasis. Interestingly, emerging evidence in the literature over the past five years has shown that lncRNA regulation is impacted by exposures to various chemicals such as polycyclic aromatic hydrocarbons, benzene, cadmium, chlorpyrifos-methyl, bisphenol A, phthalates, phenols, and bile acids. Recent technological advancements, including next-generation sequencing technologies and novel computational algorithms, have enabled the profiling and functional characterizations of lncRNAs on a genomic scale. In this review, we summarize the biogenesis and general biological functions of lncRNAs, highlight the important roles of lncRNAs in human diseases and especially during the toxicological responses to various xenobiotics, evaluate current methods for identifying aberrant lncRNA expression and molecular target interactions, and discuss the potential to implement these tools to address fundamental questions in toxicology. PMID:27864543
Kinetic models of gene expression including non-coding RNAs

NASA Astrophysics Data System (ADS)

Zhdanov, Vladimir P.

2011-03-01

In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.
Genomic positional conservation identifies topological anchor point RNAs linked to developmental loci.

PubMed

Amaral, Paulo P; Leonardi, Tommaso; Han, Namshik; Viré, Emmanuelle; Gascoigne, Dennis K; Arias-Carrasco, Raúl; Büscher, Magdalena; Pandolfini, Luca; Zhang, Anda; Pluchino, Stefano; Maracaja-Coutinho, Vinicius; Nakaya, Helder I; Hemberg, Martin; Shiekhattar, Ramin; Enright, Anton J; Kouzarides, Tony

2018-03-15

The mammalian genome is transcribed into large numbers of long noncoding RNAs (lncRNAs), but the definition of functional lncRNA groups has proven difficult, partly due to their low sequence conservation and lack of identified shared properties. Here we consider promoter conservation and positional conservation as indicators of functional commonality. We identify 665 conserved lncRNA promoters in mouse and human that are preserved in genomic position relative to orthologous coding genes. These positionally conserved lncRNA genes are primarily associated with developmental transcription factor loci with which they are coexpressed in a tissue-specific manner. Over half of positionally conserved RNAs in this set are linked to chromatin organization structures, overlapping binding sites for the CTCF chromatin organiser and located at chromatin loop anchor points and borders of topologically associating domains (TADs). We define these RNAs as topological anchor point RNAs (tapRNAs). Characterization of these noncoding RNAs and their associated coding genes shows that they are functionally connected: they regulate each other's expression and influence the metastatic phenotype of cancer cells in vitro in a similar fashion. Furthermore, we find that tapRNAs contain conserved sequence domains that are enriched in motifs for zinc finger domain-containing RNA-binding proteins and transcription factors, whose binding sites are found mutated in cancers. This work leverages positional conservation to identify lncRNAs with potential importance in genome organization, development and disease. The evidence that many developmental transcription factors are physically and functionally connected to lncRNAs represents an exciting stepping-stone to further our understanding of genome regulation.
Measles virus minigenomes encoding two autofluorescent proteins reveal cell-to-cell variation in reporter expression dependent on viral sequences between the transcription units.

PubMed

Rennick, Linda J; Duprex, W Paul; Rima, Bert K

2007-10-01

Transcription from morbillivirus genomes commences at a single promoter in the 3' non-coding terminus, with the six genes being transcribed sequentially. The 3' and 5' untranslated regions (UTRs) of the genes (mRNA sense), together with the intergenic trinucleotide spacer, comprise the non-coding sequences (NCS) of the virus and contain the conserved gene end and gene start signals, respectively. Bicistronic minigenomes containing transcription units (TUs) encoding autofluorescent reporter proteins separated by measles virus (MV) NCS were used to give a direct estimation of gene expression in single, living cells by assessing the relative amounts of each fluorescent protein in each cell. Initially, five minigenomes containing each of the MV NCS were generated. Assays were developed to determine the amount of each fluorescent protein in cells at both cell population and single-cell levels. This revealed significant variations in gene expression between cells expressing the same NCS-containing minigenome. The minigenome containing the M/F NCS produced significantly lower amounts of fluorescent protein from the second TU (TU2), compared with the other minigenomes. A minigenome with a truncated F 5' UTR had increased expression from TU2. This UTR is 524 nt longer than the other MV 5' UTRs. Insertions into the 5' UTR of the enhanced green fluorescent protein gene in the minigenome containing the N/P NCS showed that specific sequences, rather than just the additional length of F 5' UTR, govern this decreased expression from TU2.
Interpreting Mammalian Evolution using Fugu Genome Comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stubbs, L; Ovcharenko, I; Loots, G G

2004-04-02

Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.

NONCODE v2.0: decoding the non-coding.

PubMed

He, Shunmin; Liu, Changning; Skogerbø, Geir; Zhao, Haitao; Wang, Jie; Liu, Tao; Bai, Baoyan; Zhao, Yi; Chen, Runsheng

2008-01-01

The NONCODE database is an integrated knowledge database designed for the analysis of non-coding RNAs (ncRNAs). Since NONCODE was first released 3 years ago, the number of known ncRNAs has grown rapidly, and there is growing recognition that ncRNAs play important regulatory roles in most organisms. In the updated version of NONCODE (NONCODE v2.0), the number of collected ncRNAs has reached 206 226, including a wide range of microRNAs, Piwi-interacting RNAs and mRNA-like ncRNAs. The improvements brought to the database include not only new and updated ncRNA data sets, but also an incorporation of BLAST alignment search service and access through our custom UCSC Genome Browser. NONCODE can be found under http://www.noncode.org or http://noncode.bioinfo.org.cn.
The emergence of noncoding RNAs as Heracles in autophagy.

PubMed

Zhang, Jian; Wang, Peiyuan; Wan, Lin; Xu, Shouping; Pang, Da

2017-06-03

Macroautophagy/autophagy is a catabolic process that is widely found in nature. Over the past few decades, mounting evidence has indicated that noncoding RNAs, ranging from small noncoding RNAs to long noncoding RNAs (lncRNAs) and even circular RNAs (circRNAs), mediate the transcriptional and post-transcriptional regulation of autophagy-related genes by participating in autophagy regulatory networks. The differential expression of noncoding RNAs affects autophagy levels at different physiological and pathological stages, including embryonic proliferation and differentiation, cellular senescence, and even diseases such as cancer. We summarize the current knowledge regarding noncoding RNA dysregulation in autophagy and investigate the molecular regulatory mechanisms underlying noncoding RNA involvement in autophagy regulatory networks. Then, we integrate public resources to predict autophagy-related noncoding RNAs across species and discuss strategies for and the challenges of identifying autophagy-related noncoding RNAs. This article will deepen our understanding of the relationship between noncoding RNAs and autophagy, and provide new insights to specifically target noncoding RNAs in autophagy-associated therapeutic strategies.
lncRNA requirements for mouse acute myeloid leukemia and normal differentiation

PubMed Central

Knott, Simon RV; Munera Maravilla, Ester; Jackson, Benjamin T; Wild, Sophia A; Kovacevic, Tatjana; Stork, Eva Maria; Zhou, Meng; Erard, Nicolas; Lee, Emily; Kelley, David R; Roth, Mareike; Barbosa, Inês AM; Zuber, Johannes; Rinn, John L

2017-01-01

A substantial fraction of the genome is transcribed in a cell-type-specific manner, producing long non-coding RNAs (lncRNAs), rather than protein-coding transcripts. Here, we systematically characterize transcriptional dynamics during hematopoiesis and in hematological malignancies. Our analysis of annotated and de novo assembled lncRNAs showed many are regulated during differentiation and mis-regulated in disease. We assessed lncRNA function via an in vivo RNAi screen in a model of acute myeloid leukemia. This identified several lncRNAs essential for leukemia maintenance, and found that a number act by promoting leukemia stem cell signatures. Leukemia blasts show a myeloid differentiation phenotype when these lncRNAs were depleted, and our data indicates that this effect is mediated via effects on the MYC oncogene. Bone marrow reconstitutions showed that a lncRNA expressed across all progenitors was required for the myeloid lineage, whereas the other leukemia-induced lncRNAs were dispensable in the normal setting. PMID:28875933
lncRNA requirements for mouse acute myeloid leukemia and normal differentiation.

PubMed

Delás, M Joaquina; Sabin, Leah R; Dolzhenko, Egor; Knott, Simon Rv; Munera Maravilla, Ester; Jackson, Benjamin T; Wild, Sophia A; Kovacevic, Tatjana; Stork, Eva Maria; Zhou, Meng; Erard, Nicolas; Lee, Emily; Kelley, David R; Roth, Mareike; Barbosa, Inês Am; Zuber, Johannes; Rinn, John L; Smith, Andrew D; Hannon, Gregory J

2017-09-06

A substantial fraction of the genome is transcribed in a cell-type-specific manner, producing long non-coding RNAs (lncRNAs), rather than protein-coding transcripts. Here, we systematically characterize transcriptional dynamics during hematopoiesis and in hematological malignancies. Our analysis of annotated and de novo assembled lncRNAs showed many are regulated during differentiation and mis-regulated in disease. We assessed lncRNA function via an in vivo RNAi screen in a model of acute myeloid leukemia. This identified several lncRNAs essential for leukemia maintenance, and found that a number act by promoting leukemia stem cell signatures. Leukemia blasts show a myeloid differentiation phenotype when these lncRNAs were depleted, and our data indicates that this effect is mediated via effects on the MYC oncogene. Bone marrow reconstitutions showed that a lncRNA expressed across all progenitors was required for the myeloid lineage, whereas the other leukemia-induced lncRNAs were dispensable in the normal setting.
Down-regulation of 21A Alu RNA as a tool to boost proliferation maintaining the tissue regeneration potential of progenitor cells

PubMed Central

Gigoni, Arianna; Costa, Delfina; Gaetani, Massimiliano; Tasso, Roberta; Villa, Federico; Florio, Tullio; Pagano, Aldo

2016-01-01

ABSTRACT 21A is an Alu non-coding (nc) RNA transcribed by RNA polymerase (pol) III. While investigating the biological role of 21A ncRNA we documented an inverse correlation between its expression level and the rate of cell proliferation. The downregulation of this ncRNA not only caused a boost in cell proliferation, but was also associated to a transient cell dedifferentiation, suggesting a possible involvement of this RNA in cell dedifferentiation/reprogramming. In this study, we explored the possibility to enhance proliferation and dedifferentiation of cells of interest, by 21A down-regulation, using a mixture of chemically modified Anti-21A RNAs. Our results confirmed the validity of this approach that allows the amplification of specific cell populations, in a controlled manner and without inducing permanent effects. In addition to induce cell proliferation, the procedure did not decrease the tissue regeneration potential of progenitor cells in two different cell systems. PMID:27494068
Down-regulation of 21A Alu RNA as a tool to boost proliferation maintaining the tissue regeneration potential of progenitor cells.

PubMed

Gigoni, Arianna; Costa, Delfina; Gaetani, Massimiliano; Tasso, Roberta; Villa, Federico; Florio, Tullio; Pagano, Aldo

2016-09-16

21A is an Alu non-coding (nc) RNA transcribed by RNA polymerase (pol) III. While investigating the biological role of 21A ncRNA we documented an inverse correlation between its expression level and the rate of cell proliferation. The downregulation of this ncRNA not only caused a boost in cell proliferation, but was also associated to a transient cell dedifferentiation, suggesting a possible involvement of this RNA in cell dedifferentiation/reprogramming. In this study, we explored the possibility to enhance proliferation and dedifferentiation of cells of interest, by 21A down-regulation, using a mixture of chemically modified Anti-21A RNAs. Our results confirmed the validity of this approach that allows the amplification of specific cell populations, in a controlled manner and without inducing permanent effects. In addition to induce cell proliferation, the procedure did not decrease the tissue regeneration potential of progenitor cells in two different cell systems.
Molecular organization of the 5S rDNA gene type II in elasmobranchs.

PubMed

Castro, Sergio I; Hleap, Jose S; Cárdenas, Heiber; Blouin, Christian

2016-01-01

The 5S rDNA gene is a non-coding RNA that can be found in 2 copies (type I and type II) in bony and cartilaginous fish. Previous studies have pointed out that type II gene is a paralog derived from type I. We analyzed the molecular organization of 5S rDNA type II in elasmobranchs. Although the structure of the 5S rDNA is supposed to be highly conserved, our results show that the secondary structure in this group possesses some variability and is different than the consensus secondary structure. One of these differences in Selachii is an internal loop at nucleotides 7 and 112. These mutations observed in the transcribed region suggest an independent origin of the gene among Batoids and Selachii. All promoters were highly conserved with the exception of BoxA, possibly due to its affinity to polymerase III. This latter enzyme recognizes a dT4 sequence as stop signal, however in Rajiformes this signal was doubled in length to dT8. This could be an adaptation toward a higher efficiency in the termination process. Our results suggest that there is no TATA box in elasmobranchs in the NTS region. We also provide some evidence suggesting that the complexity of the microsatellites present in the NTS region play an important role in the 5S rRNA gene since it is significantly correlated with the length of the NTS.
Molecular organization of the 5S rDNA gene type II in elasmobranchs

PubMed Central

Castro, Sergio I.; Hleap, Jose S.; Cárdenas, Heiber; Blouin, Christian

2016-01-01

ABSTRACT The 5S rDNA gene is a non-coding RNA that can be found in 2 copies (type I and type II) in bony and cartilaginous fish. Previous studies have pointed out that type II gene is a paralog derived from type I. We analyzed the molecular organization of 5S rDNA type II in elasmobranchs. Although the structure of the 5S rDNA is supposed to be highly conserved, our results show that the secondary structure in this group possesses some variability and is different than the consensus secondary structure. One of these differences in Selachii is an internal loop at nucleotides 7 and 112. These mutations observed in the transcribed region suggest an independent origin of the gene among Batoids and Selachii. All promoters were highly conserved with the exception of BoxA, possibly due to its affinity to polymerase III. This latter enzyme recognizes a dT4 sequence as stop signal, however in Rajiformes this signal was doubled in length to dT8. This could be an adaptation toward a higher efficiency in the termination process. Our results suggest that there is no TATA box in elasmobranchs in the NTS region. We also provide some evidence suggesting that the complexity of the microsatellites present in the NTS region play an important role in the 5S rRNA gene since it is significantly correlated with the length of the NTS. PMID:26488198
A Unique cis-Encoded Small Noncoding RNA Is Regulating Legionella pneumophila Hfq Expression in a Life Cycle-Dependent Manner.

PubMed

Oliva, Giulia; Sahr, Tobias; Rolando, Monica; Knoth, Maike; Buchrieser, Carmen

2017-01-10

Legionella pneumophila is an environmental bacterium that parasitizes protozoa, but it may also infect humans, thereby causing a severe pneumonia called Legionnaires' disease. To cycle between the environment and a eukaryotic host, L. pneumophila is regulating the expression of virulence factors in a life cycle-dependent manner: replicating bacteria do not express virulence factors, whereas transmissive bacteria are highly motile and infective. Here we show that Hfq is an important regulator in this network. Hfq is highly expressed in transmissive bacteria but is expressed at very low levels in replicating bacteria. A L. pneumophila hfq deletion mutant exhibits reduced abilities to infect and multiply in Acanthamoeba castellanii at environmental temperatures. The life cycle-dependent regulation of Hfq expression depends on a unique cis-encoded small RNA named Anti-hfq that is transcribed antisense of the hfq transcript and overlaps its 5' untranslated region. The Anti-hfq sRNA is highly expressed only in replicating L. pneumophila where it regulates hfq expression through binding to the complementary regions of the hfq transcripts. This results in reduced Hfq protein levels in exponentially growing cells. Both the small noncoding RNA (sRNA) and hfq mRNA are bound and stabilized by the Hfq protein, likely leading to the cleavage of the RNA duplex by the endoribonuclease RNase III. In contrast, after the switch to transmissive bacteria, the sRNA is not expressed, allowing now an efficient expression of the hfq gene and consequently Hfq. Our results place Hfq and its newly identified sRNA anti-hfq in the center of the regulatory network governing L. pneumophila differentiation from nonvirulent to virulent bacteria. The abilities of L. pneumophila to replicate intracellularly and to cause disease depend on its capacity to adapt to different extra- and intracellular environmental conditions. Therefore, a timely and fine-tuned expression of virulence factors and adaptation traits is crucial. Yet, the regulatory circuits governing the life cycle of L. pneumophila from replicating to virulent bacteria are only partly uncovered. Here we show that the life cycle-dependent regulation of the RNA chaperone Hfq relies on a small regulatory RNA encoded antisense to the hfq-encoding gene through a base pairing mechanism. Furthermore, Hfq regulates its own expression in an autoregulatory loop. The discovery of this RNA regulatory mechanism in L. pneumophila is an important step forward in the understanding of how the switch from inoffensive, replicating to highly virulent, transmissive L. pneumophila is regulated. Copyright © 2017 Oliva et al.
Molecular Regulatory Pathways Link Sepsis With Metabolic Syndrome: Non-coding RNA Elements Underlying the Sepsis/Metabolic Cross-Talk.

PubMed

Meydan, Chanan; Bekenstein, Uriya; Soreq, Hermona

2018-01-01

Sepsis and metabolic syndrome (MetS) are both inflammation-related entities with high impact for human health and the consequences of concussions. Both represent imbalanced parasympathetic/cholinergic response to insulting triggers and variably uncontrolled inflammation that indicates shared upstream regulators, including short microRNAs (miRs) and long non-coding RNAs (lncRNAs). These may cross talk across multiple systems, leading to complex molecular and clinical outcomes. Notably, biomedical and RNA-sequencing based analyses both highlight new links between the acquired and inherited pathogenic, cardiac and inflammatory traits of sepsis/MetS. Those include the HOTAIR and MIAT lncRNAs and their targets, such as miR-122, -150, -155, -182, -197, -375, -608 and HLA-DRA. Implicating non-coding RNA regulators in sepsis and MetS may delineate novel high-value biomarkers and targets for intervention.
Molecular phylogeography of the Andean alpine plant, Gunnera magellanica

NASA Astrophysics Data System (ADS)

Shimizu, M.; Fujii, N.; Ito, M.; Asakawa, T.; Nishida, H.; Suyama, C.; Ueda, K.

2015-12-01

To clarify the evolutionary history of Gunnera magellanica (Gunneraceae), an alpine plant of the Andes mountains, we performed molecular phylogeographic analyses based on the sequences of an internal transcribed spacer (ITS) of nuclear ribosomal DNA and four non-coding regions (trnH-psbA, trnL-trnF, atpB-rbcL, rpl16 intron) of chloroplast DNA. We investigated 3, 4, 4 and 11 populations in, Ecuador, Bolivia, Argentina, and Chile, respectively, and detected six ITS genotypes (Types A-F) in G. magellanica. Five genotypes (Types A-E) were observed in the northern Andes population (Ecuador and Bolivia); only one ITS genotype (Type F) was observed in the southern Andes population (Chile and Argentina). Phylogenetic analyses showed that the ITS genotypes of the northern and southern Andes populations form different clades with high bootstrap probability. Furthermore, network analysis, analysis of molecular variance, and spatial analysis of molecular variance showed that there were two major clusters (the northern and southern Andes populations) in this species. Furthermore, in chloroplast DNA analysis, three major clades (northern Andes, Chillan, and southern Andes) were inferred from phylogenetic analyses using four non-coding regions, a finding that was supported by the above three types of analysis. The Chillan clade is the northernmost population in the southern Andes populations. With the exception of the Chillan clade (Chillan population), results of nuclear DNA and chloroplast DNA analyses were consistent. Both markers showed that the northern and southern Andes populations of G. magellanica were genetically different from each other. This type of clear phylogeographical structure was supported by PERMUT analysis according to Pons & Petit (1995, 1996). Moreover, based on our preliminary estimation that is based on the ITS sequences, the northern and southern Andes clades diverged ~0.63-3 million years ago, during a period of upheaval in the Andes. This suggests that the populations of G. magellanica that were distributed along the Andes have been divided into the two local populations of the northern and southern Andes during the uplift of the Andes.
Regulation of an antisense RNA with the transition of neonatal to IIb myosin heavy chain during postnatal development and hypothyroidism in rat skeletal muscle

PubMed Central

Jiang, Weihua; Qin, Anqi X.; Bodell, Paul W.; Baldwin, Kenneth M.; Haddad, Fadia

2012-01-01

Postnatal development of fast skeletal muscle is characterized by a transition in expression of myosin heavy chain (MHC) isoforms, from primarily neonatal MHC at birth to primarily IIb MHC in adults, in a tightly coordinated manner. These isoforms are encoded by distinct genes, which are separated by ∼17 kb on rat chromosome 10. The neonatal-to-IIb MHC transition is inhibited by a hypothyroid state. We examined RNA products [mRNA, pre-mRNA, and natural antisense transcript (NAT)] of developmental and adult-expressed MHC genes (embryonic, neonatal, I, IIa, IIx, and IIb) at 2, 10, 20, and 40 days after birth in normal and thyroid-deficient rat neonates treated with propylthiouracil. We found that a long noncoding antisense-oriented RNA transcript, termed bII NAT, is transcribed from a site within the IIb-Neo intergenic region and across most of the IIb MHC gene. NATs have previously been shown to mediate transcriptional repression of sense-oriented counterparts. The bII NAT is transcriptionally regulated during postnatal development and in response to hypothyroidism. Evidence for a regulatory mechanism is suggested by an inverse relationship between IIb MHC and bII NAT in normal and hypothyroid-treated muscle. Neonatal MHC transcription is coordinately expressed with bII NAT. A comparative phylogenetic analysis also suggests that bII NAT-mediated regulation has been a conserved trait of placental mammals for most of the eutherian evolutionary history. The evidence in support of the regulatory model implicates long noncoding antisense RNA as a mechanism to coordinate the transition between neonatal and IIb MHC during postnatal development. PMID:22262309
Regulation of an antisense RNA with the transition of neonatal to IIb myosin heavy chain during postnatal development and hypothyroidism in rat skeletal muscle.

PubMed

Pandorf, Clay E; Jiang, Weihua; Qin, Anqi X; Bodell, Paul W; Baldwin, Kenneth M; Haddad, Fadia

2012-04-01

Postnatal development of fast skeletal muscle is characterized by a transition in expression of myosin heavy chain (MHC) isoforms, from primarily neonatal MHC at birth to primarily IIb MHC in adults, in a tightly coordinated manner. These isoforms are encoded by distinct genes, which are separated by ∼17 kb on rat chromosome 10. The neonatal-to-IIb MHC transition is inhibited by a hypothyroid state. We examined RNA products [mRNA, pre-mRNA, and natural antisense transcript (NAT)] of developmental and adult-expressed MHC genes (embryonic, neonatal, I, IIa, IIx, and IIb) at 2, 10, 20, and 40 days after birth in normal and thyroid-deficient rat neonates treated with propylthiouracil. We found that a long noncoding antisense-oriented RNA transcript, termed bII NAT, is transcribed from a site within the IIb-Neo intergenic region and across most of the IIb MHC gene. NATs have previously been shown to mediate transcriptional repression of sense-oriented counterparts. The bII NAT is transcriptionally regulated during postnatal development and in response to hypothyroidism. Evidence for a regulatory mechanism is suggested by an inverse relationship between IIb MHC and bII NAT in normal and hypothyroid-treated muscle. Neonatal MHC transcription is coordinately expressed with bII NAT. A comparative phylogenetic analysis also suggests that bII NAT-mediated regulation has been a conserved trait of placental mammals for most of the eutherian evolutionary history. The evidence in support of the regulatory model implicates long noncoding antisense RNA as a mechanism to coordinate the transition between neonatal and IIb MHC during postnatal development.
Long Non-Coding RNAs: A Novel Paradigm for Toxicology.

PubMed

Dempsey, Joseph L; Cui, Julia Yue

2017-01-01

Long non-coding RNAs (lncRNAs) are over 200 nucleotides in length and are transcribed from the mammalian genome in a tissue-specific and developmentally regulated pattern. There is growing recognition that lncRNAs are novel biomarkers and/or key regulators of toxicological responses in humans and animal models. Lacking protein-coding capacity, the numerous types of lncRNAs possess a myriad of transcriptional regulatory functions that include cis and trans gene expression, transcription factor activity, chromatin remodeling, imprinting, and enhancer up-regulation. LncRNAs also influence mRNA processing, post-transcriptional regulation, and protein trafficking. Dysregulation of lncRNAs has been implicated in various human health outcomes such as various cancers, Alzheimer's disease, cardiovascular disease, autoimmune diseases, as well as intermediary metabolism such as glucose, lipid, and bile acid homeostasis. Interestingly, emerging evidence in the literature over the past five years has shown that lncRNA regulation is impacted by exposures to various chemicals such as polycyclic aromatic hydrocarbons, benzene, cadmium, chlorpyrifos-methyl, bisphenol A, phthalates, phenols, and bile acids. Recent technological advancements, including next-generation sequencing technologies and novel computational algorithms, have enabled the profiling and functional characterizations of lncRNAs on a genomic scale. In this review, we summarize the biogenesis and general biological functions of lncRNAs, highlight the important roles of lncRNAs in human diseases and especially during the toxicological responses to various xenobiotics, evaluate current methods for identifying aberrant lncRNA expression and molecular target interactions, and discuss the potential to implement these tools to address fundamental questions in toxicology. © The Author 2016. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
The domain structure and distribution of Alu elements in long noncoding RNAs and mRNAs

PubMed Central

Kim, Eugene Z.; Wespiser, Adam R.; Caffrey, Daniel R.

2016-01-01

Approximately 75% of the human genome is transcribed and many of these spliced transcripts contain primate-specific Alu elements, the most abundant mobile element in the human genome. The majority of exonized Alu elements are located in long noncoding RNAs (lncRNAs) and the untranslated regions of mRNA, with some performing molecular functions. To further assess the potential for Alu elements to be repurposed as functional RNA domains, we investigated the distribution and evolution of Alu elements in spliced transcripts. Our analysis revealed that Alu elements are underrepresented in mRNAs and lncRNAs, suggesting that most exonized Alu elements arising in the population are rare or deleterious to RNA function. When mRNAs and lncRNAs retain exonized Alu elements, they have a clear preference for Alu dimers, left monomers, and right monomers. mRNAs often acquire Alu elements when their genes are duplicated within Alu-rich regions. In lncRNAs, reverse-oriented Alu elements are significantly enriched and are not restricted to the 3′ and 5′ ends. Both lncRNAs and mRNAs primarily contain the Alu J and S subfamilies that were amplified relatively early in primate evolution. Alu J subfamilies are typically overrepresented in lncRNAs, whereas the Alu S dimer is overrepresented in mRNAs. The sequences of Alu dimers tend to be constrained in both lncRNAs and mRNAs, whereas the left and right monomers are constrained within particular Alu subfamilies and classes of RNA. Collectively, these findings suggest that Alu-containing RNAs are capable of forming stable structures and that some of these Alu domains might have novel biological functions. PMID:26654912
A long noncoding RNA from the HBS1L-MYB intergenic region on chr6q23 regulates human fetal hemoglobin expression.

PubMed

Morrison, Tasha A; Wilcox, Ibifiri; Luo, Hong-Yuan; Farrell, John J; Kurita, Ryo; Nakamura, Yukio; Murphy, George J; Cui, Shuaiying; Steinberg, Martin H; Chui, David H K

2018-03-01

The HBS1L-MYB intergenic region (chr6q23) regulates erythroid cell proliferation, maturation, and fetal hemoglobin (HbF) expression. An enhancer element within this locus, highlighted by a 3-bp deletion polymorphism (rs66650371), is known to interact with the promoter of the neighboring gene, MYB, to increase its expression, thereby regulating HbF production. RNA polymerase II binding and a 50-bp transcript from this enhancer region reported in ENCODE datasets suggested the presence of a long noncoding RNA (lncRNA). We characterized a novel 1283bp transcript (HMI-LNCRNA; chr6:135,096,362-135,097,644; hg38) that was transcribed from the enhancer region of MYB. Within erythroid cells, HMI-LNCRNA was almost exclusively present in nucleus, and was much less abundant than the mRNA for MYB. HMI-LNCRNA expression was significantly higher in erythroblasts derived from cultured adult peripheral blood CD34 + cells which expressed more HBB, compared to erythroblasts from cultured cord blood CD34 + cells which expressed much more HBG. Down-regulation of HMI-LNCRNA in HUDEP-2 cells, which expressed mostly HBB, significantly upregulated HBG expression both at the mRNA (200-fold) and protein levels, and promoted erythroid maturation. No change was found in the expression of BCL11A and other key transcription factors known to modulate HBG expression. HMI-LNCRNA plays an important role in regulating HBG expression, and its downregulation can result in a significant increase in HbF. HMI-LNCRNA might be a potential therapeutic target for HbF induction treatment in sickle cell disease and β-thalassemia. Copyright © 2017 Elsevier Inc. All rights reserved.
An imprinted non-coding genomic cluster at 14q32 defines clinically relevant molecular subtypes in osteosarcoma across multiple independent datasets.

PubMed

Hill, Katherine E; Kelly, Andrew D; Kuijjer, Marieke L; Barry, William; Rattani, Ahmed; Garbutt, Cassandra C; Kissick, Haydn; Janeway, Katherine; Perez-Atayde, Antonio; Goldsmith, Jeffrey; Gebhardt, Mark C; Arredouani, Mohamed S; Cote, Greg; Hornicek, Francis; Choy, Edwin; Duan, Zhenfeng; Quackenbush, John; Haibe-Kains, Benjamin; Spentzos, Dimitrios

2017-05-15

A microRNA (miRNA) collection on the imprinted 14q32 MEG3 region has been associated with outcome in osteosarcoma. We assessed the clinical utility of this miRNA set and their association with methylation status. We integrated coding and non-coding RNA data from three independent annotated clinical osteosarcoma cohorts (n = 65, n = 27, and n = 25) and miRNA and methylation data from one in vitro (19 cell lines) and one clinical (NCI Therapeutically Applicable Research to Generate Effective Treatments (TARGET) osteosarcoma dataset, n = 80) dataset. We used time-dependent receiver operating characteristic (tdROC) analysis to evaluate the clinical value of candidate miRNA profiles and machine learning approaches to compare the coding and non-coding transcriptional programs of high- and low-risk osteosarcoma tumors and high- versus low-aggressiveness cell lines. In the cell line and TARGET datasets, we also studied the methylation patterns of the MEG3 imprinting control region on 14q32 and their association with miRNA expression and tumor aggressiveness. In the tdROC analysis, miRNA sets on 14q32 showed strong discriminatory power for recurrence and survival in the three clinical datasets. High- or low-risk tumor classification was robust to using different microRNA sets or classification methods. Machine learning approaches showed that genome-wide miRNA profiles and miRNA regulatory networks were quite different between the two outcome groups and mRNA profiles categorized the samples in a manner concordant with the miRNAs, suggesting potential molecular subtypes. Further, miRNA expression patterns were reproducible in comparing high-aggressiveness versus low-aggressiveness cell lines. Methylation patterns in the MEG3 differentially methylated region (DMR) also distinguished high-aggressiveness from low-aggressiveness cell lines and were associated with expression of several 14q32 miRNAs in both the cell lines and the large TARGET clinical dataset. Within the limits of available CpG array coverage, we observed a potential methylation-sensitive regulation of the non-coding RNA cluster by CTCF, a known enhancer-blocking factor. Loss of imprinting/methylation changes in the 14q32 non-coding region defines reproducible previously unrecognized osteosarcoma subtypes with distinct transcriptional programs and biologic and clinical behavior. Future studies will define the precise relationship between 14q32 imprinting, non-coding RNA expression, genomic enhancer binding, and tumor aggressiveness, with possible therapeutic implications for both early- and advanced-stage patients.
An in silico model for identification of small RNAs in whole bacterial genomes: characterization of antisense RNAs in pathogenic Escherichia coli and Streptococcus agalactiae strains.

PubMed

Pichon, Christophe; du Merle, Laurence; Caliot, Marie Elise; Trieu-Cuot, Patrick; Le Bouguénec, Chantal

2012-04-01

Characterization of small non-coding ribonucleic acids (sRNA) among the large volume of data generated by high-throughput RNA-seq or tiling microarray analyses remains a challenge. Thus, there is still a need for accurate in silico prediction methods to identify sRNAs within a given bacterial species. After years of effort, dedicated software were developed based on comparative genomic analyses or mathematical/statistical models. Although these genomic analyses enabled sRNAs in intergenic regions to be efficiently identified, they all failed to predict antisense sRNA genes (asRNA), i.e. RNA genes located on the DNA strand complementary to that which encodes the protein. The statistical models enabled any genomic region to be analyzed theorically but not efficiently. We present a new model for in silico identification of sRNA and asRNA candidates within an entire bacterial genome. This model was successfully used to analyze the Gram-negative Escherichia coli and Gram-positive Streptococcus agalactiae. In both bacteria, numerous asRNAs are transcribed from the complementary strand of genes located in pathogenicity islands, strongly suggesting that these asRNAs are regulators of the virulence expression. In particular, we characterized an asRNA that acted as an enhancer-like regulator of the type 1 fimbriae production involved in the virulence of extra-intestinal pathogenic E. coli.
An in silico model for identification of small RNAs in whole bacterial genomes: characterization of antisense RNAs in pathogenic Escherichia coli and Streptococcus agalactiae strains

PubMed Central

Pichon, Christophe; du Merle, Laurence; Caliot, Marie Elise; Trieu-Cuot, Patrick; Le Bouguénec, Chantal

2012-01-01

Characterization of small non-coding ribonucleic acids (sRNA) among the large volume of data generated by high-throughput RNA-seq or tiling microarray analyses remains a challenge. Thus, there is still a need for accurate in silico prediction methods to identify sRNAs within a given bacterial species. After years of effort, dedicated software were developed based on comparative genomic analyses or mathematical/statistical models. Although these genomic analyses enabled sRNAs in intergenic regions to be efficiently identified, they all failed to predict antisense sRNA genes (asRNA), i.e. RNA genes located on the DNA strand complementary to that which encodes the protein. The statistical models enabled any genomic region to be analyzed theorically but not efficiently. We present a new model for in silico identification of sRNA and asRNA candidates within an entire bacterial genome. This model was successfully used to analyze the Gram-negative Escherichia coli and Gram-positive Streptococcus agalactiae. In both bacteria, numerous asRNAs are transcribed from the complementary strand of genes located in pathogenicity islands, strongly suggesting that these asRNAs are regulators of the virulence expression. In particular, we characterized an asRNA that acted as an enhancer-like regulator of the type 1 fimbriae production involved in the virulence of extra-intestinal pathogenic E. coli. PMID:22139924
Development and utilization of novel intron length polymorphic markers in foxtail millet (Setaria italica (L.) P. Beauv.).

PubMed

Gupta, Sarika; Kumari, Kajal; Das, Jyotirmoy; Lata, Charu; Puranik, Swati; Prasad, Manoj

2011-07-01

Introns are noncoding sequences in a gene that are transcribed to precursor mRNA but spliced out during mRNA maturation and are abundant in eukaryotic genomes. The availability of codominant molecular markers and saturated genetic linkage maps have been limited in foxtail millet (Setaria italica (L.) P. Beauv.). Here, we describe the development of 98 novel intron length polymorphic (ILP) markers in foxtail millet using sequence information of the model plant rice. A total of 575 nonredundant expressed sequence tag (EST) sequences were obtained, of which 327 and 248 unique sequences were from dehydration- and salinity-stressed suppression subtractive hybridization libraries, respectively. The BLAST analysis of 98 EST sequences suggests a nearly defined function for about 64% of them, and they were grouped into 11 different functional categories. All 98 ILP primer pairs showed a high level of cross-species amplification in two millets and two nonmillets species ranging from 90% to 100%, with a mean of ∼97%. The mean observed heterozygosity and Nei's average gene diversity 0.016 and 0.171, respectively, established the efficiency of the ILP markers for distinguishing the foxtail millet accessions. Based on 26 ILP markers, a reasonable dendrogram of 45 foxtail millet accessions was constructed, demonstrating the utility of ILP markers in germplasm characterizations and genomic relationships in millets and nonmillets species.

Non coding RNAs in vascular disease - from basic science to clinical applications: Scientific update from the Working Group of Myocardial Function of the European Society of Cardiology

PubMed

Fiedler, Jan; Baker, Andrew H; Dimmeler, Stefanie; Heymans, Stephane; Mayr, Manuel; Thum, Thomas

2018-05-23

Non-coding RNAs are increasingly recognized not only as regulators of various biological functions but also as targets for a new generation of RNA therapeutics and biomarkers. We hereby review recent insights relating to non-coding RNAs including microRNAs (e.g. miR-126, miR-146a), long non-coding RNAs (e.g. MIR503HG, GATA6-AS, SMILR) and circular RNAs (e.g. cZNF292) and their role in vascular diseases. This includes identification and therapeutic use of hypoxia-regulated non-coding RNAs and endogenous non-coding RNAs that regulate intrinsic smooth muscle cell signalling, age-related non-coding RNAs and non-coding RNAs involved in the regulation of mitochondrial biology and metabolic control. Finally, we discuss non-coding RNA species with biomarker potential.
Novel insights into the response of Atlantic salmon (Salmo salar) to Piscirickettsia salmonis: Interplay of coding genes and lncRNAs during bacterial infection.

PubMed

Valenzuela-Miranda, Diego; Gallardo-Escárate, Cristian

2016-12-01

Despite the high prevalence and impact to Chilean salmon aquaculture of the intracellular bacterium Piscirickettsia salmonis, the molecular underpinnings of host-pathogen interactions remain unclear. Herein, the interplay of coding and non-coding transcripts has been proposed as a key mechanism involved in immune response. Therefore, the aim of this study was to evidence how coding and non-coding transcripts are modulated during the infection process of Atlantic salmon with P. salmonis. For this, RNA-seq was conducted in brain, spleen, and head kidney samples, revealing different transcriptional profiles according to bacterial load. Additionally, while most of the regulated genes annotated for diverse biological processes during infection, a common response associated with clathrin-mediated endocytosis and iron homeostasis was present in all tissues. Interestingly, while endocytosis-promoting factors and clathrin inductions were upregulated, endocytic receptors were mainly downregulated. Furthermore, the regulation of genes related to iron homeostasis suggested an intracellular accumulation of iron, a process in which heme biosynthesis/degradation pathways might play an important role. Regarding the non-coding response, 918 putative long non-coding RNAs were identified, where 425 were newly characterized for S. salar. Finally, co-localization and co-expression analyses revealed a strong correlation between the modulations of long non-coding RNAs and genes associated with endocytosis and iron homeostasis. These results represent the first comprehensive study of putative interplaying mechanisms of coding and non-coding RNAs during bacterial infection in salmonids. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

PubMed Central

Wang, Hongbo; Zhao, Yingchao; Chen, Mingyue; Cui, Jie

2017-01-01

Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16) mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN) tissues from three patients with high-throughput RNA sequencing (RNA-seq). In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE) in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO) terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA) network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer. PMID:28970820
Chromatin structure of the LCR in the human β-globin locus transcribing the adult δ- and β-globin genes.

PubMed

Kim, Seoyeon; Kim, Yea Woon; Shim, Sung Han; Kim, Chul Geun; Kim, Aeri

2012-03-01

The β-like globin genes are transcribed in a developmental stage specific fashion in erythroid cells. The specific transcription of globin genes is conferred by the locus control region (LCR), but the chromatin structure of the LCR in the human adult β-globin locus transcribing the δ- and β-globin genes is not clear. Here, we employed hybrid MEL cells that contain a human chromosome 11. The δ- and β-globin genes were highly transcribed in hybrid MEL/ch11 cells after transcriptional induction. LCR HS3 and HS2 were strongly occupied by erythroid specific transcriptional activators and co-factors in the induced locus. These HSs, but not HS4 and HS1, were in close proximity with the active globin genes as revealed by high resolution 3C experiments. The active features at HS3 were markedly established after transcriptional induction, while HS2 was in a relatively active conformation before the induction. Unexpectedly, HS1 did not show notable active features except histone hyperacetylation. Taken together, the LCR of the human β-globin locus transcribing the adult δ- and β-globin genes has HS specific chromatin structure. The structure at each HS, which is different from the locus transcribing the fetal globin genes, might relate to its role in transcribing the adult genes. Copyright © 2011 Elsevier Ltd. All rights reserved.
Identification and role of regulatory non-coding RNAs in Listeria monocytogenes.

PubMed

Izar, Benjamin; Mraheil, Mobarak Abu; Hain, Torsten

2011-01-01

Bacterial regulatory non-coding RNAs control numerous mRNA targets that direct a plethora of biological processes, such as the adaption to environmental changes, growth and virulence. Recently developed high-throughput techniques, such as genomic tiling arrays and RNA-Seq have allowed investigating prokaryotic cis- and trans-acting regulatory RNAs, including sRNAs, asRNAs, untranslated regions (UTR) and riboswitches. As a result, we obtained a more comprehensive view on the complexity and plasticity of the prokaryotic genome biology. Listeria monocytogenes was utilized as a model system for intracellular pathogenic bacteria in several studies, which revealed the presence of about 180 regulatory RNAs in the listerial genome. A regulatory role of non-coding RNAs in survival, virulence and adaptation mechanisms of L. monocytogenes was confirmed in subsequent experiments, thus, providing insight into a multifaceted modulatory function of RNA/mRNA interference. In this review, we discuss the identification of regulatory RNAs by high-throughput techniques and in their functional role in L. monocytogenes.
Long Non-Coding RNAs Differentially Expressed between Normal versus Primary Breast Tumor Tissues Disclose Converse Changes to Breast Cancer-Related Protein-Coding Genes

PubMed Central

Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.

2014-01-01

Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628
Long non-coding RNAs differentially expressed between normal versus primary breast tumor tissues disclose converse changes to breast cancer-related protein-coding genes.

PubMed

Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O

2014-01-01

Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.
Non-coding RNAs: new biomarkers and therapeutic targets for esophageal cancer

PubMed Central

Ren, Zhipeng; Zhang, Guoliang

2017-01-01

Esophageal cancer is one of the most common gastrointestinal malignant diseases and there is still no effective treatment. The incidence of esophageal cancer in the world is relatively high and on the increase year by year. Thus, the elaboration on the carcinogenesis of esophageal cancer and the identification of new biomarkers and therapeutic targets is quite beneficial to optimizing the current therapeutic regimen for treating such deadly disease. More and more evidence has shown that non-coding RNAs play an important role in the development and progression of multiple human cancers, including esophageal cancer. microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) are two functional kinds of non-coding RNAs that have been well investigated. They exert tumor suppressive or promoting effect by specifically regulating the expression of certain downstream target genes, which is tumor specific. It is also proved that miRNAs and lncRNAs level in tissue and plasma from esophageal cancer patients are closely correlated with the survival and disease progression, which could be used as a prognostic factor and therapeutic target for esophageal cancer. PMID:28388588
Non-coding RNAs: new biomarkers and therapeutic targets for esophageal cancer.

PubMed

Hou, Xiaobin; Wen, Jiaxin; Ren, Zhipeng; Zhang, Guoliang

2017-06-27

Esophageal cancer is one of the most common gastrointestinal malignant diseases and there is still no effective treatment. The incidence of esophageal cancer in the world is relatively high and on the increase year by year. Thus, the elaboration on the carcinogenesis of esophageal cancer and the identification of new biomarkers and therapeutic targets is quite beneficial to optimizing the current therapeutic regimen for treating such deadly disease. More and more evidence has shown that non-coding RNAs play an important role in the development and progression of multiple human cancers, including esophageal cancer. microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) are two functional kinds of non-coding RNAs that have been well investigated. They exert tumor suppressive or promoting effect by specifically regulating the expression of certain downstream target genes, which is tumor specific. It is also proved that miRNAs and lncRNAs level in tissue and plasma from esophageal cancer patients are closely correlated with the survival and disease progression, which could be used as a prognostic factor and therapeutic target for esophageal cancer.
Long Noncoding RNA-Associated Transcriptomic Changes in Resiliency or Susceptibility to Depression and Response to Antidepressant Treatment

PubMed Central

Roy, Bhaskar; Wang, Qingzhong; Dwivedi, Yogesh

2018-01-01

Abstract Background Recent emergence of long noncoding RNAs in regulating gene expression and thereby modulating physiological functions in brain has manifested their possible role in psychiatric disorders. In this study, the roles of long noncoding RNAs in susceptibility and resiliency to develop stress-induced depression and their response to antidepressant treatment were examined. Methods Microarray-based transcriptome-wide changes in long noncoding RNAs were determined in hippocampus of male Holtzman rats who showed susceptibility (learned helplessness) or resiliency (nonlearned helplessness) to develop depression. Changes in long noncoding RNA expression were also ascertained after subchronic administration of fluoxetine to learned helplessness rats. Bioinformatic and target prediction analyses (cis- and trans-acting) and qPCR-based assays were performed to decipher the functional role of altered long noncoding RNAs. Results Group-wise comparison showed an overrepresented class of long noncoding RNAs that were uniquely associated with nonlearned helplessness or learned helplessness behavior. Chromosomal mapping within the 5-kbp flank region of the top 20 dysregulated long noncoding RNAs in the learned helplessness group showed several target genes that were regulated through cis- or trans-actions, including Zbtb20 and Zfp385b from zinc finger binding protein family. Genomic context of differentially expressed long noncoding RNAs showed an overall blunted response in the learned helplessness group regardless of the long noncoding RNA classes analyzed. Gene ontology exhibited the functional clustering for anatomical structure development, cellular architecture modulation, protein metabolism, and cellular communications. Fluoxetine treatment reversed learned helplessness-induced changes in many long noncoding RNAs and target genes. Conclusions The involvement of specific classes of long noncoding RNAs with distinctive roles in modulating target gene expression could confer the role of long noncoding RNAs in resiliency or susceptibility to develop depression with a reciprocal response to antidepressant treatment. PMID:29390069
Unit-length line-1 transcripts in human teratocarcinoma cells.

PubMed Central

Skowronski, J; Fanning, T G; Singer, M F

1988-01-01

We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons. Images PMID:2454389
DNA barcode and identification of the varieties and provenances of Taiwan's domestic and imported made teas using ribosomal internal transcribed spacer 2 sequences.

PubMed

Lee, Shih-Chieh; Wang, Chia-Hsiang; Yen, Cheng-En; Chang, Chieh

2017-04-01

The major aim of made tea identification is to identify the variety and provenance of the tea plant. The present experiment used 113 tea plants [Camellia sinensis (L.) O. Kuntze] housed at the Tea Research and Extension Substation, from which 113 internal transcribed spacer 2 (ITS2) fragments, 104 trnL intron, and 98 trnL-trnF intergenic sequence region DNA sequences were successfully sequenced. The similarity of the ITS2 nucleotide sequences between tea plants housed at the Tea Research and Extension Substation was 0.379-0.994. In this polymerase chain reaction-amplified noncoding region, no varieties possessed identical sequences. Compared with the trnL intron and trnL-trnF intergenic sequence fragments of chloroplast cpDNA, the proportion of ITS2 nucleotide sequence variation was large and is more suitable for establishing a DNA barcode database to identify tea plant varieties. After establishing the database, 30 imported teas and 35 domestic made teas were used in this model system to explore the feasibility of using ITS2 sequences to identify the varieties and provenances of made teas. A phylogenetic tree was constructed using ITS2 sequences with the unweighted pair group method with arithmetic mean, which indicated that the same variety of tea plant is likely to be successfully categorized into one cluster, but contamination from other tea plants was also detected. This result provides molecular evidence that the similarity between important tea varieties in Taiwan remains high. We suggest a direct, wide collection of made tea and original samples of tea plants to establish an ITS2 sequence molecular barcode identification database to identify the varieties and provenances of tea plants. The DNA barcode comparison method can satisfy the need for a rapid, low-cost, frontline differentiation of the large amount of made teas from Taiwan and abroad, and can provide molecular evidence of their varieties and provenances. Copyright © 2016. Published by Elsevier B.V.
A universal genomic coordinate translator for comparative genomics

PubMed Central

2014-01-01

Background Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Results Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Conclusions Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken. PMID:24976580
A universal genomic coordinate translator for comparative genomics.

PubMed

Zamani, Neda; Sundström, Görel; Meadows, Jennifer R S; Höppner, Marc P; Dainat, Jacques; Lantz, Henrik; Haas, Brian J; Grabherr, Manfred G

2014-06-30

Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken.
Conserved expression of transposon-derived non-coding transcripts in primate stem cells.

PubMed

Ramsay, LeeAnn; Marchetto, Maria C; Caron, Maxime; Chen, Shu-Huang; Busche, Stephan; Kwan, Tony; Pastinen, Tomi; Gage, Fred H; Bourque, Guillaume

2017-02-28

A significant portion of expressed non-coding RNAs in human cells is derived from transposable elements (TEs). Moreover, it has been shown that various long non-coding RNAs (lncRNAs), which come from the human endogenous retrovirus subfamily H (HERVH), are not only expressed but required for pluripotency in human embryonic stem cells (hESCs). To identify additional TE-derived functional non-coding transcripts, we generated RNA-seq data from induced pluripotent stem cells (iPSCs) of four primate species (human, chimpanzee, gorilla, and rhesus) and searched for transcripts whose expression was conserved. We observed that about 30% of TE instances expressed in human iPSCs had orthologous TE instances that were also expressed in chimpanzee and gorilla. Notably, our analysis revealed a number of repeat families with highly conserved expression profiles including HERVH but also MER53, which is known to be the source of a placental-specific family of microRNAs (miRNAs). We also identified a number of repeat families from all classes of TEs, including MLT1-type and Tigger families, that contributed a significant amount of sequence to primate lncRNAs whose expression was conserved. Together, these results describe TE families and TE-derived lncRNAs whose conserved expression patterns can be used to identify what are likely functional TE-derived non-coding transcripts in primate iPSCs.
Noncoding copy-number variations are associated with congenital limb malformation.

PubMed

Flöttmann, Ricarda; Kragesteen, Bjørt K; Geuer, Sinje; Socha, Magdalena; Allou, Lila; Sowińska-Seidler, Anna; Bosquillon de Jarcy, Laure; Wagner, Johannes; Jamsheer, Aleksander; Oehl-Jaschkowitz, Barbara; Wittler, Lars; de Silva, Deepthi; Kurth, Ingo; Maya, Idit; Santos-Simarro, Fernando; Hülsemann, Wiebke; Klopocki, Eva; Mountford, Roger; Fryer, Alan; Borck, Guntram; Horn, Denise; Lapunzina, Pablo; Wilson, Meredith; Mascrez, Bénédicte; Duboule, Denis; Mundlos, Stefan; Spielmann, Malte

2017-10-12

PurposeCopy-number variants (CNVs) are generally interpreted by linking the effects of gene dosage with phenotypes. The clinical interpretation of noncoding CNVs remains challenging. We investigated the percentage of disease-associated CNVs in patients with congenital limb malformations that affect noncoding cis-regulatory sequences versus genes sensitive to gene dosage effects.MethodsWe applied high-resolution copy-number analysis to 340 unrelated individuals with isolated limb malformation. To investigate novel candidate CNVs, we re-engineered human CNVs in mice using clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing.ResultsOf the individuals studied, 10% harbored CNVs segregating with the phenotype in the affected families. We identified 31 CNVs previously associated with congenital limb malformations and four novel candidate CNVs. Most of the disease-associated CNVs (57%) affected the noncoding cis-regulatory genome, while only 43% included a known disease gene and were likely to result from gene dosage effects. In transgenic mice harboring four novel candidate CNVs, we observed altered gene expression in all cases, indicating that the CNVs had a regulatory effect either by changing the enhancer dosage or altering the topological associating domain architecture of the genome.ConclusionOur findings suggest that CNVs affecting noncoding regulatory elements are a major cause of congenital limb malformations.Genetics in Medicine advance online publication, 12 October 2017; doi:10.1038/gim.2017.154.
Noncoding sequence classification based on wavelet transform analysis: part I

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.
Rapid identification of fungal pathogens in BacT/ALERT, BACTEC, and BBL MGIT media using polymerase chain reaction and DNA sequencing of the internal transcribed spacer regions.

PubMed

Pryce, Todd M; Palladino, Silvano; Price, Diane M; Gardam, Dianne J; Campbell, Peter B; Christiansen, Keryn J; Murray, Ronan J

2006-04-01

We report a direct polymerase chain reaction/sequence (d-PCRS)-based method for the rapid identification of clinically significant fungi from 5 different types of commercial broth enrichment media inoculated with clinical specimens. Media including BacT/ALERT FA (BioMérieux, Marcy l'Etoile, France) (n = 87), BACTEC Plus Aerobic/F (Becton Dickinson, Microbiology Systems, Sparks, MD) (n = 16), BACTEC Peds Plus/F (Becton Dickinson) (n = 15), BACTEC Lytic/10 Anaerobic/F (Becton Dickinson) (n = 11) bottles, and BBL MGIT (Becton Dickinson) (n = 11) were inoculated with specimens from 138 patients. A universal DNA extraction method was used combining a novel pretreatment step to remove PCR inhibitors with a column-based DNA extraction kit. Target sequences in the noncoding internal transcribed spacer regions of the rRNA gene were amplified by PCR and sequenced using a rapid (24 h) automated capillary electrophoresis system. Using sequence alignment software, fungi were identified by sequence similarity with sequences derived from isolates identified by upper-level reference laboratories or isolates defined as ex-type strains. We identified Candida albicans (n = 14), Candida parapsilosis (n = 8), Candida glabrata (n = 7), Candida krusei (n = 2), Scedosporium prolificans (n = 4), and 1 each of Candida orthopsilosis, Candida dubliniensis, Candida kefyr, Candida tropicalis, Candida guilliermondii, Saccharomyces cerevisiae, Cryptococcus neoformans, Aspergillus fumigatus, Histoplasma capsulatum, and Malassezia pachydermatis by d-PCRS analysis. All d-PCRS identifications from positive broths were in agreement with the final species identification of the isolates grown from subculture. Earlier identification of fungi using d-PCRS may facilitate prompt and more appropriate antifungal therapy.
Transcriptome profiling of Nasonia vitripennis testis reveals novel transcripts expressed from the selfish B chromosome, paternal sex ratio.

PubMed

Akbari, Omar S; Antoshechkin, Igor; Hay, Bruce A; Ferree, Patrick M

2013-09-04

A widespread phenomenon in nature is sex ratio distortion of arthropod populations caused by microbial and genetic parasites. Currently little is known about how these agents alter host developmental processes to favor one sex or the other. The paternal sex ratio (PSR) chromosome is a nonessential, paternally transmitted centric fragment that segregates in natural populations of the jewel wasp, Nasonia vitripennis. To persist, PSR is thought to modify the hereditary material of the developing sperm, with the result that all nuclear DNA other than the PSR chromosome is destroyed shortly after fertilization. This results in the conversion of a fertilized embryo--normally a female--into a male, thereby insuring transmission of the "selfish" PSR chromosome, and simultaneously leading to wasp populations that are male-biased. To begin to understand this system at the mechanistic level, we carried out transcriptional profiling of testis from WT and PSR-carrying males. We identified a number of transcripts that are differentially expressed between these conditions. We also discovered nine transcripts that are uniquely expressed from the PSR chromosome. Four of these PSR-specific transcripts encode putative proteins, whereas the others have very short open reading frames and no homology to known proteins, suggesting that they are long noncoding RNAs. We propose several different models for how these transcripts could facilitate PSR-dependent effects. Our analyses also revealed 15.71 MB of novel transcribed regions in the N. vitripennis genome, thus increasing the current annotation of total transcribed regions by 53.4%. Finally, we detected expression of multiple meiosis-related genes in the wasp testis, despite the lack of conventional meiosis in the male sex.
Transcriptome Profiling of Nasonia vitripennis Testis Reveals Novel Transcripts Expressed from the Selfish B Chromosome, Paternal Sex Ratio

PubMed Central

Akbari, Omar S.; Antoshechkin, Igor; Hay, Bruce A.; Ferree, Patrick M.

2013-01-01

A widespread phenomenon in nature is sex ratio distortion of arthropod populations caused by microbial and genetic parasites. Currently little is known about how these agents alter host developmental processes to favor one sex or the other. The paternal sex ratio (PSR) chromosome is a nonessential, paternally transmitted centric fragment that segregates in natural populations of the jewel wasp, Nasonia vitripennis. To persist, PSR is thought to modify the hereditary material of the developing sperm, with the result that all nuclear DNA other than the PSR chromosome is destroyed shortly after fertilization. This results in the conversion of a fertilized embryo—normally a female—into a male, thereby insuring transmission of the “selfish” PSR chromosome, and simultaneously leading to wasp populations that are male-biased. To begin to understand this system at the mechanistic level, we carried out transcriptional profiling of testis from WT and PSR-carrying males. We identified a number of transcripts that are differentially expressed between these conditions. We also discovered nine transcripts that are uniquely expressed from the PSR chromosome. Four of these PSR-specific transcripts encode putative proteins, whereas the others have very short open reading frames and no homology to known proteins, suggesting that they are long noncoding RNAs. We propose several different models for how these transcripts could facilitate PSR-dependent effects. Our analyses also revealed 15.71 MB of novel transcribed regions in the N. vitripennis genome, thus increasing the current annotation of total transcribed regions by 53.4%. Finally, we detected expression of multiple meiosis-related genes in the wasp testis, despite the lack of conventional meiosis in the male sex. PMID:23893741

Systematic identification of non-coding RNA 2,2,7-trimethylguanosine cap structures in Caenorhabditis elegans

PubMed Central

Jia, Dong; Cai, Lun; He, Housheng; Skogerbø, Geir; Li, Tiantian; Aftab, Muhammad Nauman; Chen, Runsheng

2007-01-01

Background The 2,2,7-trimethylguanosine (TMG) cap structure is an important functional characteristic of ncRNAs with critical cellular roles, such as some snRNAs. Here we used immunoprecipitation with both K121 and R1131 anti-TMG antibodies to systematically identify the TMG cap structures for all presently characterized ncRNAs in C. elegans. Results The two anti-TMG antibodies precipitated a similar group of the C. elegans ncRNAs. All snRNAs known to have a TMG cap structure were found in the precipitate, indicating that our identification system was efficient. Other ncRNA families related to splicing, such as SL RNAs and Sm Y RNAs, were also found in the precipitate, as were 7 C/D box snoRNAs. Further analysis showed that the SL RNAs and the Sm Y RNAs shared a very similar Sm binding site element (AAU4–5GGA), which sequence composition differed somewhat from those of other U snRNAs. There were also 16 ncRNAs without an Sm binding site element in the precipitate, suggesting that for these ncRNAs, TMG formation may occur independently of Sm proteins. Conclusion Our results showed that most ncRNAs predicted to be transcribed by RNA polymerase II had a TMG cap, while those predicted to be transcribed by RNA plymerase III or located in introns did not have a TMG cap structure. Compared to ncRNAs without a TMG cap, TMG-capped ncRNAs tended to have higher expression levels. Five functionally non-annotated ncRNAs also have a TMG cap structure, which might be helpful for identifying the cellular roles of these ncRNAs. PMID:17903271
Systematic identification of non-coding RNA 2,2,7-trimethylguanosine cap structures in Caenorhabditis elegans.

PubMed

Jia, Dong; Cai, Lun; He, Housheng; Skogerbø, Geir; Li, Tiantian; Aftab, Muhammad Nauman; Chen, Runsheng

2007-09-29

The 2,2,7-trimethylguanosine (TMG) cap structure is an important functional characteristic of ncRNAs with critical cellular roles, such as some snRNAs. Here we used immunoprecipitation with both K121 and R1131 anti-TMG antibodies to systematically identify the TMG cap structures for all presently characterized ncRNAs in C. elegans. The two anti-TMG antibodies precipitated a similar group of the C. elegans ncRNAs. All snRNAs known to have a TMG cap structure were found in the precipitate, indicating that our identification system was efficient. Other ncRNA families related to splicing, such as SL RNAs and Sm Y RNAs, were also found in the precipitate, as were 7 C/D box snoRNAs. Further analysis showed that the SL RNAs and the Sm Y RNAs shared a very similar Sm binding site element (AAU4-5GGA), which sequence composition differed somewhat from those of other U snRNAs. There were also 16 ncRNAs without an Sm binding site element in the precipitate, suggesting that for these ncRNAs, TMG formation may occur independently of Sm proteins. Our results showed that most ncRNAs predicted to be transcribed by RNA polymerase II had a TMG cap, while those predicted to be transcribed by RNA plymerase III or located in introns did not have a TMG cap structure. Compared to ncRNAs without a TMG cap, TMG-capped ncRNAs tended to have higher expression levels. Five functionally non-annotated ncRNAs also have a TMG cap structure, which might be helpful for identifying the cellular roles of these ncRNAs.
Characteristics and significance of intergenic polyadenylated RNA transcription in Arabidopsis.

PubMed

Moghe, Gaurav D; Lehti-Shiu, Melissa D; Seddon, Alex E; Yin, Shan; Chen, Yani; Juntawong, Piyada; Brandizzi, Federica; Bailey-Serres, Julia; Shiu, Shin-Han

2013-01-01

The Arabidopsis (Arabidopsis thaliana) genome is the most well-annotated plant genome. However, transcriptome sequencing in Arabidopsis continues to suggest the presence of polyadenylated (polyA) transcripts originating from presumed intergenic regions. It is not clear whether these transcripts represent novel noncoding or protein-coding genes. To understand the nature of intergenic polyA transcription, we first assessed its abundance using multiple messenger RNA sequencing data sets. We found 6,545 intergenic transcribed fragments (ITFs) occupying 3.6% of Arabidopsis intergenic space. In contrast to transcribed fragments that map to protein-coding and RNA genes, most ITFs are significantly shorter, are expressed at significantly lower levels, and tend to be more data set specific. A surprisingly large number of ITFs (32.1%) may be protein coding based on evidence of translation. However, our results indicate that these "translated" ITFs tend to be close to and are likely associated with known genes. To investigate if ITFs are under selection and are functional, we assessed ITF conservation through cross-species as well as within-species comparisons. Our analysis reveals that 237 ITFs, including 49 with translation evidence, are under strong selective constraint and relatively distant from annotated features. These ITFs are likely parts of novel genes. However, the selective pressure imposed on most ITFs is similar to that of randomly selected, untranscribed intergenic sequences. Our findings indicate that despite the prevalence of ITFs, apart from the possibility of genomic contamination, many may be background or noisy transcripts derived from "junk" DNA, whose production may be inherent to the process of transcription and which, on rare occasions, may act as catalysts for the creation of novel genes.
Selective Degradation of Host RNA Polymerase II Transcripts by Influenza A Virus PA-X Host Shutoff Protein

PubMed Central

Larkins-Ford, Jonah; McCormick, Craig; Gaglia, Marta M.

2016-01-01

Influenza A viruses (IAVs) inhibit host gene expression by a process known as host shutoff. Host shutoff limits host innate immune responses and may also redirect the translation apparatus to the production of viral proteins. Multiple IAV proteins regulate host shutoff, including PA-X, a ribonuclease that remains incompletely characterized. We report that PA-X selectively targets host RNA polymerase II (Pol II) transcribed mRNAs, while sparing products of Pol I and Pol III. Interestingly, we show that PA-X can also target Pol II-transcribed RNAs in the nucleus, including non-coding RNAs that are not destined to be translated, and reporter transcripts with RNA hairpin structures that block ribosome loading. Transcript degradation likely occurs in the nucleus, as PA-X is enriched in the nucleus and its nuclear localization correlates with reduction in target RNA levels. Complete degradation of host mRNAs following PA-X-mediated endonucleolytic cleavage is dependent on the host 5’->3’-exonuclease Xrn1. IAV mRNAs are structurally similar to host mRNAs, but are synthesized and modified at the 3’ end by the action of the viral RNA-dependent RNA polymerase complex. Infection of cells with wild-type IAV or a recombinant PA-X-deficient virus revealed that IAV mRNAs resist PA-X-mediated degradation during infection. At the same time, loss of PA-X resulted in changes in the synthesis of select viral mRNAs and a decrease in viral protein accumulation. Collectively, these results significantly advance our understanding of IAV host shutoff, and suggest that the PA-X causes selective degradation of host mRNAs by discriminating some aspect of Pol II-dependent RNA biogenesis in the nucleus. PMID:26849127
Metformin-Induced Changes of the Coding Transcriptome and Non-Coding RNAs in the Livers of Non-Alcoholic Fatty Liver Disease Mice.

PubMed

Guo, Jun; Zhou, Yuan; Cheng, Yafen; Fang, Weiwei; Hu, Gang; Wei, Jie; Lin, Yajun; Man, Yong; Guo, Lixin; Sun, Mingxiao; Cui, Qinghua; Li, Jian

2018-01-01

Recent studies have suggested that changes in non-coding mRNA play a key role in the progression of non-alcoholic fatty liver disease (NAFLD). Metformin is now recommended and effective for the treatment of NAFLD. We hope the current analyses of the non-coding mRNA transcriptome will provide a better presentation of the potential roles of mRNAs and long non-coding RNAs (lncRNAs) that underlie NAFLD and metformin intervention. The present study mainly analysed changes in the coding transcriptome and non-coding RNAs after the application of a five-week metformin intervention. Liver samples from three groups of mice were harvested for transcriptome profiling, which covered mRNA, lncRNA, microRNA (miRNA) and circular RNA (circRNA), using a microarray technique. A systematic alleviation of high-fat diet (HFD)-induced transcriptome alterations by metformin was observed. The metformin treatment largely reversed the correlations with diabetes-related pathways. Our analysis also suggested interaction networks between differentially expressed lncRNAs and known hepatic disease genes and interactions between circRNA and their disease-related miRNA partners. Eight HFD-responsive lncRNAs and three metformin-responsive lncRNAs were noted due to their widespread associations with disease genes. Moreover, seven miRNAs that interacted with multiple differentially expressed circRNAs were highlighted because they were likely to be associated with metabolic or liver diseases. The present study identified novel changes in the coding transcriptome and non-coding RNAs in the livers of NAFLD mice after metformin treatment that might shed light on the underlying mechanism by which metformin impedes the progression of NAFLD. © 2018 The Author(s). Published by S. Karger AG, Basel.
Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions

DOE Office of Scientific and Technical Information (OSTI.GOV)

MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi

2009-05-15

BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of functionmore » and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.« less
Conserved sequence-specific lincRNA-steroid receptor interactions drive transcriptional repression and direct cell fate

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hudson, William H.; Pickard, Mark R.; de Vera, Ian Mitchelle S.

2014-12-23

The majority of the eukaryotic genome is transcribed, generating a significant number of long intergenic noncoding RNAs (lincRNAs). Although lincRNAs represent the most poorly understood product of transcription, recent work has shown lincRNAs fulfill important cellular functions. In addition to low sequence conservation, poor understanding of structural mechanisms driving lincRNA biology hinders systematic prediction of their function. Here we report the molecular requirements for the recognition of steroid receptors (SRs) by the lincRNA growth arrest-specific 5 (Gas5), which regulates steroid-mediated transcriptional regulation, growth arrest and apoptosis. We identify the functional Gas5-SR interface and generate point mutations that ablate the SR-Gas5more » lincRNA interaction, altering Gas5-driven apoptosis in cancer cell lines. Further, we find that the Gas5 SR-recognition sequence is conserved among haplorhines, with its evolutionary origin as a splice acceptor site. This study demonstrates that lincRNAs can recognize protein targets in a conserved, sequence-specific manner in order to affect critical cell functions.« less
Expression of Telomere-Associated Proteins is Interdependent to Stabilize Native Telomere Structure and Telomere Dysfunction by G-Quadruplex Ligand Causes TERRA Upregulation.

PubMed

Sadhukhan, Ratan; Chowdhury, Priyanka; Ghosh, Sourav; Ghosh, Utpal

2018-06-01

Telomere DNA can form specialized nucleoprotein structure with telomere-associated proteins to hide free DNA ends or G-quadruplex structures under certain conditions especially in presence of G-quadruplex ligand. Telomere DNA is transcribed to form non-coding telomere repeat-containing RNA (TERRA) whose biogenesis and function is poorly understood. Our aim was to find the role of telomere-associated proteins and telomere structures in TERRA transcription. We silenced four [two shelterin (TRF1, TRF2) and two non-shelterin (PARP-1, SLX4)] telomere-associated genes using siRNA and verified depletion in protein level. Knocking down of one gene modulated expression of other telomere-associated genes and increased TERRA from 10q, 15q, XpYp and XqYq chromosomes in A549 cells. Telomere was destabilized or damaged by G-quadruplex ligand pyridostatin (PDS) and bleomycin. Telomere dysfunction-induced foci (TIFs) were observed for each case of depletion of proteins, treatment with PDS or bleomycin. TERRA level was elevated by PDS and bleomycin treatment alone or in combination with depletion of telomere-associated proteins.
Stc1: A Critical Link between RNAi and Chromatin Modification Required for Heterochromatin Integrity

PubMed Central

Bayne, Elizabeth H.; White, Sharon A.; Kagansky, Alexander; Bijos, Dominika A.; Sanchez-Pulido, Luis; Hoe, Kwang-Lae; Kim, Dong-Uk; Park, Han-Oh; Ponting, Chris P.; Rappsilber, Juri; Allshire, Robin C.

2010-01-01

Summary In fission yeast, RNAi directs heterochromatin formation at centromeres, telomeres, and the mating type locus. Noncoding RNAs transcribed from repeat elements generate siRNAs that are incorporated into the Argonaute-containing RITS complex and direct it to nascent homologous transcripts. This leads to recruitment of the CLRC complex, including the histone methyltransferase Clr4, promoting H3K9 methylation and heterochromatin formation. A key question is what mediates the recruitment of Clr4/CLRC to transcript-bound RITS. We have identified a LIM domain protein, Stc1, that is required for centromeric heterochromatin integrity. Our analyses show that Stc1 is specifically required to establish H3K9 methylation via RNAi, and interacts both with the RNAi effector Ago1, and with the chromatin-modifying CLRC complex. Moreover, tethering Stc1 to a euchromatic locus is sufficient to induce silencing and heterochromatin formation independently of RNAi. We conclude that Stc1 associates with RITS on centromeric transcripts and recruits CLRC, thereby coupling RNAi to chromatin modification. PMID:20211136
Prevalence of transcription promoters within archaeal operons and coding sequences

PubMed Central

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of ∼64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein–DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3′ ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes—events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements. PMID:19536208
Prevalence of transcription promoters within archaeal operons and coding sequences.

PubMed

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
Fas-Antisense Long Noncoding RNA and Acute Myeloid Leukemia: Is There any Relation?

PubMed

Sayad, Arezou; Hajifathali, Abbas; Hamidieh, Amir Ali; Esfandi, Farbod; Taheri, Mohammad

2018-01-27

In recent years, lncRNAs have been considered as potential predictive biomarkers for prognosis of different human cancers. One example is the FAS antisense RNA 1 (FAS-AS1) located in the 10q23.31 region which is transcribed from the opposite strand of the FAS gene. FAS has an important role in regulation of apoptotic pathways and there is an inverse correlation between FAS-AS1 expression level and production of the soluble form of Fas, so that it might have potential as a therapeutic target to improve chemotherapy effectiveness. In the present study we therefore evaluated FAS-AS1 expression in blood samples of de novo AML patients and healthy controls using real-time quantitative reverse transcription-PCR (qRT-PCR). Our results indicated that the expression level of FAS-AS1 lncRNA demonstrated no significant difference between AML patients and healthy individuals. We conclude from the obtained data that FAS-AS1 is not an informative and reliable biomarker for AML diagnosis, although our results need to be confirmed in further studies. Creative Commons Attribution License
Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies

PubMed Central

Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.

2018-01-01

Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504
Possible roles for products of polymorphic MHC and linked olfactory receptor genes during selection processes in reproduction.

PubMed

Ziegler, Andreas; Dohr, Gotrfried; Uchanska-Ziegler, Barbara

2002-07-01

Polymorphic genes of the human major histocompatibility complex [MHC; human leukocyte antigen (HLA)] are probably important in determining resistance to parasites and avoidance of inbreeding. We investigated whether HLA-associated sexual selection could also involve HLA-linked olfactory receptor (OR) genes, which might not only participate in olfaction-guided mate choice, but also in selection processes within the testis. The testicular expression status of HLA class I molecules (by immunohistology) and HLA-linked OR genes (by transcriptional analysis) was determined. Various HLA class I heavy chains, but not beta2-microglobulin (beta2m), were expressed, mainly at the spermatocyte I stage. Of 17 HLA-linked OR genes analyzed, eight were found to be transcribed in the testis. They exhibited varying numbers of 5'- or 3'-non-coding exons as well as differential splicing. We suggest that testis-expressed polymorphic HLA and OR proteins are functionally connected and serve the selection of spermatozoa, enabling them to distinguish 'self from 'non-self [the sperm-receptor-selection (SRS) hypothesis].
Regulation of Global Transcription in Escherichia coli by Rsd and 6S RNA

PubMed Central

Lal, Avantika; Krishna, Sandeep; Seshasayee, Aswin Sai Narain

2018-01-01

In Escherichia coli, the sigma factor σ70 directs RNA polymerase to transcribe growth-related genes, while σ38 directs transcription of stress response genes during stationary phase. Two molecules hypothesized to regulate RNA polymerase are the protein Rsd, which binds to σ70, and the non-coding 6S RNA which binds to the RNA polymerase-σ70 holoenzyme. Despite multiple studies, the functions of Rsd and 6S RNA remain controversial. Here we use RNA-Seq in five phases of growth to elucidate their function on a genome-wide scale. We show that Rsd and 6S RNA facilitate σ38 activity throughout bacterial growth, while 6S RNA also regulates widely different genes depending upon growth phase. We discover novel interactions between 6S RNA and Rsd and show widespread expression changes in a strain lacking both regulators. Finally, we present a mathematical model of transcription which highlights the crosstalk between Rsd and 6S RNA as a crucial factor in controlling sigma factor competition and global gene expression. PMID:29686109
Construction of Infectious cDNA Clone of a Chrysanthemum stunt viroid Korean Isolate

PubMed Central

Yoon, Ju-Yeon; Cho, In-Sook; Choi, Gug-Seoun; Choi, Seung-Kook

2014-01-01

Chrysanthemum stunt viroid (CSVd), a noncoding infectious RNA molecule, causes seriously economic losses of chrysanthemum for 3 or 4 years after its first infection. Monomeric cDNA clones of CSVd isolate SK1 (CSVd-SK1) were constructed in the plasmids pGEM-T easy vector and pUC19 vector. Linear positive-sense transcripts synthesized in vitro from the full-length monomeric cDNA clones of CSVd-SK1 could infect systemically tomato seedlings and chrysanthemum plants, suggesting that the linear CSVd RNA transcribed from the cDNA clones could be replicated as efficiently as circular CSVd in host species. However, direct inoculation of plasmid cDNA clones containing full-length monomeric cDNA of CSVd-SK1 failed to infect tomato and chrysanthemum and linear negative-sense transcripts from the plasmid DNAs were not infectious in the two plant species. The cDNA sequences of progeny viroid in systemically infected tomato and chrysanthemum showed a few substitutions at a specific nucleotide position, but there were no deletions and insertions in the sequences of the CSVd progeny from tomato and chrysanthemum plants. PMID:25288987
Regulation of Global Transcription in Escherichia coli by Rsd and 6S RNA.

PubMed

Lal, Avantika; Krishna, Sandeep; Seshasayee, Aswin Sai Narain

2018-05-31

In Escherichia coli , the sigma factor σ 70 directs RNA polymerase to transcribe growth-related genes, while σ 38 directs transcription of stress response genes during stationary phase. Two molecules hypothesized to regulate RNA polymerase are the protein Rsd, which binds to σ 70 , and the non-coding 6S RNA which binds to the RNA polymerase-σ 70 holoenzyme. Despite multiple studies, the functions of Rsd and 6S RNA remain controversial. Here we use RNA-Seq in five phases of growth to elucidate their function on a genome-wide scale. We show that Rsd and 6S RNA facilitate σ 38 activity throughout bacterial growth, while 6S RNA also regulates widely different genes depending upon growth phase. We discover novel interactions between 6S RNA and Rsd and show widespread expression changes in a strain lacking both regulators. Finally, we present a mathematical model of transcription which highlights the crosstalk between Rsd and 6S RNA as a crucial factor in controlling sigma factor competition and global gene expression. Copyright © 2018 Lal et al.
YY1 binding association with sex-biased transcription revealed through X-linked transcript levels and allelic binding analyses.

PubMed

Chen, Chih-Yu; Shi, Wenqiang; Balaton, Bradley P; Matthews, Allison M; Li, Yifeng; Arenillas, David J; Mathelier, Anthony; Itoh, Masayoshi; Kawaji, Hideya; Lassmann, Timo; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R R; Brown, Carolyn J; Wasserman, Wyeth W

2016-11-18

Sex differences in susceptibility and progression have been reported in numerous diseases. Female cells have two copies of the X chromosome with X-chromosome inactivation imparting mono-allelic gene silencing for dosage compensation. However, a subset of genes, named escapees, escape silencing and are transcribed bi-allelically resulting in sexual dimorphism. Here we conducted in silico analyses of the sexes using human datasets to gain perspectives into such regulation. We identified transcription start sites of escapees (escTSSs) based on higher transcription levels in female cells using FANTOM5 CAGE data. Significant over-representations of YY1 transcription factor binding motif and ChIP-seq peaks around escTSSs highlighted its positive association with escapees. Furthermore, YY1 occupancy is significantly biased towards the inactive X (Xi) at long non-coding RNA loci that are frequent contacts of Xi-specific superloops. Our study suggests a role for YY1 in transcriptional activity on Xi in general through sequence-specific binding, and its involvement at superloop anchors.
YY1 binding association with sex-biased transcription revealed through X-linked transcript levels and allelic binding analyses

PubMed Central

Chen, Chih-yu; Shi, Wenqiang; Balaton, Bradley P.; Matthews, Allison M.; Li, Yifeng; Arenillas, David J.; Mathelier, Anthony; Itoh, Masayoshi; Kawaji, Hideya; Lassmann, Timo; Hayashizaki, Yoshihide; Carninci, Piero; Forrest, Alistair R. R.; Brown, Carolyn J.; Wasserman, Wyeth W.

2016-01-01

Sex differences in susceptibility and progression have been reported in numerous diseases. Female cells have two copies of the X chromosome with X-chromosome inactivation imparting mono-allelic gene silencing for dosage compensation. However, a subset of genes, named escapees, escape silencing and are transcribed bi-allelically resulting in sexual dimorphism. Here we conducted in silico analyses of the sexes using human datasets to gain perspectives into such regulation. We identified transcription start sites of escapees (escTSSs) based on higher transcription levels in female cells using FANTOM5 CAGE data. Significant over-representations of YY1 transcription factor binding motif and ChIP-seq peaks around escTSSs highlighted its positive association with escapees. Furthermore, YY1 occupancy is significantly biased towards the inactive X (Xi) at long non-coding RNA loci that are frequent contacts of Xi-specific superloops. Our study suggests a role for YY1 in transcriptional activity on Xi in general through sequence-specific binding, and its involvement at superloop anchors. PMID:27857184
RNA expression in a cartilaginous fish cell line reveals ancient 3′ noncoding regions highly conserved in vertebrates

PubMed Central

Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2007-01-01

We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856

The anti-tumor drug bleomycin preferentially cleaves at the transcription start sites of actively transcribed genes in human cells.

PubMed

Murray, Vincent; Chen, Jon K; Galea, Anne M

2014-04-01

The genome-wide pattern of DNA cleavage at transcription start sites (TSSs) for the anti-tumor drug bleomycin was examined in human HeLa cells using next-generation DNA sequencing. It was found that actively transcribed genes were preferentially cleaved compared with non-transcribed genes. The 143,600 identified human TSSs were split into non-transcribed genes (82,596) and transcribed genes (61,004) for HeLa cells. These transcribed genes were further split into quintiles of 12,201 genes comprising the top 20, 20-40, 40-60, 60-80, and 80-100 % of expressed genes. The bleomycin cleavage pattern at highly transcribed gene TSSs was greatly enhanced compared with purified DNA and non-transcribed gene TSSs. The top 20 and 20-40 % quintiles had a very similar enhanced cleavage pattern, the 40-60 % quintile was intermediate, while the 60-80 and 80-100 % quintiles were close to the non-transcribed and purified DNA profiles. The pattern of bleomycin enhanced cleavage had peaks that were approximately 200 bp apart, and this indicated that bleomycin was identifying the presence of phased nucleosomes at TSSs. Hence bleomycin can be utilized to detect chromatin structures that are present at actively transcribed genes. In this study, for the first time, the pattern of DNA damage by a clinically utilized cancer chemotherapeutic agent was performed on a human genome-wide scale at the nucleotide level.
An expanding universe of noncoding RNAs between the poles of basic science and clinical investigations.

PubMed

Weil, Patrick P; Hensel, Kai O; Weber, David; Postberg, Jan

2016-03-01

The Keystone Symposium 'MicroRNAs and Noncoding RNAs in Cancer', Keystone, CO, USA, 7-12 June 2015 Since the discovery of RNAi, great efforts have been undertaken to unleash the potential biomedical applicability of small noncoding RNAs, mainly miRNAs, involving their use as biomarkers for personalized diagnostics or their usability as active agents or therapy targets. The research's focus on the noncoding RNA world is now slowly moving from a phase of basic discoveries into a new phase, where every single molecule out of many hundreds of cataloged noncoding RNAs becomes dissected in order to investigate these molecules' biomedical relevance. In addition, RNA classes neglected before, such as long noncoding RNAs or circular RNAs attract more attention. Numerous timely results and hypotheses were presented at the 2015 Keystone Symposium 'MicroRNAs and Noncoding RNAs in Cancer'.
Facts and updates about cardiovascular non-coding RNAs in heart failure.

PubMed

Thum, Thomas

2015-09-01

About 11% of all deaths include heart failure as a contributing cause. The annual cost of heart failure amounts to US $34,000,000,000 in the United States alone. With the exception of heart transplantation, there is no curative therapy available. Only occasionally there are new areas in science that develop into completely new research fields. The topic on non-coding RNAs, including microRNAs, long non-coding RNAs, and circular RNAs, is such a field. In this short review, we will discuss the latest developments about non-coding RNAs in cardiovascular disease. MicroRNAs are short regulatory non-coding endogenous RNA species that are involved in virtually all cellular processes. Long non-coding RNAs also regulate gene and protein levels; however, by much more complicated and diverse mechanisms. In general, non-coding RNAs have been shown to be of great value as therapeutic targets in adverse cardiac remodelling and also as diagnostic and prognostic biomarkers for heart failure. In the future, non-coding RNA-based therapeutics are likely to enter the clinical reality offering a new treatment approach of heart failure.
Noncoding RNAs in human intervertebral disc degeneration: An integrated microarray study.

PubMed

Liu, Xu; Che, Lu; Xie, Yan-Ke; Hu, Qing-Jie; Ma, Chi-Jiao; Pei, Yan-Jun; Wu, Zhi-Gang; Liu, Zhi-Heng; Fan, Li-Ying; Wang, Hai-Qiang

2015-09-01

Accumulating evidence indicates that noncoding RNAs play important roles in a multitude of biological processes. The striking findings of miRNAs (microRNAs) and lncRNAs (long noncoding RNAs) as members of noncoding RNAs open up an exciting era in the studies of gene regulation. More recently, the reports of circRNAs (circular RNAs) add fuel to the noncoding RNAs research. Human intervertebral disc degeneration (IDD) is a main cause of low back pain as a disabling spinal disease. We have addressed the expression profiles if miRNAs, lncRNAs and mRNAs in IDD (Wang et al., J Pathology, 2011 and Wan et al., Arthritis Res Ther, 2014). Furthermore, we thoroughly analysed noncoding RNAs, including miRNAs, lncRNAs and circRNAs in IDD using the very same samples. Here we delineate in detail the contents of the aforementioned microarray analyses. Microarray and sample annotation data were deposited in GEO under accession number GSE67567 as SuperSeries. The integrated analyses of these noncoding RNAs will shed a novel light on coding-noncoding regulatory machinery.
Long non-coding RNAs in hepatocellular carcinoma: Potential roles and clinical implications

PubMed Central

Niu, Zhao-Shan; Niu, Xiao-Jun; Wang, Wen-Hong

2017-01-01

Long non-coding RNAs (lncRNAs) are a subgroup of non-coding RNA transcripts greater than 200 nucleotides in length with little or no protein-coding potential. Emerging evidence indicates that lncRNAs may play important regulatory roles in the pathogenesis and progression of human cancers, including hepatocellular carcinoma (HCC). Certain lncRNAs may be used as diagnostic or prognostic markers for HCC, a serious malignancy with increasing morbidity and high mortality rates worldwide. Therefore, elucidating the functional roles of lncRNAs in tumors can contribute to a better understanding of the molecular mechanisms of HCC and may help in developing novel therapeutic targets. In this review, we summarize the recent progress regarding the functional roles of lncRNAs in HCC and explore their clinical implications as diagnostic or prognostic biomarkers and molecular therapeutic targets for HCC. PMID:28932078
Expression of the cervical carcinoma expressed PCNA regulatory (CCEPR) long noncoding RNA is driven by the human papillomavirus E6 protein and modulates cell proliferation independent of PCNA.

PubMed

Sharma, Surendra; Munger, Karl

2018-05-01

Modulation of expression of noncoding RNAs is an important aspect of the oncogenic activities of high-risk human papillomavirus (HPV) E6 and E7 proteins. While HPV E6/E7-mediated alterations of microRNAs (miRNAs) has been studied in detail there are fewer reports on HPV-mediated dysregulation of long noncoding RNAs (lncRNAs). The cervical carcinoma expressed PCNA regulatory (CCEPR) lncRNA is highly expressed in cervical cancers and expression correlates with tumor size and patient outcome. We report that CCEPR is a nuclear lncRNA and that HPV16 E6 oncogene expression causes increased CCEPR expression through a mechanism that is not directly dependent on TP53 inactivation. CCEPR depletion in cervical carcinoma cell lines reduces viability, while overexpression enhances viability. In contrast to what was published and inspired its designation, there is no evidence for PCNA mRNA stabilization, and hence CCEPR likely functions through a different mechanism. Copyright © 2018 Elsevier Inc. All rights reserved.
Targeting noncoding RNAs in disease

PubMed Central

Parsons, Christine; Walker, Lisa; Zhang, Wen Cai; Slack, Frank J.

2017-01-01

Many RNA species have been identified as important players in the development of chronic diseases, including cancer. Over the past decade, numerous studies have highlighted how regulatory RNAs such as microRNAs (miRNAs) and long noncoding RNAs (lncRNAs) play crucial roles in the development of a disease state. It is clear that the aberrant expression of miRNAs promotes tumor initiation and progression, is linked with cardiac dysfunction, allows for the improper physiological response in maintaining glucose and insulin levels, and can prevent the appropriate integration of neuronal networks, resulting in neurodegenerative disorders. Because of this, there has been a major effort to therapeutically target these noncoding RNAs. In just the past 5 years, over 100 antisense oligonucleotide–based therapies have been tested in phase I clinical trials, a quarter of which have reached phase II/III. Most notable are fomivirsen and mipomersen, which have received FDA approval to treat cytomegalovirus retinitis and high blood cholesterol, respectively. The continued improvement of innovative RNA modifications and delivery entities, such as nanoparticles, will aid in the development of future RNA-based therapeutics for a broader range of chronic diseases. Here we summarize the latest promises and challenges of targeting noncoding RNAs in disease. PMID:28248199
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Widespread anti-sense transcription in apple is correlated with siRNA production and indicates a large potential for transcriptional and/or post-transcriptional control.

PubMed

Celton, Jean-Marc; Gaillard, Sylvain; Bruneau, Maryline; Pelletier, Sandra; Aubourg, Sébastien; Martin-Magniette, Marie-Laure; Navarro, Lionel; Laurens, François; Renou, Jean-Pierre

2014-07-01

Characterizing the transcriptome of eukaryotic organisms is essential for studying gene regulation and its impact on phenotype. The realization that anti-sense (AS) and noncoding RNA transcription is pervasive in many genomes has emphasized our limited understanding of gene transcription and post-transcriptional regulation. Numerous mechanisms including convergent transcription, anti-correlated expression of sense and AS transcripts, and RNAi remain ill-defined. Here, we have combined microarray analysis and high-throughput sequencing of small RNAs (sRNAs) to unravel the complexity of transcriptional and potential post-transcriptional regulation in eight organs of apple (Malus × domestica). The percentage of AS transcript expression is higher than that identified in annual plants such as rice and Arabidopsis thaliana. Furthermore, we show that a majority of AS transcripts are transcribed beyond 3'UTR regions, and may cover a significant portion of the predicted sense transcripts. Finally we demonstrate at a genome-wide scale that anti-sense transcript expression is correlated with the presence of both short (21-23 nt) and long (> 30 nt) siRNAs, and that the sRNA coverage depth varies with the level of AS transcript expression. Our study provides a new insight on the functional role of anti-sense transcripts at the genome-wide level, and a new basis for the understanding of sRNA biogenesis in plants. © 2014 INRA. New Phytologist © 2014 New Phytologist Trust.
Computational and transcriptional evidence for microRNAs in the honey bee genome

PubMed Central

Weaver, Daniel B; Anzola, Juan M; Evans, Jay D; Reid, Jeffrey G; Reese, Justin T; Childs, Kevin L; Zdobnov, Evgeny M; Samanta, Manoj P; Miller, Jonathan; Elsik, Christine G

2007-01-01

Background Non-coding microRNAs (miRNAs) are key regulators of gene expression in eukaryotes. Insect miRNAs help regulate the levels of proteins involved with development, metabolism, and other life history traits. The recently sequenced honey bee genome provides an opportunity to detect novel miRNAs in both this species and others, and to begin to infer the roles of miRNAs in honey bee development. Results Three independent computational surveys of the assembled honey bee genome identified a total of 65 non-redundant candidate miRNAs, several of which appear to have previously unrecognized orthologs in the Drosophila genome. A subset of these candidate miRNAs were screened for expression by quantitative RT-PCR and/or genome tiling arrays and most predicted miRNAs were confirmed as being expressed in at least one honey bee tissue. Interestingly, the transcript abundance for several known and novel miRNAs displayed caste or age-related differences in honey bees. Genes in proximity to miRNAs in the bee genome are disproportionately associated with the Gene Ontology terms 'physiological process', 'nucleus' and 'response to stress'. Conclusion Computational approaches successfully identified miRNAs in the honey bee and indicated previously unrecognized miRNAs in the well-studied Drosophila melanogaster genome despite the 280 million year distance between these insects. Differentially transcribed miRNAs are likely to be involved in regulating honey bee development, and arguably in the extreme developmental switch between sterile worker bees and highly fertile queens. PMID:17543122
An RpoS-dependent sRNA regulates the expression of a chaperone involved in protein folding

PubMed Central

Silva, Inês Jesus; Ortega, Álvaro Darío; Viegas, Sandra Cristina; García-del Portillo, Francisco; Arraiano, Cecília Maria

2013-01-01

Small noncoding RNAs (sRNAs) are usually expressed in the cell to face a variety of stresses. In this report we disclose the first target for SraL (also known as RyjA), a sRNA present in many bacteria, which is highly induced in stationary phase. We also demonstrate that this sRNA is directly transcribed by the major stress σ factor σS (RpoS) in Salmonella enterica serovar Typhimurium. We show that SraL sRNA down-regulates the expression of the chaperone Trigger Factor (TF), encoded by the tig gene. TF is one of the three major chaperones that cooperate in the folding of the newly synthesized cytosolic proteins and is the only ribosome-associated chaperone known in bacteria. By use of bioinformatic tools and mutagenesis experiments, SraL was shown to directly interact with the 5′ UTR of the tig mRNA a few nucleotides upstream of the Shine-Dalgarno region. Namely, point mutations in the sRNA (SraL*) abolished the repression of tig mRNA and could only down-regulate a tig transcript target with the respective compensatory mutations. We have also validated in vitro that SraL forms a stable duplex with the tig mRNA. This work constitutes the first report of a small RNA affecting protein folding. Taking into account that both SraL and TF are very well conserved in enterobacteria, this work will have important repercussions in the field. PMID:23893734
An RpoS-dependent sRNA regulates the expression of a chaperone involved in protein folding.

PubMed

Silva, Inês Jesus; Ortega, Alvaro Darío; Viegas, Sandra Cristina; García-Del Portillo, Francisco; Arraiano, Cecília Maria

2013-09-01

Small noncoding RNAs (sRNAs) are usually expressed in the cell to face a variety of stresses. In this report we disclose the first target for SraL (also known as RyjA), a sRNA present in many bacteria, which is highly induced in stationary phase. We also demonstrate that this sRNA is directly transcribed by the major stress σ factor σ(S) (RpoS) in Salmonella enterica serovar Typhimurium. We show that SraL sRNA down-regulates the expression of the chaperone Trigger Factor (TF), encoded by the tig gene. TF is one of the three major chaperones that cooperate in the folding of the newly synthesized cytosolic proteins and is the only ribosome-associated chaperone known in bacteria. By use of bioinformatic tools and mutagenesis experiments, SraL was shown to directly interact with the 5' UTR of the tig mRNA a few nucleotides upstream of the Shine-Dalgarno region. Namely, point mutations in the sRNA (SraL*) abolished the repression of tig mRNA and could only down-regulate a tig transcript target with the respective compensatory mutations. We have also validated in vitro that SraL forms a stable duplex with the tig mRNA. This work constitutes the first report of a small RNA affecting protein folding. Taking into account that both SraL and TF are very well conserved in enterobacteria, this work will have important repercussions in the field.
Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data.

PubMed

Huang, Yi-Fei; Gulko, Brad; Siepel, Adam

2017-04-01

Many genetic variants that influence phenotypes of interest are located outside of protein-coding genes, yet existing methods for identifying such variants have poor predictive power. Here we introduce a new computational method, called LINSIGHT, that substantially improves the prediction of noncoding nucleotide sites at which mutations are likely to have deleterious fitness consequences, and which, therefore, are likely to be phenotypically important. LINSIGHT combines a generalized linear model for functional genomic data with a probabilistic model of molecular evolution. The method is fast and highly scalable, enabling it to exploit the 'big data' available in modern genomics. We show that LINSIGHT outperforms the best available methods in identifying human noncoding variants associated with inherited diseases. In addition, we apply LINSIGHT to an atlas of human enhancers and show that the fitness consequences at enhancers depend on cell type, tissue specificity, and constraints at associated promoters.
Evolution of coding and non-coding genes in HOX clusters of a marsupial.

PubMed

Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

2012-06-18

The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial

PubMed Central

2012-01-01

Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
Noncoding sequence classification based on wavelet transform analysis: part II

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez-Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. We hypothesize that the characteristic periodicities of the noncoding sequences are related to their function. We describe the procedure to identify these characteristic periodicities using the wavelet analysis. Our results show that three groups of noncoding sequences, each one with different biological function, may be differentiated by their wavelet coefficients within specific frequency range.
Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling.

PubMed

Li, Shan; Dong, Xia; Su, Zhengchang

2013-07-30

Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads.
Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling

PubMed Central

2013-01-01

Background Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. Results To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. Conclusions As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads. PMID:23899370
Identification of coding and non-coding mutational hotspots in cancer genomes.

PubMed

Piraino, Scott W; Furney, Simon J

2017-01-05

The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from likely passenger regions susceptible to somatic mutation.
Variation in conserved non-coding sequences on chromosome 5q andsusceptibility to asthma and atopy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Donfack, Joseph; Schneider, Daniel H.; Tan, Zheng

2005-09-10

Background: Evolutionarily conserved sequences likely havebiological function. Methods: To determine whether variation in conservedsequences in non-coding DNA contributes to risk for human disease, westudied six conserved non-coding elements in the Th2 cytokine cluster onhuman chromosome 5q31 in a large Hutterite pedigree and in samples ofoutbred European American and African American asthma cases and controls.Results: Among six conserved non-coding elements (>100 bp,>70percent identity; human-mouse comparison), we identified one singlenucleotide polymorphism (SNP) in each of two conserved elements and sixSNPs in the flanking regions of three conserved elements. We genotypedour samples for four of these SNPs and an additional three SNPs eachmore » inthe IL13 and IL4 genes. While there was only modest evidence forassociation with single SNPs in the Hutterite and European Americansamples (P<0.05), there were highly significant associations inEuropean Americans between asthma and haplotypes comprised of SNPs in theIL4 gene (P<0.001), including a SNP in a conserved non-codingelement. Furthermore, variation in the IL13 gene was strongly associatedwith total IgE (P = 0.00022) and allergic sensitization to mold allergens(P = 0.00076) in the Hutterites, and more modestly associated withsensitization to molds in the European Americans and African Americans (P<0.01). Conclusion: These results indicate that there is overalllittle variation in the conserved non-coding elements on 5q31, butvariation in IL4 and IL13, including possibly one SNP in a conservedelement, influence asthma and atopic phenotypes in diversepopulations.« less

Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan; Pfreundt, Ulrike; Nelson, William C.; ...

2015-03-23

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance because of its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20–25% less than the average for other cyanobacteria and nonpathogenic, free-living bacteria. In this paper, we use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus, both in culture and in situ. Transcriptome analysis indicates that 86% ofmore » the noncoding space is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and twofold higher than that found in the gene-dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium noncoding RNA secondary structures were predicted between most culture and metagenomic sequences, lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Finally, our data suggest that transposition of selfish DNA, low effective population size, and high-fidelity replication allowed the unusual “inflation” of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Crosstalk between the Notch signaling pathway and non-coding RNAs in gastrointestinal cancers

PubMed Central

Pan, Yangyang; Mao, Yuyan; Jin, Rong; Jiang, Lei

2018-01-01

The Notch signaling pathway is one of the main signaling pathways that mediates direct contact between cells, and is essential for normal development. It regulates various cellular processes, including cell proliferation, apoptosis, migration, invasion, angiogenesis and metastasis. It additionally serves an important function in tumor progression. Non-coding RNAs mainly include small microRNAs, long non-coding RNAs and circular RNAs. At present, a large body of literature supports the biological significance of non-coding RNAs in tumor progression. It is also becoming increasingly evident that cross-talk exists between Notch signaling and non-coding RNAs. The present review summarizes the current knowledge of Notch-mediated gastrointestinal cancer cell processes, and the effect of the crosstalk between the three major types of non-coding RNAs and the Notch signaling pathway on the fate of gastrointestinal cancer cells. PMID:29285185
Transcriptomic signatures in cartilage ageing

PubMed Central

2013-01-01

Introduction Age is an important factor in the development of osteoarthritis. Microarray studies provide insight into cartilage aging but do not reveal the full transcriptomic phenotype of chondrocytes such as small noncoding RNAs, pseudogenes, and microRNAs. RNA-Seq is a powerful technique for the interrogation of large numbers of transcripts including nonprotein coding RNAs. The aim of the study was to characterise molecular mechanisms associated with age-related changes in gene signatures. Methods RNA for gene expression analysis using RNA-Seq and real-time PCR analysis was isolated from macroscopically normal cartilage of the metacarpophalangeal joints of eight horses; four young donors (4 years old) and four old donors (>15 years old). RNA sequence libraries were prepared following ribosomal RNA depletion and sequencing was undertaken using the Illumina HiSeq 2000 platform. Differentially expressed genes were defined using Benjamini-Hochberg false discovery rate correction with a generalised linear model likelihood ratio test (P < 0.05, expression ratios ± 1.4 log2 fold-change). Ingenuity pathway analysis enabled networks, functional analyses and canonical pathways from differentially expressed genes to be determined. Results In total, the expression of 396 transcribed elements including mRNAs, small noncoding RNAs, pseudogenes, and a single microRNA was significantly different in old compared with young cartilage (± 1.4 log2 fold-change, P < 0.05). Of these, 93 were at higher levels in the older cartilage and 303 were at lower levels in the older cartilage. There was an over-representation of genes with reduced expression relating to extracellular matrix, degradative proteases, matrix synthetic enzymes, cytokines and growth factors in cartilage derived from older donors compared with young donors. In addition, there was a reduction in Wnt signalling in ageing cartilage. Conclusion There was an age-related dysregulation of matrix, anabolic and catabolic cartilage factors. This study has increased our knowledge of transcriptional networks in cartilage ageing by providing a global view of the transcriptome. PMID:23971731
An intergenic non-coding rRNA correlated with expression of the rRNA and frequency of an rRNA single nucleotide polymorphism in lung cancer cells.

PubMed

Shiao, Yih-Horng; Lupascu, Sorin T; Gu, Yuhan D; Kasprzak, Wojciech; Hwang, Christopher J; Fields, Janet R; Leighty, Robert M; Quiñones, Octavio; Shapiro, Bruce A; Alvord, W Gregory; Anderson, Lucy M

2009-10-19

Ribosomal RNA (rRNA) is a central regulator of cell growth and may control cancer development. A cis noncoding rRNA (nc-rRNA) upstream from the 45S rRNA transcription start site has recently been implicated in control of rRNA transcription in mouse fibroblasts. We investigated whether a similar nc-rRNA might be expressed in human cancer epithelial cells, and related to any genomic characteristics. Using quantitative rRNA measurement, we demonstrated that a nc-rRNA is transcribed in human lung epithelial and lung cancer cells, starting from approximately -1000 nucleotides upstream of the rRNA transcription start site (+1) and extending at least to +203. This nc-rRNA was significantly more abundant in the majority of lung cancer cell lines, relative to a nontransformed lung epithelial cell line. Its abundance correlated negatively with total 45S rRNA in 12 of 13 cell lines (P = 0.014). During sequence analysis from -388 to +306, we observed diverse, frequent intercopy single nucleotide polymorphisms (SNPs) in rRNA, with a frequency greater than predicted by chance at 12 sites. A SNP at +139 (U/C) in the 5' leader sequence varied among the cell lines and correlated negatively with level of the nc-rRNA (P = 0.014). Modelling of the secondary structure of the rRNA 5'-leader sequence indicated a small increase in structural stability due to the +139 U/C SNP and a minor shift in local configuration occurrences. The results demonstrate occurrence of a sense nc-rRNA in human lung epithelial and cancer cells, and imply a role in regulation of the rRNA gene, which may be affected by a +139 SNP in the 5' leader sequence of the primary rRNA transcript.
Conservation of σ28-Dependent Non-Coding RNA Paralogs and Predicted σ54-Dependent Targets in Thermophilic Campylobacter Species

PubMed Central

Le, My Thanh; van Veldhuizen, Mart; Porcelli, Ida; Bongaerts, Roy J.; Gaskin, Duncan J. H.; Pearson, Bruce M.; van Vliet, Arnoud H. M.

2015-01-01

Assembly of flagella requires strict hierarchical and temporal control via flagellar sigma and anti-sigma factors, regulatory proteins and the assembly complex itself, but to date non-coding RNAs (ncRNAs) have not been described to regulate genes directly involved in flagellar assembly. In this study we have investigated the possible role of two ncRNA paralogs (CjNC1, CjNC4) in flagellar assembly and gene regulation of the diarrhoeal pathogen Campylobacter jejuni. CjNC1 and CjNC4 are 37/44 nt identical and predicted to target the 5' untranslated region (5' UTR) of genes transcribed from the flagellar sigma factor σ54. Orthologs of the σ54-dependent 5' UTRs and ncRNAs are present in the genomes of other thermophilic Campylobacter species, and transcription of CjNC1 and CNC4 is dependent on the flagellar sigma factor σ28. Surprisingly, inactivation and overexpression of CjNC1 and CjNC4 did not affect growth, motility or flagella-associated phenotypes such as autoagglutination. However, CjNC1 and CjNC4 were able to mediate sequence-dependent, but Hfq-independent, partial repression of fluorescence of predicted target 5' UTRs in an Escherichia coli-based GFP reporter gene system. This hints towards a subtle role for the CjNC1 and CjNC4 ncRNAs in post-transcriptional gene regulation in thermophilic Campylobacter species, and suggests that the currently used phenotypic methodologies are insufficiently sensitive to detect such subtle phenotypes. The lack of a role of Hfq in the E. coli GFP-based system indicates that the CjNC1 and CjNC4 ncRNAs may mediate post-transcriptional gene regulation in ways that do not conform to the paradigms obtained from the Enterobacteriaceae. PMID:26512728
Systematic analysis and evolution of 5S ribosomal DNA in metazoans.

PubMed

Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

2013-11-01

Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12,766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades.
Systematic analysis and evolution of 5S ribosomal DNA in metazoans

PubMed Central

Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

2013-01-01

Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12 766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades. PMID:23838690
Separating the wheat from the chaff: systematic identification of functionally relevant noncoding variants in ADHD.

PubMed

Tong, J H S; Hawi, Z; Dark, C; Cummins, T D R; Johnson, B P; Newman, D P; Lau, R; Vance, A; Heussler, H S; Matthews, N; Bellgrove, M A; Pang, K C

2016-11-01

Attention deficit hyperactivity disorder (ADHD) is a highly heritable psychiatric condition with negative lifetime outcomes. Uncovering its genetic architecture should yield important insights into the neurobiology of ADHD and assist development of novel treatment strategies. Twenty years of candidate gene investigations and more recently genome-wide association studies have identified an array of potential association signals. In this context, separating the likely true from false associations ('the wheat' from 'the chaff') will be crucial for uncovering the functional biology of ADHD. Here, we defined a set of 2070 DNA variants that showed evidence of association with ADHD (or were in linkage disequilibrium). More than 97% of these variants were noncoding, and were prioritised for further exploration using two tools-genome-wide annotation of variants (GWAVA) and Combined Annotation-Dependent Depletion (CADD)-that were recently developed to rank variants based upon their likely pathogenicity. Capitalising on recent efforts such as the Encyclopaedia of DNA Elements and US National Institutes of Health Roadmap Epigenomics Projects to improve understanding of the noncoding genome, we subsequently identified 65 variants to which we assigned functional annotations, based upon their likely impact on alternative splicing, transcription factor binding and translational regulation. We propose that these 65 variants, which possess not only a high likelihood of pathogenicity but also readily testable functional hypotheses, represent a tractable shortlist for future experimental validation in ADHD. Taken together, this study brings into sharp focus the likely relevance of noncoding variants for the genetic risk associated with ADHD, and more broadly suggests a bioinformatics approach that should be relevant to other psychiatric disorders.
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach

PubMed Central

Laurent, Georges St.; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J.L.; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R.R.; Nicolas, Estelle; McCaffrey, Timothy A.; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-01-01

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlincRNAs genes likely function in cis to activate nearby genes. This effect while most pronounced in closely spaced vlincRNA–gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlincRNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. PMID:27001520
T cells are influenced by a long non-coding RNA in the autoimmune associated PTPN2 locus.

PubMed

Houtman, Miranda; Shchetynsky, Klementy; Chemin, Karine; Hensvold, Aase Haj; Ramsköld, Daniel; Tandre, Karolina; Eloranta, Maija-Leena; Rönnblom, Lars; Uebe, Steffen; Catrina, Anca Irinel; Malmström, Vivianne; Padyukov, Leonid

2018-06-01

Non-coding SNPs in the protein tyrosine phosphatase non-receptor type 2 (PTPN2) locus have been linked with several autoimmune diseases, including rheumatoid arthritis, type I diabetes, and inflammatory bowel disease. However, the functional consequences of these SNPs are poorly characterized. Herein, we show in blood cells that SNPs in the PTPN2 locus are highly correlated with DNA methylation levels at four CpG sites downstream of PTPN2 and expression levels of the long non-coding RNA (lncRNA) LINC01882 downstream of these CpG sites. We observed that LINC01882 is mainly expressed in T cells and that anti-CD3/CD28 activated naïve CD4 + T cells downregulate the expression of LINC01882. RNA sequencing analysis of LINC01882 knockdown in Jurkat T cells, using a combination of antisense oligonucleotides and RNA interference, revealed the upregulation of the transcription factor ZEB1 and kinase MAP2K4, both involved in IL-2 regulation. Overall, our data suggests the involvement of LINC01882 in T cell activation and hints towards an auxiliary role of these non-coding SNPs in autoimmunity associated with the PTPN2 locus. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

PubMed Central

Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren

2015-01-01

There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

PubMed

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-10-03

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes

PubMed Central

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-01-01

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274
Regulation of mammalian cell differentiation by long non-coding RNAs

PubMed Central

Hu, Wenqian; Alvarez-Dominguez, Juan R; Lodish, Harvey F

2012-01-01

Differentiation of specialized cell types from stem and progenitor cells is tightly regulated at several levels, both during development and during somatic tissue homeostasis. Many long non-coding RNAs have been recognized as an additional layer of regulation in the specification of cellular identities; these non-coding species can modulate gene-expression programmes in various biological contexts through diverse mechanisms at the transcriptional, translational or messenger RNA stability levels. Here, we summarize findings that implicate long non-coding RNAs in the control of mammalian cell differentiation. We focus on several representative differentiation systems and discuss how specific long non-coding RNAs contribute to the regulation of mammalian development. PMID:23070366
The Most Deeply Conserved Noncoding Sequences in Plants Serve Similar Functions to Those in Vertebrates Despite Large Differences in Evolutionary Rates[W

PubMed Central

Burgess, Diane; Freeling, Michael

2014-01-01

In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing–associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates. PMID:24681619
Long Noncoding RNA H19 Inhibits Cell Viability, Migration, and Invasion Via Downregulation of IRS-1 in Thyroid Cancer Cells

PubMed Central

Wang, Peng; Xu, Weimin; Liu, Haixia; Bu, Qingao; Sun, Diwen

2017-01-01

Thyroid cancer is a common endocrine gland malignancy which exhibited rapid increased incidence worldwide in recent decades. This study was aimed to investigate the role of long noncoding RNA H19 in thyroid cancer. Long noncoding RNA H19 was overexpressed or knockdown in thyroid cancer cells SW579 and TPC-1, and the expression of long noncoding RNA H19 was detected by real-time polymerase chain reaction. The cell viability, migration, and invasion were determined by 3-(4, 5-dimethyl-2-thiazolyl)-2, 5-diphenyl-2-H-tetrazolium bromide assay, Transwell assay, and wound healing assay, respectively. Furthermore, cell apoptosis was analyzed by flow cytometry, and expressions of some factors that were related to phosphatidyl inositide 3-kinases/protein kinase B and nuclear factor κB signal pathway were measured by Western blotting. This study revealed that cell viability and migration/invasion of SW579 and TPC-1 were significantly decreased by long noncoding RNA H19 overexpression compared with the control group (P < .05), whereas cell apoptosis was statistically increased (P < .001). Meanwhile, cell viability and migration/invasion were significantly increased after long noncoding RNA H19 knockdown (P < .05). Furthermore, long noncoding RNA H19 negatively regulated the expression of insulin receptor substrate 1 and thus effect on cell proliferation and apoptosis. Insulin receptor substrate 1 regulated the activation of phosphatidyl inositide 3-kinases/AKT and nuclear factor κB signal pathways. In conclusion, long noncoding RNA H19 could suppress cell viability, migration, and invasion via downregulation of insulin receptor substrate 1 in SW579 and TPC-1 cells. These results suggested the important role of long noncoding RNA H19 in thyroid cancer, and long noncoding RNA H19 might be a potential target of thyroid cancer treatment. PMID:29332545
Non-coding cancer driver candidates identified with a sample- and position-specific model of the somatic mutation rate

PubMed Central

Juul, Malene; Bertl, Johanna; Guo, Qianyun; Nielsen, Morten Muhlig; Świtnicki, Michał; Hornshøj, Henrik; Madsen, Tobias; Hobolth, Asger; Pedersen, Jakob Skou

2017-01-01

Non-coding mutations may drive cancer development. Statistical detection of non-coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance. DOI: http://dx.doi.org/10.7554/eLife.21778.001 PMID:28362259
The Inescapable Influence of Noncoding RNAs in Cancer

PubMed Central

Adams, Brian D.; Anastasiadou, Eleni; Esteller, Manel; He, Lin; Slack, Frank J.

2015-01-01

This report summarizes information presented at the 2015 Keystone Symposium on “MicroRNAs and Noncoding RNAs in Cancer”. Nearly two decades after the discovery of the first microRNA (miRNA), the role of noncoding RNAs in developmental processes and the mechanisms behind their dysregulation in cancer has been steadily elucidated. Excitingly, miRNAs have begun making their way into the clinic to combat disease such a hepatitis C, and various forms of cancer. Therefore, at this Keystone meeting novel findings were presented that enhance our view on how small and long noncoding RNAs control developmental timing and oncogenic processes. Recurring themes included, 1) how miRNAs can be differentially processed, degraded, and regulated by ribonucleoprotein (RNP) complexes, 2) how particular miRNA genetic networks that control developmental process, when disrupted, can result in cancer disease, 3) the technologies available to therapeutically deliver RNA to combat diseases such as cancer, and 4) the elucidation of the mechanism of actions for long noncoding RNAs, currently a poorly understood class of noncoding RNA. During the meeting there was an emphasis on presenting unpublished findings, and the breadth of topics covered reflected how inescapable the influence of noncoding RNAs are in development and cancer. PMID:26567137
The Intolerance of Regulatory Sequence to Genetic Variation Predicts Gene Dosage Sensitivity

PubMed Central

Wang, Quanli; Halvorsen, Matt; Han, Yujun; Weir, William H.; Allen, Andrew S.; Goldstein, David B.

2015-01-01

Noncoding sequence contains pathogenic mutations. Yet, compared with mutations in protein-coding sequence, pathogenic regulatory mutations are notoriously difficult to recognize. Most fundamentally, we are not yet adept at recognizing the sequence stretches in the human genome that are most important in regulating the expression of genes. For this reason, it is difficult to apply to the regulatory regions the same kinds of analytical paradigms that are being successfully applied to identify mutations among protein-coding regions that influence risk. To determine whether dosage sensitive genes have distinct patterns among their noncoding sequence, we present two primary approaches that focus solely on a gene’s proximal noncoding regulatory sequence. The first approach is a regulatory sequence analogue of the recently introduced residual variation intolerance score (RVIS), termed noncoding RVIS, or ncRVIS. The ncRVIS compares observed and predicted levels of standing variation in the regulatory sequence of human genes. The second approach, termed ncGERP, reflects the phylogenetic conservation of a gene’s regulatory sequence using GERP++. We assess how well these two approaches correlate with four gene lists that use different ways to identify genes known or likely to cause disease through changes in expression: 1) genes that are known to cause disease through haploinsufficiency, 2) genes curated as dosage sensitive in ClinGen’s Genome Dosage Map, 3) genes judged likely to be under purifying selection for mutations that change expression levels because they are statistically depleted of loss-of-function variants in the general population, and 4) genes judged unlikely to cause disease based on the presence of copy number variants in the general population. We find that both noncoding scores are highly predictive of dosage sensitivity using any of these criteria. In a similar way to ncGERP, we assess two ensemble-based predictors of regional noncoding importance, ncCADD and ncGWAVA, and find both scores are significantly predictive of human dosage sensitive genes and appear to carry information beyond conservation, as assessed by ncGERP. These results highlight that the intolerance of noncoding sequence stretches in the human genome can provide a critical complementary tool to other genome annotation approaches to help identify the parts of the human genome increasingly likely to harbor mutations that influence risk of disease. PMID:26332131
TAS3 miR390-dependent loci in non-vascular land plants: towards a comprehensive reconstruction of the gene evolutionary history.

PubMed

Morozov, Sergey Y; Milyutina, Irina A; Erokhina, Tatiana N; Ozerova, Liudmila V; Troitsky, Alexey V; Solovyev, Andrey G

2018-01-01

Trans-acting small interfering RNAs (ta-siRNAs) are transcribed from protein non-coding genomic TAS loci and belong to a plant-specific class of endogenous small RNAs. These siRNAs have been found to regulate gene expression in most taxa including seed plants, gymnosperms, ferns and mosses. In this study, bioinformatic and experimental PCR-based approaches were used as tools to analyze TAS3 and TAS6 loci in transcriptomes and genomic DNAs from representatives of evolutionary distant non-vascular plant taxa such as Bryophyta, Marchantiophyta and Anthocerotophyta. We revealed previously undiscovered TAS3 loci in plant classes Sphagnopsida and Anthocerotopsida, as well as TAS6 loci in Bryophyta classes Tetraphidiopsida, Polytrichopsida, Andreaeopsida and Takakiopsida. These data further unveil the evolutionary pathway of the miR390-dependent TAS3 loci in land plants. We also identified charophyte alga sequences coding for SUPPRESSOR OF GENE SILENCING 3 (SGS3), which is required for generation of ta-siRNAs in plants, and hypothesized that the appearance of TAS3-related sequences could take place at a very early step in evolutionary transition from charophyte algae to an earliest common ancestor of land plants.

Heat shock represses rRNA synthesis by inactivation of TIF-IA and lncRNA-dependent changes in nucleosome positioning.

PubMed

Zhao, Zhongliang; Dammert, Marcel A; Hoppe, Sven; Bierhoff, Holger; Grummt, Ingrid

2016-09-30

Attenuation of ribosome biogenesis in suboptimal growth environments is crucial for cellular homeostasis and genetic integrity. Here, we show that shutdown of rRNA synthesis in response to elevated temperature is brought about by mechanisms that target both the RNA polymerase I (Pol I) transcription machinery and the epigenetic signature of the rDNA promoter. Upon heat shock, the basal transcription factor TIF-IA is inactivated by inhibition of CK2-dependent phosphorylations at Ser170/172. Attenuation of pre-rRNA synthesis in response to heat stress is accompanied by upregulation of PAPAS, a long non-coding RNA (lncRNA) that is transcribed in antisense orientation to pre-rRNA. PAPAS interacts with CHD4, the adenosine triphosphatase subunit of NuRD, leading to deacetylation of histones and movement of the promoter-bound nucleosome into a position that is refractory to transcription initiation. The results exemplify how stress-induced inactivation of TIF-IA and lncRNA-dependent changes of chromatin structure ensure repression of rRNA synthesis in response to thermo-stress. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
The RNAs of RNA-directed DNA methylation

PubMed Central

Wendte, Jered M.; Pikaard, Craig S.

2016-01-01

Summary RNA-directed chromatin modification that includes cytosine methylation silences transposable elements in both plants and mammals, contributing to genome defense and stability. In Arabidopsis thaliana, most RNA-directed DNA methylation (RdDM) is guided by small RNAs derived from double-stranded precursors synthesized at cytosine-methylated loci by nuclear multisubunit RNA Polymerase IV (Pol IV), in close partnership with the RNA-dependent RNA polymerase, RDR2. These small RNAs help keep transposons transcriptionally inactive. However, if transposons escape silencing, and are transcribed by multisubunit RNA polymerase II (Pol II), their mRNAs can be recognized and degraded, generating small RNAs that can also guide initial DNA methylation, thereby enabling subsequent Pol IV-RDR2 recruitment. In both pathways, the small RNAs find their target sites by interacting with longer noncoding RNAs synthesized by multisubunit RNA Polymerase V (Pol V). Despite a decade of progress, numerous questions remain concerning the initiation, synthesis, processing, size and features of the RNAs that drive RdDM. Here, we review recent insights, questions and controversies concerning RNAs produced by Pols IV and V, and their functions in RdDM. We also provide new data concerning Pol V transcript 5’ and 3’ ends. PMID:27521981
TERRA and the histone methyltransferase Dot1 cooperate to regulate senescence in budding yeast

PubMed Central

Wanat, Jennifer J.; Logsdon, Glennis A.; Driskill, Jordan H.; Deng, Zhong; Lieberman, Paul M.

2018-01-01

The events underlying senescence induced by critical telomere shortening are not fully understood. Here we provide evidence that TERRA, a non-coding RNA transcribed from subtelomeres, contributes to senescence in yeast lacking telomerase (tlc1Δ). Levels of TERRA expressed from multiple telomere ends appear elevated at senescence, and expression of an artificial RNA complementary to TERRA (anti-TERRA) binds TERRA in vivo and delays senescence. Anti-TERRA acts independently from several other mechanisms known to delay senescence, including those elicited by deletions of EXO1, TEL1, SAS2, and genes encoding RNase H enzymes. Further, it acts independently of the senescence delay provided by RAD52-dependent recombination. However, anti-TERRA delays senescence in a fashion epistatic to inactivation of the conserved histone methyltransferase Dot1. Dot1 associates with TERRA, and anti-TERRA disrupts this interaction in vitro and in vivo. Surprisingly, the anti-TERRA delay is independent of the C-terminal methyltransferase domain of Dot1 and instead requires only its N-terminus, which was previously found to facilitate release of telomeres from the nuclear periphery. Together, these data suggest that TERRA and Dot1 cooperate to drive senescence. PMID:29649255
Noncoding Genomics in Gastric Cancer and the Gastric Precancerous Cascade: Pathogenesis and Biomarkers

PubMed Central

Garcia-Bloj, Benjamin; Fry, Jacqueline; Wichmann, Ignacio

2015-01-01

Gastric cancer is the fifth most common cancer and the third leading cause of cancer-related death, whose patterns vary among geographical regions and ethnicities. It is a multifactorial disease, and its development depends on infection by Helicobacter pylori (H. pylori) and Epstein-Barr virus (EBV), host genetic factors, and environmental factors. The heterogeneity of the disease has begun to be unraveled by a comprehensive mutational evaluation of primary tumors. The low-abundance of mutations suggests that other mechanisms participate in the evolution of the disease, such as those found through analyses of noncoding genomics. Noncoding genomics includes single nucleotide polymorphisms (SNPs), regulation of gene expression through DNA methylation of promoter sites, miRNAs, other noncoding RNAs in regulatory regions, and other topics. These processes and molecules ultimately control gene expression. Potential biomarkers are appearing from analyses of noncoding genomics. This review focuses on noncoding genomics and potential biomarkers in the context of gastric cancer and the gastric precancerous cascade. PMID:26379360
Pan-cancer transcriptomic analysis associates long non-coding RNAs with key mutational driver events

PubMed Central

Ashouri, Arghavan; Sayin, Volkan I.; Van den Eynden, Jimmy; Singh, Simranjit X.; Papagiannakopoulos, Thales; Larsson, Erik

2016-01-01

Thousands of long non-coding RNAs (lncRNAs) lie interspersed with coding genes across the genome, and a small subset has been implicated as downstream effectors in oncogenic pathways. Here we make use of transcriptome and exome sequencing data from thousands of tumours across 19 cancer types, to identify lncRNAs that are induced or repressed in relation to somatic mutations in key oncogenic driver genes. Our screen confirms known coding and non-coding effectors and also associates many new lncRNAs to relevant pathways. The associations are often highly reproducible across cancer types, and while many lncRNAs are co-expressed with their protein-coding hosts or neighbours, some are intergenic and independent. We highlight lncRNAs with possible functions downstream of the tumour suppressor TP53 and the master antioxidant transcription factor NFE2L2. Our study provides a comprehensive overview of lncRNA transcriptional alterations in relation to key driver mutational events in human cancers. PMID:28959951
Long noncoding RNA DANCR promotes colorectal cancer proliferation and metastasis via miR-577 sponging.

PubMed

Wang, Yong; Lu, Zhi; Wang, Ningnin; Feng, Jianzhou; Zhang, Junjie; Luan, Lan; Zhao, Wei; Zeng, Xiandong

2018-05-01

Long non-coding RNAs (lncRNAs) play key roles in various malignant tumors, including colorectal cancer (CRC). Long non-coding RNA differentiation antagonizing non-protein coding RNA (DANCR) is overexpressed in CRC patients, but whether it affects CRC proliferation and metastasis via regulation of heat shock protein 27 (HSP27) remains unclear. In the present study, we found that DANCR was highly expressed and correlated with proliferation and metastasis in CRC. In addition, we demonstrated that DANCR and HSP27 were both targets of microRNA-577 (miR-577) and shared the same binding site. Furthermore, we revealed that DANCR promoted HSP27 expression and its mediation of proliferation/metastasis via miR-577 sponging. Finally, using an in vivo study, we confirmed that overexpression of DANCR promoted CRC tumor growth and liver metastasis. The present study demonstrated the function of DANCR in CRC and might provide a new target in the treatment of CRC.
Long Noncoding RNAs as a Key Player in Hepatocellular Carcinoma

PubMed Central

Mehra, Mrigaya; Chauhan, Ranjit

2017-01-01

Hepatocellular carcinoma (HCC) is a major malignancy in the liver and has emerged as one of the main cancers in the world with a high mortality rate. However, the molecular mechanisms of HCC are still poorly understood. Long noncoding RNAs (lncRNAs) have recently come to the forefront as functional non–protein-coding RNAs that are involved in a variety of cellular processes ranging from maintaining the structural integrity of chromosomes to gene expression regulation in a spatiotemporal manner. Many recent studies have reported the involvement of lncRNAs in HCC which has led to a better understanding of the underlying molecular mechanisms operating in HCC. Long noncoding RNAs have been shown to regulate development and progression of HCC, and thus, lncRNAs have both diagnostic and therapeutic potentials. In this review, we present an overview of the lncRNAs involved in different stages of HCC and their potential in clinical applications which have been studied so far. PMID:29147078
Insights into inner ear-specific gene regulation: epigenetics and non-coding RNAs in inner ear development and regeneration

PubMed Central

Avraham, Karen B.

2016-01-01

The vertebrate inner ear houses highly specialized sensory organs, tuned to detect and encode sound, head motion and gravity. Gene expression programs under the control of transcription factors orchestrate the formation and specialization of the non-sensory inner ear labyrinth and its sensory constituents. More recently, epigenetic factors and non-coding RNAs emerged as an additional layer of gene regulation, both in inner ear development and disease. In this review, we provide an overview on how epigenetic modifications and non-coding RNAs, in particular microRNAs (miRNAs), influence gene expression and summarize recent discoveries that highlight their critical role in the proper formation of the inner ear labyrinth and its sensory organs. In contrast to non-mammalian vertebrates, adult mammals lack the ability to regenerate inner ear mechano-sensory hair cells. Finally, we discuss recent insights into how epigenetic factors and miRNAs may facilitate, or in the case of mammals, restrict sensory hair cell regeneration. PMID:27836639
Non-coding glucometers among pediatric patients with diabetes: looking for the target population and an accuracy evaluation of no-coding personal glucometer.

PubMed

Fendler, Wojciech; Hogendorf, Anna; Szadkowska, Agnieszka; Młynarski, Wojciech

2011-01-01

Self-monitoring of blood glucose (SMBG) is one of the cornerstones of diabetes management. To evaluate the potential for miscoding of a personal glucometer, to define a target population among pediatric patients with diabetes for a non-coding glucometer and the accuracy of the Contour TS non-coding system. Potential for miscoding during self-monitoring of blood glucose was evaluated by means of an anonymous questionnaire, with worst and best case scenarios evaluated depending on the responses pattern. Testing of the Contour TS system was performed according to guidelines set by the national committee for clinical laboratory standards. Estimated frequency of individuals prone to non-coding ranged from 68.21% (95% 60.70- 75.72%) to 7.95% (95%CI 3.86-12.31%) for the worse and best case scenarios respectively. Factors associated with increased likelihood of non-coding were: a smaller number of tests per day, a greater number of individuals involved in testing and self-testing by the patient with diabetes. The Contour TS device showed intra- and inter-assay accuracy -95%, linear association with laboratory measurements (R2=0.99, p <0.0001) and consistent, but small bias of -1.12% (95% Confidence Interval -3.27 to 1.02%). Clarke error grid analysis showed 4% of values within the benign error zone (B) with the other measurements yielding an acceptably accurate result (zone A). The Contour TS system showed sufficient accuracy to be safely used in monitoring of pediatric diabetic patients. Patients from families with a high throughput of test-strips or multiple individuals involved in SMBG using the same meter are candidates for clinical use of such devices due to an increased risk of calibration errors.
Trans-packaging of human immunodeficiency virus type 1 genome into Gag virus-like particles in Saccharomyces cerevisiae.

PubMed

Tomo, Naoki; Goto, Toshiyuki; Morikawa, Yuko

2013-03-26

Yeast is recognized as a generally safe microorganism and is utilized for the production of pharmaceutical products, including vaccines. We previously showed that expression of human immunodeficiency virus type 1 (HIV-1) Gag protein in Saccharomyces cerevisiae spheroplasts released Gag virus-like particles (VLPs) extracellularly, suggesting that the production system could be used in vaccine development. In this study, we further establish HIV-1 genome packaging into Gag VLPs in a yeast cell system. The nearly full-length HIV-1 genome containing the entire 5' long terminal repeat, U3-R-U5, did not transcribe gag mRNA in yeast. Co-expression of HIV-1 Tat, a transcription activator, did not support the transcription. When the HIV-1 promoter U3 was replaced with the promoter for the yeast glyceraldehyde-3-phosphate dehydrogenase gene, gag mRNA transcription was restored, but no Gag protein expression was observed. Co-expression of HIV-1 Rev, a factor that facilitates nuclear export of gag mRNA, did not support the protein synthesis. Progressive deletions of R-U5 and its downstream stem-loop-rich region (SL) to the gag start ATG codon restored Gag protein expression, suggesting that a highly structured noncoding RNA generated from the R-U5-SL region had an inhibitory effect on gag mRNA translation. When a plasmid containing the HIV-1 genome with the R-U5-SL region was coexpressed with an expression plasmid for Gag protein, the HIV-1 genomic RNA was transcribed and incorporated into Gag VLPs formed by Gag protein assembly, indicative of the trans-packaging of HIV-1 genomic RNA into Gag VLPs in a yeast cell system. The concentration of HIV-1 genomic RNA in Gag VLPs released from yeast was approximately 500-fold higher than that in yeast cytoplasm. The deletion of R-U5 to the gag gene resulted in the failure of HIV-1 RNA packaging into Gag VLPs, indicating that the packaging signal of HIV-1 genomic RNA present in the R-U5 to gag region functions similarly in yeast cells. Our data indicate that selective trans-packaging of HIV-1 genomic RNA into Gag VLPs occurs in a yeast cell system, analogous to a mammalian cell system, suggesting that yeast may provide an alternative packaging system for lentiviral RNA.
Long non-coding RNA nuclear paraspeckle assembly transcript 1 inhibits the apoptosis of retina Müller cells after diabetic retinopathy through regulating miR-497/brain-derived neurotrophic factor axis.

PubMed

Li, Xiu-Juan

2018-05-01

The role of long non-coding RNA in diabetic retinopathy, a serious complication of diabetes mellitus, has attracted increasing attention in recent years. The purpose of this study was to explore whether long non-coding RNA nuclear paraspeckle assembly transcript 1 was involved in the context of diabetic retinopathy and its underlying mechanisms. Our results revealed that nuclear paraspeckle assembly transcript 1 was significantly downregulated in the retina of diabetes mellitus rats. Meanwhile, miR-497 was significantly increased in diabetes mellitus rats' retina and high glucose-treated Müller cells, but brain-derived neurotrophic factor was increased. We also found that high glucose-induced apoptosis of Müller cells was accompanied by the significant downregulation of nuclear paraspeckle assembly transcript 1 in vitro. Further study demonstrated that high glucose-promoted Müller cells apoptosis through downregulating nuclear paraspeckle assembly transcript 1 and downregulated nuclear paraspeckle assembly transcript 1 mediated this effect via negative regulating miR-497. Moreover, brain-derived neurotrophic factor was negatively regulated by miR-497 and associated with the apoptosis of Müller cells under high glucose. Our results suggested that under diabetic conditions, downregulated nuclear paraspeckle assembly transcript 1 decreased the expression of brain-derived neurotrophic factor through elevating miR-497, thereby promoting Müller cells apoptosis and aggravating diabetic retinopathy.
Whole-genome sequencing identifies EN1 as a determinant of bone density and fracture

PubMed Central

Zheng, Hou-Feng; Forgetta, Vincenzo; Hsu, Yi-Hsiang; Estrada, Karol; Rosello-Diez, Alberto; Leo, Paul J; Dahia, Chitra L; Park-Min, Kyung Hyun; Tobias, Jonathan H; Kooperberg, Charles; Kleinman, Aaron; Styrkarsdottir, Unnur; Liu, Ching-Ti; Uggla, Charlotta; Evans, Daniel S; Nielson, Carrie M; Walter, Klaudia; Pettersson-Kymmer, Ulrika; McCarthy, Shane; Eriksson, Joel; Kwan, Tony; Jhamai, Mila; Trajanoska, Katerina; Memari, Yasin; Min, Josine; Huang, Jie; Danecek, Petr; Wilmot, Beth; Li, Rui; Chou, Wen-Chi; Mokry, Lauren E; Moayyeri, Alireza; Claussnitzer, Melina; Cheng, Chia-Ho; Cheung, Warren; Medina-Gómez, Carolina; Ge, Bing; Chen, Shu-Huang; Choi, Kwangbom; Oei, Ling; Fraser, James; Kraaij, Robert; Hibbs, Matthew A; Gregson, Celia L; Paquette, Denis; Hofman, Albert; Wibom, Carl; Tranah, Gregory J; Marshall, Mhairi; Gardiner, Brooke B; Cremin, Katie; Auer, Paul; Hsu, Li; Ring, Sue; Tung, Joyce Y; Thorleifsson, Gudmar; Enneman, Anke W; van Schoor, Natasja M; de Groot, Lisette C.P.G.M.; van der Velde, Nathalie; Melin, Beatrice; Kemp, John P; Christiansen, Claus; Sayers, Adrian; Zhou, Yanhua; Calderari, Sophie; van Rooij, Jeroen; Carlson, Chris; Peters, Ulrike; Berlivet, Soizik; Dostie, Josée; Uitterlinden, Andre G; Williams, Stephen R.; Farber, Charles; Grinberg, Daniel; LaCroix, Andrea Z; Haessler, Jeff; Chasman, Daniel I; Giulianini, Franco; Rose, Lynda M; Ridker, Paul M; Eisman, John A; Nguyen, Tuan V; Center, Jacqueline R; Nogues, Xavier; Garcia-Giralt, Natalia; Launer, Lenore L; Gudnason, Vilmunder; Mellström, Dan; Vandenput, Liesbeth; Karlsson, Magnus K; Ljunggren, Östen; Svensson, Olle; Hallmans, Göran; Rousseau, François; Giroux, Sylvie; Bussière, Johanne; Arp, Pascal P; Koromani, Fjorda; Prince, Richard L; Lewis, Joshua R; Langdahl, Bente L; Hermann, A Pernille; Jensen, Jens-Erik B; Kaptoge, Stephen; Khaw, Kay-Tee; Reeve, Jonathan; Formosa, Melissa M; Xuereb-Anastasi, Angela; Åkesson, Kristina; McGuigan, Fiona E; Garg, Gaurav; Olmos, Jose M; Zarrabeitia, Maria T; Riancho, Jose A; Ralston, Stuart H; Alonso, Nerea; Jiang, Xi; Goltzman, David; Pastinen, Tomi; Grundberg, Elin; Gauguier, Dominique; Orwoll, Eric S; Karasik, David; Davey-Smith, George; Smith, Albert V; Siggeirsdottir, Kristin; Harris, Tamara B; Zillikens, M Carola; van Meurs, Joyce BJ; Thorsteinsdottir, Unnur; Maurano, Matthew T; Timpson, Nicholas J; Soranzo, Nicole; Durbin, Richard; Wilson, Scott G; Ntzani, Evangelia E; Brown, Matthew A; Stefansson, Kari; Hinds, David A; Spector, Tim; Cupples, L Adrienne; Ohlsson, Claes; Greenwood, Celia MT; Jackson, Rebecca D; Rowe, David W; Loomis, Cynthia A; Evans, David M; Ackert-Bicknell, Cheryl L; Joyner, Alexandra L; Duncan, Emma L; Kiel, Douglas P; Rivadeneira, Fernando; Richards, J Brent

2016-01-01

SUMMARY The extent to which low-frequency (minor allele frequency [MAF] between 1–5%) and rare (MAF ≤ 1%) variants contribute to complex traits and disease in the general population is largely unknown. Bone mineral density (BMD) is highly heritable, is a major predictor of osteoporotic fractures and has been previously associated with common genetic variants1–8, and rare, population-specific, coding variants9. Here we identify novel non-coding genetic variants with large effects on BMD (ntotal = 53,236) and fracture (ntotal = 508,253) in individuals of European ancestry from the general population. Associations for BMD were derived from whole-genome sequencing (n=2,882 from UK10K), whole-exome sequencing (n= 3,549), deep imputation of genotyped samples using a combined UK10K/1000Genomes reference panel (n=26,534), and de-novo replication genotyping (n= 20,271). We identified a low-frequency non-coding variant near a novel locus, EN1, with an effect size 4-fold larger than the mean of previously reported common variants for lumbar spine BMD8 (rs11692564[T], MAF = 1.7%, replication effect size = +0.20 standard deviations [SD], Pmeta = 2×10−14), which was also associated with a decreased risk of fracture (OR = 0.85; P = 2×10−11; ncases = 98,742 and ncontrols = 409,511). Using an En1Cre/flox mouse model, we observed that conditional loss of En1 results in low bone mass, likely as a consequence of high bone turn-over. We also identified a novel low-frequency non-coding variant with large effects on BMD near WNT16 (rs148771817[T], MAF = 1.1%, replication effect size = +0.39 SD, Pmeta = 1×10−11). In general, there was an excess of association signals arising from deleterious coding and conserved non-coding variants. These findings provide evidence that low-frequency non-coding variants have large effects on BMD and fracture, thereby providing rationale for whole-genome sequencing and improved imputation reference panels to study the genetic architecture of complex traits and disease in the general population. PMID:26367794
Identification of novel mRNAs and lncRNAs associated with mouse experimental colitis and human inflammatory bowel disease.

PubMed

Rankin, Carl Robert; Theodorou, Evangelos; Law, Ivy Ka Man; Rowe, Lorraine; Kokkotou, Efi; Pekow, Joel; Wang, Jiafang; Martin, Martin G; Pothoulakis, Charalabos; Padua, David Miguel

2018-06-28

Inflammatory bowel disease (IBD) is a complex disorder that is associated with significant morbidity. While many recent advances have been made with new diagnostic and therapeutic tools, a deeper understanding of its basic pathophysiology is needed to continue this trend towards improving treatments. By utilizing an unbiased, high-throughput transcriptomic analysis of two well-established mouse models of colitis, we set out to uncover novel coding and non-coding RNAs that are differentially expressed in the setting of colonic inflammation. RNA-seq analysis was performed using colonic tissue from two mouse models of colitis, a dextran sodium sulfate induced model and a genetic-induced model in mice lacking IL-10. We identified 81 coding RNAs that were commonly altered in both experimental models. Of these coding RNAs, 12 of the human orthologs were differentially expressed in a transcriptomic analysis of IBD patients. Interestingly, 5 of the 12 of human differentially expressed genes have not been previously identified as IBD-associated genes, including ubiquitin D. Our analysis also identified 15 non-coding RNAs that were differentially expressed in either mouse model. Surprisingly, only three non-coding RNAs were commonly dysregulated in both of these models. The discovery of these new coding and non-coding RNAs expands our transcriptional knowledge of mouse models of IBD and offers additional targets to deepen our understanding of the pathophysiology of IBD.
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach.

PubMed

St Laurent, Georges; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J L; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R R; Nicolas, Estelle; McCaffrey, Timothy A; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-04-20

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Behind the curtain of non-coding RNAs; long non-coding RNAs regulating hepatocarcinogenesis

PubMed Central

El Khodiry, Aya; Afify, Menna; El Tayebi, Hend M

2018-01-01

Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers worldwide. HCC is the fifth common malignancy in the world and the second leading cause of cancer death in Asia. Long non-coding RNAs (lncRNAs) are RNAs with a length greater than 200 nucleotides that do not encode proteins. lncRNAs can regulate gene expression and protein synthesis in several ways by interacting with DNA, RNA and proteins in a sequence specific manner. They could regulate cellular and developmental processes through either gene inhibition or gene activation. Many studies have shown that dysregulation of lncRNAs is related to many human diseases such as cardiovascular diseases, genetic disorders, neurological diseases, immune mediated disorders and cancers. However, the study of lncRNAs is challenging as they are poorly conserved between species, their expression levels aren’t as high as that of mRNAs and have great interpatient variations. The study of lncRNAs expression in cancers have been a breakthrough as it unveils potential biomarkers and drug targets for cancer therapy and helps understand the mechanism of pathogenesis. This review discusses many long non-coding RNAs and their contribution in HCC, their role in development, metastasis, and prognosis of HCC and how to regulate and target these lncRNAs as a therapeutic tool in HCC treatment in the future. PMID:29434445
Long Non-Coding RNA CASC2 Improves Diabetic Nephropathy by Inhibiting JNK Pathway.

PubMed

Yang, Huihui; Kan, Quan E; Su, Yong; Man, Hua

2018-06-11

It's known that long non-coding RNA CASC2 overexpression inhibit the JNK pathway in some disease models, while JNK pathway activation exacerbates diabetic nephropathy. Therefore we speculate that long non-coding RNA CASC2 can improve diabetic nephropathy by inhibiting JNK pathway. Thus, our study was carried out to investigate the involvement of CASC2 in diabetic nephropathy. We found that serum level of CASC2 was significantly lower in diabetic nephropathy patients than in normal people, and serum level of CASC2 showed no significant correlations with age, gender, alcohol consumption and smoking habits, but was correlated with course of disease. ROC curve analysis showed that serum level of CASC2 could be used to accurately predict diabetic nephropathy. Diabetes mellitus has many complications. This study also included a series of complications of diabetes, such as diabetic retinopathy, diabetic ketoacidosis, diabetic foot infections and diabetic cardiopathy, while serum level of CASC2 was specifically reduced in diabetic nephropathy. CASC2 expression level decreased, while JNK1 phosphorylation level increased in mouse podocyte cells treated with high glucose. CASC2 overexpression inhibited apoptosis of podocyte cells and reduced phosphorylation level of JNK1. We conclude that long non-coding RNA CASC2 may improve diabetic nephropathy by inhibiting JNK pathway. © Georg Thieme Verlag KG Stuttgart · New York.
PLncPRO for prediction of long non-coding RNAs (lncRNAs) in plants and its application for discovery of abiotic stress-responsive lncRNAs in rice and chickpea

PubMed Central

Singh, Urminder; Rajkumar, Mohan Singh; Garg, Rohini

2017-01-01

Abstract Long non-coding RNAs (lncRNAs) make up a significant portion of non-coding RNAs and are involved in a variety of biological processes. Accurate identification/annotation of lncRNAs is the primary step for gaining deeper insights into their functions. In this study, we report a novel tool, PLncPRO, for prediction of lncRNAs in plants using transcriptome data. PLncPRO is based on machine learning and uses random forest algorithm to classify coding and long non-coding transcripts. PLncPRO has better prediction accuracy as compared to other existing tools and is particularly well-suited for plants. We developed consensus models for dicots and monocots to facilitate prediction of lncRNAs in non-model/orphan plants. The performance of PLncPRO was quite better with vertebrate transcriptome data as well. Using PLncPRO, we discovered 3714 and 3457 high-confidence lncRNAs in rice and chickpea, respectively, under drought or salinity stress conditions. We investigated different characteristics and differential expression under drought/salinity stress conditions, and validated lncRNAs via RT-qPCR. Overall, we developed a new tool for the prediction of lncRNAs in plants and showed its utility via identification of lncRNAs in rice and chickpea. PMID:29036354
A global transcriptional analysis of Plasmodium falciparum malaria reveals a novel family of telomere-associated lncRNAs

PubMed Central

2011-01-01

Background Mounting evidence suggests a major role for epigenetic feedback in Plasmodium falciparum transcriptional regulation. Long non-coding RNAs (lncRNAs) have recently emerged as a new paradigm in epigenetic remodeling. We therefore set out to investigate putative roles for lncRNAs in P. falciparum transcriptional regulation. Results We used a high-resolution DNA tiling microarray to survey transcriptional activity across 22.6% of the P. falciparum strain 3D7 genome. We identified 872 protein-coding genes and 60 putative P. falciparum lncRNAs under developmental regulation during the parasite's pathogenic human blood stage. Further characterization of lncRNA candidates led to the discovery of an intriguing family of lncRNA telomere-associated repetitive element transcripts, termed lncRNA-TARE. We have quantified lncRNA-TARE expression at 15 distinct chromosome ends and mapped putative transcriptional start and termination sites of lncRNA-TARE loci. Remarkably, we observed coordinated and stage-specific expression of lncRNA-TARE on all chromosome ends tested, and two dominant transcripts of approximately 1.5 kb and 3.1 kb transcribed towards the telomere. Conclusions We have characterized a family of 22 telomere-associated lncRNAs in P. falciparum. Homologous lncRNA-TARE loci are coordinately expressed after parasite DNA replication, and are poised to play an important role in P. falciparum telomere maintenance, virulence gene regulation, and potentially other processes of parasite chromosome end biology. Further study of lncRNA-TARE and other promising lncRNA candidates may provide mechanistic insight into P. falciparum transcriptional regulation. PMID:21689454
MicroRNA: Biogenesis, Function and Role in Cancer

PubMed Central

MacFarlane, Leigh-Ann; Murphy, Paul R.

2010-01-01

MicroRNAs are small, highly conserved non-coding RNA molecules involved in the regulation of gene expression. MicroRNAs are transcribed by RNA polymerases II and III, generating precursors that undergo a series of cleavage events to form mature microRNA. The conventional biogenesis pathway consists of two cleavage events, one nuclear and one cytoplasmic. However, alternative biogenesis pathways exist that differ in the number of cleavage events and enzymes responsible. How microRNA precursors are sorted to the different pathways is unclear but appears to be determined by the site of origin of the microRNA, its sequence and thermodynamic stability. The regulatory functions of microRNAs are accomplished through the RNA-induced silencing complex (RISC). MicroRNA assembles into RISC, activating the complex to target messenger RNA (mRNA) specified by the microRNA. Various RISC assembly models have been proposed and research continues to explore the mechanism(s) of RISC loading and activation. The degree and nature of the complementarity between the microRNA and target determine the gene silencing mechanism, slicer-dependent mRNA degradation or slicer-independent translation inhibition. Recent evidence indicates that P-bodies are essential for microRNA-mediated gene silencing and that RISC assembly and silencing occurs primarily within P-bodies. The P-body model outlines microRNA sorting and shuttling between specialized P-body compartments that house enzymes required for slicer –dependent and –independent silencing, addressing the reversibility of these silencing mechanisms. Detailed knowledge of the microRNA pathways is essential for understanding their physiological role and the implications associated with dysfunction and dysregulation. PMID:21532838
Deciphering the Regulatory Logic of an Ancient, Ultraconserved Nuclear Receptor Enhancer Module

PubMed Central

Bagamasbad, Pia D.; Bonett, Ronald M.; Sachs, Laurent; Buisine, Nicolas; Raj, Samhitha; Knoedler, Joseph R.; Kyono, Yasuhiro; Ruan, Yijun; Ruan, Xiaoan

2015-01-01

Cooperative, synergistic gene regulation by nuclear hormone receptors can increase sensitivity and amplify cellular responses to hormones. We investigated thyroid hormone (TH) and glucocorticoid (GC) synergy on the Krüppel-like factor 9 (Klf9) gene, which codes for a zinc finger transcription factor involved in development and homeostasis of diverse tissues. We identified regions of the Xenopus and mouse Klf9 genes 5–6 kb upstream of the transcription start sites that supported synergistic transactivation by TH plus GC. Within these regions, we found an orthologous sequence of approximately 180 bp that is highly conserved among tetrapods, but absent in other chordates, and possesses chromatin marks characteristic of an enhancer element. The Xenopus and mouse approximately 180-bp DNA element conferred synergistic transactivation by hormones in transient transfection assays, so we designate this the Klf9 synergy module (KSM). We identified binding sites within the mouse KSM for TH receptor, GC receptor, and nuclear factor κB. TH strongly increased recruitment of liganded GC receptor and serine 5 phosphorylated (initiating) RNA polymerase II to chromatin at the KSM, suggesting a mechanism for transcriptional synergy. The KSM is transcribed to generate long noncoding RNAs, which are also synergistically induced by combined hormone treatment, and the KSM interacts with the Klf9 promoter and a far upstream region through chromosomal looping. Our findings support that the KSM plays a central role in hormone regulation of vertebrate Klf9 genes, it evolved in the tetrapod lineage, and has been maintained by strong stabilizing selection. PMID:25866873

High-throughput RNA sequencing reveals structural differences of orthologous brain-expressed genes between western lowland gorillas and humans.

PubMed

Lipovich, Leonard; Hou, Zhuo-Cheng; Jia, Hui; Sinkler, Christopher; McGowen, Michael; Sterner, Kirstin N; Weckle, Amy; Sugalski, Amara B; Pipes, Lenore; Gatti, Domenico L; Mason, Christopher E; Sherwood, Chet C; Hof, Patrick R; Kuzawa, Christopher W; Grossman, Lawrence I; Goodman, Morris; Wildman, Derek E

2016-02-01

The human brain and human cognitive abilities are strikingly different from those of other great apes despite relatively modest genome sequence divergence. However, little is presently known about the interspecies divergence in gene structure and transcription that might contribute to these phenotypic differences. To date, most comparative studies of gene structure in the brain have examined humans, chimpanzees, and macaque monkeys. To add to this body of knowledge, we analyze here the brain transcriptome of the western lowland gorilla (Gorilla gorilla gorilla), an African great ape species that is phylogenetically closely related to humans, but with a brain that is approximately one-third the size. Manual transcriptome curation from a sample of the planum temporale region of the neocortex revealed 12 protein-coding genes and one noncoding-RNA gene with exons in the gorilla unmatched by public transcriptome data from the orthologous human loci. These interspecies gene structure differences accounted for a total of 134 amino acids in proteins found in the gorilla that were absent from protein products of the orthologous human genes. Proteins varying in structure between human and gorilla were involved in immunity and energy metabolism, suggesting their relevance to phenotypic differences. This gorilla neocortical transcriptome comprises an empirical, not homology- or prediction-driven, resource for orthologous gene comparisons between human and gorilla. These findings provide a unique repository of the sequences and structures of thousands of genes transcribed in the gorilla brain, pointing to candidate genes that may contribute to the traits distinguishing humans from other closely related great apes. © 2015 Wiley Periodicals, Inc.
Statistical properties of DNA sequences

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Simons, M.; Stanley, H. E.

1995-01-01

We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33301 coding and 29453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.
The expanding universe of noncoding RNAs.

PubMed

Hannon, G J; Rivas, F V; Murchison, E P; Steitz, J A

2006-01-01

The 71st Cold Spring Harbor Symposium on Quantitative Biology celebrated the numerous and expanding roles of regulatory RNAs in systems ranging from bacteria to mammals. It was clearly evident that noncoding RNAs are undergoing a renaissance, with reports of their involvement in nearly every cellular process. Previously known classes of longer noncoding RNAs were shown to function by every possible means-acting catalytically, sensing physiological states through adoption of complex secondary and tertiary structures, or using their primary sequences for recognition of target sites. The many recently discovered classes of small noncoding RNAs, generally less than 35 nucleotides in length, most often exert their effects by guiding regulatory complexes to targets via base-pairing. With the ability to analyze the RNA products of the genome in ever greater depth, it has become clear that the universe of noncoding RNAs may extend far beyond the boundaries we had previously imagined. Thus, as much as the Symposium highlighted exciting progress in the field, it also revealed how much farther we must go to understand fully the biological impact of noncoding RNAs.
The majority of total nuclear-encoded non-ribosomal RNA in a human cell is 'dark matter' un-annotated RNA.

PubMed

Kapranov, Philipp; St Laurent, Georges; Raz, Tal; Ozsolak, Fatih; Reynolds, C Patrick; Sorensen, Poul H B; Reaman, Gregory; Milos, Patrice; Arceci, Robert J; Thompson, John F; Triche, Timothy J

2010-12-21

Discovery that the transcriptional output of the human genome is far more complex than predicted by the current set of protein-coding annotations and that most RNAs produced do not appear to encode proteins has transformed our understanding of genome complexity and suggests new paradigms of genome regulation. However, the fraction of all cellular RNA whose function we do not understand and the fraction of the genome that is utilized to produce that RNA remain controversial. This is not simply a bookkeeping issue because the degree to which this un-annotated transcription is present has important implications with respect to its biologic function and to the general architecture of genome regulation. For example, efforts to elucidate how non-coding RNAs (ncRNAs) regulate genome function will be compromised if that class of RNAs is dismissed as simply 'transcriptional noise'. We show that the relative mass of RNA whose function and/or structure we do not understand (the so called 'dark matter' RNAs), as a proportion of all non-ribosomal, non-mitochondrial human RNA (mt-RNA), can be greater than that of protein-encoding transcripts. This observation is obscured in studies that focus only on polyA-selected RNA, a method that enriches for protein coding RNAs and at the same time discards the vast majority of RNA prior to analysis. We further show the presence of a large number of very long, abundantly-transcribed regions (100's of kb) in intergenic space and further show that expression of these regions is associated with neoplastic transformation. These overlap some regions found previously in normal human embryonic tissues and raises an interesting hypothesis as to the function of these ncRNAs in both early development and neoplastic transformation. We conclude that 'dark matter' RNA can constitute the majority of non-ribosomal, non-mitochondrial-RNA and a significant fraction arises from numerous very long, intergenic transcribed regions that could be involved in neoplastic transformation.
Dnmt2 mediates intergenerational transmission of paternally acquired metabolic disorders through sperm small non-coding RNAs.

PubMed

Zhang, Yunfang; Zhang, Xudong; Shi, Junchao; Tuorto, Francesca; Li, Xin; Liu, Yusheng; Liebers, Reinhard; Zhang, Liwen; Qu, Yongcun; Qian, Jingjing; Pahima, Maya; Liu, Ying; Yan, Menghong; Cao, Zhonghong; Lei, Xiaohua; Cao, Yujing; Peng, Hongying; Liu, Shichao; Wang, Yue; Zheng, Huili; Woolsey, Rebekah; Quilici, David; Zhai, Qiwei; Li, Lei; Zhou, Tong; Yan, Wei; Lyko, Frank; Zhang, Ying; Zhou, Qi; Duan, Enkui; Chen, Qi

2018-05-01

The discovery of RNAs (for example, messenger RNAs, non-coding RNAs) in sperm has opened the possibility that sperm may function by delivering additional paternal information aside from solely providing the DNA 1 . Increasing evidence now suggests that sperm small non-coding RNAs (sncRNAs) can mediate intergenerational transmission of paternally acquired phenotypes, including mental stress 2,3 and metabolic disorders 4-6 . How sperm sncRNAs encode paternal information remains unclear, but the mechanism may involve RNA modifications. Here we show that deletion of a mouse tRNA methyltransferase, DNMT2, abolished sperm sncRNA-mediated transmission of high-fat-diet-induced metabolic disorders to offspring. Dnmt2 deletion prevented the elevation of RNA modifications (m 5 C, m 2 G) in sperm 30-40 nt RNA fractions that are induced by a high-fat diet. Also, Dnmt2 deletion altered the sperm small RNA expression profile, including levels of tRNA-derived small RNAs and rRNA-derived small RNAs, which might be essential in composing a sperm RNA 'coding signature' that is needed for paternal epigenetic memory. Finally, we show that Dnmt2-mediated m 5 C contributes to the secondary structure and biological properties of sncRNAs, implicating sperm RNA modifications as an additional layer of paternal hereditary information.
Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs

PubMed Central

Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.

2013-01-01

Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175
Regulatory elements of Caenorhabditis elegans ribosomal protein genes

PubMed Central

2012-01-01

Background Ribosomal protein genes (RPGs) are essential, tightly regulated, and highly expressed during embryonic development and cell growth. Even though their protein sequences are strongly conserved, their mechanism of regulation is not conserved across yeast, Drosophila, and vertebrates. A recent investigation of genomic sequences conserved across both nematode species and associated with different gene groups indicated the existence of several elements in the upstream regions of C. elegans RPGs, providing a new insight regarding the regulation of these genes in C. elegans. Results In this study, we performed an in-depth examination of C. elegans RPG regulation and found nine highly conserved motifs in the upstream regions of C. elegans RPGs using the motif discovery algorithm DME. Four motifs were partially similar to transcription factor binding sites from C. elegans, Drosophila, yeast, and human. One pair of these motifs was found to co-occur in the upstream regions of 250 transcripts including 22 RPGs. The distance between the two motifs displayed a complex frequency pattern that was related to their relative orientation. We tested the impact of three of these motifs on the expression of rpl-2 using a series of reporter gene constructs and showed that all three motifs are necessary to maintain the high natural expression level of this gene. One of the motifs was similar to the binding site of an orthologue of POP-1, and we showed that RNAi knockdown of pop-1 impacts the expression of rpl-2. We further determined the transcription start site of rpl-2 by 5’ RACE and found that the motifs lie 40–90 bases upstream of the start site. We also found evidence that a noncoding RNA, contained within the outron of rpl-2, is co-transcribed with rpl-2 and cleaved during trans-splicing. Conclusions Our results indicate that C. elegans RPGs are regulated by a complex novel series of regulatory elements that is evolutionarily distinct from those of all other species examined up until now. PMID:22928635
The protective function of noncoding DNA in genome defense of eukaryotic male germ cells.

PubMed

Qiu, Guo-Hua; Huang, Cuiqin; Zheng, Xintian; Yang, Xiaoyan

2018-04-01

Peripheral and abundant noncoding DNA has been hypothesized to protect the genome and the central protein-coding sequences against DNA damage in somatic genome. In the cytosol, invading exogenous nucleic acids may first be deactivated by small RNAs encoded by noncoding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. In the nucleus, the radicals generated by radiation in the cytosol, radiation energy and invading exogenous nucleic acids are absorbed, blocked and/or reduced by peripheral heterochromatin, and damaged DNA in heterochromatin is removed and excluded from the nucleus to the cytoplasm through nuclear pore complexes. To further strengthen the hypothesis, this review summarizes the experimental evidence supporting the protective function of noncoding DNA in the genome of male germ cells. Based on these data, this review provides evidence supporting the protective role of noncoding DNA in the genome defense of sperm genome through similar mechanisms to those of the somatic genome.
Expression of Antisense Long Noncoding RNAs as Potential Regulators in Rainbow Trout with Different Tolerance to Plant-Based Diets.

PubMed

Abernathy, Jason; Overturf, Ken

2018-01-04

Reformulation of aquafeeds in salmonid diets to include more plant proteins is critical for sustainable aquaculture. However, increasing plant proteins can lead to stunted growth and enteritis. Toward an understanding of the regulatory mechanisms behind plant protein utilization, directional RNA sequencing of liver tissues from a rainbow trout strain selected for growth on an all plant-protein diet and a control strain, both fed a plant diet for 12 weeks, were utilized to construct long noncoding RNAs. Antisense long noncoding RNAs were selected for differential expression and functional analyses since they have been shown to have regulatory actions within a genome. A total of 142 unique antisense long noncoding RNAs were differentially expressed between strains, 60 of which could be mapped to a gene. Genes underlying these noncoding RNAs are indicated in lipid metabolism and immunity. Six noncoding transcripts were also found to overlap with differentially expressed protein-coding genes, all of which were co-expressed. Associating variation in regulatory elements between rainbow trout strains with differing tolerance to plant-protein diets will assist in future studies toward increased gains throughout carnivorous aquaculture.
Interplay between cardiac transcription factors and non-coding RNAs in predisposing to atrial fibrillation.

PubMed

Mikhailov, Alexander T; Torrado, Mario

2018-05-12

There is growing evidence that putative gene regulatory networks including cardio-enriched transcription factors, such as PITX2, TBX5, ZFHX3, and SHOX2, and their effector/target genes along with downstream non-coding RNAs can play a potentially important role in the process of adaptive and maladaptive atrial rhythm remodeling. In turn, expression of atrial fibrillation-associated transcription factors is under the control of upstream regulatory non-coding RNAs. This review broadly explores gene regulatory mechanisms associated with susceptibility to atrial fibrillation-with key examples from both animal models and patients-within the context of both cardiac transcription factors and non-coding RNAs. These two systems appear to have multiple levels of cross-regulation and act coordinately to achieve effective control of atrial rhythm effector gene expression. Perturbations of a dynamic expression balance between transcription factors and corresponding non-coding RNAs can provoke the development or promote the progression of atrial fibrillation. We also outline deficiencies in current models and discuss ongoing studies to clarify remaining mechanistic questions. An understanding of the function of transcription factors and non-coding RNAs in gene regulatory networks associated with atrial fibrillation risk will enable the development of innovative therapeutic strategies.
Unexpected high intragenomic variation in two of three major pest thrips species does not affect ribosomal internal transcribed spacer 2 (ITS2)utility for thrips identification

USDA-ARS?s Scientific Manuscript database

The mitochondrial gene mtCO1 and the internal transcribed spacer (ITS) region of the ribosomal DNA are among the most widely used molecular markers for insect taxonomic characterization. Three economically important species of thrips, Scirtothrips dorsalis, Thrips palmi, and Frankliniella occidental...
mRNA changes in nucleus accumbens related to methamphetamine addiction in mice

NASA Astrophysics Data System (ADS)

Zhu, Li; Li, Jiaqi; Dong, Nan; Guan, Fanglin; Liu, Yufeng; Ma, Dongliang; Goh, Eyleen L. K.; Chen, Teng

2016-11-01

Methamphetamine (METH) is a highly addictive psychostimulant that elicits aberrant changes in the expression of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) in the nucleus accumbens of mice, indicating a potential role of METH in post-transcriptional regulations. To decipher the potential consequences of these post-transcriptional regulations in response to METH, we performed strand-specific RNA sequencing (ssRNA-Seq) to identify alterations in mRNA expression and their alternative splicing in the nucleus accumbens of mice following exposure to METH. METH-mediated changes in mRNAs were analyzed and correlated with previously reported changes in non-coding RNAs (miRNAs and lncRNAs) to determine the potential functions of these mRNA changes observed here and how non-coding RNAs are involved. A total of 2171 mRNAs were differentially expressed in response to METH with functions involved in synaptic plasticity, mitochondrial energy metabolism and immune response. 309 and 589 of these mRNAs are potential targets of miRNAs and lncRNAs respectively. In addition, METH treatment decreases mRNA alternative splicing, and there are 818 METH-specific events not observed in saline-treated mice. Our results suggest that METH-mediated addiction could be attributed by changes in miRNAs and lncRNAs and consequently, changes in mRNA alternative splicing and expression. In conclusion, our study reported a methamphetamine-modified nucleus accumbens transcriptome and provided non-coding RNA-mRNA interaction networks possibly involved in METH addiction.
nRC: non-coding RNA Classifier based on structural features.

PubMed

Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

2017-01-01

Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.
Quantification of non-coding RNA target localization diversity and its application in cancers.

PubMed

Cheng, Lixin; Leung, Kwong-Sak

2018-04-01

Subcellular localization is pivotal for RNAs and proteins to implement biological functions. The localization diversity of protein interactions has been studied as a crucial feature of proteins, considering that the protein-protein interactions take place in various subcellular locations. Nevertheless, the localization diversity of non-coding RNA (ncRNA) target proteins has not been systematically studied, especially its characteristics in cancers. In this study, we provide a new algorithm, non-coding RNA target localization coefficient (ncTALENT), to quantify the target localization diversity of ncRNAs based on the ncRNA-protein interaction and protein subcellular localization data. ncTALENT can be used to calculate the target localization coefficient of ncRNAs and measure how diversely their targets are distributed among the subcellular locations in various scenarios. We focus our study on long non-coding RNAs (lncRNAs), and our observations reveal that the target localization diversity is a primary characteristic of lncRNAs in different biotypes. Moreover, we found that lncRNAs in multiple cancers, differentially expressed cancer lncRNAs, and lncRNAs with multiple cancer target proteins are prone to have high target localization diversity. Furthermore, the analysis of gastric cancer helps us to obtain a better understanding that the target localization diversity of lncRNAs is an important feature closely related to clinical prognosis. Overall, we systematically studied the target localization diversity of the lncRNAs and uncovered its association with cancer.
ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data.

PubMed

Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan; Zhou, Hui; Qu, Liang-Hu

2013-01-01

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) represent two classes of important non-coding RNAs in eukaryotes. Although these non-coding RNAs have been implicated in organismal development and in various human diseases, surprisingly little is known about their transcriptional regulation. Recent advances in chromatin immunoprecipitation with next-generation DNA sequencing (ChIP-Seq) have provided methods of detecting transcription factor binding sites (TFBSs) with unprecedented sensitivity. In this study, we describe ChIPBase (http://deepbase.sysu.edu.cn/chipbase/), a novel database that we have developed to facilitate the comprehensive annotation and discovery of transcription factor binding maps and transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. The current release of ChIPBase includes high-throughput sequencing data that were generated by 543 ChIP-Seq experiments in diverse tissues and cell lines from six organisms. By analysing millions of TFBSs, we identified tens of thousands of TF-lncRNA and TF-miRNA regulatory relationships. Furthermore, two web-based servers were developed to annotate and discover transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. In addition, we developed two genome browsers, deepView and genomeView, to provide integrated views of multidimensional data. Moreover, our web implementation supports diverse query types and the exploration of TFs, lncRNAs, miRNAs, gene ontologies and pathways.
Long non-coding RNA-mediated regulation of signaling pathways in gastric cancer.

PubMed

Zong, Wei; Ju, Shaoqing; Jing, Rongrong; Cui, Ming

2018-05-28

Gastric cancer (GC) is one of the most common cancers globally. Because of the high frequency of tumor recurrence, or metastasis, after surgical resection, the prognosis of patients with GC is poor. Therefore, exploring the mechanisms underlying GC is of great importance. Recently, accumulating evidence has begun to show that dysregulated long non-coding RNAs (lncRNAs) participate in the progression of GC via several typical signaling pathways, such as the AKT and MAPK signaling pathways. Moreover, the interactions between lncRNAs and microRNAs appear to represent a novel mechanism in the pathogenesis of GC. This review provides a synopsis of the latest research relating to lncRNAs and associated signaling pathways in GC.
Increasing the Yield in Targeted Next-Generation Sequencing by Implicating CNV Analysis, Non-Coding Exons and the Overall Variant Load: The Example of Retinal Dystrophies

PubMed Central

Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O.; Decker, Christian; Preising, Markus N.; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Issa, Peter Charbel; Holz, Frank G.; Baig, Shahid M.; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y.; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S.; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J.

2013-01-01

Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading. PMID:24265693
Increasing the yield in targeted next-generation sequencing by implicating CNV analysis, non-coding exons and the overall variant load: the example of retinal dystrophies.

PubMed

Eisenberger, Tobias; Neuhaus, Christine; Khan, Arif O; Decker, Christian; Preising, Markus N; Friedburg, Christoph; Bieg, Anika; Gliem, Martin; Charbel Issa, Peter; Holz, Frank G; Baig, Shahid M; Hellenbroich, Yorck; Galvez, Alberto; Platzer, Konrad; Wollnik, Bernd; Laddach, Nadja; Ghaffari, Saeed Reza; Rafati, Maryam; Botzenhart, Elke; Tinschert, Sigrid; Börger, Doris; Bohring, Axel; Schreml, Julia; Körtge-Jung, Stefani; Schell-Apacik, Chayim; Bakur, Khadijah; Al-Aama, Jumana Y; Neuhann, Teresa; Herkenrath, Peter; Nürnberg, Gudrun; Nürnberg, Peter; Davis, John S; Gal, Andreas; Bergmann, Carsten; Lorenz, Birgit; Bolz, Hanno J

2013-01-01

Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover "hidden mutations" such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5' exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5'-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading.
Diabetes Mellitus-Induced Long Noncoding RNA Dnm3os Regulates Macrophage Functions and Inflammation via Nuclear Mechanisms.

PubMed

Das, Sadhan; Reddy, Marpadga A; Senapati, Parijat; Stapleton, Kenneth; Lanting, Linda; Wang, Mei; Amaram, Vishnu; Ganguly, Rituparna; Zhang, Lingxiao; Devaraj, Sridevi; Schones, Dustin E; Natarajan, Rama

2018-06-21

Macrophages play key roles in inflammation and diabetic vascular complications. Emerging evidence implicates long noncoding RNAs in inflammation, but their role in macrophage dysfunction associated with inflammatory diabetic complications is unclear and was therefore investigated in this study. RNA-sequencing and real-time quantitative PCR demonstrated that a long noncoding RNA Dnm3os (dynamin 3 opposite strand) is upregulated in bone marrow-derived macrophages from type 2 diabetic db/db mice, diet-induced insulin-resistant mice, and diabetic ApoE -/ - mice, as well as in monocytes from type 2 diabetic patients relative to controls. Diabetic conditions (high glucose and palmitic acid) induced Dnm3os in mouse and human macrophages. Promoter reporter analysis and chromatin immunoprecipitation assays demonstrated that diabetic conditions induce Dnm3os via NF-κB activation. RNA fluorescence in situ hybridization and real-time quantitative PCRs of subcellular fractions demonstrated nuclear localization and chromatin enrichment of Dnm3os in macrophages. Stable overexpression of Dnm3os in macrophages altered global histone modifications and upregulated inflammation and immune response genes and phagocytosis. Conversely, RNAi-mediated knockdown of Dnm3os attenuated these responses. RNA pull-down assays with macrophage nuclear lysates identified nucleolin and ILF-2 (interleukin enhancer-binding factor 2) as protein binding partners of Dnm3os , which was further confirmed by RNA immunoprecipitation and RNA fluorescence in situ hybridization immunofluorescence. Furthermore, nucleolin levels were decreased in diabetic conditions, and its knockdown enhanced Dnm3os -induced inflammatory gene expression and histone H3K9-acetylation at their promoters. These results demonstrate novel mechanisms involving upregulation of long noncoding RNA Dnm3os , disruption of its interaction with nucleolin, and epigenetic modifications at target genes that promote macrophage inflammatory phenotype in diabetes mellitus. The data could lead to long noncoding RNA-based therapies for inflammatory diabetes mellitus complications. © 2018 American Heart Association, Inc.
Gene regulation by noncoding RNAs

PubMed Central

Patil, Veena S.; Zhou, Rui; Rana, Tariq M.

2015-01-01

The past two decades have seen an explosion in research on noncoding RNAs and their physiological and pathological functions. Several classes of small (20–30 nucleotides) and long (>200 nucleotides) noncoding RNAs have been firmly established as key regulators of gene expression in myriad processes ranging from embryonic development to innate immunity. In this review, we focus on our current understanding of the molecular mechanisms underlying the biogenesis and function of small interfering RNAs (siRNAs), microRNAs (miRNAs), and Piwi-interacting RNAs (piRNAs). In addition, we briefly review the relevance of small and long noncoding RNAs to human physiology and pathology and their potential to be exploited as therapeutic agents. PMID:24164576

Integrating non-coding RNAs in JAK-STAT regulatory networks

PubMed Central

Witte, Steven; Muljo, Stefan A

2014-01-01

Being a well-characterized pathway, JAK-STAT signaling serves as a valuable paradigm for studying the architecture of gene regulatory networks. The discovery of untranslated or non-coding RNAs, namely microRNAs and long non-coding RNAs, provides an opportunity to elucidate their roles in such networks. In principle, these regulatory RNAs can act as downstream effectors of the JAK-STAT pathway and/or affect signaling by regulating the expression of JAK-STAT components. Examples of interactions between signaling pathways and non-coding RNAs have already emerged in basic cell biology and human diseases such as cancer, and can potentially guide the identification of novel biomarkers or drug targets for medicine. PMID:24778925
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome

PubMed Central

Ferlaino, Michael; Rogers, Mark F.; Shihab, Hashem A.; Mort, Matthew; Cooper, David N.; Gaunt, Tom R.; Campbell, Colin

2018-01-01

Background Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. Results We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. Conclusions FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome. PMID:28985712
DNA rearrangements directed by non-coding RNAs in ciliates

PubMed Central

Mochizuki, Kazufumi

2013-01-01

Extensive programmed rearrangement of DNA, including DNA elimination, chromosome fragmentation, and DNA descrambling, takes place in the newly developed macronucleus during the sexual reproduction of ciliated protozoa. Recent studies have revealed that two distant classes of ciliates use distinct types of non-coding RNAs to regulate such DNA rearrangement events. DNA elimination in Tetrahymena is regulated by small non-coding RNAs that are produced and utilized in an RNAi-related process. It has been proposed that the small RNAs produced from the micronuclear genome are used to identify eliminated DNA sequences by whole-genome comparison between the parental macronucleus and the micronucleus. In contrast, DNA descrambling in Oxytricha is guided by long non-coding RNAs that are produced from the parental macronuclear genome. These long RNAs are proposed to act as templates for the direct descrambling events that occur in the developing macronucleus. Both cases provide useful examples to study epigenetic chromatin regulation by non-coding RNAs. PMID:21956937
Bleomycin Can Cleave an Oncogenic Noncoding RNA.

PubMed

Angelbello, Alicia J; Disney, Matthew D

2018-01-04

Noncoding RNAs are pervasive in cells and contribute to diseases such as cancer. A question in biomedical research is whether noncoding RNAs are targets of medicines. Bleomycin is a natural product that cleaves DNA; however, it is known to cleave RNA in vitro. Herein, an in-depth analysis of the RNA cleavage preferences of bleomycin A5 is presented. Bleomycin A5 prefers to cleave RNAs with stretches of AU base pairs. Based on these preferences and bioinformatic analysis, the microRNA-10b hairpin precursor was identified as a potential substrate for bleomycin A5. Both in vitro and cellular experiments demonstrated cleavage. Importantly, chemical cleavage by bleomycin A5 in the microRNA-10b hairpin precursors occurred near the Drosha and Dicer enzymatic processing sites and led to destruction of the microRNA. Evidently, oncogenic noncoding RNAs can be considered targets of cancer medicines and might elicit their pharmacological effects by targeting noncoding RNA. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

PubMed

Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

2017-10-06

Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.
Long Noncoding RNAs: New Players in the Osteogenic Differentiation of Bone Marrow- and Adipose-Derived Mesenchymal Stem Cells.

PubMed

Yang, Qiaolin; Jia, Lingfei; Li, Xiaobei; Guo, Runzhi; Huang, Yiping; Zheng, Yunfei; Li, Weiran

2018-06-01

Mesenchymal stem cells (MSCs) are an important population of multipotent stem cells that differentiate into multiple lineages and display great potential in bone regeneration and repair. Although the role of protein-coding genes in the osteogenic differentiation of MSCs has been extensively studied, the functions of noncoding RNAs in the osteogenic differentiation of MSCs are unclear. The recent application of next-generation sequencing to MSC transcriptomes has revealed that long noncoding RNAs (lncRNAs) are associated with the osteogenic differentiation of MSCs. LncRNAs are a class of non-coding transcripts of more than 200 nucleotides in length. Noncoding RNAs are thought to play a key role in osteoblast differentiation through various regulatory mechanisms including chromatin modification, transcription factor binding, competent endogenous mechanism, and other post-transcriptional mechanisms. Here, we review the roles of lncRNAs in the osteogenic differentiation of bone marrow- and adipose-derived stem cells and provide a theoretical foundation for future research.
Saprolegniaceae identified on amphibian eggs throughout the Pacific Northwest, USA, by internal transcribed spacer sequences and phylogenetic analysis

Treesearch

Jill E. Petrisko; Christopher A. Pearl; David S. Pilliod; Peter P. Sheridan; Charles F. Williams; Charles R. Peterson; R. Bruce Bury

2008-01-01

We assessed the diversity and phylogeny of Saprolegniaceae on amphibian eggs from the Pacific Northwest, with particular focus on Saprolegnia ferax, a species implicated in high egg mortality. We identified isolates from eggs of six amphibians with the internal transcribed spacer (ITS) and 5.8S gene regions and BLAST of the GenBank database. We...
Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing.

PubMed

Zhang, Rui; Deng, Patricia; Jacobson, Dionna; Li, Jin Billy

2017-02-01

Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3'UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3'UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions.
Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing

PubMed Central

Jacobson, Dionna

2017-01-01

Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3’UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3’UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions. PMID:28166241
Emergence of the Noncoding Cancer Genome: A Target of Genetic and Epigenetic Alterations.

PubMed

Zhou, Stanley; Treloar, Aislinn E; Lupien, Mathieu

2016-11-01

The emergence of whole-genome annotation approaches is paving the way for the comprehensive annotation of the human genome across diverse cell and tissue types exposed to various environmental conditions. This has already unmasked the positions of thousands of functional cis-regulatory elements integral to transcriptional regulation, such as enhancers, promoters, and anchors of chromatin interactions that populate the noncoding genome. Recent studies have shown that cis-regulatory elements are commonly the targets of genetic and epigenetic alterations associated with aberrant gene expression in cancer. Here, we review these findings to showcase the contribution of the noncoding genome and its alteration in the development and progression of cancer. We also highlight the opportunities to translate the biological characterization of genetic and epigenetic alterations in the noncoding cancer genome into novel approaches to treat or monitor disease. The majority of genetic and epigenetic alterations accumulate in the noncoding genome throughout oncogenesis. Discriminating driver from passenger events is a challenge that holds great promise to improve our understanding of the etiology of different cancer types. Advancing our understanding of the noncoding cancer genome may thus identify new therapeutic opportunities and accelerate our capacity to find improved biomarkers to monitor various stages of cancer development. Cancer Discov; 6(11); 1215-29. ©2016 AACR. ©2016 American Association for Cancer Research.
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

NASA Technical Reports Server (NTRS)

Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.
Genome defense against exogenous nucleic acids in eukaryotes by non-coding DNA occurs through CRISPR-like mechanisms in the cytosol and the bodyguard protection in the nucleus.

PubMed

Qiu, Guo-Hua

2016-01-01

In this review, the protective function of the abundant non-coding DNA in the eukaryotic genome is discussed from the perspective of genome defense against exogenous nucleic acids. Peripheral non-coding DNA has been proposed to act as a bodyguard that protects the genome and the central protein-coding sequences from ionizing radiation-induced DNA damage. In the proposed mechanism of protection, the radicals generated by water radiolysis in the cytosol and IR energy are absorbed, blocked and/or reduced by peripheral heterochromatin; then, the DNA damage sites in the heterochromatin are removed and expelled from the nucleus to the cytoplasm through nuclear pore complexes, most likely through the formation of extrachromosomal circular DNA. To strengthen this hypothesis, this review summarizes the experimental evidence supporting the protective function of non-coding DNA against exogenous nucleic acids. Based on these data, I hypothesize herein about the presence of an additional line of defense formed by small RNAs in the cytosol in addition to their bodyguard protection mechanism in the nucleus. Therefore, exogenous nucleic acids may be initially inactivated in the cytosol by small RNAs generated from non-coding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. Exogenous nucleic acids may enter the nucleus, where some are absorbed and/or blocked by heterochromatin and others integrate into chromosomes. The integrated fragments and the sites of DNA damage are removed by repetitive non-coding DNA elements in the heterochromatin and excluded from the nucleus. Therefore, the normal eukaryotic genome and the central protein-coding sequences are triply protected by non-coding DNA against invasion by exogenous nucleic acids. This review provides evidence supporting the protective role of non-coding DNA in genome defense. Copyright © 2016 Elsevier B.V. All rights reserved.
Genome Size, Molecular Phylogeny, and Evolutionary History of the Tribe Aquilarieae (Thymelaeaceae), the Natural Source of Agarwood

PubMed Central

Farah, Azman H.; Lee, Shiou Yih; Gao, Zhihui; Yao, Tze Leong; Madon, Maria; Mohamed, Rozi

2018-01-01

The tribe Aquilarieae of the family Thymelaeaceae consists of two genera, Aquilaria and Gyrinops, with a total of 30 species, distributed from northeast India, through southeast Asia and the south of China, to Papua New Guinea. They are an important botanical resource for fragrant agarwood, a prized product derived from injured or infected stems of these species. The aim of this study was to estimate the genome size of selected Aquilaria species and comprehend the evolutionary history of Aquilarieae speciation through molecular phylogeny. Five non-coding chloroplast DNA regions and a nuclear region were sequenced from 12 Aquilaria and three Gyrinops species. Phylogenetic trees constructed using combined chloroplast DNA sequences revealed relationships of the studied 15 members in Aquilarieae, while nuclear ribosomal DNA internal transcribed spacer (ITS) sequences showed a paraphyletic relationship between Aquilaria species from Indochina and Malesian. We exposed, for the first time, the estimated divergence time for Aquilarieae speciation, which was speculated to happen during the Miocene Epoch. The ancestral split and biogeographic pattern of studied species were discussed. Results showed no large variation in the 2C-values for the five Aquilaria species (1.35–2.23 pg). Further investigation into the genome size may provide additional information regarding ancestral traits and its evolution history. PMID:29896211
The PAPI-1 pathogenicity island-encoded small RNA PesA influences Pseudomonas aeruginosa virulence and modulates pyocin S3 production

PubMed Central

Ferrara, Silvia; Falcone, Marilena; Macchi, Raffaella; Bragonzi, Alessandra; Girelli, Daniela; Cariani, Lisa; Cigana, Cristina

2017-01-01

Small non-coding RNAs (sRNAs) are post-transcriptional regulators of gene expression that have been recognized as key contributors to bacterial virulence and pathogenic mechanisms. In this study, we characterized the sRNA PesA of the opportunistic human pathogen Pseudomonas aeruginosa. We show that PesA, which is transcribed within the pathogenicity island PAPI-1 of P. aeruginosa strain PA14, contributes to P. aeruginosa PA14 virulence. In fact, pesA gene deletion resulted in a less pathogenic strain, showing higher survival of cystic fibrosis human bronchial epithelial cells after infection. Moreover, we show that PesA influences positively the expression of pyocin S3 whose genetic locus comprises two structural genes, pyoS3A and pyoS3I, encoding the killing S3A and the immunity S3I proteins, respectively. Interestingly, the deletion of pesA gene results in increased sensitivity to UV irradiation and to the fluoroquinolone antibiotic ciprofloxacin. The degree of UV sensitivity displayed by the PA14 strain lacking PesA is comparable to that of a strain deleted for pyoS3A-I. These results suggest an involvement of pyocin S3 in DNA damage repair and a regulatory role of PesA on this function. PMID:28665976
The humankind genome: from genetic diversity to the origin of human diseases.

PubMed

Belizário, Jose E

2013-12-01

Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
TAS3 miR390-dependent loci in non-vascular land plants: towards a comprehensive reconstruction of the gene evolutionary history

PubMed Central

Milyutina, Irina A.; Erokhina, Tatiana N.; Ozerova, Liudmila V.; Troitsky, Alexey V.; Solovyev, Andrey G.

2018-01-01

Trans-acting small interfering RNAs (ta-siRNAs) are transcribed from protein non-coding genomic TAS loci and belong to a plant-specific class of endogenous small RNAs. These siRNAs have been found to regulate gene expression in most taxa including seed plants, gymnosperms, ferns and mosses. In this study, bioinformatic and experimental PCR-based approaches were used as tools to analyze TAS3 and TAS6 loci in transcriptomes and genomic DNAs from representatives of evolutionary distant non-vascular plant taxa such as Bryophyta, Marchantiophyta and Anthocerotophyta. We revealed previously undiscovered TAS3 loci in plant classes Sphagnopsida and Anthocerotopsida, as well as TAS6 loci in Bryophyta classes Tetraphidiopsida, Polytrichopsida, Andreaeopsida and Takakiopsida. These data further unveil the evolutionary pathway of the miR390-dependent TAS3 loci in land plants. We also identified charophyte alga sequences coding for SUPPRESSOR OF GENE SILENCING 3 (SGS3), which is required for generation of ta-siRNAs in plants, and hypothesized that the appearance of TAS3-related sequences could take place at a very early step in evolutionary transition from charophyte algae to an earliest common ancestor of land plants. PMID:29682420
DNA-RNA hybrid formation mediates RNAi-directed heterochromatin formation.

PubMed

Nakama, Mina; Kawakami, Kei; Kajitani, Takuya; Urano, Takeshi; Murakami, Yota

2012-03-01

Certain noncoding RNAs (ncRNAs) implicated in the regulation of chromatin structure associate with chromatin. During the formation of RNAi-directed heterochromatin in fission yeast, ncRNAs transcribed from heterochromatin are thought to recruit the RNAi machinery to chromatin for the formation of heterochromatin; however, the molecular details of this association are not clear. Here, using RNA immunoprecipitation assay, we showed that the heterochromatic ncRNA was associated with chromatin via the formation of a DNA-RNA hybrid and bound to the RNA-induced transcriptional silencing (RITS) complex. The presence of DNA-RNA hybrid in the cell was also confirmed by immunofluorescence analysis using anti-DNA-RNA hybrid antibody. Over-expression and depletion of RNase H in vivo decreased and increased the amount of DNA-RNA hybrid formed, respectively, and both disturbed heterochromatin. Moreover, DNA-RNA hybrid was formed on, and over-expression of RNase H inhibited the formation of, artificial heterochromatin induced by tethering of RITS to mRNA. These results indicate that heterochromatic ncRNAs are retained on chromatin via the formation of DNA-RNA hybrids and provide a platform for the RNAi-directed heterochromatin assembly and suggest that DNA-RNA hybrid formation plays a role in chromatic ncRNA function. © 2012 The Authors. Journal compilation © 2012 by the Molecular Biology Society of Japan/Blackwell Publishing Ltd.
DSAP: deep-sequencing small RNA analysis pipeline.

PubMed

Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

2010-07-01

DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.
The mitochondrial genome of Priapulus caudatus Lamarck (Priapulida: Priapulidae).

PubMed

Webster, Bonnie L; Mackenzie-Dodds, Jacqueline A; Telford, Maximilian J; Littlewood, D Timothy J

2007-03-01

We sequenced and annotated the complete mitochondrial (mt) genome of the priapulid Priapulus caudatus in order to provide a source of phylogenetic characters including an assessment of gene order arrangement. The genome was 14,919 bp in its entirety with few, short non-coding regions. A number of protein-coding and tRNA genes overlapped, making the genome relatively compact. The gene order was: cox1, cox2, trnK, trnD, atp8, atp6, cox3, trnG, nad3, trnA, trnR, trnN, rrnS, trnV, rrnL, trnL(yaa), trnL(nag), nad1, -trnS(nga), -cob, -nad6, trnP, -trnT, nad4L, nad4, trnH, nad5, trnF, -trnE, -trnS(nct), trnI, -trnQ, trnM, nad2, trnW, -trnC, -trnY; where '-' indicates genes transcribed on the opposite strand. The gene order, although unique amongst Metazoa, shared the greatest number of gene boundaries and the longest contiguous fragments with the chelicerate Limulus polyphemus. The mt genomes of these taxa differed only by a single inversion of 18 contiguous genes bounded by rrnS and trnS(nct). Other arthropods and nematodes shared fewer gene boundaries but considerably more than the most similar non-ecdysozoan.
A molecular phylogeny of Raddia and its allies within the tribe Olyreae (Poaceae, Bambusoideae) based on noncoding plastid and nuclear spacers.

PubMed

Oliveira, Reyjane P; Clark, Lynn G; Schnadelbach, Alessandra S; Monteiro, Silvana H N; Borba, Eduardo L; Longhi-Wagner, Hilda M; van den Berg, Cassio

2014-09-01

The plastid spacer trnD-trnT and the nuclear ribosomal internal transcribed spacer (ITS) were sequenced for 37 samples of herbaceous bamboos (Poaceae: Olyreae), including all Raddia species and allied genera, as well as two members of the woody bamboos (tribes Bambuseae and Arundinarieae), in order to examine their relationships. The sequences were analyzed using maximum parsimony and Bayesian inference. Both the individual and combined analyses of ITS and trnD-trnT supported Olyreae as a monophyletic group. All species of Raddia also formed a well-supported monophyletic group, and combined datasets allowed us to outline some relationships within this group. Individual analyses indicated incongruence regarding the sister group of Raddia, with ITS data weakly indicating Raddiella malmeana whereas trnD-trnT data supported Sucrea maculata in this position. However, the combined analysis supported Sucrea as sister to Raddia, although the monophyly of Sucrea is not well supported. Parodiolyra is paraphyletic to Raddiella in all analyses; Olyra is also paraphyletic, with species of Lithachne, Arberella and Cryptochloa nested within it. Eremitis and Pariana appeared as an isolated clade within Olyreae, and the position of the New Guinean Buergersiochloa remains uncertain within this tribe. Copyright © 2014 Elsevier Inc. All rights reserved.

Identification of a novel antisense long non-coding RNA PLA2G16-AS that regulates the expression of PLA2G16 in pigs.

PubMed

Liu, Pengliang; Jin, Long; Zhao, Lirui; Long, Keren; Song, Yang; Tang, Qianzi; Ma, Jideng; Wang, Xun; Tang, Guoqing; Jiang, Yanzhi; Zhu, Li; Li, Xuewei; Li, Mingzhou

2018-05-31

Natural antisense transcripts (NATs) are widely present in mammalian genomes and act as pivotal regulator molecules to control gene expression. However, studies on the NATs of pigs are relatively rare. Here, we identified a novel antisense transcript, designated PLA2G16-AS, transcribed from the phospholipase A2 group XVI locus (PLA2G16) in the porcine genome, which is a well-known regulatory molecule of fat deposition. PLA2G16-AS and PLA2G16 were dominantly expressed in porcine adipose tissue, and were differentially expressed between Tibetan pigs and Rongchang pigs. In addition, PLA2G16-AS has a weak sequence conservation among different vertebrates. PLA2G16-AS was also shown to form an RNA-RNA duplex with PLA2G16, and to regulate PLA2G16 expression at the mRNA level. Moreover, the overexpression of PLA2G16-AS increased the stability of PLA2G16 mRNA in porcine cells. We envision that our findings of a NAT for a regulatory gene associated with lipolysis might further our understanding of the molecular regulation of fat deposition. Copyright © 2017. Published by Elsevier B.V.
Genetics Home Reference: isolated Pierre Robin sequence

MedlinePlus

... PG, Fitzpatrick DR, Lyonnet S. Highly conserved non-coding elements on either side of SOX9 associated with Pierre ... Citation on PubMed or Free article on PubMed Central Jakobsen LP, Ullmann R, Christensen SB, Jensen KE, ...
Long non-coding RNA gastric carcinoma highly expressed transcript 1 promotes cell proliferation and invasion in human head and neck cancer.

PubMed

Liu, Hui; Wu, Yu

2018-05-01

Recent evidence indicates that the long non-coding RNA gastric carcinoma highly expressed transcript 1 (GHET1) is involved in the development and carcinogenesis of several tumor types; however, the exact roles of GHET1 and its underlying mechanisms in head and neck cancer (HNC) remain largely unknown. In the present study, the expression patterns of GHET1 in HNC were determined and its clinical significance was assessed. The expression level of GHET1 was significantly increased in HNC tissues, compared with paired adjacent normal tissues. High GHET1 expression was significantly associated with advanced Tumor-Node-Metastasis stages and poor prognosis. Furthermore, inhibition of GHET1 suppressed cell proliferation, induced cell apoptosis and caused cell cycle arrest in vitro . In addition, GHET1 silencing inhibited cell migration and invasion. Taken together, the results of the present study indicated that GHET1 acts as an oncogene in HNC and may represent a novel therapeutic target.
Disruption of long-distance highly conserved noncoding elements in neurocristopathies.

PubMed

Amiel, Jeanne; Benko, Sabina; Gordon, Christopher T; Lyonnet, Stanislas

2010-12-01

One of the key discoveries of vertebrate genome sequencing projects has been the identification of highly conserved noncoding elements (CNEs). Some characteristics of CNEs include their high frequency in mammalian genomes, their potential regulatory role in gene expression, and their enrichment in gene deserts nearby master developmental genes. The abnormal development of neural crest cells (NCCs) leads to a broad spectrum of congenital malformation(s), termed neurocristopathies, and/or tumor predisposition. Here we review recent findings that disruptions of CNEs, within or at long distance from the coding sequences of key genes involved in NCC development, result in neurocristopathies via the alteration of tissue- or stage-specific long-distance regulation of gene expression. While most studies on human genetic disorders have focused on protein-coding sequences, these examples suggest that investigation of genomic alterations of CNEs will provide a broader understanding of the molecular etiology of both rare and common human congenital malformations. © 2010 New York Academy of Sciences.
Aberrant expression of long noncoding RNAs in cumulus cells isolated from PCOS patients.

PubMed

Huang, Xin; Hao, Cuifang; Bao, Hongchu; Wang, Meimei; Dai, Huangguan

2016-01-01

To describe the long noncoding RNA (lncRNA) profiles in cumulus cells isolated from polycystic ovary syndrome (PCOS) patients by employing a microarray and in-depth bioinformatics analysis. This information will help us understand the occurrence and development of PCOS. In this study, we used a microarray to describe lncRNA profiles in cumulus cells isolated from ten patients (five PCOS and five normal women). Several differentially expressed lncRNAs were chosen to validate the microarray results by quantitative RT-PCR (qRT-PCR). Then, the differentially expressed lncRNAs were classified into three subgroups (HOX loci lncRNA, enhancer-like lncRNA, and lincRNA) to deduce their potential features. Furthermore, a lncRNA/mRNA co-expression network was constructed by using the Cytoscape software (V2.8.3, http://www.cytoscape.org/ ). We observed that 623 lncRNAs and 260 messenger RNAs (mRNAs) were significantly up- or down-regulated (≥2-fold change), and these differences could be used to discriminate cumulus cells of PCOS from those of normal patients. Five differentially expressed lncRNAs (XLOC_011402, ENST00000454271, ENST00000433673, ENST00000450294, and ENST00000432431) were selected to validate the microarray results using quantitative RT-PCR (qRT-PCR). The qRT-PCR results were consistent with the microarray data. Further analysis indicated that many differentially expressed lncRNAs were transcribed from chromosome 2 and may act as enhancers to regulate their neighboring protein-coding genes. Forty-three lncRNAs and 29 mRNAs were used to construct the coding-non-coding gene co-expression network. Most pairs positively correlated, and one mRNA correlated with one or more lncRNAs. Our study is the first to determine genome-wide lncRNA expression patterns in cumulus cells isolated from PCOS patients by microarray. The results show that clusters of lncRNAs were aberrantly expressed in cumulus cells of PCOS patients compared with those of normal women, which revealed that lncRNAs differentially expressed in PCOS and normal women may contribute to the occurrence of PCOS and affect oocyte development.
A deep learning method for lincRNA detection using auto-encoder algorithm.

PubMed

Yu, Ning; Yu, Zeng; Pan, Yi

2017-12-06

RNA sequencing technique (RNA-seq) enables scientists to develop novel data-driven methods for discovering more unidentified lincRNAs. Meantime, knowledge-based technologies are experiencing a potential revolution ignited by the new deep learning methods. By scanning the newly found data set from RNA-seq, scientists have found that: (1) the expression of lincRNAs appears to be regulated, that is, the relevance exists along the DNA sequences; (2) lincRNAs contain some conversed patterns/motifs tethered together by non-conserved regions. The two evidences give the reasoning for adopting knowledge-based deep learning methods in lincRNA detection. Similar to coding region transcription, non-coding regions are split at transcriptional sites. However, regulatory RNAs rather than message RNAs are generated. That is, the transcribed RNAs participate the biological process as regulatory units instead of generating proteins. Identifying these transcriptional regions from non-coding regions is the first step towards lincRNA recognition. The auto-encoder method achieves 100% and 92.4% prediction accuracy on transcription sites over the putative data sets. The experimental results also show the excellent performance of predictive deep neural network on the lincRNA data sets compared with support vector machine and traditional neural network. In addition, it is validated through the newly discovered lincRNA data set and one unreported transcription site is found by feeding the whole annotated sequences through the deep learning machine, which indicates that deep learning method has the extensive ability for lincRNA prediction. The transcriptional sequences of lincRNAs are collected from the annotated human DNA genome data. Subsequently, a two-layer deep neural network is developed for the lincRNA detection, which adopts the auto-encoder algorithm and utilizes different encoding schemes to obtain the best performance over intergenic DNA sequence data. Driven by those newly annotated lincRNA data, deep learning methods based on auto-encoder algorithm can exert their capability in knowledge learning in order to capture the useful features and the information correlation along DNA genome sequences for lincRNA detection. As our knowledge, this is the first application to adopt the deep learning techniques for identifying lincRNA transcription sequences.
Transcriptome Sequence and Plasmid Copy Number Analysis of the Brewery Isolate Pediococcus claussenii ATCC BAA-344T during Growth in Beer

PubMed Central

Pittet, Vanessa; Phister, Trevor G.; Ziola, Barry

2013-01-01

Growth of specific lactic acid bacteria in beer leads to spoiled product and economic loss for the brewing industry. Microbial growth is typically inhibited by the combined stresses found in beer (e.g., ethanol, hops, low pH, minimal nutrients); however, certain bacteria have adapted to grow in this harsh environment. Considering little is known about the mechanisms used by bacteria to grow in and spoil beer, transcriptome sequencing was performed on a variant of the beer-spoilage organism Pediococcus claussenii ATCC BAA-344T (Pc344-358). Illumina sequencing was used to compare the transcript levels in Pc344-358 growing mid-exponentially in beer to those in nutrient-rich MRS broth. Various operons demonstrated high gene expression in beer, several of which are involved in nutrient acquisition and overcoming the inhibitory effects of hop compounds. As well, genes functioning in cell membrane modification and biosynthesis demonstrated significantly higher transcript levels in Pc344-358 growing in beer. Three plasmids had the majority of their genes showing increased transcript levels in beer, whereas the two cryptic plasmids showed slightly decreased gene expression. Follow-up analysis of plasmid copy number in both growth environments revealed similar trends, where more copies of the three non-cryptic plasmids were found in Pc344-358 growing in beer. Transcriptome sequencing also enabled the addition of several genes to the P . claussenii ATCC BAA-344T genome annotation, some of which are putatively transcribed as non-coding RNAs. The sequencing results not only provide the first transcriptome description of a beer-spoilage organism while growing in beer, but they also highlight several targets for future exploration, including genes that may have a role in the general stress response of lactic acid bacteria. PMID:24040005
Comparative Genomics in Drosophila.

PubMed

Oti, Martin; Pane, Attilio; Sammeth, Michael

2018-01-01

Since the pioneering studies of Thomas Hunt Morgan and coworkers at the dawn of the twentieth century, Drosophila melanogaster and its sister species have tremendously contributed to unveil the rules underlying animal genetics, development, behavior, evolution, and human disease. Recent advances in DNA sequencing technologies launched Drosophila into the post-genomic era and paved the way for unprecedented comparative genomics investigations. The complete sequencing and systematic comparison of the genomes from 12 Drosophila species represents a milestone achievement in modern biology, which allowed a plethora of different studies ranging from the annotation of known and novel genomic features to the evolution of chromosomes and, ultimately, of entire genomes. Despite the efforts of countless laboratories worldwide, the vast amount of data that were produced over the past 15 years is far from being fully explored.In this chapter, we will review some of the bioinformatic approaches that were developed to interrogate the genomes of the 12 Drosophila species. Setting off from alignments of the entire genomic sequences, the degree of conservation can be separately evaluated for every region of the genome, providing already first hints about elements that are under purifying selection and therefore likely functional. Furthermore, the careful analysis of repeated sequences sheds light on the evolutionary dynamics of transposons, an enigmatic and fascinating class of mobile elements housed in the genomes of animals and plants. Comparative genomics also aids in the computational identification of the transcriptionally active part of the genome, first and foremost of protein-coding loci, but also of transcribed nevertheless apparently noncoding regions, which were once considered "junk" DNA. Eventually, the synergy between functional and comparative genomics also facilitates in silico and in vivo studies on cis-acting regulatory elements, like transcription factor binding sites, that due to the high degree of sequence variability usually impose increased challenges for bioinformatics approaches.
The Evolution of Dark Matter in the Mitogenome of Seed Beetles

PubMed Central

Sayadi, Ahmed; Immonen, Elina; Tellgren-Roth, Christian

2017-01-01

Abstract Animal mitogenomes are generally thought of as being economic and optimized for rapid replication and transcription. We use long-read sequencing technology to assemble the remarkable mitogenomes of four species of seed beetles. These are the largest circular mitogenomes ever assembled in insects, ranging from 24,496 to 26,613 bp in total length, and are exceptional in that some 40% consists of non-coding DNA. The size expansion is due to two very long intergenic spacers (LIGSs), rich in tandem repeats. The two LIGSs are present in all species but vary greatly in length (114–10,408 bp), show very low sequence similarity, divergent tandem repeat motifs, a very high AT content and concerted length evolution. The LIGSs have been retained for at least some 45 my but must have undergone repeated reductions and expansions, despite strong purifying selection on protein coding mtDNA genes. The LIGSs are located in two intergenic sites where a few recent studies of insects have also reported shorter LIGSs (>200 bp). These sites may represent spaces that tolerate neutral repeat array expansions or, alternatively, the LIGSs may function to allow a more economic translational machinery. Mitochondrial respiration in adult seed beetles is based almost exclusively on fatty acids, which reduces the need for building complex I of the oxidative phosphorylation pathway (NADH dehydrogenase). One possibility is thus that the LIGSs may allow depressed transcription of NAD genes. RNA sequencing showed that LIGSs are partly transcribed and transcriptional profiling suggested that all seven mtDNA NAD genes indeed show low levels of transcription and co-regulation of transcription across sexes and tissues. PMID:29048527
Genome-wide identification of miRNAs and lncRNAs in Cajanus cajan.

PubMed

Nithin, Chandran; Thomas, Amal; Basak, Jolly; Bahadur, Ranjit Prasad

2017-11-15

Non-coding RNAs (ncRNAs) are important players in the post transcriptional regulation of gene expression (PTGR). On one hand, microRNAs (miRNAs) are an abundant class of small ncRNAs (~22nt long) that negatively regulate gene expression at the levels of messenger RNAs stability and translation inhibition, on the other hand, long ncRNAs (lncRNAs) are a large and diverse class of transcribed non-protein coding RNA molecules (> 200nt) that play both up-regulatory as well as down-regulatory roles at the transcriptional level. Cajanus cajan, a leguminosae pulse crop grown in tropical and subtropical areas of the world, is a source of high value protein to vegetarians or very poor populations globally. Hence, genome-wide identification of miRNAs and lncRNAs in C. cajan is extremely important to understand their role in PTGR with a possible implication to generate improve variety of crops. We have identified 616 mature miRNAs in C. cajan belonging to 118 families, of which 578 are novel and not reported in MirBase21. A total of 1373 target sequences were identified for 180 miRNAs. Of these, 298 targets were characterized at the protein level. Besides, we have also predicted 3919 lncRNAs. Additionally, we have identified 87 of the predicted lncRNAs to be targeted by 66 miRNAs. miRNA and lncRNAs in plants are known to control a variety of traits including yield, quality and stress tolerance. Owing to its agricultural importance and medicinal value, the identified miRNA, lncRNA and their targets in C. cajan may be useful for genome editing to improve better quality crop. A thorough understanding of ncRNA-based cellular regulatory networks will aid in the improvement of C. cajan agricultural traits.
Distinct patterns of alteration of myc genes associated with integration of human papillomavirus type 16 or type 45 DNA in two genital tumours.

PubMed

Sastre-Garau, X; Favre, M; Couturier, J; Orth, G

2000-08-01

We previously described two genital carcinomas (IC2, IC4) containing human papillomavirus type 16 (HPV-16)- or HPV-18-related sequences integrated in chromosomal bands containing the c-myc (8q24) or N-myc (2p24) gene, respectively. The c-myc gene was rearranged and amplified in IC2 cells without evidence of overexpression. The N-myc gene was amplified and highly transcribed in IC4 cells. Here, the sequence of an 8039 bp IC4 DNA fragment containing the integrated viral sequences and the cellular junctions is reported. A 3948 bp segment of the genome of HPV-45 encompassing the upstream regulatory region and the E6 and E7 ORFs was integrated into the untranslated part of N-myc exon 3, upstream of the N-myc polyadenylation signal. Both N-myc and HPV-45 sequences were amplified 10- to 20-fold. The 3' ends of the major N-myc transcript were mapped upstream of the 5' junction. A minor N-myc/HPV-45 fusion transcript was also identified, as well as two abundant transcripts from the HPV-45 E6-E7 region. Large amounts of N-myc protein were detected in IC4 cells. A major alteration of c-myc sequences in IC2 cells involved the insertion of a non-coding sequence into the second intron and their co-amplification with the third exon, without any evidence for the integration of HPV-16 sequences within or close to the gene. Different patterns of myc gene alterations may thus be associated with integration of HPV DNA in genital tumours, including the activation of the protooncogene via a mechanism of insertional mutagenesis and/or gene amplification.
A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

PubMed

Kress, W John; Erickson, David L

2007-06-06

A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan G.; Pfreundt, Ulrike; Nelson, William C.; ...

2015-04-07

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance due to its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20-25% less than the average for other cyanobacteria and non-pathogenic, free-living bacteria. We use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus both in culture and in situ. Transcriptome analysis indicates that 86% of the non-coding spacemore » is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and two fold higher than that found in the gene dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium ncRNA secondary structures were predicted between most culture and metagenomic sequences lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Our data suggest that transposition of selfish DNA, low effective population size, and high fidelity replication allowed the unusual ‘inflation’ of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Faster-X evolution of gene expression is driven by recessive adaptive cis-regulatory variation in Drosophila.

PubMed

Llopart, Ana

2018-05-01

The hemizygosity of the X (Z) chromosome fully exposes the fitness effects of mutations on that chromosome and has evolutionary consequences on the relative rates of evolution of X and autosomes. Specifically, several population genetics models predict increased rates of evolution in X-linked loci relative to autosomal loci. This prediction of faster-X evolution has been evaluated and confirmed for both protein coding sequences and gene expression. In the case of faster-X evolution for gene expression divergence, it is often assumed that variation in 5' noncoding sequences is associated with variation in transcript abundance between species but a formal, genomewide test of this hypothesis is still missing. Here, I use whole genome sequence data in Drosophila yakuba and D. santomea to evaluate this hypothesis and report positive correlations between sequence divergence at 5' noncoding sequences and gene expression divergence. I also examine polymorphism and divergence in 9,279 noncoding sequences located at the 5' end of annotated genes and detected multiple signals of positive selection. Notably, I used the traditional synonymous sites as neutral reference to test for adaptive evolution, but I also used bases 8-30 of introns <65 bp, which have been proposed to be a better neutral choice. X-linked genes with high degree of male-biased expression show the most extreme adaptive pattern at 5' noncoding regions, in agreement with faster-X evolution for gene expression divergence and a higher incidence of positively selected recessive mutations. © 2018 The Authors. Molecular Ecology Published by John Wiley & Sons Ltd.
Statistical and linguistic features of DNA sequences

NASA Technical Reports Server (NTRS)

Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We present evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationary" feature of the sequence of base pairs by applying a new algorithm called Detrended Fluctuation Analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and noncoding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to all eukaryotic DNA sequences (33 301 coding and 29 453 noncoding) in the entire GenBank database. We describe a simple model to account for the presence of long-range power-law correlations which is based upon a generalization of the classic Levy walk. Finally, we describe briefly some recent work showing that the noncoding sequences have certain statistical features in common with natural languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function. We suggest that noncoding regions in plants and invertebrates may display a smaller entropy and larger redundancy than coding regions, further supporting the possibility that noncoding regions of DNA may carry biological information.
LncRNA-DANCR: A valuable cancer related long non-coding RNA for human cancers.

PubMed

Thin, Khaing Zar; Liu, Xuefang; Feng, Xiaobo; Raveendran, Sudheesh; Tu, Jian Cheng

2018-06-01

Long noncoding RNAs (lncRNA) are a type of noncoding RNA that comprise of longer than 200 nucleotides sequences. They can regulate chromosome structure, gene expression and play an essential role in the pathophysiology of human diseases, especially in tumorigenesis and progression. Nowadays, they are being targeted as potential biomarkers for various cancer types. And many research studies have proven that lncRNAs might bring a new era to cancer diagnosis and support treatment management. The purpose of this review was to inspect the molecular mechanism and clinical significance of long non-coding RNA- differentiation antagonizing nonprotein coding RNA(DANCR) in various types of human cancers. In this review, we summarize and figure out recent research studies concerning the expression and biological mechanisms of lncRNA-DANCR in tumour development. The related studies were obtained through a systematic search of PubMed, Embase and Cochrane Library. Long non-coding RNAs-DANCR is a valuable cancer-related lncRNA that its dysregulated expression was found in a variety of malignancies, including hepatocellular carcinoma, breast cancer, glioma, colorectal cancer, gastric cancer, and lung cancer. The aberrant expressions of DANCR have been shown to contribute to proliferation, migration and invasion of cancer cells. Long non-coding RNAs-DANCR likely serves as a useful disease biomarker or therapeutic cancer target. Copyright © 2018 Elsevier GmbH. All rights reserved.
A Positive Regulatory Loop between a Wnt-Regulated Non-coding RNA and ASCL2 Controls Intestinal Stem Cell Fate.

PubMed

Giakountis, Antonis; Moulos, Panagiotis; Zarkou, Vasiliki; Oikonomou, Christina; Harokopos, Vaggelis; Hatzigeorgiou, Artemis G; Reczko, Martin; Hatzis, Pantelis

2016-06-21

The canonical Wnt pathway plays a central role in stem cell maintenance, differentiation, and proliferation in the intestinal epithelium. Constitutive, aberrant activity of the TCF4/β-catenin transcriptional complex is the primary transforming factor in colorectal cancer. We identify a nuclear long non-coding RNA, termed WiNTRLINC1, as a direct target of TCF4/β-catenin in colorectal cancer cells. WiNTRLINC1 positively regulates the expression of its genomic neighbor ASCL2, a transcription factor that controls intestinal stem cell fate. WiNTRLINC1 interacts with TCF4/β-catenin to mediate the juxtaposition of its promoter with the regulatory regions of ASCL2. ASCL2, in turn, regulates WiNTRLINC1 transcriptionally, closing a feedforward regulatory loop that controls stem cell-related gene expression. This regulatory circuitry is highly amplified in colorectal cancer and correlates with increased metastatic potential and decreased patient survival. Our results uncover the interplay between non-coding RNA-mediated regulation and Wnt signaling and point to the diagnostic and therapeutic potential of WiNTRLINC1. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Long non-coding RNA and Polycomb: an intricate partnership in cancer biology.

PubMed

Achour, Cyrinne; Aguilo, Francesca

2018-06-01

High-throughput analyses have revealed that the vast majority of the transcriptome does not code for proteins. These non-translated transcripts, when larger than 200 nucleotides, are termed long non-coding RNAs (lncRNAs), and play fundamental roles in diverse cellular processes. LncRNAs are subject to dynamic chemical modification, adding another layer of complexity to our understanding of the potential roles that lncRNAs play in health and disease. Many lncRNAs regulate transcriptional programs by influencing the epigenetic state through direct interactions with chromatin-modifying proteins. Among these proteins, Polycomb repressive complexes 1 and 2 (PRC1 and PRC2) have been shown to be recruited by lncRNAs to silence target genes. Aberrant expression, deficiency or mutation of both lncRNA and Polycomb have been associated with numerous human diseases, including cancer. In this review, we have highlighted recent findings regarding the concerted mechanism of action of Polycomb group proteins (PcG), acting together with some classically defined lncRNAs including X-inactive specific transcript ( XIST ), antisense non-coding RNA in the INK4 locus ( ANRIL ), metastasis associated lung adenocarcinoma transcript 1 ( MALAT1 ), and HOX transcript antisense RNA ( HOTAIR ).
Identification and characterization of a class of MALAT1 -like genomic loci

DOE PAGES

Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.; ...

2017-05-23

The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less
Identification and characterization of a class of MALAT1 -like genomic loci

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Bin; Mao, Yuntao S.; Diermeier, Sarah D.

The MALAT1 (Metastasis-Associated Lung Adenocarcinoma Transcript 1) gene encodes a noncoding RNA that is processed into a long nuclear retained transcript ( MALAT1) and a small cytoplasmic tRNA-like transcript (mascRNA). Using an RNA sequence- and structure-based covariance model, we identified more than 130 genomic loci in vertebrate genomes containing the MALAT1 3' end triple-helix structure and its immediate downstream tRNA-like structure, including 44 in the green lizard Anolis carolinensis. Structural and computational analyses revealed a co-occurrence of components of the 3' end module. MALAT1-like genes in Anolis carolinensis are highly expressed in adult testis, thus we named them testis-abundant longmore » noncoding RNAs (tancRNAs). MALAT1-like loci also produce multiple small RNA species, including PIWI-interacting RNAs (piRNAs), from the antisense strand. The 3' ends of tancRNAs serve as potential targets for the PIWI-piRNA complex. Furthermore, we have identified an evolutionarily conserved class of long noncoding RNAs (lncRNAs) with similar structural constraints, post-transcriptional processing, and subcellular localization and a distinct function in spermatocytes.« less

Non-coding RNAs in cancer brain metastasis

PubMed Central

Wu, Kerui; Sharma, Sambad; Venkat, Suresh; Liu, Keqin; Zhou, Xiaobo; Watabe, Kounosuke

2017-01-01

More than 90% of cancer death is attributed to metastatic disease, and the brain is one of the major metastatic sites of melanoma, colon, renal, lung and breast cancers. Despite the recent advancement of targeted therapy for cancer, the incidence of brain metastasis is increasing. One reason is that most therapeutic drugs can’t penetrate blood-brain-barrier and tumor cells find the brain as sanctuary site. In this review, we describe the pathophysiology of brain metastases to introduce the latest understandings of metastatic brain malignancies. This review also particularly focuses on non-coding RNAs and their roles in cancer brain metastasis. Furthermore, we discuss the roles of the extracellular vesicles as they are known to transport information between cells to initiate cancer cell-microenvironment communication. The potential clinical translation of non-coding RNAs as a tool for diagnosis and for treatment is also discussed in this review. At the end, the computational aspects of non-coding RNA detection, the sequence and structure calculation and epigenetic regulation of non-coding RNA in brain metastasis are discussed. PMID:26709907
Functional interrogation of non-coding DNA through CRISPR genome editing

PubMed Central

Canver, Matthew C.; Bauer, Daniel E.; Orkin, Stuart H.

2017-01-01

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. PMID:28288828
Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids

PubMed Central

2012-01-01

Background Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with more than a few hundred copies of the rDNA unit. Here we study rDNA complexity in species with arrays consisting of thousands of units. Methods We examined homogeneity of genic (18S) and non-coding internally transcribed spacer (ITS1) regions of rDNA using Roche 454 and/or Illumina platforms in four angiosperm species, Nicotiana sylvestris, N. tomentosiformis, N. otophora and N. kawakamii. We compared the data with Southern blot hybridisation revealing the structure of intergenic spacer (IGS) sequences and with the number and distribution of rDNA loci. Results and Conclusions In all four species the intragenomic homogeneity of the 18S gene was high; a single ribotype makes up over 90% of the genes. However greater variation was observed in the ITS1 region, particularly in species with two or more rDNA loci, where >55% of rDNA units were a single ribotype, with the second most abundant variant accounted for >18% of units. IGS heterogeneity was high in all species. The increased number of ribotypes in ITS1 compared with 18S sequences may reflect rounds of incomplete homogenisation with strong selection for functional genic regions and relaxed selection on ITS1 variants. The relationship between the number of ITS1 ribotypes and the number of rDNA loci leads us to propose that rDNA evolution and complexity is influenced by locus number and/or amplification of orphaned rDNA units at new chromosomal locations. PMID:23259460
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

PubMed

Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

PubMed Central

Rehm, Charlotte; Wurmthaler, Lena A.; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S.

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria. PMID:26695179
The complete mitochondrial genomes for three Toxocara species of human and animal health significance.

PubMed

Li, Ming-Wei; Lin, Rui-Qing; Song, Hui-Qun; Wu, Xiang-Yun; Zhu, Xing-Quan

2008-05-16

Studying mitochondrial (mt) genomics has important implications for various fundamental areas, including mt biochemistry, physiology and molecular biology. In addition, mt genome sequences have provided useful markers for investigating population genetic structures, systematics and phylogenetics of organisms. Toxocara canis, Toxocara cati and Toxocara malaysiensis cause significant health problems in animals and humans. Although they are of importance in human and animal health, no information on the mt genomes for any of Toxocara species is available. The sizes of the entire mt genome are 14,322 bp for T. canis, 14029 bp for T. cati and 14266 bp for T. malaysiensis, respectively. These circular genomes are amongst the largest reported to date for all secernentean nematodes. Their relatively large sizes relate mainly to an increased length in the AT-rich region. The mt genomes of the three Toxocara species all encode 12 proteins, two ribosomal RNAs and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with all other species of Nematode studied to date, with the exception of Trichinella spiralis. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The contents of A+T of the complete genomes are 68.57% for T. canis, 69.95% for T. cati and 68.86% for T. malaysiensis, among which the A+T for T. canis is the lowest among all nematodes studied to date. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. The mt genome structures for three Toxocara species, including genes and non-coding regions, are in the same order as for Ascaris suum and Anisakis simplex, but differ from Ancylostoma duodenale, Necator americanus and Caenorhabditis elegans only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus,Dirofiliria immitis and Strongyloides stercoralis. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes revealed that the newly described species T. malaysiensis was more closely related to T. cati than to T. canis, consistent with results of a previous study using sequences of nuclear internal transcribed spacers as genetic markers. The present study determined the complete mt genome sequences for three roundworms of human and animal health significance, which provides mtDNA evidence for the validity of T. malaysiensis and also provides a foundation for studying the systematics, population genetics and ecology of these and other nematodes of socio-economic importance.
Whole transcriptome analysis reveals dysregulated oncogenic lncRNAs in natural killer/T-cell lymphoma and establishes MIR155HG as a target of PRDM1.

PubMed

Baytak, Esra; Gong, Qiang; Akman, Burcu; Yuan, Hongling; Chan, Wing C; Küçük, Can

2017-05-01

Natural killer/T-cell lymphoma is a rare but aggressive neoplasm with poor prognosis. Despite previous reports that showed potential tumor suppressors, such as PRDM1 or oncogenes associated with the etiology of this malignancy, the role of long non-coding RNAs in natural killer/T-cell lymphoma pathobiology has not been addressed to date. Here, we aim to identify cancer-associated dysregulated long non-coding RNAs and signaling pathways or biological processes associated with these long non-coding RNAs in natural killer/T-cell lymphoma cases and to identify the long non-coding RNAs transcriptionally regulated by PRDM1. RNA-Seq analysis revealed 166 and 66 long non-coding RNAs to be significantly overexpressed or underexpressed, respectively, in natural killer/T-cell lymphoma cases compared with resting or activated normal natural killer cells. Novel long non-coding RNAs as well as the cancer-associated ones such as SNHG5, ZFAS1, or MIR155HG were dysregulated. Interestingly, antisense transcripts of many growth-regulating genes appeared to be transcriptionally deregulated. Expression of ZFAS1, which is upregulated in natural killer/T-cell lymphoma cases, showed association with growth-regulating pathways such as stabilization of P53, regulation of apoptosis, cell cycle, or nuclear factor-kappa B signaling in normal and neoplastic natural killer cell samples. Consistent with the tumor suppressive role of PRDM1, we identified MIR155HG and TERC to be transcriptionally downregulated by PRDM1 in two PRDM1-null NK-cell lines when it is ectopically expressed. In conclusion, this is the first study that identified long non-coding RNAs whose expression is dysregulated in natural killer/T-cell lymphoma cases. These findings suggest that ZFAS1 and other dysregulated long non-coding RNAs may be involved in natural killer/T-cell lymphoma pathobiology through regulation of cancer-related genes, and loss-of-PRDM1 expression in natural killer/T-cell lymphomas may contribute to overexpression of MIR155HG; thereby promoting tumorigenesis.
Non-coding RNAs and plant male sterility: current knowledge and future prospects.

PubMed

Mishra, Ankita; Bohra, Abhishek

2018-02-01

Latest outcomes assign functional role to non-coding (nc) RNA molecules in regulatory networks that confer male sterility to plants. Male sterility in plants offers great opportunity for improving crop performance through application of hybrid technology. In this respect, cytoplasmic male sterility (CMS) and sterility induced by photoperiod (PGMS)/temperature (TGMS) have greatly facilitated development of high-yielding hybrids in crops. Participation of non-coding (nc) RNA molecules in plant reproductive development is increasingly becoming evident. Recent breakthroughs in rice definitively associate ncRNAs with PGMS and TGMS. In case of CMS, the exact mechanism through which the mitochondrial ORFs exert influence on the development of male gametophyte remains obscure in several crops. High-throughput sequencing has enabled genome-wide discovery and validation of these regulatory molecules and their target genes, describing their potential roles performed in relation to CMS. Discovery of ncRNA localized in plant mtDNA with its possible implication in CMS induction is intriguing in this respect. Still, conclusive evidences linking ncRNA with CMS phenotypes are currently unavailable, demanding complementing genetic approaches like transgenics to substantiate the preliminary findings. Here, we review the recent literature on the contribution of ncRNAs in conferring male sterility to plants, with an emphasis on microRNAs. Also, we present a perspective on improved understanding about ncRNA-mediated regulatory pathways that control male sterility in plants. A refined understanding of plant male sterility would strengthen crop hybrid industry to deliver hybrids with improved performance.
Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome.

PubMed

Wu, Jia Qian; Du, Jiang; Rozowsky, Joel; Zhang, Zhengdong; Urban, Alexander E; Euskirchen, Ghia; Weissman, Sherman; Gerstein, Mark; Snyder, Michael

2008-01-03

Recent studies of the mammalian transcriptome have revealed a large number of additional transcribed regions and extraordinary complexity in transcript diversity. However, there is still much uncertainty regarding precisely what portion of the genome is transcribed, the exact structures of these novel transcripts, and the levels of the transcripts produced. We have interrogated the transcribed loci in 420 selected ENCyclopedia Of DNA Elements (ENCODE) regions using rapid amplification of cDNA ends (RACE) sequencing. We analyzed annotated known gene regions, but primarily we focused on novel transcriptionally active regions (TARs), which were previously identified by high-density oligonucleotide tiling arrays and on random regions that were not believed to be transcribed. We found RACE sequencing to be very sensitive and were able to detect low levels of transcripts in specific cell types that were not detectable by microarrays. We also observed many instances of sense-antisense transcripts; further analysis suggests that many of the antisense transcripts (but not all) may be artifacts generated from the reverse transcription reaction. Our results show that the majority of the novel TARs analyzed (60%) are connected to other novel TARs or known exons. Of previously unannotated random regions, 17% were shown to produce overlapping transcripts. Furthermore, it is estimated that 9% of the novel transcripts encode proteins. We conclude that RACE sequencing is an efficient, sensitive, and highly accurate method for characterization of the transcriptome of specific cell/tissue types. Using this method, it appears that much of the genome is represented in polyA+ RNA. Moreover, a fraction of the novel RNAs can encode protein and are likely to be functional.
Functional interrogation of non-coding DNA through CRISPR genome editing.

PubMed

Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

2017-05-15

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
The role of epigenetics and long noncoding RNA MIAT in neuroendocrine prostate cancer.

PubMed

Crea, Francesco; Venalainen, Erik; Ci, Xinpei; Cheng, Hongwei; Pikor, Larissa; Parolia, Abhijit; Xue, Hui; Nur Saidy, Nur Ridzwan; Lin, Dong; Lam, Wan; Collins, Colin; Wang, Yuzhuo

2016-05-01

Neuroendocrine prostate cancer (NEPC) is the most lethal prostatic neoplasm. NEPC is thought to originate from the transdifferentiation of AR-positive adenocarcinoma cells. We have previously shown that an epigenetic/noncoding interactome (ENI) orchestrates cancer cells' plasticity, thereby allowing the emergence of metastatic, drug-resistant neoplasms. The primary objective of this manuscript is to discuss evidence indicating that some components of the ENI (Polycomb genes, miRNAs) play a key role in NEPC initiation and progression. Long noncoding RNAs represent vast and largely unexplored component of the ENI. Their role in NEPC has not been investigated. We show preliminary evidence indicating that a lncRNA (MIAT) is selectively upregulated in NEPCs and might interact with Polycomb genes. Our results indicate that long noncoding RNAs can be exploited as new biomarkers and therapeutic targets for NEPC.
Transcriptomes of six mutants in the Sen1 pathway reveal combinatorial control of transcription termination across the Saccharomyces cerevisiae genome

PubMed Central

Carver, Melissa N.; Müller, Ulrika; Bekiranov, Stefan; Auble, David T.

2017-01-01

Transcriptome studies on eukaryotic cells have revealed an unexpected abundance and diversity of noncoding RNAs synthesized by RNA polymerase II (Pol II), some of which influence the expression of protein-coding genes. Yet, much less is known about biogenesis of Pol II non-coding RNA than mRNAs. In the budding yeast Saccharomyces cerevisiae, initiation of non-coding transcripts by Pol II appears to be similar to that of mRNAs, but a distinct pathway is utilized for termination of most non-coding RNAs: the Sen1-dependent or “NNS” pathway. Here, we examine the effect on the S. cerevisiae transcriptome of conditional mutations in the genes encoding six different essential proteins that influence Sen1-dependent termination: Sen1, Nrd1, Nab3, Ssu72, Rpb11, and Hrp1. We observe surprisingly diverse effects on transcript abundance for the different proteins that cannot be explained simply by differing severity of the mutations. Rather, we infer from our results that termination of Pol II transcription of non-coding RNA genes is subject to complex combinatorial control that likely involves proteins beyond those studied here. Furthermore, we identify new targets and functions of Sen1-dependent termination, including a role in repression of meiotic genes in vegetative cells. In combination with other recent whole-genome studies on termination of non-coding RNAs, our results provide promising directions for further investigation. PMID:28665995
Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

PubMed Central

Dasgupta, R; Kaesberg, P

1982-01-01

The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941
Non-coding variants contribute to the clinical heterogeneity of TTR amyloidosis.

PubMed

Iorio, Andrea; De Lillo, Antonella; De Angelis, Flavio; Di Girolamo, Marco; Luigetti, Marco; Sabatelli, Mario; Pradotto, Luca; Mauro, Alessandro; Mazzeo, Anna; Stancanelli, Claudia; Perfetto, Federico; Frusconi, Sabrina; My, Filomena; Manfellotto, Dario; Fuciarelli, Maria; Polimanti, Renato

2017-09-01

Coding mutations in TTR gene cause a rare hereditary form of systemic amyloidosis, which has a complex genotype-phenotype correlation. We investigated the role of non-coding variants in regulating TTR gene expression and consequently amyloidosis symptoms. We evaluated the genotype-phenotype correlation considering the clinical information of 129 Italian patients with TTR amyloidosis. Then, we conducted a re-sequencing of TTR gene to investigate how non-coding variants affect TTR expression and, consequently, phenotypic presentation in carriers of amyloidogenic mutations. Polygenic scores for genetically determined TTR expression were constructed using data from our re-sequencing analysis and the GTEx (Genotype-Tissue Expression) project. We confirmed a strong phenotypic heterogeneity across coding mutations causing TTR amyloidosis. Considering the effects of non-coding variants on TTR expression, we identified three patient clusters with specific expression patterns associated with certain phenotypic presentations, including late onset, autonomic neurological involvement, and gastrointestinal symptoms. This study provides novel data regarding the role of non-coding variation and the gene expression profiles in patients affected by TTR amyloidosis, also putting forth an approach that could be used to investigate the mechanisms at the basis of the genotype-phenotype correlation of the disease.
Progressive changes in non-coding RNA profile in leucocytes with age

PubMed Central

Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David

2017-01-01

It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962
Non-Coding RNAs in Castration-Resistant Prostate Cancer: Regulation of Androgen Receptor Signaling and Cancer Metabolism.

PubMed

Shih, Jing-Wen; Wang, Ling-Yu; Hung, Chiu-Lien; Kung, Hsing-Jien; Hsieh, Chia-Ling

2015-12-04

Hormone-refractory prostate cancer frequently relapses from therapy and inevitably progresses to a bone-metastatic status with no cure. Understanding of the molecular mechanisms conferring resistance to androgen deprivation therapy has the potential to lead to the discovery of novel therapeutic targets for type of prostate cancer with poor prognosis. Progression to castration-resistant prostate cancer (CRPC) is characterized by aberrant androgen receptor (AR) expression and persistent AR signaling activity. Alterations in metabolic activity regulated by oncogenic pathways, such as c-Myc, were found to promote prostate cancer growth during the development of CRPC. Non-coding RNAs represent a diverse family of regulatory transcripts that drive tumorigenesis of prostate cancer and various other cancers by their hyperactivity or diminished function. A number of studies have examined differentially expressed non-coding RNAs in each stage of prostate cancer. Herein, we highlight the emerging impacts of microRNAs and long non-coding RNAs linked to reactivation of the AR signaling axis and reprogramming of the cellular metabolism in prostate cancer. The translational implications of non-coding RNA research for developing new biomarkers and therapeutic strategies for CRPC are also discussed.
Non-Coding RNAs in Castration-Resistant Prostate Cancer: Regulation of Androgen Receptor Signaling and Cancer Metabolism

PubMed Central

Shih, Jing-Wen; Wang, Ling-Yu; Hung, Chiu-Lien; Kung, Hsing-Jien; Hsieh, Chia-Ling

2015-01-01

Hormone-refractory prostate cancer frequently relapses from therapy and inevitably progresses to a bone-metastatic status with no cure. Understanding of the molecular mechanisms conferring resistance to androgen deprivation therapy has the potential to lead to the discovery of novel therapeutic targets for type of prostate cancer with poor prognosis. Progression to castration-resistant prostate cancer (CRPC) is characterized by aberrant androgen receptor (AR) expression and persistent AR signaling activity. Alterations in metabolic activity regulated by oncogenic pathways, such as c-Myc, were found to promote prostate cancer growth during the development of CRPC. Non-coding RNAs represent a diverse family of regulatory transcripts that drive tumorigenesis of prostate cancer and various other cancers by their hyperactivity or diminished function. A number of studies have examined differentially expressed non-coding RNAs in each stage of prostate cancer. Herein, we highlight the emerging impacts of microRNAs and long non-coding RNAs linked to reactivation of the AR signaling axis and reprogramming of the cellular metabolism in prostate cancer. The translational implications of non-coding RNA research for developing new biomarkers and therapeutic strategies for CRPC are also discussed. PMID:26690121
A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region

PubMed Central

Kress, W. John; Erickson, David L.

2007-01-01

Background A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Methodology/Principal Findings Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. Conclusions/Significance A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination. PMID:17551588
Transposable elements (TEs) contribute to stress-related long intergenic noncoding RNAs in plants.

PubMed

Wang, Dong; Qu, Zhipeng; Yang, Lan; Zhang, Qingzhu; Liu, Zhi-Hong; Do, Trung; Adelson, David L; Wang, Zhen-Yu; Searle, Iain; Zhu, Jian-Kang

2017-04-01

Noncoding RNAs have been extensively described in plant and animal transcriptomes by using high-throughput sequencing technology. Of these noncoding RNAs, a growing number of long intergenic noncoding RNAs (lincRNAs) have been described in multicellular organisms, however the origins and functions of many lincRNAs remain to be explored. In many eukaryotic genomes, transposable elements (TEs) are widely distributed and often account for large fractions of plant and animal genomes yet the contribution of TEs to lincRNAs is largely unknown. By using strand-specific RNA-sequencing, we profiled the expression patterns of lincRNAs in Arabidopsis, rice and maize, and identified 47 611 and 398 TE-associated lincRNAs (TE-lincRNAs), respectively. TE-lincRNAs were more often derived from retrotransposons than DNA transposons and as retrotransposon copy number in both rice and maize genomes so did TE-lincRNAs. We validated the expression of these TE-lincRNAs by strand-specific RT-PCR and also demonstrated tissue-specific transcription and stress-induced TE-lincRNAs either after salt, abscisic acid (ABA) or cold treatments. For Arabidopsis TE-lincRNA11195, mutants had reduced sensitivity to ABA as demonstrated by longer roots and higher shoot biomass when compared to wild-type. Finally, by altering the chromatin state in the Arabidopsis chromatin remodelling mutant ddm1, unique lincRNAs including TE-lincRNAs were generated from the preceding untranscribed regions and interestingly inherited in a wild-type background in subsequent generations. Our findings not only demonstrate that TE-associated lincRNAs play important roles in plant abiotic stress responses but lincRNAs and TE-lincRNAs might act as an adaptive reservoir in eukaryotes. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.

PubMed

Edwards, Scott V; Cloutier, Alison; Baker, Allan J

2017-11-01

Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.

Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics

PubMed Central

Cloutier, Alison; Baker, Allan J.

2017-01-01

Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293
Understanding Neurodevelopmental Disorders: The Promise of Regulatory Variation in the 3'UTRome.

PubMed

Wanke, Kai A; Devanna, Paolo; Vernes, Sonja C

2018-04-01

Neurodevelopmental disorders have a strong genetic component, but despite widespread efforts, the specific genetic factors underlying these disorders remain undefined for a large proportion of affected individuals. Given the accessibility of exome sequencing, this problem has thus far been addressed from a protein-centric standpoint; however, protein-coding regions only make up ∼1% to 2% of the human genome. With the advent of whole genome sequencing we are in the midst of a paradigm shift as it is now possible to interrogate the entire sequence of the human genome (coding and noncoding) to fill in the missing heritability of complex disorders. These new technologies bring new challenges, as the number of noncoding variants identified per individual can be overwhelming, making it prudent to focus on noncoding regions of known function, for which the effects of variation can be predicted and directly tested to assess pathogenicity. The 3'UTRome is a region of the noncoding genome that perfectly fulfills these criteria and is of high interest when searching for pathogenic variation related to complex neurodevelopmental disorders. Herein, we review the regulatory roles of the 3'UTRome as binding sites for microRNAs or RNA binding proteins, or during alternative polyadenylation. We detail existing evidence that these regions contribute to neurodevelopmental disorders and outline strategies for identification and validation of novel putatively pathogenic variation in these regions. This evidence suggests that studying the 3'UTRome will lead to the identification of new risk factors, new candidate disease genes, and a better understanding of the molecular mechanisms contributing to neurodevelopmental disorders. Copyright © 2017 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
A long noncoding RNA, lincRNA-Tnfaip3, acts as a coregulator of NF-κB to modulate inflammatory gene transcription in mouse macrophages.

PubMed

Ma, Shibin; Ming, Zhenping; Gong, Ai-Yu; Wang, Yang; Chen, Xiqiang; Hu, Guoku; Zhou, Rui; Shibata, Annemarie; Swanson, Patrick C; Chen, Xian-Ming

2017-03-01

Long intergenic noncoding RNAs (lincRNAs) are long noncoding transcripts (>200 nt) from the intergenic regions of annotated protein-coding genes. We report here that the lincRNA gene lincRNA-Tnfaip3 , located at mouse chromosome 10 proximal to the tumor necrosis factor α-induced protein 3 ( Tnfaip3 ) gene, is an early-primary response gene controlled by nuclear factor-κB (NF-κB) signaling in murine macrophages. Functionally, lincRNA- Tnfaip3 appears to mediate both the activation and repression of distinct classes of inflammatory genes in macrophages. Specifically, induction of lincRNA-Tnfaip3 is required for the transactivation of NF-κB-regulated inflammatory genes in response to bacterial LPSs stimulation. LincRNA-Tnfaip3 physically interacts with the high-mobility group box 1 (Hmgb1), assembling a NF-κB/Hmgb1/lincRNA-Tnfaip3 complex in macrophages after LPS stimulation. This resultant NF-κB/Hmgb1/lincRNA-Tnfaip3 complex can modulate Hmgb1-associated histone modifications and, ultimately, transactivation of inflammatory genes in mouse macrophages in response to microbial challenge. Therefore, our data indicate a new regulatory role of NF-κB-induced lincRNA-Tnfaip3 to act as a coactivator of NF-κB for the transcription of inflammatory genes in innate immune cells through modulation of epigenetic chromatin remodeling.-Ma, S., Ming, Z., Gong, A.-Y., Wang, Y., Chen, X., Hu, G., Zhou, R., Shibata, A., Swanson, P. C., Chen, X.-M. A long noncoding RNA, LincRNA-Tnfaip3, acts as a coregulator of NF-κB to modulate inflammatory gene transcription in mouse macrophages. © FASEB.
CSTminer: a web tool for the identification of coding and noncoding conserved sequence tags through cross-species genome comparison

PubMed Central

Castrignanò, Tiziana; Canali, Alessandro; Grillo, Giorgio; Liuni, Sabino; Mignone, Flavio; Pesole, Graziano

2004-01-01

The identification and characterization of genome tracts that are highly conserved across species during evolution may contribute significantly to the functional annotation of whole-genome sequences. Indeed, such sequences are likely to correspond to known or unknown coding exons or regulatory motifs. Here, we present a web server implementing a previously developed algorithm that, by comparing user-submitted genome sequences, is able to identify statistically significant conserved blocks and assess their coding or noncoding nature through the measure of a coding potential score. The web tool, available at http://www.caspur.it/CSTminer/, is dynamically interconnected with the Ensembl genome resources and produces a graphical output showing a map of detected conserved sequences and annotated gene features. PMID:15215464
Long non-coding RNA-CTD-2108O9.1 represses breast cancer metastasis by influencing leukemia inhibitory factor receptor.

PubMed

Wang, Mozhi; Wang, Mengshen; Wang, Zhenning; Yu, Xueting; Song, Yongxi; Wang, Chong; Xu, Yujie; Wei, Fengheng; Zhao, Yi; Xu, Yingying

2018-06-01

Breast cancer (BC) is an aggressive malignant disease in women worldwide with a high tendency to metastasize. However, important biomarkers for BC metastasis remain largely undefined. In the present study, we identified that long non-coding RNA-CTD-2108O9.1 is downregulated in BC tissues and cells and acts as a metastatic inhibitor of BC. Mechanistic investigation determined that lncRNA-CTD-2108O9.1 represses metastasis by targeting leukemia inhibitory factor receptor (LIFR), which is designated as a metastasis suppressor in BC. Our study characterizes a significant tumor suppressor active in BC metastasis repression through the known metastasis inhibitor LIFR. © 2018 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.
Long Noncoding RNA H19 in Digestive System Cancers: A Meta-Analysis of Its Association with Pathological Features.

PubMed

Lin, Yang; Xu, Lijian; Wei, Wei; Zhang, Xiaohui; Ying, Rongchao

2016-01-01

Long noncoding RNA (lncRNA) H19 has been reported to be upregulated in malignant digestive tumors, but its clinical relevance is not yet established. The meta-analysis was to investigate the association between H19 expression and pathological features of digestive system cancers. The databases of PubMed, EMBase, Web of Science, CNKI, and WanFang were searched for the related studies. A total of 478 patients from 6 studies were finally included. The meta-analysis showed that the patient group of high H19 expression had a higher risk of poorly differentiated grade, deep tumor invasion (T2 stage or more), lymph node metastasis, and advanced TNM stage than the group of low H19 expression, although there was no difference between them in terms of distant metastasis. Therefore, the high expression of lncRNA H19 might predict poor oncological outcomes of patients with digestive system cancers.
High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

PubMed

Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory

2017-12-01

Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.
Probing RNA Native Conformational Ensembles with Structural Constraints.

PubMed

Fonseca, Rasmus; van den Bedem, Henry; Bernauer, Julie

2016-05-01

Noncoding ribonucleic acids (RNA) play a critical role in a wide variety of cellular processes, ranging from regulating gene expression to post-translational modification and protein synthesis. Their activity is modulated by highly dynamic exchanges between three-dimensional conformational substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined by distance constraints in the tertiary structure. The dimensionality reduction enables efficient exploration of conformational space. We show that the conformational distributions obtained with our method broadly sample the conformational landscape observed in NMR experiments. Compared to normal mode analysis-based exploration, our procedure diffuses faster through the experimental ensemble while also accessing conformational substates to greater precision. Our results suggest that conformational sampling with a highly reduced but fully atomistic representation of noncoding RNA expresses key features of their dynamic nature.
Dissecting non-coding RNA mechanisms in cellulo by single-molecule high-resolution localization and counting

PubMed Central

Pitchiaya, Sethuramasundaram; Krishnan, Vishalakshi; Custer, Thomas C.; Walter, Nils G.

2013-01-01

Non-coding RNAs (ncRNAs) recently were discovered to outnumber their protein-coding counterparts, yet their diverse functions are still poorly understood. Here we report on a method for the intracellular Single-molecule High Resolution Localization and Counting (iSHiRLoC) of microRNAs (miRNAs), a conserved, ubiquitous class of regulatory ncRNAs that controls the expression of over 60% of all mammalian protein coding genes post-transcriptionally, by a mechanism shrouded by seemingly contradictory observations. We present protocols to execute single particle tracking (SPT) and single-molecule counting of functional microinjected, fluorophore-labeled miRNAs and thereby extract diffusion coefficients and molecular stoichiometries of micro-ribonucleoprotein (miRNP) complexes from living and fixed cells, respectively. This probing of miRNAs at the single molecule level sheds new light on the intracellular assembly/disassembly of miRNPs, thus beginning to unravel the dynamic nature of this important gene regulatory pathway and facilitating the development of a parsimonious model for their obscured mechanism of action. PMID:23820309
The Ftx Noncoding Locus Controls X Chromosome Inactivation Independently of Its RNA Products.

PubMed

Furlan, Giulia; Gutierrez Hernandez, Nancy; Huret, Christophe; Galupa, Rafael; van Bemmel, Joke Gerarda; Romito, Antonio; Heard, Edith; Morey, Céline; Rougeulle, Claire

2018-05-03

Accumulation of the Xist long noncoding RNA (lncRNA) on one X chromosome is the trigger for X chromosome inactivation (XCI) in female mammals. Xist expression, which needs to be tightly controlled, involves a cis-acting region, the X-inactivation center (Xic), containing many lncRNA genes that evolved concomitantly to Xist from protein-coding ancestors through pseudogeneization and loss of coding potential. Here, we uncover an essential role for the Xic-linked noncoding gene Ftx in the regulation of Xist expression. We show that Ftx is required in cis to promote Xist transcriptional activation and establishment of XCI. Importantly, we demonstrate that this function depends on Ftx transcription and not on the RNA products. Our findings illustrate the multiplicity of layers operating in the establishment of XCI and highlight the diversity in the modus operandi of the noncoding players. Copyright © 2018 Elsevier Inc. All rights reserved.
The Landscape of long non-coding RNA classification

PubMed Central

St Laurent, Georges; Wahlestedt, Claes; Kapranov, Philipp

2015-01-01

Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long non-coding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the non-coding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual un-ambiguous classification framework results in a number of challenges in the annotation and interpretation of non-coding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets. PMID:25869999
The Hippo pathway in hepatocellular carcinoma: Non-coding RNAs in action.

PubMed

Shi, Xuan; Zhu, Hai-Rong; Liu, Tao-Tao; Shen, Xi-Zhong; Zhu, Ji-Min

2017-08-01

Hepatocellular carcinoma (HCC) is the sixth most common cancer and the third leading cause of cancer-related death worldwide. However, current strategies curing HCC are far from satisfaction. The Hippo pathway is an evolutionarily conserved tumor suppressive pathway that plays crucial roles in organ size control and tissue homeostasis. Its dysregulation is commonly observed in various types of cancer including HCC. Recently, the prominent role of non-coding RNAs in the Hippo pathway during normal development and neoplastic progression is also emerging in liver. Thus, further investigation into the regulatory network between non-coding RNAs and the Hippo pathway and their connections with HCC may provide new therapeutic avenues towards developing an effective preventative or perhaps curative treatment for HCC. Herein we summarize the role of non-coding RNAs in the Hippo pathway, with an emphasis on their contribution to carcinogenesis, diagnosis, treatment and prognosis of HCC. Copyright © 2017 Elsevier B.V. All rights reserved.
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Transcriptional Regulation in Ebola Virus: Effects of Gene Border Structure and Regulatory Elements on Gene Expression and Polymerase Scanning Behavior

PubMed Central

Brauburger, Kristina; Boehmann, Yannik; Krähling, Verena

2015-01-01

ABSTRACT The highly pathogenic Ebola virus (EBOV) has a nonsegmented negative-strand (NNS) RNA genome containing seven genes. The viral genes either are separated by intergenic regions (IRs) of variable length or overlap. The structure of the EBOV gene overlaps is conserved throughout all filovirus genomes and is distinct from that of the overlaps found in other NNS RNA viruses. Here, we analyzed how diverse gene borders and noncoding regions surrounding the gene borders influence transcript levels and govern polymerase behavior during viral transcription. Transcription of overlapping genes in EBOV bicistronic minigenomes followed the stop-start mechanism, similar to that followed by IR-containing gene borders. When the gene overlaps were extended, the EBOV polymerase was able to scan the template in an upstream direction. This polymerase feature seems to be generally conserved among NNS RNA virus polymerases. Analysis of IR-containing gene borders showed that the IR sequence plays only a minor role in transcription regulation. Changes in IR length were generally well tolerated, but specific IR lengths led to a strong decrease in downstream gene expression. Correlation analysis revealed that these effects were largely independent of the surrounding gene borders. Each EBOV gene contains exceptionally long untranslated regions (UTRs) flanking the open reading frame. Our data suggest that the UTRs adjacent to the gene borders are the main regulators of transcript levels. A highly complex interplay between the different cis-acting elements to modulate transcription was revealed for specific combinations of IRs and UTRs, emphasizing the importance of the noncoding regions in EBOV gene expression control. IMPORTANCE Our data extend those from previous analyses investigating the implication of noncoding regions at the EBOV gene borders for gene expression control. We show that EBOV transcription is regulated in a highly complex yet not easily predictable manner by a set of interacting cis-active elements. These findings are important not only for the design of recombinant filoviruses but also for the design of other replicon systems widely used as surrogate systems to study the filovirus replication cycle under low biosafety levels. Insights into the complex regulation of EBOV transcription conveyed by noncoding sequences will also help to interpret the importance of mutations that have been detected within these regions, including in isolates of the current outbreak. PMID:26656691
Transcriptional Regulation in Ebola Virus: Effects of Gene Border Structure and Regulatory Elements on Gene Expression and Polymerase Scanning Behavior.

PubMed

Brauburger, Kristina; Boehmann, Yannik; Krähling, Verena; Mühlberger, Elke

2016-02-15

The highly pathogenic Ebola virus (EBOV) has a nonsegmented negative-strand (NNS) RNA genome containing seven genes. The viral genes either are separated by intergenic regions (IRs) of variable length or overlap. The structure of the EBOV gene overlaps is conserved throughout all filovirus genomes and is distinct from that of the overlaps found in other NNS RNA viruses. Here, we analyzed how diverse gene borders and noncoding regions surrounding the gene borders influence transcript levels and govern polymerase behavior during viral transcription. Transcription of overlapping genes in EBOV bicistronic minigenomes followed the stop-start mechanism, similar to that followed by IR-containing gene borders. When the gene overlaps were extended, the EBOV polymerase was able to scan the template in an upstream direction. This polymerase feature seems to be generally conserved among NNS RNA virus polymerases. Analysis of IR-containing gene borders showed that the IR sequence plays only a minor role in transcription regulation. Changes in IR length were generally well tolerated, but specific IR lengths led to a strong decrease in downstream gene expression. Correlation analysis revealed that these effects were largely independent of the surrounding gene borders. Each EBOV gene contains exceptionally long untranslated regions (UTRs) flanking the open reading frame. Our data suggest that the UTRs adjacent to the gene borders are the main regulators of transcript levels. A highly complex interplay between the different cis-acting elements to modulate transcription was revealed for specific combinations of IRs and UTRs, emphasizing the importance of the noncoding regions in EBOV gene expression control. Our data extend those from previous analyses investigating the implication of noncoding regions at the EBOV gene borders for gene expression control. We show that EBOV transcription is regulated in a highly complex yet not easily predictable manner by a set of interacting cis-active elements. These findings are important not only for the design of recombinant filoviruses but also for the design of other replicon systems widely used as surrogate systems to study the filovirus replication cycle under low biosafety levels. Insights into the complex regulation of EBOV transcription conveyed by noncoding sequences will also help to interpret the importance of mutations that have been detected within these regions, including in isolates of the current outbreak. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Present Scenario of Long Non-Coding RNAs in Plants

PubMed Central

Bhatia, Garima; Goyal, Neetu; Sharma, Shailesh; Upadhyay, Santosh Kumar; Singh, Kashmir

2017-01-01

Small non-coding RNAs have been extensively studied in plants over the last decade. In contrast, genome-wide identification of plant long non-coding RNAs (lncRNAs) has recently gained momentum. LncRNAs are now being recognized as important players in gene regulation, and their potent regulatory roles are being studied comprehensively in eukaryotes. LncRNAs were first reported in humans in 1992. Since then, research in animals, particularly in humans, has rapidly progressed, and a vast amount of data has been generated, collected, and organized using computational approaches. Additionally, numerous studies have been conducted to understand the roles of these long RNA species in several diseases. However, the status of lncRNA investigation in plants lags behind that in animals (especially humans). Efforts are being made in this direction using computational tools and high-throughput sequencing technologies, such as the lncRNA microarray technique, RNA-sequencing (RNA-seq), RNA capture sequencing, (RNA CaptureSeq), etc. Given the current scenario, significant amounts of data have been produced regarding plant lncRNAs, and this amount is likely to increase in the subsequent years. In this review we have documented brief information about lncRNAs and their status of research in plants, along with the plant-specific resources/databases for information retrieval on lncRNAs. PMID:29657289
Identification and Characterization of Small Noncoding RNAs in Genome Sequences of the Edible Fungus Pleurotus ostreatus

PubMed Central

Zhao, Mengran; Hsiang, Tom; Feng, Xiaoxing

2016-01-01

Noncoding RNAs (ncRNAs) have been identified in many fungi. However, no genome-scale identification of ncRNAs has been inventoried for basidiomycetes. In this research, we detected 254 small noncoding RNAs (sncRNAs) in a genome assembly of an isolate (CCEF00389) of Pleurotus ostreatus, which is a widely cultivated edible basidiomycetous fungus worldwide. The identified sncRNAs include snRNAs, snoRNAs, tRNAs, and miRNAs. SnRNA U1 was not found in CCEF00389 genome assembly and some other basidiomycetous genomes by BLASTn. This implies that if snRNA U1 of basidiomycetes exists, it has a sequence that varies significantly from other organisms. By analyzing the distribution of sncRNA loci, we found that snRNAs and most tRNAs (88.6%) were located in pseudo-UTR regions, while miRNAs are commonly found in introns. To analyze the evolutionary conservation of the sncRNAs in P. ostreatus, we aligned all 254 sncRNAs to the genome assemblies of some other Agaricomycotina fungi. The results suggest that most sncRNAs (77.56%) were highly conserved in P. ostreatus, and 20% were conserved in Agaricomycotina fungi. These findings indicate that most sncRNAs of P. ostreatus were not conserved across Agaricomycotina fungi. PMID:27703969
Long Non-Coding RNAs: Key Regulators of Epithelial-Mesenchymal Transition, Tumour Drug Resistance and Cancer Stem Cells

PubMed Central

Heery, Richard; Finn, Stephen P.; Cuffe, Sinead; Gray, Steven G.

2017-01-01

Epithelial mesenchymal transition (EMT), the adoption by epithelial cells of a mesenchymal-like phenotype, is a process co-opted by carcinoma cells in order to initiate invasion and metastasis. In addition, it is becoming clear that is instrumental to both the development of drug resistance by tumour cells and in the generation and maintenance of cancer stem cells. EMT is thus a pivotal process during tumour progression and poses a major barrier to the successful treatment of cancer. Non-coding RNAs (ncRNA) often utilize epigenetic programs to regulate both gene expression and chromatin structure. One type of ncRNA, called long non-coding RNAs (lncRNAs), has become increasingly recognized as being both highly dysregulated in cancer and to play a variety of different roles in tumourigenesis. Indeed, over the last few years, lncRNAs have rapidly emerged as key regulators of EMT in cancer. In this review, we discuss the lncRNAs that have been associated with the EMT process in cancer and the variety of molecular mechanisms and signalling pathways through which they regulate EMT, and finally discuss how these EMT-regulating lncRNAs impact on both anti-cancer drug resistance and the cancer stem cell phenotype. PMID:28430163
Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis.

PubMed

Buldyrev, S V; Goldberger, A L; Havlin, S; Mantegna, R N; Matsa, M E; Peng, C K; Simons, M; Stanley, H E

1995-05-01

An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33301 coding and all 29453 noncoding eukaryotic sequences--each of length larger than 512 base pairs (bp)--in the present release of the GenBank to dtermine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent beta=0.00 +/- 0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent beta is positive (0.16 +/- 0.05) which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 10(-10). We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA's known mosaic structure ("patchiness") arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.
Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis

NASA Technical Reports Server (NTRS)

Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Matsa, M. E.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33301 coding and all 29453 noncoding eukaryotic sequences--each of length larger than 512 base pairs (bp)--in the present release of the GenBank to dtermine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent beta=0.00 +/- 0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent beta is positive (0.16 +/- 0.05) which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 10(-10). We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA's known mosaic structure ("patchiness") arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.

Potential miRNA regulators of differential HPG axis gene expression between low egg producing and high egg producing turkey hens

USDA-ARS?s Scientific Manuscript database

Expression differences exist in key genes of the hypothalamo-pituitary-gonadal (HPG) axis in low egg producing hens (LEPH) and high egg producing hens (HEPH); however, regulation of these differences is unknown. MicroRNAs (miRNAs) are small non-coding RNAs that play a role in post-transcriptional re...
Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

PubMed Central

Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

2008-01-01

Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis. PMID:18954468
Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene.

PubMed

Seim, Inge; Carter, Shea L; Herington, Adrian C; Chopin, Lisa K

2008-10-28

The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS. We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.

PubMed

Jiang, Rays H Y; Govers, Francine

2006-10-01

Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
A gene family for acidic ribosomal proteins in Schizosaccharomyces pombe: two essential and two nonessential genes.

PubMed Central

Beltrame, M; Bianchi, M E

1990-01-01

We have cloned the genes for small acidic ribosomal proteins (A-proteins) of the fission yeast Schizosaccharomyces pombe. S. pombe contains four transcribed genes for small A-proteins per haploid genome, as is the case for Saccharomyces cerevisiae. In contrast, multicellular eucaryotes contain two transcribed genes per haploid genome. The four proteins of S. pombe, besides sharing a high overall similarity, form two couples of nearly identical sequences. Their corresponding genes have a very conserved structure and are transcribed to a similar level. Surprisingly, of each couple of genes coding for nearly identical proteins, one is essential for cell growth, whereas the other is not. We suggest that the unequal importance of the four small A-proteins for cell survival is related to their physical organization in 60S ribosomal subunits. Images PMID:2325655
Phylogeographically concordant chloroplast DNA divergence in sympatric Nothofagus s.s. How deep can it be?

PubMed

Premoli, Andrea C; Mathiasen, Paula; Acosta, M Cristina; Ramos, Victor A

2012-01-01

• Here, we performed phylogenetic analyses and estimated the divergence times on mostly sympatric populations of five species within subgenus Nothofagus. We aimed to investigate whether phylogenetic relationships by nuclear internal transcribed spacer (ITS) and phylogeographic patterns by chloroplast DNA (cpDNA) mirror an ancient evolutionary history that was not erased by glacial eras. Extant species are restricted to Patagonia and share a pollen type that was formerly widespread in all southern land masses. Weak reproductive barriers exist among them. • Fifteen cpDNA haplotypes resulted from the analysis of three noncoding regions on 330 individuals with a total alignment of 1794 bp. Nuclear ITS data consisted of 822 bp. We found a deep cpDNA divergence dated 32 Ma at mid-latitudes of Patagonia that predates the phylogenetic divergence of extant taxa. Other more recent breaks by cpDNA occurred towards the north. • Complex paleogeographic features explain the genetic discontinuities. Long-lasting paleobasins and marine ingressions have impeded transoceanic dispersal during range expansion towards lower latitudes under cooler trends since the Oligocene. • Cycles of hybridization-introgression among extant and extinct taxa have resulted in widespread chloroplast capture events. Our data suggest that Nothofagus biogeography will be resolved only if thorough phylogeographic analyses and molecular dating methods are applied using distinct genetic markers. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.
Live-cell imaging reveals the dynamics and function of single-telomere TERRA molecules in cancer cells.

PubMed

Avogaro, Laura; Querido, Emmanuelle; Dalachi, Myriam; Jantsch, Michael F; Chartrand, Pascal; Cusanelli, Emilio

2018-04-16

Telomeres cap the ends of eukaryotic chromosomes, protecting them from degradation and erroneous recombination events which may lead to genome instability. Telomeres are transcribed giving rise to telomeric repeat-containing RNAs, called TERRA. The TERRA long noncoding RNAs have been proposed to play important roles in telomere biology, including heterochromatin formation and telomere length homeostasis. While TERRA RNAs are predominantly nuclear and localize at telomeres, little is known about the dynamics and function of TERRA molecules expressed from individual telomeres. Herein, we developed an assay to image endogenous TERRA molecules expressed from a single telomere in living human cancer cells. We show that single-telomere TERRA can be detected as TERRA RNA single particles which freely diffuse within the nucleus. Furthermore, TERRA molecules aggregate forming TERRA clusters. Three-dimensional size distribution and single particle tracking analyses revealed distinct sizes and dynamics for TERRA RNA single particles and clusters. Simultaneous time lapse confocal imaging of TERRA particles and telomeres showed that TERRA clusters transiently co-localize with telomeres. Finally, we used chemically modified antisense oligonucleotides to deplete TERRA molecules expressed from a single telomere. Single-telomere TERRA depletion resulted in increased DNA damage at telomeres and elsewhere in the genome. These results suggest that single-telomere TERRA transcripts participate in the maintenance of genomic integrity in human cancer cells.
The inhibition of 45A ncRNA expression reduces tumor formation, affecting tumor nodules compactness and metastatic potential in neuroblastoma cells

PubMed Central

Russo, Debora; Poggi, Alessandro; Villa, Federico; Brizzolara, Antonella; Canale, Claudio; Mescola, Andrea; Daga, Antonio; Russo, Claudio; Nizzari, Mario; Florio, Tullio; Menichini, Paola; Pagano, Aldo

2017-01-01

We recently reported the in vitro over-expression of 45A, a RNA polymerase III-transcribed non-coding (nc)RNA, that perturbs the intracellular content of FE65L1 affecting cell proliferation rate, short-term response to genotoxic stress, substrate adhesion capacity and, ultimately, increasing the tumorigenic potential of human neuroblastoma cells. In this work, to deeply explore the mechanism by which 45A ncRNA contributes to cancer development, we targeted in vitro and in vivo 45A levels by the stable overexpression of antisense 45A RNA. 45A downregulation leads to deep modifications of cytoskeleton organization, adhesion and migration of neuroblastoma cells. These effects are correlated with alterations in the expression of several genes including GTSE1 (G2 and S phase-expressed-1), a crucial regulator of tumor cell migration and metastatic potential. Interestingly, the downregulation of 45A ncRNA strongly affects the in vivo tumorigenic potential of SKNBE2 neuroblastoma cells, increasing tumor nodule compactness and reducing GTSE1 protein expression in a subcutaneous neuroblastoma mouse model. Moreover, intracardiac injection of neuroblastoma cells showed that downregulation of 45A ncRNA also influences tumor metastatic ability. In conclusion, our data highlight a key role of 45A ncRNA in cancer development and suggest that its modulation might represent a possible novel anticancer therapeutic approach. PMID:28029658
The inhibition of 45A ncRNA expression reduces tumor formation, affecting tumor nodules compactness and metastatic potential in neuroblastoma cells.

PubMed

Penna, Ilaria; Gigoni, Arianna; Costa, Delfina; Vella, Serena; Russo, Debora; Poggi, Alessandro; Villa, Federico; Brizzolara, Antonella; Canale, Claudio; Mescola, Andrea; Daga, Antonio; Russo, Claudio; Nizzari, Mario; Florio, Tullio; Menichini, Paola; Pagano, Aldo

2017-01-31

We recently reported the in vitro over-expression of 45A, a RNA polymerase III-transcribed non-coding (nc)RNA, that perturbs the intracellular content of FE65L1 affecting cell proliferation rate, short-term response to genotoxic stress, substrate adhesion capacity and, ultimately, increasing the tumorigenic potential of human neuroblastoma cells. In this work, to deeply explore the mechanism by which 45A ncRNA contributes to cancer development, we targeted in vitro and in vivo 45A levels by the stable overexpression of antisense 45A RNA.45A downregulation leads to deep modifications of cytoskeleton organization, adhesion and migration of neuroblastoma cells. These effects are correlated with alterations in the expression of several genes including GTSE1 (G2 and S phase-expressed-1), a crucial regulator of tumor cell migration and metastatic potential. Interestingly, the downregulation of 45A ncRNA strongly affects the in vivo tumorigenic potential of SKNBE2 neuroblastoma cells, increasing tumor nodule compactness and reducing GTSE1 protein expression in a subcutaneous neuroblastoma mouse model. Moreover, intracardiac injection of neuroblastoma cells showed that downregulation of 45A ncRNA also influences tumor metastatic ability. In conclusion, our data highlight a key role of 45A ncRNA in cancer development and suggest that its modulation might represent a possible novel anticancer therapeutic approach.
Allopolyploidization and evolution of species with reduced floral structures in Lepidium L. (Brassicaceae)

PubMed Central

Lee, Ji-Young; Mummenhoff, Klaus; Bowman, John L.

2002-01-01

Understanding the pattern of speciation in a group of plants is critical for understanding its morphological evolution. Lepidium is the genus with the largest variation in floral structure in Brassicaceae, a family in which the floral ground plan is remarkably stable. However, flowers in more than half of Lepidium species have reduced stamen numbers, and most of these also have reduced petals. The species with reduced flowers are geographically biased, distributed mostly in the Americas and Australia/ New Zealand. Previous phylogenetic studies using noncoding regions of chloroplast DNA and rDNA internal transcribed spacer were incongruent in most New World species relationships. These data, combined with the presence of many polyploid Lepidium species, implied a reticulate history of the genus but did not provide enough information to infer the evolutionary pattern of flower structures. To address this question more thoroughly, sequences of the first intron of a single copy nuclear gene, PISTILLATA, were determined from 43 species. Phylogenetic analysis of the PI intron suggests that many species in the New World have originated from allopolyploidization, and that this is correlated with floral reduction. Interspecific hybrids were generated to understand why allopolyploidization is associated with reduced flowers. The phenotypes of F1 flowers indicate allelic dominance of the absence of lateral stamens, suggesting that propagation of dominant alleles through interspecific hybridization could account for the abundance of the allopolyploid species without lateral stamens. PMID:12481035
Relative stability of DNA as a generic criterion for promoter prediction: whole genome annotation of microbial genomes with varying nucleotide base composition.

PubMed

Rangannan, Vetriselvi; Bansal, Manju

2009-12-01

The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool PromPredict. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.
The circRNA interactome–innovative hallmarks of the intra- and extracellular radiation response

PubMed Central

O'Leary, Valerie Bríd; Smida, Jan; Matjanovski, Martina; Brockhaus, Corinna; Winkler, Klaudia; Moertl, Simone; Ovsepian, Saak Victor; Atkinson, Michael John

2017-01-01

Generated by Quaking (QKI), circular RNAs (circRNAs) are newly recognised non-coding RNA (ncRNA) members characterised by tissue specificity, increased stability and enrichment within exosomes. Studies have shown that ionizing radiation (IR) can influence ncRNA transcription. However, it is unknown whether circRNAs or indeed QKI are regulated by IR. Microarray circRNA profiling and next generation sequencing revealed that circRNA expression was altered by low and medium dose exposure sourced predominantly from genes influencing the p53 pathway. CircRNAs KIRKOS-71 and KIRKOS-73 transcribed from the WWOX (WW Domain Containing Oxidoreductase) tumor suppressor (a p53 regulator) responded within hours to IR. KIRKOS-71 and KIRKOS-73 were present in exosomes yet exhibited differential transcript clearance between irradiated cell lines. Dual-quasar labelled probes and in-situ hybridization demonstrated the intercellular distribution of KIRKOS-71 and KIRKOS-73 predominantly within the perinucleus. QKI knockdown removed nuclear expression of these circRNAs with no significant effect on cytosolic KIRKOS-71 and KIRKOS-73. Distinct QKI transcription between cell lines and its augmented interaction with KIRKOS-71 and KIRKOS-73 was noted post IR. This foremost study provides evidence that QKI and circRNAs partake in the cellular irradiation response. KIRKOS-71 and KIRKOS-73 as stable secreted circRNAs may afford vital characteristics worth syphoning as promising diagnostic radiotherapy biomarkers. PMID:29108237
Unique signatures of long noncoding RNA expression in response to virus infection and altered innate immune signaling.

PubMed

Peng, Xinxia; Gralinski, Lisa; Armour, Christopher D; Ferris, Martin T; Thomas, Matthew J; Proll, Sean; Bradel-Tretheway, Birgit G; Korth, Marcus J; Castle, John C; Biery, Matthew C; Bouzek, Heather K; Haynor, David R; Frieman, Matthew B; Heise, Mark; Raymond, Christopher K; Baric, Ralph S; Katze, Michael G

2010-10-26

Studies of the host response to virus infection typically focus on protein-coding genes. However, non-protein-coding RNAs (ncRNAs) are transcribed in mammalian cells, and the roles of many of these ncRNAs remain enigmas. Using next-generation sequencing, we performed a whole-transcriptome analysis of the host response to severe acute respiratory syndrome coronavirus (SARS-CoV) infection across four founder mouse strains of the Collaborative Cross. We observed differential expression of approximately 500 annotated, long ncRNAs and 1,000 nonannotated genomic regions during infection. Moreover, studies of a subset of these ncRNAs and genomic regions showed the following. (i) Most were similarly regulated in response to influenza virus infection. (ii) They had distinctive kinetic expression profiles in type I interferon receptor and STAT1 knockout mice during SARS-CoV infection, including unique signatures of ncRNA expression associated with lethal infection. (iii) Over 40% were similarly regulated in vitro in response to both influenza virus infection and interferon treatment. These findings represent the first discovery of the widespread differential expression of long ncRNAs in response to virus infection and suggest that ncRNAs are involved in regulating the host response, including innate immunity. At the same time, virus infection models provide a unique platform for studying the biology and regulation of ncRNAs.
Unique Signatures of Long Noncoding RNA Expression in Response to Virus Infection and Altered Innate Immune Signaling

PubMed Central

Peng, Xinxia; Gralinski, Lisa; Armour, Christopher D.; Ferris, Martin T.; Thomas, Matthew J.; Proll, Sean; Bradel-Tretheway, Birgit G.; Korth, Marcus J.; Castle, John C.; Biery, Matthew C.; Bouzek, Heather K.; Haynor, David R.; Frieman, Matthew B.; Heise, Mark; Raymond, Christopher K.; Baric, Ralph S.; Katze, Michael G.

2010-01-01

Studies of the host response to virus infection typically focus on protein-coding genes. However, non-protein-coding RNAs (ncRNAs) are transcribed in mammalian cells, and the roles of many of these ncRNAs remain enigmas. Using next-generation sequencing, we performed a whole-transcriptome analysis of the host response to severe acute respiratory syndrome coronavirus (SARS-CoV) infection across four founder mouse strains of the Collaborative Cross. We observed differential expression of approximately 500 annotated, long ncRNAs and 1,000 nonannotated genomic regions during infection. Moreover, studies of a subset of these ncRNAs and genomic regions showed the following. (i) Most were similarly regulated in response to influenza virus infection. (ii) They had distinctive kinetic expression profiles in type I interferon receptor and STAT1 knockout mice during SARS-CoV infection, including unique signatures of ncRNA expression associated with lethal infection. (iii) Over 40% were similarly regulated in vitro in response to both influenza virus infection and interferon treatment. These findings represent the first discovery of the widespread differential expression of long ncRNAs in response to virus infection and suggest that ncRNAs are involved in regulating the host response, including innate immunity. At the same time, virus infection models provide a unique platform for studying the biology and regulation of ncRNAs. PMID:20978541
xRRM

PubMed Central

Singh, Mahavir; Choi, Charles P.; Feigon, Juli

2013-01-01

Genuine La and La-related proteins group 7 (LARP7) bind to the non-coding RNAs transcribed by RNA polymerase III (RNAPIII), which end in UUU-3′OH. The La motif and RRM1 of these proteins (the La module) cooperate to bind the UUU-3′OH, protecting the RNA from degradation, while other domains may be important for RNA folding or other functions. Among the RNAPIII transcripts is ciliate telomerase RNA (TER). p65, a member of the LARP7 family, is an integral Tetrahymena thermophila telomerase holoenzyme protein required for TER biogenesis and telomerase RNP assembly. p65, together with TER and telomerase reverse transcriptase (TERT), form the Tetrahymena telomerase RNP catalytic core. p65 has an N-terminal domain followed by a La module and a C-terminal domain, which binds to the TER stem 4. We recently showed that the p65 C-terminal domain harbors a cryptic, atypical RRM, which uses a unique mode of single- and double-strand RNA binding and is required for telomerase RNP catalytic core assembly. This domain, which we named xRRM, appears to be present in and unique to genuine La and LARP7 proteins. Here we review the structure of the xRRM, discuss how this domain could recognize diverse substrates of La and LARP7 proteins and discuss the functional implications of the xRRM as an RNP chaperone. PMID:23328630
A resource for functional profiling of noncoding RNA in the yeast Saccharomyces cerevisiae.

PubMed

Parker, Steven; Fraczek, Marcin G; Wu, Jian; Shamsah, Sara; Manousaki, Alkisti; Dungrattanalert, Kobchai; de Almeida, Rogerio Alves; Estrada-Rivadeneyra, Diego; Omara, Walid; Delneri, Daniela; O'Keefe, Raymond T

2017-08-01

Eukaryotic genomes are extensively transcribed, generating many different RNAs with no known function. We have constructed 1502 molecular barcoded ncRNA gene deletion strains encompassing 443 ncRNAs in the yeast Saccharomyces cerevisiae as tools for ncRNA functional analysis. This resource includes deletions of small nuclear RNAs (snRNAs), transfer RNAs (tRNAs), small nucleolar RNAs (snoRNAs), and other annotated ncRNAs as well as the more recently identified stable unannotated transcripts (SUTs) and cryptic unstable transcripts (CUTs) whose functions are largely unknown. Specifically, deletions have been constructed for ncRNAs found in the intergenic regions, not overlapping genes or their promoters (i.e., at least 200 bp minimum distance from the closest gene start codon). The deletion strains carry molecular barcodes designed to be complementary with the protein gene deletion collection enabling parallel analysis experiments. These strains will be useful for the numerous genomic and molecular techniques that utilize deletion strains, including genome-wide phenotypic screens under different growth conditions, pooled chemogenomic screens with drugs or chemicals, synthetic genetic array analysis to uncover novel genetic interactions, and synthetic dosage lethality screens to analyze gene dosage. Overall, we created a valuable resource for the RNA community and for future ncRNA research. © 2017 Parker et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.

Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation

DOE PAGES

Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.; ...

2016-09-04

Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
NPInter v3.0: an upgraded database of noncoding RNA-associated interactions

PubMed Central

Hao, Yajing; Wu, Wei; Li, Hui; Yuan, Jiao; Luo, Jianjun; Zhao, Yi; Chen, Runsheng

2016-01-01

Despite the fact that a large quantity of noncoding RNAs (ncRNAs) have been identified, their functions remain unclear. To enable researchers to have a better understanding of ncRNAs’ functions, we updated the NPInter database to version 3.0, which contains experimentally verified interactions between ncRNAs (excluding tRNAs and rRNAs), especially long noncoding RNAs (lncRNAs) and other biomolecules (proteins, mRNAs, miRNAs and genomic DNAs). In NPInter v3.0, interactions pertaining to ncRNAs are not only manually curated from scientific literature but also curated from high-throughput technologies. In addition, we also curated lncRNA–miRNA interactions from in silico predictions supported by AGO CLIP-seq data. When compared with NPInter v2.0, the interactions are more informative (with additional information on tissues or cell lines, binding sites, conservation, co-expression values and other features) and more organized (with divisions on data sets by data sources, tissues or cell lines, experiments and other criteria). NPInter v3.0 expands the data set to 491,416 interactions in 188 tissues (or cell lines) from 68 kinds of experimental technologies. NPInter v3.0 also improves the user interface and adds new web services, including a local UCSC Genome Browser to visualize binding sites. Additionally, NPInter v3.0 defined a high-confidence set of interactions and predicted the functions of lncRNAs in human and mouse based on the interactions curated in the database. NPInter v3.0 is available at http://www.bioinfo.org/NPInter/. Database URL: http://www.bioinfo.org/NPInter/ PMID:27087310
Upregulation of the long non-coding RNA SNHG1 predicts poor prognosis, promotes cell proliferation and invasion, and reduces apoptosis in glioma.

PubMed

Wang, Qiang; Li, Qing; Zhou, Peng; Deng, Danni; Xue, Lian; Shao, Naiyuan; Peng, Ya; Zhi, Feng

2017-07-01

Long non-coding RNAs (lncRNAs), which are non-coding RNAs with a length above 200 nucleotides, have emerged as novel and important gene expression modulators in carcinogenesis. Recent evidence indicates that the lncRNA small nucleolar RNA host gene 1 (SNHG1) functions as an oncogene in several types of human cancers. However, its function in the development of glioma remains unknown. The aim of this research was to investigate the clinical aspects and biological mechanisms of SNHG1 in glioma. SNHG1 expression was measured in glioma tissues and cell lines by quantitative real-time PCR (qRT-PCR). The association between SNHG1 expression in tissues and clinicopathological characteristics and prognosis in glioma patients was also explored. Gain-of-function and loss-of-function studies using SNHG1 cDNA and siRNA, respectively, were used to investigate the role of SNHG1 in cell proliferation, invasion and apoptosis in glioma. SNHG1 was highly expressed in glioma tissues, and its upregulation was closely related to old age. Kaplan-Meier analysis showed that high expression of SNHG1 was significantly associated with poor overall survival (OS). Functionally, ectopic expression of SNHG1 enhanced cell proliferation and cell invasion and reduced cell apoptosis in vitro, while SNHG1 knockdown reversed these effects. Taken together, our findings indicate that SNHG1 functions as an oncogene in glioma and may serve as a novel therapeutic target in future treatments. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

Understanding the Role of Non-Coding RNAs in Bladder Cancer: From Dark Matter to Valuable Therapeutic Targets

PubMed Central

Pop-Bica, Cecilia; Gulei, Diana; Cojocneanu-Petric, Roxana; Braicu, Cornelia; Petrut, Bogdan; Berindan-Neagoe, Ioana

2017-01-01

The mortality and morbidity that characterize bladder cancer compel this malignancy into the category of hot topics in terms of biomolecular research. Therefore, a better knowledge of the specific molecular mechanisms that underlie the development and progression of bladder cancer is demanded. Tumor heterogeneity among patients with similar diagnosis, as well as intratumor heterogeneity, generates difficulties in terms of targeted therapy. Furthermore, late diagnosis represents an ongoing issue, significantly reducing the response to therapy and, inevitably, the overall survival. The role of non-coding RNAs in bladder cancer emerged in the last decade, revealing that microRNAs (miRNAs) may act as tumor suppressor genes, respectively oncogenes, but also as biomarkers for early diagnosis. Regarding other types of non-coding RNAs, especially long non-coding RNAs (lncRNAs) which are extensively reviewed in this article, their exact roles in tumorigenesis are—for the time being—not as evident as in the case of miRNAs, but, still, clearly suggested. Therefore, this review covers the non-coding RNA expression profile of bladder cancer patients and their validated target genes in bladder cancer cell lines, with repercussions on processes such as proliferation, invasiveness, apoptosis, cell cycle arrest, and other molecular pathways which are specific for the malignant transformation of cells. PMID:28703782
Understanding the Role of Non-Coding RNAs in Bladder Cancer: From Dark Matter to Valuable Therapeutic Targets.

PubMed

Pop-Bica, Cecilia; Gulei, Diana; Cojocneanu-Petric, Roxana; Braicu, Cornelia; Petrut, Bogdan; Berindan-Neagoe, Ioana

2017-07-13

The mortality and morbidity that characterize bladder cancer compel this malignancy into the category of hot topics in terms of biomolecular research. Therefore, a better knowledge of the specific molecular mechanisms that underlie the development and progression of bladder cancer is demanded. Tumor heterogeneity among patients with similar diagnosis, as well as intratumor heterogeneity, generates difficulties in terms of targeted therapy. Furthermore, late diagnosis represents an ongoing issue, significantly reducing the response to therapy and, inevitably, the overall survival. The role of non-coding RNAs in bladder cancer emerged in the last decade, revealing that microRNAs (miRNAs) may act as tumor suppressor genes, respectively oncogenes, but also as biomarkers for early diagnosis. Regarding other types of non-coding RNAs, especially long non-coding RNAs (lncRNAs) which are extensively reviewed in this article, their exact roles in tumorigenesis are-for the time being-not as evident as in the case of miRNAs, but, still, clearly suggested. Therefore, this review covers the non-coding RNA expression profile of bladder cancer patients and their validated target genes in bladder cancer cell lines, with repercussions on processes such as proliferation, invasiveness, apoptosis, cell cycle arrest, and other molecular pathways which are specific for the malignant transformation of cells.
Molecular interplay of pro-inflammatory transcription factors and non-coding RNAs in esophageal squamous cell carcinoma.

PubMed

Sundaram, Gopinath M; Veera Bramhachari, Pallaval

2017-06-01

Esophageal squamous cell carcinoma is the sixth most common cancer in the developing world. The aggressive nature of esophageal squamous cell carcinoma, its tendency for relapse, and the poor survival prospects of patients diagnosed at advanced stages, represent a pressing need for the development of new therapies for this disease. Chronic inflammation is known to have a causal link to cancer pre-disposition. Nuclear factor kappa B and signal transducer and activator of transcription 3 are transcription factors which regulate immunity and inflammation and are emerging as key regulators of tumor initiation, progression, and metastasis. Although these pro-inflammatory factors in esophageal squamous cell carcinoma have been well-characterized with reference to protein-coding targets, their functional interactions with non-coding RNAs have only recently been gaining attention. Non-coding RNAs, especially microRNAs and long non-coding RNAs demonstrate potential as biomarkers and alternative therapeutic targets. In this review, we summarize the recent literature and concepts on non-coding RNAs that are regulated by/regulate nuclear factor kappa B and signal transducer and activator of transcription 3 in esophageal cancer progression. We also discuss how these recent discoveries can pave way for future therapeutic options to treat esophageal squamous cell carcinoma.
Predicted stem-loop structures and variation in nucleotide sequence of 3' noncoding regions among animal calicivirus genomes.

PubMed

Seal, B S; Neill, J D; Ridpath, J F

1994-07-01

Caliciviruses are nonenveloped with a polyadenylated genome of approximately 7.6 kb and a single capsid protein. The "RNA Fold" computer program was used to analyze 3'-terminal noncoding sequences of five feline calicivirus (FCV), rabbit hemorrhagic disease virus (RHDV), and two San Miguel sea lion virus (SMSV) isolates. The FCV 3'-terminal sequences are 40-46 nucleotides in length and 72-91% similar. The FCV sequences were predicted to contain two possible duplex structures and one stem-loop structure with free energies of -2.1 to -18.2 kcal/mole. The RHDV genomic 3'-terminal RNA sequences are 54 nucleotides in length and share 49% sequence similarity to homologous regions of the FCV genome. The RHDV sequence was predicted to form two duplex structures in the 3'-terminal noncoding region with a single stem-loop structure, resembling that of FCV. In contrast, the SMSV 1 and 4 genomic 3'-terminal noncoding sequences were 185 and 182 nucleotides in length, respectively. Ten possible duplex structures were predicted with an average structural free energy of -35 kcal/mole. Sequence similarity between the two SMSV isolates was 75%. Furthermore, extensive cloverleaflike structures are predicted in the 3' noncoding region of the SMSV genome, in contrast to the predicted single stem-loop structures of FCV or RHDV.
Long Noncoding RNAs in Lung Cancer.

PubMed

Roth, Anna; Diederichs, Sven

2016-01-01

Despite great progress in research and treatment options, lung cancer remains the leading cause of cancer-related deaths worldwide. Oncogenic driver mutations in protein-encoding genes were defined and allow for personalized therapies based on genetic diagnoses. Nonetheless, diagnosis of lung cancer mostly occurs at late stages, and chronic treatment is followed by a fast onset of chemoresistance. Hence, there is an urgent need for reliable biomarkers and alternative treatment options. With the era of whole genome and transcriptome sequencing technologies, long noncoding RNAs emerged as a novel class of versatile, functional RNA molecules. Although for most of them the mechanism of action remains to be defined, accumulating evidence confirms their involvement in various aspects of lung tumorigenesis. They are functional on the epigenetic, transcriptional, and posttranscriptional level and are regulators of pathophysiological key pathways including cell growth, apoptosis, and metastasis. Long noncoding RNAs are gaining increasing attention as potential biomarkers and a novel class of druggable molecules. It has become clear that we are only beginning to understand the complexity of tumorigenic processes. The clinical integration of long noncoding RNAs in terms of prognostic and predictive biomarker signatures and additional cancer targets could provide a chance to increase the therapeutic benefit. Here, we review the current knowledge about the expression, regulation, biological function, and clinical relevance of long noncoding RNAs in lung cancer.
Transcriptional dissection of melanoma identifies a high-risk subtype underlying TP53 family genes and epigenome deregulation

PubMed Central

Badal, Brateil; Solovyov, Alexander; Di Cecilia, Serena; Chan, Joseph Minhow; Chang, Li-Wei; Iqbal, Ramiz; Aydin, Iraz T.; Rajan, Geena S.; Chen, Chen; Abbate, Franco; Arora, Kshitij S.; Tanne, Antoine; Gruber, Stephen B.; Johnson, Timothy M.; Fullen, Douglas R.; Phelps, Robert; Bhardwaj, Nina; Bernstein, Emily; Ting, David T.; Brunner, Georg; Schadt, Eric E.; Greenbaum, Benjamin D.; Celebi, Julide Tok

2017-01-01

BACKGROUND. Melanoma is a heterogeneous malignancy. We set out to identify the molecular underpinnings of high-risk melanomas, those that are likely to progress rapidly, metastasize, and result in poor outcomes. METHODS. We examined transcriptome changes from benign states to early-, intermediate-, and late-stage tumors using a set of 78 treatment-naive melanocytic tumors consisting of primary melanomas of the skin and benign melanocytic lesions. We utilized a next-generation sequencing platform that enabled a comprehensive analysis of protein-coding and -noncoding RNA transcripts. RESULTS. Gene expression changes unequivocally discriminated between benign and malignant states, and a dual epigenetic and immune signature emerged defining this transition. To our knowledge, we discovered previously unrecognized melanoma subtypes. A high-risk primary melanoma subset was distinguished by a 122-epigenetic gene signature (“epigenetic” cluster) and TP53 family gene deregulation (TP53, TP63, and TP73). This subtype associated with poor overall survival and showed enrichment of cell cycle genes. Noncoding repetitive element transcripts (LINEs, SINEs, and ERVs) that can result in immunostimulatory signals recapitulating a state of “viral mimicry” were significantly repressed. The high-risk subtype and its poor predictive characteristics were validated in several independent cohorts. Additionally, primary melanomas distinguished by specific immune signatures (“immune” clusters) were identified. CONCLUSION. The TP53 family of genes and genes regulating the epigenetic machinery demonstrate strong prognostic and biological relevance during progression of early disease. Gene expression profiling of protein-coding and -noncoding RNA transcripts may be a better predictor for disease course in melanoma. This study outlines the transcriptional interplay of the cancer cell’s epigenome with the immune milieu with potential for future therapeutic targeting. FUNDING. National Institutes of Health (CA154683, CA158557, CA177940, CA087497-13), Tisch Cancer Institute, Melanoma Research Foundation, the Dow Family Charitable Foundation, and the Icahn School of Medicine at Mount Sinai. PMID:28469092
Regulation of Mammalian Gene Dosage by Long Noncoding RNAs

PubMed Central

Hung, Ko-Hsuan; Wang, Yang; Zhao, Jing Crystal

2013-01-01

Recent transcriptome studies suggest that long noncoding RNAs (lncRNAs) are key components of the mammalian genome, and their study has become a new frontier in biomedical research. In fact, lncRNAs in the mammalian genome were identified and studied at particular epigenetic loci, including imprinted loci and X-chromosome inactivation center, at least two decades ago—long before development of high throughput sequencing technology. Since then, researchers have found that lncRNAs play essential roles in various biological processes, mostly during development. Since much of our understanding of lncRNAs originates from our knowledge of these well-established lncRNAs, in this review we will focus on lncRNAs from the X-chromosome inactivation center and the Dlk1-Dio3 imprinted cluster as examples of lncRNA mechanisms functioning in the epigenetic regulation of mammalian genes. PMID:24970160
Survey of diagnostic tools for detection of viroids and impacts of test results on the seed industry

USDA-ARS?s Scientific Manuscript database

Viroids are unencapsidated, single-stranded, covalently closed circular, highly structured noncoding RNAs of 239 – 401 nucleotides that are replicated by host enzymes and cause disease in several economically important crop plants. Although viroids are primarily and easily transmitted mechanically t...
The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development.

PubMed

Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez; Shaulsky, Gad

2017-02-09

Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development. Copyright © 2017 Rosengarten et al.
Short-lived non-coding transcripts (SLiTs): Clues to regulatory long non-coding RNA.

PubMed

Tani, Hidenori

2017-03-22

Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs). Although the importance of lncRNAs has been documented in previous reports, the biological and physiological functions of lncRNAs remain largely unknown. The role of lncRNAs seems an elusive problem. Here, I propose a clue to the identification of regulatory lncRNAs. The key point is RNA half-life. RNAs with a long half-life (t 1/2 > 4 h) contain a significant proportion of ncRNAs, as well as mRNAs involved in housekeeping functions, whereas RNAs with a short half-life (t 1/2 < 4 h) include known regulatory ncRNAs and regulatory mRNAs. This novel class of ncRNAs with a short half-life can be categorized as Short-Lived non-coding Transcripts (SLiTs). I consider that SLiTs are likely to be rich in functionally uncharacterized regulatory RNAs. This review describes recent progress in research into SLiTs.
Regulatory variation: an emerging vantage point for cancer biology.

PubMed

Li, Luolan; Lorzadeh, Alireza; Hirst, Martin

2014-01-01

Transcriptional regulation involves complex and interdependent interactions of noncoding and coding regions of the genome with proteins that interact and modify them. Genetic variation/mutation in coding and noncoding regions of the genome can drive aberrant transcription and disease. In spite of accounting for nearly 98% of the genome comparatively little is known about the contribution of noncoding DNA elements to disease. Genome-wide association studies of complex human diseases including cancer have revealed enrichment for variants in the noncoding genome. A striking finding of recent cancer genome re-sequencing efforts has been the previously underappreciated frequency of mutations in epigenetic modifiers across a wide range of cancer types. Taken together these results point to the importance of dysregulation in transcriptional regulatory control in genesis of cancer. Powered by recent technological advancements in functional genomic profiling, exploration of normal and transformed regulatory networks will provide novel insight into the initiation and progression of cancer and open new windows to future prognostic and diagnostic tools. © 2013 Wiley Periodicals, Inc.
Functional Interplay between Small Non-Coding RNAs and RNA Modification in the Brain.

PubMed

Leighton, Laura J; Bredy, Timothy W

2018-06-07

Small non-coding RNAs are essential for transcription, translation and gene regulation in all cell types, but are particularly important in neurons, with known roles in neurodevelopment, neuroplasticity and neurological disease. Many small non-coding RNAs are directly involved in the post-transcriptional modification of other RNA species, while others are themselves substrates for modification, or are functionally modulated by modification of their target RNAs. In this review, we explore the known and potential functions of several distinct classes of small non-coding RNAs in the mammalian brain, focusing on the newly recognised interplay between the epitranscriptome and the activity of small RNAs. We discuss the potential for this relationship to influence the spatial and temporal dynamics of gene activation in the brain, and predict that further research in the field of epitranscriptomics will identify interactions between small RNAs and RNA modifications which are essential for higher order brain functions such as learning and memory.
Decoding the Emerging Patterns Exhibited in Non-coding RNAs Characteristic of Lung Cancer with Regard to their Clinical Significance.

PubMed

Sonea, Laura; Buse, Mihail; Gulei, Diana; Onaciu, Anca; Simon, Ioan; Braicu, Cornelia; Berindan-Neagoe, Ioana

2018-05-01

Lung cancer continues to be the leading topic concerning global mortality rate caused by can-cer; it needs to be further investigated to reduce these dramatic unfavorable statistic data. Non-coding RNAs (ncRNAs) have been shown to be important cellular regulatory factors and the alteration of their expression levels has become correlated to extensive number of pathologies. Specifically, their expres-sion profiles are correlated with development and progression of lung cancer, generating great interest for further investigation. This review focuses on the complex role of non-coding RNAs, namely miR-NAs, piwi-interacting RNAs, small nucleolar RNAs, long non-coding RNAs and circular RNAs in the process of developing novel biomarkers for diagnostic and prognostic factors that can then be utilized for personalized therapies toward this devastating disease. To support the concept of personalized medi-cine, we will focus on the roles of miRNAs in lung cancer tumorigenesis, their use as diagnostic and prognostic biomarkers and their application for patient therapy.
Long Non-Coding RNAs Regulating Immunity in Insects

PubMed Central

Satyavathi, Valluri; Ghosh, Rupam; Subramanian, Srividya

2017-01-01

Recent advances in modern technology have led to the understanding that not all genetic information is coded into protein and that the genomes of each and every organism including insects produce non-coding RNAs that can control different biological processes. Among RNAs identified in the last decade, long non-coding RNAs (lncRNAs) represent a repertoire of a hidden layer of internal signals that can regulate gene expression in physiological, pathological, and immunological processes. Evidence shows the importance of lncRNAs in the regulation of host–pathogen interactions. In this review, an attempt has been made to view the role of lncRNAs regulating immune responses in insects. PMID:29657286
GAS5 long non-coding RNA in malignant pleural mesothelioma.

PubMed

Renganathan, Arun; Kresoja-Rakic, Jelena; Echeverry, Nohemy; Ziltener, Gabriela; Vrugt, Bart; Opitz, Isabelle; Stahel, Rolf A; Felley-Bosco, Emanuela

2014-05-23

Malignant pleural mesothelioma (MPM) is an aggressive cancer with short overall survival. Long non-coding RNAs (lncRNA) are a class of RNAs more than 200 nucleotides long that do not code for protein and are part of the 90% of the human genome that is transcribed. Earlier experimental studies in mice showed GAS5 (growth arrest specific transcript 5) gene deletion in asbestos driven mesothelioma. GAS5 encodes for a lncRNA whose function is not well known, but it has been shown to act as glucocorticoid receptor decoy and microRNA "sponge". Our aim was to investigate the possible role of the GAS5 in the growth of MPM. Primary MPM cultures grown in serum-free condition in 3% oxygen or MPM cell lines grown in serum-containing medium were used to investigate the modulation of GAS5 by growth arrest after inhibition of Hedgehog or PI3K/mTOR signalling. Cell cycle length was determined by EdU incorporation assay in doxycycline inducible short hairpinGAS5 clones generated from ZL55SPT cells. Gene expression was quantified by quantitative PCR. To investigate the GAS5 promoter, a 0.77 kb sequence was inserted into a pGL3 reporter vector and luciferase activity was determined after transfection into MPM cells. Localization of GAS5 lncRNA was identified by in situ hybridization. To characterize cells expressing GAS5, expression of podoplanin and Ki-67 was assessed by immunohistochemistry. GAS5 expression was lower in MPM cell lines compared to normal mesothelial cells. GAS5 was upregulated upon growth arrest induced by inhibition of Hedgehog and PI3K/mTOR signalling in in vitro MPM models. The increase in GAS5 lncRNA was accompanied by increased promoter activity. Silencing of GAS5 increased the expression of glucocorticoid responsive genes glucocorticoid inducible leucine-zipper and serum/glucocorticoid-regulated kinase-1 and shortened the length of the cell cycle. Drug induced growth arrest was associated with GAS5 accumulation in the nuclei. GAS5 was abundant in tumoral quiescent cells and it was correlated to podoplanin expression. The observations that GAS5 levels modify cell proliferation in vitro, and that GAS5 expression in MPM tissue is associated with cell quiescence and podoplanin expression support a role of GAS5 in MPM biology.
Two distinct promoters drive transcription of the human D1A dopamine receptor gene.

PubMed

Lee, S H; Minowa, M T; Mouradian, M M

1996-10-11

The human D1A dopamine receptor gene has a GC-rich, TATA-less promoter located upstream of a small, noncoding exon 1, which is separated from the coding exon 2 by a 116-base pair (bp)-long intron. Serial 3'-deletions of the 5'-noncoding region of this gene, including the intron and 5'-end of exon 2, resulted in 80 and 40% decrease in transcriptional activity of the upstream promoter in two D1A-expressing neuroblastoma cell lines, SK-N-MC and NS20Y, respectively. To investigate the function of this region, the intron and 245 bp at the 5'-end of exon 2 were investigated. Transient expression analyses using various chloramphenicol acetyltransferase constructs showed that the transcriptional activity of the intron is higher than that of the upstream promoter by 12-fold in SK-N-MC cells and by 5.5-fold in NS20Y cells in an orientation-dependent manner, indicating that the D1A intron is a strong promoter. Primer extension and ribonuclease protection assays revealed that transcription driven by the intron promoter is initiated at the junction of intron and exon 2 and at a cluster of nucleotides located 50 bp downstream from this junction. The same transcription start sites are utilized by the chloramphenicol acetyltransferase constructs employed in transfections as well as by the D1A gene expressed within the human caudate. The relative abundance of D1A transcripts originating from the upstream promoter compared with those transcribed from the intron promoter is 1.5-2.9 times in SK-N-MC cells and 2 times in the human caudate. Transcript stability studies in SK-N-MC cells revealed that longer D1A mRNA molecules containing exon 1 are degraded 1.8 times faster than shorter transcripts lacking exon 1. Although gel mobility shift assay could not detect DNA-protein interaction at the D1A intron, competitive co-transfection using the intron as competitor confirmed the presence of trans-acting factors at the intron. These data taken together indicate that the human D1A gene has two functional TATA-less promoters, both in D1A expressing cultured neuroblastoma cells and in the human striatum.
c-Myc Represses Transcription of Epstein-Barr Virus Latent Membrane Protein 1 Early after Primary B Cell Infection.

PubMed

Price, Alexander M; Messinger, Joshua E; Luftig, Micah A

2018-01-15

Recent evidence has shown that the Epstein-Barr virus (EBV) oncogene LMP1 is not expressed at high levels early after EBV infection of primary B cells, despite its being essential for the long-term outgrowth of immortalized lymphoblastoid cell lines (LCLs). In this study, we found that expression of LMP1 increased 50-fold between 7 days postinfection and the LCL state. Metabolic labeling of nascent transcribed mRNA indicated that this was primarily a transcription-mediated event. EBNA2, the key viral transcription factor regulating LMP1, and CTCF, an important chromatin insulator, were recruited to the LMP1 locus similarly early and late after infection. However, the activating histone H3K9Ac mark was enriched at the LMP1 promoter in LCLs relative to that in infected B cells early after infection. We found that high c-Myc activity in EBV-infected lymphoma cells as well as overexpression of c-Myc in an LCL model system repressed LMP1 transcription. Finally, we found that chemical inhibition of c-Myc both in LCLs and early after primary B cell infection increased LMP1 expression. These data support a model in which high levels of endogenous c-Myc activity induced early after primary B cell infection directly repress LMP1 transcription. IMPORTANCE EBV is a highly successful pathogen that latently infects more than 90% of adults worldwide and is also causally associated with a number of B cell malignancies. During the latent life cycle, EBV expresses a set of viral oncoproteins and noncoding RNAs with the potential to promote cancer. Critical among these is the viral latent membrane protein LMP1. Prior work suggests that LMP1 is essential for EBV to immortalize B cells, but our recent work indicates that LMP1 is not produced at high levels during the first few weeks after infection. Here we show that transcription of the LMP1 gene can be negatively regulated by a host transcription factor, c-Myc. Ultimately, understanding the regulation of EBV oncogenes will allow us to better treat cancers that rely on these viral products for survival. Copyright © 2018 American Society for Microbiology.
CsrB, a noncoding regulatory RNA, is required for BarA-dependent expression of biocontrol traits in Rahnella aquatilis HX2.

PubMed

Mei, Li; Xu, Sanger; Lu, Peng; Lin, Haiping; Guo, Yanbin; Wang, Yongjun

2017-01-01

Rahnella aquatilis is ubiquitous and its certain strains have the applicative potent as a plant growth-promoting rhizobacteria. R. aquatilis HX2 is a biocontrol agent to produce antibacterial substance (ABS) and showed efficient biocontrol against crown gall caused by Agrobacterium vitis on sunflower and grapevine plants. The regulatory network of the ABS production and biocontrol activity is still limited known. In this study, a transposon-mediated mutagenesis strategy was used to investigate the regulators that involved in the biocontrol activity of R. aquatilis HX2. A 366-nt noncoding RNA CsrB was identified in vitro and in vivo, which regulated ABS production and biocontrol activity against crown gall on sunflower plants, respectively. The predicted product of noncoding RNA CsrB contains 14 stem-loop structures and an additional ρ-independent terminator harpin, with 23 characteristic GGA motifs in the loops and other unpaired regions. CsrB is required for ABS production and biocontrol activity in the biocontrol regulation by a two-component regulatory system BarA/UvrY in R. aquatilis HX2. The noncoding RNA CsrB regulates BarA-dependent ABS production and biocontrol activity in R. aquatilis HX2. To the best of our knowledge, this is the first report of noncoding RNA as a regulator for biocontrol function in R. aquatilis.
CsrB, a noncoding regulatory RNA, is required for BarA-dependent expression of biocontrol traits in Rahnella aquatilis HX2

PubMed Central

Lu, Peng; Lin, Haiping; Guo, Yanbin

2017-01-01

Background Rahnella aquatilis is ubiquitous and its certain strains have the applicative potent as a plant growth-promoting rhizobacteria. R. aquatilis HX2 is a biocontrol agent to produce antibacterial substance (ABS) and showed efficient biocontrol against crown gall caused by Agrobacterium vitis on sunflower and grapevine plants. The regulatory network of the ABS production and biocontrol activity is still limited known. Methodology/Principal findings In this study, a transposon-mediated mutagenesis strategy was used to investigate the regulators that involved in the biocontrol activity of R. aquatilis HX2. A 366-nt noncoding RNA CsrB was identified in vitro and in vivo, which regulated ABS production and biocontrol activity against crown gall on sunflower plants, respectively. The predicted product of noncoding RNA CsrB contains 14 stem-loop structures and an additional ρ-independent terminator harpin, with 23 characteristic GGA motifs in the loops and other unpaired regions. CsrB is required for ABS production and biocontrol activity in the biocontrol regulation by a two-component regulatory system BarA/UvrY in R. aquatilis HX2. Conclusion/Significance The noncoding RNA CsrB regulates BarA-dependent ABS production and biocontrol activity in R. aquatilis HX2. To the best of our knowledge, this is the first report of noncoding RNA as a regulator for biocontrol function in R. aquatilis. PMID:29091941
Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria.

PubMed

Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn

2015-04-22

In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.

Identification of differentially expressed lncRNAs involved in transient regeneration of the neonatal C57BL/6J mouse heart by next-generation high-throughput RNA sequencing.

PubMed

Chen, Yu-Mei; Li, Hua; Fan, Yi; Zhang, Qi-Jun; Li, Xing; Wu, Li-Jie; Chen, Zi-Jie; Zhu, Chun; Qian, Ling-Mei

2017-04-25

Previous studies have shown that mammalian cardiac tissue has a regenerative capacity. Remarkably, neonatal mice can regenerate their cardiac tissue for up to 6 days after birth, but this capacity is lost by day 7. In this study, we aimed to explore the expression pattern of long noncoding RNA (lncRNA) during this period and examine the mechanisms underlying this process. We found that 685 lncRNAs and 1833 mRNAs were differentially expressed at P1 and P7 by the next-generation high-throughput RNA sequencing. The coding genes associated with differentially expressed lncRNAs were mainly involved in metabolic processes and cell proliferation, and also were potentially associated with several key regeneration signalling pathways, including PI3K-Akt, MAPK, Hippo and Wnt. In addition, we identified some correlated targets of highly-dysregulated lncRNAs such as Igfbp3, Trnp1, Itgb6, and Pim3 by the coding-noncoding gene co-expression network. These data may offer a reference resource for further investigation about the mechanisms by which lncRNAs regulate cardiac regeneration.
Discovery and Characterization of PRCAT47: A Novel Prostate Lineage and Cancer-Specific Long Noncoding RNA

DTIC Science & Technology

2017-07-01

targeting PRCAT47. ASOs were able to knock down PRCAT47 at high efficacy. Genes regulated upon ASO-mediated knockdown are highly correlated with...PRCAT47. ASOs were able to knock down PRCAT47 at high efficacy. Genes regulated upon ASO-mediated knockdown are highly correlated with that of siRNA...The following section will highlight the progress made in each sub-aims/ tasks proposed in the grant, including a detailed description of methods
Coding and small non-coding transcriptional landscape of tuberous sclerosis complex cortical tubers: implications for pathophysiology and treatment.

PubMed

Mills, James D; Iyer, Anand M; van Scheppingen, Jackelien; Bongaarts, Anika; Anink, Jasper J; Janssen, Bart; Zimmer, Till S; Spliet, Wim G; van Rijen, Peter C; Jansen, Floor E; Feucht, Martha; Hainfellner, Johannes A; Krsek, Pavel; Zamecnik, Josef; Kotulska, Katarzyna; Jozwiak, Sergiusz; Jansen, Anna; Lagae, Lieven; Curatolo, Paolo; Kwiatkowski, David J; Pasterkamp, R Jeroen; Senthilkumar, Ketharini; von Oerthel, Lars; Hoekman, Marco F; Gorter, Jan A; Crino, Peter B; Mühlebner, Angelika; Scicluna, Brendon P; Aronica, Eleonora

2017-08-14

Tuberous Sclerosis Complex (TSC) is a rare genetic disorder that results from a mutation in the TSC1 or TSC2 genes leading to constitutive activation of the mechanistic target of rapamycin complex 1 (mTORC1). TSC is associated with autism, intellectual disability and severe epilepsy. Cortical tubers are believed to represent the neuropathological substrates of these disabling manifestations in TSC. In the presented study we used high-throughput RNA sequencing in combination with systems-based computational approaches to investigate the complexity of the TSC molecular network. Overall we detected 438 differentially expressed genes and 991 differentially expressed small non-coding RNAs in cortical tubers compared to autopsy control brain tissue. We observed increased expression of genes associated with inflammatory, innate and adaptive immune responses. In contrast, we observed a down-regulation of genes associated with neurogenesis and glutamate receptor signaling. MicroRNAs represented the largest class of over-expressed small non-coding RNA species in tubers. In particular, our analysis revealed that the miR-34 family (including miR-34a, miR-34b and miR-34c) was significantly over-expressed. Functional studies demonstrated the ability of miR-34b to modulate neurite outgrowth in mouse primary hippocampal neuronal cultures. This study provides new insights into the TSC transcriptomic network along with the identification of potential new treatment targets.
Re-annotation, improved large-scale assembly and establishment of a catalogue of noncoding loci for the genome of the model brown alga Ectocarpus.

PubMed

Cormier, Alexandre; Avia, Komlan; Sterck, Lieven; Derrien, Thomas; Wucher, Valentin; Andres, Gwendoline; Monsoor, Misharl; Godfroy, Olivier; Lipinska, Agnieszka; Perrineau, Marie-Mathilde; Van De Peer, Yves; Hitte, Christophe; Corre, Erwan; Coelho, Susana M; Cock, J Mark

2017-04-01

The genome of the filamentous brown alga Ectocarpus was the first to be completely sequenced from within the brown algal group and has served as a key reference genome both for this lineage and for the stramenopiles. We present a complete structural and functional reannotation of the Ectocarpus genome. The large-scale assembly of the Ectocarpus genome was significantly improved and genome-wide gene re-annotation using extensive RNA-seq data improved the structure of 11 108 existing protein-coding genes and added 2030 new loci. A genome-wide analysis of splicing isoforms identified an average of 1.6 transcripts per locus. A large number of previously undescribed noncoding genes were identified and annotated, including 717 loci that produce long noncoding RNAs. Conservation of lncRNAs between Ectocarpus and another brown alga, the kelp Saccharina japonica, suggests that at least a proportion of these loci serve a function. Finally, a large collection of single nucleotide polymorphism-based markers was developed for genetic analyses. These resources are available through an updated and improved genome database. This study significantly improves the utility of the Ectocarpus genome as a high-quality reference for the study of many important aspects of brown algal biology and as a reference for genomic analyses across the stramenopiles. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
The long non-coding RNA PARTICLE is associated with WWOX and the absence of FRA16D breakage in osteosarcoma patients.

PubMed

O'Leary, Valerie Bríd; Maugg, Doris; Smida, Jan; Baumhoer, Daniel; Nathrath, Michaela; Ovsepian, Saak Victor; Atkinson, Michael John

2017-10-20

Breakage of the fragile site FRA16D disrupts the WWOX (WW Domain Containing Oxidoreductase) tumor suppressor gene in osteosarcoma. However, the frequency of breakage is not sufficient to explain the rate of WWOX loss in pathogenesis. The involvement of non-coding RNA transcripts is proposed due to their accumulation at fragile sites, where they are advocated to influence specific chromosomal regions associated with malignancy. The long ncRNA PARTICLE (promoter of MAT2A antisense radiation-induced circulating long non-coding RNA) is transiently elevated in response to irradiation and influences epigenetic silencing modification within WWOX . It now emerges that elevated PARTICLE levels are significantly associated with FRA16D non-breakage in OS patients. Although not associated with overall survival, high PARTICLE levels were found to be significantly linked to metastasis free outcome. The transcription of both PARTICLE and WWOX are transiently responsive to exposure to low doses of radiation in osteosarcoma cell lines. Herein, a relationship between WWOX and PARTICLE transcription is suggested in human osteosarcoma cell lines representing alternative genetic backgrounds. PARTICLE over-expression ameliorated WWOX promoter activity in U2OS harboring FRA16D non-breakage. It can be concluded that the lncRNA PARTICLE influences the WWOX tumor suppressor and in the absence of WWOX FRA16D breakage, it is associated with OS metastasis-free survival.
Disease-Causing 7.4 kb Cis-Regulatory Deletion Disrupting Conserved Non-Coding Sequences and Their Interaction with the FOXL2 Promotor: Implications for Mutation Screening

PubMed Central

Dostie, Josée; Lemire, Edmond; Bouchard, Philippe; Field, Michael; Jones, Kristie; Lorenz, Birgit; Menten, Björn; Buysse, Karen; Pattyn, Filip; Friedli, Marc; Ucla, Catherine; Rossier, Colette; Wyss, Carine; Speleman, Frank; De Paepe, Anne; Dekker, Job; Antonarakis, Stylianos E.; De Baere, Elfride

2009-01-01

To date, the contribution of disrupted potentially cis-regulatory conserved non-coding sequences (CNCs) to human disease is most likely underestimated, as no systematic screens for putative deleterious variations in CNCs have been conducted. As a model for monogenic disease we studied the involvement of genetic changes of CNCs in the cis-regulatory domain of FOXL2 in blepharophimosis syndrome (BPES). Fifty-seven molecularly unsolved BPES patients underwent high-resolution copy number screening and targeted sequencing of CNCs. Apart from three larger distant deletions, a de novo deletion as small as 7.4 kb was found at 283 kb 5′ to FOXL2. The deletion appeared to be triggered by an H-DNA-induced double-stranded break (DSB). In addition, it disrupts a novel long non-coding RNA (ncRNA) PISRT1 and 8 CNCs. The regulatory potential of the deleted CNCs was substantiated by in vitro luciferase assays. Interestingly, Chromosome Conformation Capture (3C) of a 625 kb region surrounding FOXL2 in expressing cellular systems revealed physical interactions of three upstream fragments and the FOXL2 core promoter. Importantly, one of these contains the 7.4 kb deleted fragment. Overall, this study revealed the smallest distant deletion causing monogenic disease and impacts upon the concept of mutation screening in human disease and developmental disorders in particular. PMID:19543368
Studying the genetic basis of speciation in high gene flow marine invertebrates

PubMed Central

2016-01-01

A growing number of genes responsible for reproductive incompatibilities between species (barrier loci) exhibit the signals of positive selection. However, the possibility that genes experiencing positive selection diverge early in speciation and commonly cause reproductive incompatibilities has not been systematically investigated on a genome-wide scale. Here, I outline a research program for studying the genetic basis of speciation in broadcast spawning marine invertebrates that uses a priori genome-wide information on a large, unbiased sample of genes tested for positive selection. A targeted sequence capture approach is proposed that scores single-nucleotide polymorphisms (SNPs) in widely separated species populations at an early stage of allopatric divergence. The targeted capture of both coding and non-coding sequences enables SNPs to be characterized at known locations across the genome and at genes with known selective or neutral histories. The neutral coding and non-coding SNPs provide robust background distributions for identifying FST-outliers within genes that can, in principle, identify specific mutations experiencing diversifying selection. If natural hybridization occurs between species, the neutral coding and non-coding SNPs can provide a neutral admixture model for genomic clines analyses aimed at finding genes exhibiting strong blocks to introgression. Strongylocentrotid sea urchins are used as a model system to outline the approach but it can be used for any group that has a complete reference genome available. PMID:29491951
Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata.

PubMed

Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda

2013-01-07

Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration.
Deep sequencing reveals unique small RNA repertoire that is regulated during head regeneration in Hydra magnipapillata

PubMed Central

Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda

2013-01-01

Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem–loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping–pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307
Arabidopsis intragenomic conserved noncoding sequence

PubMed Central

Thomas, Brian C.; Rapaka, Lakshmi; Lyons, Eric; Pedersen, Brent; Freeling, Michael

2007-01-01

After the most recent tetraploidy in the Arabidopsis lineage, most gene pairs lost one, but not both, of their duplicates. We manually inspected the 3,179 retained gene pairs and their surrounding gene space still present in the genome using a custom-made viewer application. The display of these pairs allowed us to define intragenic conserved noncoding sequences (CNSs), identify exon annotation errors, and discover potentially new genes. Using a strict algorithm to sort high-scoring pair sequences from the bl2seq data, we created a database of 14,944 intragenomic Arabidopsis CNSs. The mean CNS length is 31 bp, ranging from 15 to 285 bp. There are ≈1.7 CNSs associated with a typical gene, and Arabidopsis CNSs are found in all areas around exons, most frequently in the 5′ upstream region. Gene ontology classifications related to transcription, regulation, or “response to …” external or endogenous stimuli, especially hormones, tend to be significantly overrepresented among genes containing a large number of CNSs, whereas protein localization, transport, and metabolism are common among genes with no CNSs. There is a 1.5% overlap between these CNSs and the 218,982 putative RNAs in the Arabidopsis Small RNA Project database, allowing for two mismatches. These CNSs provide a unique set of noncoding sequences enriched for function. CNS function is implied by evolutionary conservation and independently supported because CNS-richness predicts regulatory gene ontology categories. PMID:17301222
Transcriptomics Profiling of Alzheimer’s Disease Reveal Neurovascular Defects, Altered Amyloid-β Homeostasis, and Deregulated Expression of Long Noncoding RNAs

PubMed Central

Magistri, Marco; Velmeshev, Dmitry; Makhmutova, Madina; Faghihi, Mohammad Ali

2015-01-01

Abstract The underlying genetic variations of late-onset Alzheimer’s disease (LOAD) cases remain largely unknown. A combination of genetic variations with variable penetrance and lifetime epigenetic factors may converge on transcriptomic alterations that drive LOAD pathological process. Transcriptome profiling using deep sequencing technology offers insight into common altered pathways regardless of underpinning genetic or epigenetic factors and thus represents an ideal tool to investigate molecular mechanisms related to the pathophysiology of LOAD. We performed directional RNA sequencing on high quality RNA samples extracted from hippocampi of LOAD and age-matched controls. We further validated our data using qRT-PCR on a larger set of postmortem brain tissues, confirming downregulation of the gene encoding substance P (TAC1) and upregulation of the gene encoding the plasminogen activator inhibitor-1 (SERPINE1). Pathway analysis indicates dysregulation in neural communication, cerebral vasculature, and amyloid-β clearance. Beside protein coding genes, we identified several annotated and non-annotated long noncoding RNAs that are differentially expressed in LOAD brain tissues, three of them are activity-dependent regulated and one is induced by Aβ1 - 42 exposure of human neural cells. Our data provide a comprehensive list of transcriptomics alterations in LOAD hippocampi and warrant holistic approach including both coding and non-coding RNAs in functional studies aimed to understand the pathophysiology of LOAD. PMID:26402107
Identification of long non-coding RNAs in two anthozoan species and their possible implications for coral bleaching.

PubMed

Huang, Chen; Morlighem, Jean-Étienne R L; Cai, Jing; Liao, Qiwen; Perez, Carlos Daniel; Gomes, Paula Braga; Guo, Min; Rádis-Baptista, Gandhi; Lee, Simon Ming-Yuen

2017-07-13

Long non-coding RNAs (lncRNAs) have been shown to play regulatory roles in a diverse range of biological processes and are associated with the outcomes of various diseases. The majority of studies about lncRNAs focus on model organisms, with lessened investigation in non-model organisms to date. Herein, we have undertaken an investigation on lncRNA in two zoanthids (cnidarian): Protolpalythoa varibilis and Palythoa caribaeorum. A total of 11,206 and 13,240 lncRNAs were detected in P. variabilis and P. caribaeorum transcriptome, respectively. Comparison using NONCODE database indicated that the majority of these lncRNAs is taxonomically species-restricted with no identifiable orthologs. Even so, we found cases in which short regions of P. caribaeorum's lncRNAs were similar to vertebrate species' lncRNAs, and could be associated with lncRNA conserved regulatory functions. Consequently, some high-confidence lncRNA-mRNA interactions were predicted based on such conserved regions, therefore revealing possible involvement of lncRNAs in posttranscriptional processing and regulation in anthozoans. Moreover, investigation of differentially expressed lncRNAs, in healthy colonies and colonial individuals undergoing natural bleaching, indicated that some up-regulated lncRNAs in P. caribaeorum could posttranscriptionally regulate the mRNAs encoding proteins of Ras-mediated signal transduction pathway and components of innate immune-system, which could contribute to the molecular response of coral bleaching.
RNASeq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar 'Riesling'

USDA-ARS?s Scientific Manuscript database

The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in heterozygous species. This is a promising approach to improving the annotation of the reference genome sequence of grapevine (Vitis vinifera L.), a species of high-l...
Genome-wide identification of microRNAs in pomegranate (Punica granatum L.) by high-throughput sequencing

USDA-ARS?s Scientific Manuscript database

Background: MicroRNAs (miRNAs), a class of small non-coding endogenous RNAs that regulate gene expression post-transcriptionally, play multiple key roles in plant growth and development and in biotic and abiotic stress response. Knowledge and roles of miRNAs in pomegranate fruit development have not...
The 'dark matter' in the plant genomes: non-coding and unannotated DNA sequences associated with open chromatin.

PubMed

Jiang, Jiming

2015-04-01

Sequencing of complete plant genomes has become increasingly more routine since the advent of the next-generation sequencing technology. Identification and annotation of large amounts of noncoding but functional DNA sequences, including cis-regulatory DNA elements (CREs), have become a new frontier in plant genome research. Genomic regions containing active CREs bound to regulatory proteins are hypersensitive to DNase I digestion and are called DNase I hypersensitive sites (DHSs). Several recent DHS studies in plants illustrate that DHS datasets produced by DNase I digestion followed by next-generation sequencing (DNase-seq) are highly valuable for the identification and characterization of CREs associated with plant development and responses to environmental cues. DHS-based genomic profiling has opened a door to identify and annotate the 'dark matter' in sequenced plant genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
RNA regulatory networks in animals and plants: a long noncoding RNA perspective.

PubMed

Bai, Youhuang; Dai, Xiaozhuan; Harrison, Andrew P; Chen, Ming

2015-03-01

A recent highlight of genomics research has been the discovery of many families of transcripts which have function but do not code for proteins. An important group is long noncoding RNAs (lncRNAs), which are typically longer than 200 nt, and whose members originate from thousands of loci across genomes. We review progress in understanding the biogenesis and regulatory mechanisms of lncRNAs. We describe diverse computational and high throughput technologies for identifying and studying lncRNAs. We discuss the current knowledge of functional elements embedded in lncRNAs as well as insights into the lncRNA-based regulatory network in animals. We also describe genome-wide studies of large amount of lncRNAs in plants, as well as knowledge of selected plant lncRNAs with a focus on biotic/abiotic stress-responsive lncRNAs. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Secondary structure of the 3'-noncoding region of flavivirus genomes: comparative analysis of base pairing probabilities.

PubMed

Rauscher, S; Flamm, C; Mandl, C W; Heinz, F X; Stadler, P F

1997-07-01

The prediction of the complete matrix of base pairing probabilities was applied to the 3' noncoding region (NCR) of flavivirus genomes. This approach identifies not only well-defined secondary structure elements, but also regions of high structural flexibility. Flaviviruses, many of which are important human pathogens, have a common genomic organization, but exhibit a significant degree of RNA sequence diversity in the functionally important 3'-NCR. We demonstrate the presence of secondary structures shared by all flaviviruses, as well as structural features that are characteristic for groups of viruses within the genus reflecting the established classification scheme. The significance of most of the predicted structures is corroborated by compensatory mutations. The availability of infectious clones for several flaviviruses will allow the assessment of these structural elements in processes of the viral life cycle, such as replication and assembly.
The plastid genomes of nonphotosynthetic algae are not so small after all

PubMed Central

Figueroa-Martinez, Francisco; Nedelcu, Aurora M.; Reyes-Prieto, Adrian

2017-01-01

ABSTRACT The thing about plastid genomes in nonphotosynthetic plants and algae is that they are usually very small and highly compact. This is not surprising: a heterotrophic existence means that genes for photosynthesis can be easily discarded. But the loss of photosynthesis cannot explain why the plastomes of heterotrophs are so often depauperate in noncoding DNA. If plastid genomes from photosynthetic taxa can span the gamut of compactness, why can't those of nonphotosynthetic species? Well, recently we showed that they can. The free-living, heterotrophic green alga Polytoma uvella has a plastid genome boasting more than 165 kilobases of noncoding DNA, making it the most bloated plastome yet found in a heterotroph. In this addendum to the primary study, we elaborate on why the P. uvella plastome is so inflated, discussing the potential impact of a free-living vs. parasitic lifestyle on plastid genome expansion in nonphotosynthetic lineages. PMID:28377793
Identification of small non-coding RNA classes expressed in swine whole blood during HP-PRRSV infection.

PubMed

Fleming, Damarius S; Miller, Laura C

2018-04-01

It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs have emerged as having an important role in the immune system in humans. The study uses transcriptomic read counts to profile the type and quantity of both well and lesser characterized sncRNAs, such as microRNAs and small nucleolar RNAs to identify and quantify the classes of sncRNA expressed in whole blood between healthy and highly pathogenic PRRSV-infected pigs. Our results returned evidence on nine classes of sncRNA, four of which were consistently statistically significantly different based on Fisher's Exact Test, that can be detected and possibly interrogated for their effect on host dysregulation during PRRSV infections. Published by Elsevier Inc.
Fluorogenic RNA Mango aptamers for imaging small non-coding RNAs in mammalian cells.

PubMed

Autour, Alexis; C Y Jeng, Sunny; D Cawte, Adam; Abdolahzadeh, Amir; Galli, Angela; Panchapakesan, Shanker S S; Rueda, David; Ryckelynck, Michael; Unrau, Peter J

2018-02-13

Despite having many key roles in cellular biology, directly imaging biologically important RNAs has been hindered by a lack of fluorescent tools equivalent to the fluorescent proteins available to study cellular proteins. Ideal RNA labelling systems must preserve biological function, have photophysical properties similar to existing fluorescent proteins, and be compatible with established live and fixed cell protein labelling strategies. Here, we report a microfluidics-based selection of three new high-affinity RNA Mango fluorogenic aptamers. Two of these are as bright or brighter than enhanced GFP when bound to TO1-Biotin. Furthermore, we show that the new Mangos can accurately image the subcellular localization of three small non-coding RNAs (5S, U6, and a box C/D scaRNA) in fixed and live mammalian cells. These new aptamers have many potential applications to study RNA function and dynamics both in vitro and in mammalian cells.

High OCT4A levels drive tumorigenicity and metastatic potential of medulloblastoma cells

PubMed Central

Gonçalves da Silva, Patrícia Benites; Teixeira dos Santos, Márcia Cristina; Rodini, Carolina Oliveira; Kaid, Carolini; Leite Pereira, Márcia Cristina; Furukawa, Gabriela; Gimenes da Cruz, Daniel Sanzio; Goldfeder, Mauricio Barbugiani; Reily Rocha, Clarissa Ribeiro; Rosenberg, Carla; Okamoto, Oswaldo Keith

2017-01-01

Medulloblastoma is a highly aggressive pediatric brain tumor, in which sporadic expression of the pluripotency factor OCT4 has been recently correlated with poor patient survival. However the contribution of specific OCT4 isoforms to tumor aggressiveness is still poorly understood. Here, we report that medulloblastoma cells stably overexpressing the OCT4A isoform displayed enhanced clonogenic, tumorsphere generation, and invasion capabilities. Moreover, in an orthotopic metastatic model of medulloblastoma, OCT4A overexpressing cells generated more developed, aggressive and infiltrative tumors, with tumor-bearing mice attaining advanced metastatic disease and shorter survival rates. Pro-oncogenic OCT4A effects were expression-level dependent and accompanied by distinct chromosomal aberrations. OCT4A overexpression in medulloblastoma cells also induced a marked differential expression of non-coding RNAs, including poorly characterized long non-coding RNAs and small nucleolar RNAs. Altogether, our findings support the relevance of pluripotency-related factors in the aggravation of medulloblastoma traits classically associated with poor clinical outcome, and underscore the prognostic and therapeutic value of OCT4A in this challenging type of pediatric brain cancer. PMID:28186969
High OCT4A levels drive tumorigenicity and metastatic potential of medulloblastoma cells.

PubMed

da Silva, Patrícia Benites Gonçalves; Teixeira Dos Santos, Márcia Cristina; Rodini, Carolina Oliveira; Kaid, Carolini; Pereira, Márcia Cristina Leite; Furukawa, Gabriela; da Cruz, Daniel Sanzio Gimenes; Goldfeder, Mauricio Barbugiani; Rocha, Clarissa Ribeiro Reily; Rosenberg, Carla; Okamoto, Oswaldo Keith

2017-03-21

Medulloblastoma is a highly aggressive pediatric brain tumor, in which sporadic expression of the pluripotency factor OCT4 has been recently correlated with poor patient survival. However the contribution of specific OCT4 isoforms to tumor aggressiveness is still poorly understood. Here, we report that medulloblastoma cells stably overexpressing the OCT4A isoform displayed enhanced clonogenic, tumorsphere generation, and invasion capabilities. Moreover, in an orthotopic metastatic model of medulloblastoma, OCT4A overexpressing cells generated more developed, aggressive and infiltrative tumors, with tumor-bearing mice attaining advanced metastatic disease and shorter survival rates. Pro-oncogenic OCT4A effects were expression-level dependent and accompanied by distinct chromosomal aberrations. OCT4A overexpression in medulloblastoma cells also induced a marked differential expression of non-coding RNAs, including poorly characterized long non-coding RNAs and small nucleolar RNAs. Altogether, our findings support the relevance of pluripotency-related factors in the aggravation of medulloblastoma traits classically associated with poor clinical outcome, and underscore the prognostic and therapeutic value of OCT4A in this challenging type of pediatric brain cancer.
Efficient CRISPR/Cas9-Mediated Versatile, Predictable, and Donor-Free Gene Knockout in Human Pluripotent Stem Cells.

PubMed

Liu, Zhongliang; Hui, Yi; Shi, Lei; Chen, Zhenyu; Xu, Xiangjie; Chi, Liankai; Fan, Beibei; Fang, Yujiang; Liu, Yang; Ma, Lin; Wang, Yiran; Xiao, Lei; Zhang, Quanbin; Jin, Guohua; Liu, Ling; Zhang, Xiaoqing

2016-09-13

Loss-of-function studies in human pluripotent stem cells (hPSCs) require efficient methodologies for lesion of genes of interest. Here, we introduce a donor-free paired gRNA-guided CRISPR/Cas9 knockout strategy (paired-KO) for efficient and rapid gene ablation in hPSCs. Through paired-KO, we succeeded in targeting all genes of interest with high biallelic targeting efficiencies. More importantly, during paired-KO, the cleaved DNA was repaired mostly through direct end joining without insertions/deletions (precise ligation), and thus makes the lesion product predictable. The paired-KO remained highly efficient for one-step targeting of multiple genes and was also efficient for targeting of microRNA, while for long non-coding RNA over 8 kb, cleavage of a short fragment of the core promoter region was sufficient to eradicate downstream gene transcription. This work suggests that the paired-KO strategy is a simple and robust system for loss-of-function studies for both coding and non-coding genes in hPSCs. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Simultaneous sequencing of coding and noncoding RNA reveals a human transcriptome dominated by a small number of highly expressed noncoding genes.

PubMed

Boivin, Vincent; Deschamps-Francoeur, Gabrielle; Couture, Sonia; Nottingham, Ryan M; Bouchard-Bourelle, Philia; Lambowitz, Alan M; Scott, Michelle S; Abou-Elela, Sherif

2018-07-01

Comparing the abundance of one RNA molecule to another is crucial for understanding cellular functions but most sequencing techniques can target only specific subsets of RNA. In this study, we used a new fragmented ribodepleted TGIRT sequencing method that uses a thermostable group II intron reverse transcriptase (TGIRT) to generate a portrait of the human transcriptome depicting the quantitative relationship of all classes of nonribosomal RNA longer than 60 nt. Comparison between different sequencing methods indicated that FRT is more accurate in ranking both mRNA and noncoding RNA than viral reverse transcriptase-based sequencing methods, even those that specifically target these species. Measurements of RNA abundance in different cell lines using this method correlate with biochemical estimates, confirming tRNA as the most abundant nonribosomal RNA biotype. However, the single most abundant transcript is 7SL RNA, a component of the signal recognition particle. S tructured n on c oding RNAs (sncRNAs) associated with the same biological process are expressed at similar levels, with the exception of RNAs with multiple functions like U1 snRNA. In general, sncRNAs forming RNPs are hundreds to thousands of times more abundant than their mRNA counterparts. Surprisingly, only 50 sncRNA genes produce half of the non-rRNA transcripts detected in two different cell lines. Together the results indicate that the human transcriptome is dominated by a small number of highly expressed sncRNAs specializing in functions related to translation and splicing. © 2018 Boivin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Multiplexed direct genomic selection (MDiGS): a pooled BAC capture approach for highly accurate CNV and SNP/INDEL detection.

PubMed

Alvarado, David M; Yang, Ping; Druley, Todd E; Lovett, Michael; Gurnett, Christina A

2014-06-01

Despite declining sequencing costs, few methods are available for cost-effective single-nucleotide polymorphism (SNP), insertion/deletion (INDEL) and copy number variation (CNV) discovery in a single assay. Commercially available methods require a high investment to a specific region and are only cost-effective for large samples. Here, we introduce a novel, flexible approach for multiplexed targeted sequencing and CNV analysis of large genomic regions called multiplexed direct genomic selection (MDiGS). MDiGS combines biotinylated bacterial artificial chromosome (BAC) capture and multiplexed pooled capture for SNP/INDEL and CNV detection of 96 multiplexed samples on a single MiSeq run. MDiGS is advantageous over other methods for CNV detection because pooled sample capture and hybridization to large contiguous BAC baits reduces sample and probe hybridization variability inherent in other methods. We performed MDiGS capture for three chromosomal regions consisting of ∼ 550 kb of coding and non-coding sequence with DNA from 253 patients with congenital lower limb disorders. PITX1 nonsense and HOXC11 S191F missense mutations were identified that segregate in clubfoot families. Using a novel pooled-capture reference strategy, we identified recurrent chromosome chr17q23.1q23.2 duplications and small HOXC 5' cluster deletions (51 kb and 12 kb). Given the current interest in coding and non-coding variants in human disease, MDiGS fulfills a niche for comprehensive and low-cost evaluation of CNVs, coding, and non-coding variants across candidate regions of interest. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Allele-Selective Transcriptome Recruitment to Polysomes Primed for Translation: Protein-Coding and Noncoding RNAs, and RNA Isoforms.

PubMed

Mascarenhas, Roshan; Pietrzak, Maciej; Smith, Ryan M; Webb, Amy; Wang, Danxin; Papp, Audrey C; Pinsonneault, Julia K; Seweryn, Michal; Rempala, Grzegorz; Sadee, Wolfgang

2015-01-01

mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs), and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs) in lymphoblast cell lines (LCL) and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T) in ABCB1 (MDR1) on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq) in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs) affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.
The Voice Transcription Technique: Use of Voice Recognition Software to Transcribe Digital Interview Data in Qualitative Research

ERIC Educational Resources Information Center

Matheson, Jennifer L.

2007-01-01

Transcribing interview data is a time-consuming task that most qualitative researchers dislike. Transcribing is even more difficult for people with physical limitations because traditional transcribing requires manual dexterity and the ability to sit at a computer for long stretches of time. Researchers have begun to explore using an automated…
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements

PubMed Central

Maliszewska-Olejniczak, Kamila; Gruchota, Julita; Gromadka, Robert; Denby Wilkes, Cyril; Arnaiz, Olivier; Mathy, Nathalie; Duharcourt, Sandra; Bétermier, Mireille; Nowak, Jacek K.

2015-01-01

Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs) in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline) nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs). Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium, and establishes for the first time a specific role of TFIIS in non-coding transcription in eukaryotes. PMID:26177014
DOE Office of Scientific and Technical Information (OSTI.GOV)

Helfenbein, Kevin G.; Brown, Wesley M.; Boore, Jeffrey L.

We have sequenced the complete mitochondrial DNA (mtDNA) of the articulate brachiopod Terebratalia transversa. The circular genome is 14,291 bp in size, relatively small compared to other published metazoan mtDNAs. The 37 genes commonly found in animal mtDNA are present; the size decrease is due to the truncation of several tRNA, rRNA, and protein genes, to some nucleotide overlaps, and to a paucity of non-coding nucleotides. Although the gene arrangement differs radically from those reported for other metazoans, some gene junctions are shared with two other articulate brachiopods, Laqueus rubellus and Terebratulina retusa. All genes in the T. transversa mtDNA,more » unlike those in most metazoan mtDNAs reported, are encoded by the same strand. The A+T content (59.1 percent) is low for a metazoan mtDNA, and there is a high propensity for homopolymer runs and a strong base-compositional strand bias. The coding strand is quite G+T-rich, a skew that is shared by the confamilial (laqueid) specie s L. rubellus, but opposite to that found in T. retusa, a cancellothyridid. These compositional skews are strongly reflected in the codon usage patterns and the amino acid compositions of the mitochondrial proteins, with markedly different usage observed between T. retusa and the two laqueids. This observation, plus the similarity of the laqueid non-coding regions to the reverse complement of the non-coding region of the cancellothyridid, suggest that an inversion that resulted in a reversal in the direction of first-strand replication has occurred in one of the two lineages. In addition to the presence of one non-coding region in T. transversa that is comparable to those in the other brachiopod mtDNAs, there are two others with the potential to form secondary structures; one or both of these may be involved in the process of transcript cleavage.« less
MicroRNAs and other non-coding RNAs as targets for anticancer drug development

PubMed Central

Ling, Hui; Fabbri, Muller; Calin, George A.

2015-01-01

With the first cancer-targeted microRNA drug, MRX34, a liposome-based miR-34 mimic, entering phase I clinical trial in patients with advanced hepatocellular carcinoma in April 2013, miRNA therapeutics are attracting special attention from both academia and biotechnology companies. Although to date the most studied non-coding RNAs (ncRNAs) are miRNAs, the importance of long non-coding RNAs (lncRNAs) is increasingly being recognized. Here we summarize the roles of miRNAs and lncRNAs in cancer, with a focus on the recently identified novel mechanisms of action, and discuss the current strategies in designing ncRNA-targeting therapeutics, as well as the associated challenges. PMID:24172333
Paraspeckles: nuclear bodies built on long noncoding RNA

PubMed Central

Bond, Charles S.

2009-01-01

Paraspeckles are ribonucleoprotein bodies found in the interchromatin space of mammalian cell nuclei. These structures play a role in regulating the expression of certain genes in differentiated cells by nuclear retention of RNA. The core paraspeckle proteins (PSF/SFPQ, P54NRB/NONO, and PSPC1 [paraspeckle protein 1]) are members of the DBHS (Drosophila melanogaster behavior, human splicing) family. These proteins, together with the long nonprotein-coding RNA NEAT1 (MEN-ϵ/β), associate to form paraspeckles and maintain their integrity. Given the large numbers of long noncoding transcripts currently being discovered through whole transcriptome analysis, paraspeckles may be a paradigm for a class of subnuclear bodies formed around long noncoding RNA. PMID:19720872
A Tale of Two RNAs during Viral Infection: How Viruses Antagonize mRNAs and Small Non-Coding RNAs in The Host Cell

PubMed Central

Herbert, Kristina M.; Nag, Anita

2016-01-01

Viral infection initiates an array of changes in host gene expression. Many viruses dampen host protein expression and attempt to evade the host anti-viral defense machinery. Host gene expression is suppressed at several stages of host messenger RNA (mRNA) formation including selective degradation of translationally competent messenger RNAs. Besides mRNAs, host cells also express a variety of noncoding RNAs, including small RNAs, that may also be subject to inhibition upon viral infection. In this review we focused on different ways viruses antagonize coding and noncoding RNAs in the host cell to its advantage. PMID:27271653
Post-transcriptional Regulation of Genes Related to Biological Behaviors of Gastric Cancer by Long Noncoding RNAs and MicroRNAs

PubMed Central

Liu, Wenjing; Ma, Rui; Yuan, Yuan

2017-01-01

Noncoding RNAs play critical roles in regulating protein-coding genes and comprise two major classes: long noncoding RNAs (lncRNAs) and microRNAs (miRNAs). LncRNAs regulate gene expression at transcriptional, post-transcriptional, and epigenetic levels via multiple action modes. LncRNAs can also function as endogenous competitive RNAs for miRNAs and indirectly regulate gene expression post-transcriptionally. By binding to the 3'-untranslated regions (3'-UTR) of target genes, miRNAs post-transcriptionally regulate gene expression. Herein, we conducted a review of post-transcriptional regulation by lncRNAs and miRNAs of genes associated with biological behaviors of gastric cancer. PMID:29187891
The complete mitochondrial genomes for three Toxocara species of human and animal health significance

PubMed Central

Li, Ming-Wei; Lin, Rui-Qing; Song, Hui-Qun; Wu, Xiang-Yun; Zhu, Xing-Quan

2008-01-01

Background Studying mitochondrial (mt) genomics has important implications for various fundamental areas, including mt biochemistry, physiology and molecular biology. In addition, mt genome sequences have provided useful markers for investigating population genetic structures, systematics and phylogenetics of organisms. Toxocara canis, Toxocara cati and Toxocara malaysiensis cause significant health problems in animals and humans. Although they are of importance in human and animal health, no information on the mt genomes for any of Toxocara species is available. Results The sizes of the entire mt genome are 14,322 bp for T. canis, 14029 bp for T. cati and 14266 bp for T. malaysiensis, respectively. These circular genomes are amongst the largest reported to date for all secernentean nematodes. Their relatively large sizes relate mainly to an increased length in the AT-rich region. The mt genomes of the three Toxocara species all encode 12 proteins, two ribosomal RNAs and 22 transfer RNA genes, but lack the ATP synthetase subunit 8 gene, which is consistent with all other species of Nematode studied to date, with the exception of Trichinella spiralis. All genes are transcribed in the same direction and have a nucleotide composition high in A and T, but low in G and C. The contents of A+T of the complete genomes are 68.57% for T. canis, 69.95% for T. cati and 68.86% for T. malaysiensis, among which the A+T for T. canis is the lowest among all nematodes studied to date. The AT bias had a significant effect on both the codon usage pattern and amino acid composition of proteins. The mt genome structures for three Toxocara species, including genes and non-coding regions, are in the same order as for Ascaris suum and Anisakis simplex, but differ from Ancylostoma duodenale, Necator americanus and Caenorhabditis elegans only in the location of the AT-rich region, whereas there are substantial differences when compared with Onchocerca volvulus,Dirofiliria immitis and Strongyloides stercoralis. Phylogenetic analyses based on concatenated amino acid sequences of 12 protein-coding genes revealed that the newly described species T. malaysiensis was more closely related to T. cati than to T. canis, consistent with results of a previous study using sequences of nuclear internal transcribed spacers as genetic markers. Conclusion The present study determined the complete mt genome sequences for three roundworms of human and animal health significance, which provides mtDNA evidence for the validity of T. malaysiensis and also provides a foundation for studying the systematics, population genetics and ecology of these and other nematodes of socio-economic importance. PMID:18482460
Identification of small non-coding RNA classes expressed in swine whole blood during HP-PRRSV infection

USDA-ARS?s Scientific Manuscript database

It has been established that reduced susceptibility to porcine reproductive and respiratory syndrome virus (PRRSV) has a genetic component. This genetic component may take the form of small non-coding RNAs (sncRNA), which are molecules that function as regulators of gene expression. Various sncRNAs ...
Specificity Protein (Sp) Transcription Factors and Metformin Regulate Expression of the Long Non-coding RNA HULC

EPA Science Inventory

There is evidence that specificity protein 1 (Sp1) transcription factor (TF) regulates expression of long non-coding RNAs (lncRNAs) in hepatocellular carcinoma (HCC) cells. RNA interference (RNAi) studies showed that among several lncRNAs expressed in HepG2, SNU-449 and SK-Hep-1...
Identification and characterization of long non-coding RNAs in rainbow trout eggs

USDA-ARS?s Scientific Manuscript database

Long non-coding RNAs (lncRNAs) are in general considered as a diverse class of transcripts longer than 200 nucleotides that structurally resemble mRNAs but do not encode proteins. Recent advances in RNA sequencing (RNA-Seq) and bioinformatics methods have provided an opportunity to indentify and ana...
Dynamic interplay and function of multiple noncoding genes governing X chromosome inactivation

PubMed Central

Yue, Minghui; Richard, John Lalith Charles

2015-01-01

There is increasing evidence for the emergence of long noncoding RNAs (IncRNAs) as important components, especially in the regulation of gene expression. In the event of X chromosome inactivation, robust epigenetic marks are established in a long noncoding Xist RNA-dependent manner, giving rise to a distinct epigenetic landscape on the inactive X chromosome (Xi). The X inactivation center (Xic is essential for induction of X chromosome inactivation and harbors two topologically associated domains (TADs) to regulate monoallelic Xist expression: one at the noncoding Xist gene and its upstream region, and the other at the antisense Tsix and its upstream region. The monoallelic expression of Xist is tightly regulated by these two functionally distinct TADs as well as their constituting IncRNAs and proteins. In this review, we summarize recent updates in our knowledge of IncRNAs found at the Xic and discuss their overall mechanisms of action. We also discuss our current understanding of the molecular mechanism behind Xist RNA-mediated induction of the repressive epigenetic landscape at the Xi. PMID:26260844
LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature.

PubMed

Pian, Cong; Zhang, Guangle; Chen, Zhi; Chen, Yuanyuan; Zhang, Jin; Yang, Tao; Zhang, Liangyun

2016-01-01

As a novel class of noncoding RNAs, long noncoding RNAs (lncRNAs) have been verified to be associated with various diseases. As large scale transcripts are generated every year, it is significant to accurately and quickly identify lncRNAs from thousands of assembled transcripts. To accurately discover new lncRNAs, we develop a classification tool of random forest (RF) named LncRNApred based on a new hybrid feature. This hybrid feature set includes three new proposed features, which are MaxORF, RMaxORF and SNR. LncRNApred is effective for classifying lncRNAs and protein coding transcripts accurately and quickly. Moreover,our RF model only requests the training using data on human coding and non-coding transcripts. Other species can also be predicted by using LncRNApred. The result shows that our method is more effective compared with the Coding Potential Calculate (CPC). The web server of LncRNApred is available for free at http://mm20132014.wicp.net:57203/LncRNApred/home.jsp.
Long non-coding RNA CASC2 regulates cell biological behaviour through the MAPK signalling pathway in hepatocellular carcinoma.

PubMed

Gan, Yuanyuan; Han, Nana; He, Xiaoqin; Yu, Jiajun; Zhang, Meixia; Zhou, Yujie; Liang, Huiling; Deng, Junjian; Zheng, Yongfa; Ge, Wei; Long, Zhixiong; Xu, Ximing

2017-06-01

Long non-coding RNAs have previously been demonstrated to play important roles in regulating human diseases, especially cancer. However, the biological functions and molecular mechanisms of long non-coding RNAs in hepatocellular carcinoma have not been extensively studied. The long non-coding RNA CASC2 (cancer susceptibility candidate 2) has been characterised as a tumour suppressor in endometrial cancer and gliomas. However, the role and function of CASC2 in hepatocellular carcinoma remain unknown. In this study, using quantitative real-time polymerase chain reaction, we confirmed that CASC2 expression was downregulated in 50 hepatocellular carcinoma cases (62%) and in hepatocellular carcinoma cell lines compared with the paired adjacent tissues and normal liver cells. In vitro experiments further demonstrated that overexpressed CASC2 decreased hepatocellular carcinoma cell proliferation, migration and invasion as well as promoted apoptosis via inactivating the mitogen-activated protein kinase signalling pathway. Our findings demonstrate that CASC2 could be a useful tumour suppressor factor and a promising therapeutic target for hepatocellular carcinoma.

The Long Non-Coding RNA Transcriptome Landscape in CHO Cells Under Batch and Fed-Batch Conditions.

PubMed

Vito, Davide; Smales, C Mark

2018-05-21

The role of non-coding RNAs in determining growth, productivity and recombinant product quality attributes in Chinese hamster ovary (CHO) cells has received much attention in recent years, exemplified by studies into microRNAs in particular. However, other classes of non-coding RNAs have received less attention. One such class are the non-coding RNAs known collectively as long non-coding RNAs (lncRNAs). We have undertaken the first landscape analysis of the lncRNA transcriptome in CHO using a mouse based microarray that also provided for the surveillance of the coding transcriptome. We report on those lncRNAs present in a model host CHO cell line under batch and fed-batch conditions on two different days and relate the expression of different lncRNAs to each other. We demonstrate that the mouse microarray was suitable for the detection and analysis of thousands of CHO lncRNAs and validated a number of these by qRT-PCR. We then further analysed the data to identify those lncRNAs whose expression changed the most between growth and stationary phases of culture or between batch and fed-batch culture to identify potential lncRNA targets for further functional studies with regard to their role in controlling growth of CHO cells. We discuss the implications for the publication of this rich dataset and how this may be used by the community. This article is protected by copyright. All rights reserved.
Long Noncoding RNA in Digestive Tract Cancers: Function, Mechanism, and Potential Biomarker

PubMed Central

Zeng, Shuo; Xiao, Yu-Feng; Tang, Bo; Hu, Chang-Jiang; Xie, Rei; Yang, Shi-Ming

2015-01-01

Digestive tract cancers (DTCs) are a leading cause of cancer-related death worldwide. Current therapeutic tools for advanced stage DTCs have limitations, and patients with early stage DTCs frequently have a missed diagnosis due to shortage of efficient biomarkers. Consequently, it is necessary to develop novel biomarkers for early diagnosis and novel therapeutic targets for treatment of DTCs. In recent years, long noncoding RNAs (lncRNAs), a class of noncoding RNAs with >200 nucleotides, have been shown to be aberrantly expressed in DTCs and to have an important role in DTC development: the expression profiles of lncRNAs strongly correlated with poor survival of patients with DTCs, and lncRNAs acted as oncogenes or tumor suppressor genes in DTC progression. In this review, we summarized the functional lncRNAs and expounded on their regulatory mechanisms in DTCs. Implications for Practice: Digestive tract cancers (DTCs) are a leading cause of cancer-related death worldwide. It is necessary to exploit novel biomarkers for early diagnosis and novel therapeutic targets for treatment of DTCs. Long noncoding RNAs (lncRNAs), a class of noncoding RNAs with approximately 200 nucleotides to 100,000 bases, participate in the progression of a variety of diseases. This review summarizes functional lncRNAs, which were shown to serve as novel biomarkers for diagnosis and prognosis of DTCs and to act as oncogenes or tumor suppressor genes in DTC development. In addition, the potential mechanism of functional lncRNAs in DTCs is highlighted. PMID:26156325
Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria

PubMed Central

Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn

2015-01-01

In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393
Spatiotemporal clustering of the epigenome reveals rules of dynamic gene regulation

PubMed Central

Yu, Pengfei; Xiao, Shu; Xin, Xiaoyun; Song, Chun-Xiao; Huang, Wei; McDee, Darina; Tanaka, Tetsuya; Wang, Ting; He, Chuan; Zhong, Sheng

2013-01-01

Spatial organization of different epigenomic marks was used to infer functions of the epigenome. It remains unclear what can be learned from the temporal changes of the epigenome. Here, we developed a probabilistic model to cluster genomic sequences based on the similarity of temporal changes of multiple epigenomic marks during a cellular differentiation process. We differentiated mouse embryonic stem (ES) cells into mesendoderm cells. At three time points during this differentiation process, we used high-throughput sequencing to measure seven histone modifications and variants—H3K4me1/2/3, H3K27ac, H3K27me3, H3K36me3, and H2A.Z; two DNA modifications—5-mC and 5-hmC; and transcribed mRNAs and noncoding RNAs (ncRNAs). Genomic sequences were clustered based on the spatiotemporal epigenomic information. These clusters not only clearly distinguished gene bodies, promoters, and enhancers, but also were predictive of bidirectional promoters, miRNA promoters, and piRNAs. This suggests specific epigenomic patterns exist on piRNA genes much earlier than germ cell development. Temporal changes of H3K4me2, unmethylated CpG, and H2A.Z were predictive of 5-hmC changes, suggesting unmethylated CpG and H3K4me2 as potential upstream signals guiding TETs to specific sequences. Several rules on combinatorial epigenomic changes and their effects on mRNA expression and ncRNA expression were derived, including a simple rule governing the relationship between 5-hmC and gene expression levels. A Sox17 enhancer containing a FOXA2 binding site and a Foxa2 enhancer containing a SOX17 binding site were identified, suggesting a positive feedback loop between the two mesendoderm transcription factors. These data illustrate the power of using epigenome dynamics to investigate regulatory functions. PMID:23033340
MiRNA-124 is a link between measles virus persistent infection and cell division of human neuroblastoma cells.

PubMed

Naaman, Hila; Rall, Glenn; Matullo, Christine; Veksler-Lublinsky, Isana; Shemer-Avni, Yonat; Gopas, Jacob

2017-01-01

Measles virus (MV) infects a variety of lymphoid and non-lymphoid peripheral organs. However, in rare cases, the virus can persistently infect cells within the central nervous system. Although some of the factors that allow MV to persist are known, the contribution of host cell-encoded microRNAs (miRNA) have not been described. MiRNAs are a class of noncoding RNAs transcribed from genomes of all multicellular organisms and some viruses, which regulate gene expression in a sequence-specific manner. We have studied the contribution of host cell-encoded miRNAs to the establishment of MV persistent infection in human neuroblastoma cells. Persistent MV infection was accompanied by differences in the expression profile and levels of several host cell-encoded microRNAs as compared to uninfected cells. MV persistence infection of a human neuroblastoma cell line (UKF-NB-MV), exhibit high miRNA-124 expression, and reduced expression of cyclin dependent kinase 6 (CDK6), a known target of miRNA-124, resulting in slower cell division but not cell death. By contrast, acute MV infection of UKF-NB cells did not result in increased miRNA-124 levels or CDK6 reduction. Ectopic overexpression of miRNA-124 affected cell viability only in UKF-NB-MV cells, causing cell death; implying that miRNA-124 over expression can sensitize cells to death only in the presence of MV persistent infection. To determine if miRNA-124 directly contributes to the establishment of MV persistence, UKF-NB cells overexpressing miRNA-124 were acutely infected, resulting in establishment of persistently infected colonies. We propose that miRNA-124 triggers a CDK6-dependent decrease in cell proliferation, which facilitates the establishment of MV persistence in neuroblastoma cells. To our knowledge, this is the first report to describe the role of a specific miRNA in MV persistence.
The lncRNA myocardial infarction associated transcript-centric competing endogenous RNA network in non-small-cell lung cancer.

PubMed

Zheng, Chang; Li, Xuelian; Qian, Biyun; Feng, Nannan; Gao, Sumeng; Zhao, Yuxia; Zhou, Baosen

2018-01-01

The leading cause of death for cancer is lung cancer, of which the majority subtype is non-small cell lung cancer (NSCLC). Recent studies have shown long non-coding RNAs are transcribed and contribute to cancer. Previous study has shown that a few single nucleotide polymorphisms (SNPs) in myocardial infarction associated transcript (MIAT) were associated with some diseases or function as competing endogenous RNA (ceRNA) in some cancer. We performed bioinformatic methods for analyzing RNA-seq and miRNA-seq data of NSCLC from The Cancer Genome Atlas database. 1352 NSCLC patients and 1320 cancer-free controls for genotyping, and dual luciferase reporter assay, real-time PCR are performed in A549 and H1975 lung cancer cell lines. Results are analyzed by SPSS v16.0. In the present study, we focus on the role of over-expression MIAT in NSCLC. We confirmed that rs1061451 T>C (allele odds ratio = 0.22; P < 0.01) was associated with NSCLC. Furthermore, we constructed MIAT-centric ceRNA network, and three mRNAs ( MYO1B , SGK1 and WNT9A ) was identified as targets by MIAT via miR-133a-5p. C-containing genotypes of MIAT rs1061451 were protective factor of NSCLC, and MIAT, which may act as ceRNA via miR-133a-5p, modulated MYO1B , SGK1 and WNT9A expression level.
Ultra-deep sequencing of ribosome-associated poly-adenylated RNA in early Drosophila embryos reveals hundreds of conserved translated sORFs.

PubMed

Li, Hongmei; Hu, Chuansheng; Bai, Ling; Li, Hua; Li, Mingfa; Zhao, Xiaodong; Czajkowsky, Daniel M; Shao, Zhifeng

2016-12-01

There is growing recognition that small open reading frames (sORFs) encoding peptides shorter than 100 amino acids are an important class of functional elements in the eukaryotic genome, with several already identified to play critical roles in growth, development, and disease. However, our understanding of their biological importance has been hindered owing to the significant technical challenges limiting their annotation. Here we combined ultra-deep sequencing of ribosome-associated poly-adenylated RNAs with rigorous conservation analysis to identify a comprehensive population of translated sORFs during early Drosophila embryogenesis. In total, we identify 399 sORFs, including those previously annotated but without evidence of translational capacity, those found within transcripts previously classified as non-coding, and those not previously known to be transcribed. Further, we find, for the first time, evidence for translation of many sORFs with different isoforms, suggesting their regulation is as complex as longer ORFs. Furthermore, many sORFs are found not associated with ribosomes in late-stage Drosophila S2 cells, suggesting that many of the translated sORFs may have stage-specific functions during embryogenesis. These results thus provide the first comprehensive annotation of the sORFs present during early Drosophila embryogenesis, a necessary basis for a detailed delineation of their function in embryogenesis and other biological processes. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
NDM29, a RNA polymerase III-dependent non coding RNA, promotes amyloidogenic processing of APP and amyloid β secretion.

PubMed

Massone, Sara; Ciarlo, Eleonora; Vella, Serena; Nizzari, Mario; Florio, Tullio; Russo, Claudio; Cancedda, Ranieri; Pagano, Aldo

2012-07-01

Neuroblastoma Differentiation Marker 29 (NDM29) is a RNA polymerase (pol) III-transcribed non-coding (nc) RNA whose synthesis drives neuroblastoma (NB) cell differentiation to a nonmalignant neuron-like phenotype. Since in this process a complex pattern of molecular changes is associated to plasma membrane protein repertoire we hypothesized that the expression of NDM29 might influence also key players of neurodegenerative pathways. In this work we show that the NDM29-dependent cell maturation induces amyloid precursor protein (APP) synthesis, leading to the increase of amyloid β peptide (Aβ) secretion and the concomitant increment of Aβ x-42/Aβ x-40 ratio. We also demonstrate that the expression of NDM29 RNA, and the consequent increase of Aβ formation, can be promoted by inflammatory stimuli (and repressed by anti-inflammatory drugs). Moreover, NDM29 expression was detected in normal human brains although an abnormal increased synthesis of this ncRNA is induced in patients affected by neurodegenerative diseases. Therefore, the complex of events triggered by NDM29 expression induces a condition that favors the formation of Aβ peptides in the extracellular space, as it may occur in Alzheimer's Disease (AD). In addition, these data unexpectedly show that a pol III-dependent small RNA can act as key regulator of brain physiology and/or pathology suggesting that a better knowledge of this portion of the human transcriptome might provide hints for neurodegeneration studies. Copyright © 2012 Elsevier B.V. All rights reserved.
The lncRNA myocardial infarction associated transcript-centric competing endogenous RNA network in non-small-cell lung cancer

PubMed Central

Zheng, Chang; Li, Xuelian; Qian, Biyun; Feng, Nannan; Gao, Sumeng; Zhao, Yuxia; Zhou, Baosen

2018-01-01

Background The leading cause of death for cancer is lung cancer, of which the majority subtype is non-small cell lung cancer (NSCLC). Recent studies have shown long non-coding RNAs are transcribed and contribute to cancer. Previous study has shown that a few single nucleotide polymorphisms (SNPs) in myocardial infarction associated transcript (MIAT) were associated with some diseases or function as competing endogenous RNA (ceRNA) in some cancer. Patients and methods We performed bioinformatic methods for analyzing RNA-seq and miRNA-seq data of NSCLC from The Cancer Genome Atlas database. 1352 NSCLC patients and 1320 cancer-free controls for genotyping, and dual luciferase reporter assay, real-time PCR are performed in A549 and H1975 lung cancer cell lines. Results are analyzed by SPSS v16.0. Results In the present study, we focus on the role of over-expression MIAT in NSCLC. We confirmed that rs1061451 T>C (allele odds ratio = 0.22; P < 0.01) was associated with NSCLC. Furthermore, we constructed MIAT-centric ceRNA network, and three mRNAs (MYO1B, SGK1 and WNT9A) was identified as targets by MIAT via miR-133a-5p. Conclusion C-containing genotypes of MIAT rs1061451 were protective factor of NSCLC, and MIAT, which may act as ceRNA via miR-133a-5p, modulated MYO1B, SGK1 and WNT9A expression level. PMID:29795987
Quick Fluorescent In Situ Hybridization Protocol for Xist RNA Combined with Immunofluorescence of Histone Modification in X-chromosome Inactivation

PubMed Central

Yamada, Norishige; Ogawa, Akiyo; Ogawa, Yuya

2014-01-01

Combining RNA fluorescent in situ hybridization (FISH) with immunofluorescence (immuno-FISH) creates a technique that can be employed at the single cell level to detect the spatial dynamics of RNA localization with simultaneous insight into the localization of proteins, epigenetic modifications and other details which can be highlighted by immunofluorescence. X-chromosome inactivation is a paradigm for long non-coding RNA (lncRNA)-mediated gene silencing. X-inactive specific transcript (Xist) lncRNA accumulation (called an Xist cloud) on one of the two X-chromosomes in mammalian females is a critical step to initiate X-chromosome inactivation. Xist RNA directly or indirectly interacts with various chromatin-modifying enzymes and introduces distinct epigenetic landscapes to the inactive X-chromosome (Xi). One known epigenetic hallmark of the Xi is the Histone H3 trimethyl-lysine 27 (H3K27me3) modification. Here, we describe a simple and quick immuno-FISH protocol for detecting Xist RNA using RNA FISH with multiple oligonucleotide probes coupled with immunofluorescence of H3K27me3 to examine the localization of Xist RNA and associated epigenetic modifications. Using oligonucleotide probes results in a shorter incubation time and more sensitive detection of Xist RNA compared to in vitro transcribed RNA probes (riboprobes). This protocol provides a powerful tool for understanding the dynamics of lncRNAs and its associated epigenetic modification, chromatin structure, nuclear organization and transcriptional regulation. PMID:25489864
Independent activity of the homologous small regulatory RNAs AbcR1 and AbcR2 in the legume symbiont Sinorhizobium meliloti.

PubMed

Torres-Quesada, Omar; Millán, Vicenta; Nisa-Martínez, Rafael; Bardou, Florian; Crespi, Martín; Toro, Nicolás; Jiménez-Zurdo, José I

2013-01-01

The legume symbiont Sinorhizobium meliloti expresses a plethora of small noncoding RNAs (sRNAs) whose function is mostly unknown. Here, we have functionally characterized two tandemly encoded S. meliloti Rm1021 sRNAs that are similar in sequence and structure. Homologous sRNAs (designated AbcR1 and AbcR2) have been shown to regulate several ABC transporters in the related α-proteobacteria Agrobacterium tumefaciens and Brucella abortus. In Rm1021, AbcR1 and AbcR2 exhibit divergent unlinked regulation and are stabilized by the RNA chaperone Hfq. AbcR1 is transcribed in actively dividing bacteria, either in culture, rhizosphere or within the invasion zone of mature alfalfa nodules. Conversely, AbcR2 expression is induced upon entry into stationary phase and under abiotic stress. Only deletion of AbcR1 resulted into a discrete growth delay in rich medium, but both are dispensable for symbiosis. Periplasmic proteome profiling revealed down-regulation of the branched-chain amino acid binding protein LivK by AbcR1, but not by AbcR2. A double-plasmid reporter assay confirmed the predicted specific targeting of the 5'-untranslated region of the livK mRNA by AbcR1 in vivo. Our findings provide evidences of independent regulatory functions of these sRNAs, probably to fine-tune nutrient uptake in free-living and undifferentiated symbiotic rhizobia.
Independent Activity of the Homologous Small Regulatory RNAs AbcR1 and AbcR2 in the Legume Symbiont Sinorhizobium meliloti

PubMed Central

Torres-Quesada, Omar; Millán, Vicenta; Nisa-Martínez, Rafael; Bardou, Florian; Crespi, Martín; Toro, Nicolás; Jiménez-Zurdo, José I.

2013-01-01

The legume symbiont Sinorhizobium meliloti expresses a plethora of small noncoding RNAs (sRNAs) whose function is mostly unknown. Here, we have functionally characterized two tandemly encoded S. meliloti Rm1021 sRNAs that are similar in sequence and structure. Homologous sRNAs (designated AbcR1 and AbcR2) have been shown to regulate several ABC transporters in the related α-proteobacteria Agrobacterium tumefaciens and Brucella abortus. In Rm1021, AbcR1 and AbcR2 exhibit divergent unlinked regulation and are stabilized by the RNA chaperone Hfq. AbcR1 is transcribed in actively dividing bacteria, either in culture, rhizosphere or within the invasion zone of mature alfalfa nodules. Conversely, AbcR2 expression is induced upon entry into stationary phase and under abiotic stress. Only deletion of AbcR1 resulted into a discrete growth delay in rich medium, but both are dispensable for symbiosis. Periplasmic proteome profiling revealed down-regulation of the branched-chain amino acid binding protein LivK by AbcR1, but not by AbcR2. A double-plasmid reporter assay confirmed the predicted specific targeting of the 5′-untranslated region of the livK mRNA by AbcR1 in vivo. Our findings provide evidences of independent regulatory functions of these sRNAs, probably to fine-tune nutrient uptake in free-living and undifferentiated symbiotic rhizobia. PMID:23869210
Isolation and Identification of Post-Transcriptional Gene Silencing-Related Micro-RNAs by Functionalized Silicon Nanowire Field-effect Transistor

NASA Astrophysics Data System (ADS)

Chen, Kuan-I.; Pan, Chien-Yuan; Li, Keng-Hui; Huang, Ying-Chih; Lu, Chia-Wei; Tang, Chuan-Yi; Su, Ya-Wen; Tseng, Ling-Wei; Tseng, Kun-Chang; Lin, Chi-Yun; Chen, Chii-Dong; Lin, Shih-Shun; Chen, Yit-Tsong

2015-11-01

Many transcribed RNAs are non-coding RNAs, including microRNAs (miRNAs), which bind to complementary sequences on messenger RNAs to regulate the translation efficacy. Therefore, identifying the miRNAs expressed in cells/organisms aids in understanding genetic control in cells/organisms. In this report, we determined the binding of oligonucleotides to a receptor-modified silicon nanowire field-effect transistor (SiNW-FET) by monitoring the changes in conductance of the SiNW-FET. We first modified a SiNW-FET with a DNA probe to directly and selectively detect the complementary miRNA in cell lysates. This SiNW-FET device has 7-fold higher sensitivity than reverse transcription-quantitative polymerase chain reaction in detecting the corresponding miRNA. Next, we anchored viral p19 proteins, which bind the double-strand small RNAs (ds-sRNAs), on the SiNW-FET. By perfusing the device with synthesized ds-sRNAs of different pairing statuses, the dissociation constants revealed that the nucleotides at the 3‧-overhangs and pairings at the terminus are important for the interactions. After perfusing the total RNA mixture extracted from Nicotiana benthamiana across the device, this device could enrich the ds-sRNAs for sequence analysis. Finally, this bionanoelectronic SiNW-FET, which is able to isolate and identify the interacting protein-RNA, adds an additional tool in genomic technology for the future study of direct biomolecular interactions.
Trypanosoma brucei RAP1 maintains telomere and subtelomere integrity by suppressing TERRA and telomeric RNA:DNA hybrids.

PubMed

Nanavaty, Vishal; Sandhu, Ranjodh; Jehi, Sanaa E; Pandya, Unnati M; Li, Bibo

2017-06-02

Trypanosoma brucei causes human African trypanosomiasis and regularly switches its major surface antigen, VSG, thereby evading the host's immune response. VSGs are monoallelically expressed from subtelomeric expression sites (ESs), and VSG switching exploits subtelomere plasticity. However, subtelomere integrity is essential for T. brucei viability. The telomeric transcript, TERRA, was detected in T. brucei previously. We now show that the active ES-adjacent telomere is transcribed. We find that TbRAP1, a telomere protein essential for VSG silencing, suppresses VSG gene conversion-mediated switching. Importantly, TbRAP1 depletion increases the TERRA level, which appears to result from longer read-through into the telomere downstream of the active ES. Depletion of TbRAP1 also results in more telomeric RNA:DNA hybrids and more double strand breaks (DSBs) at telomeres and subtelomeres. In TbRAP1-depleted cells, expression of excessive TbRNaseH1, which cleaves the RNA strand of the RNA:DNA hybrid, brought telomeric RNA:DNA hybrids, telomeric/subtelomeric DSBs and VSG switching frequency back to WT levels. Therefore, TbRAP1-regulated appropriate levels of TERRA and telomeric RNA:DNA hybrid are fundamental to subtelomere/telomere integrity. Our study revealed for the first time an important role of a long, non-coding RNA in antigenic variation and demonstrated a link between telomeric silencing and subtelomere/telomere integrity through TbRAP1-regulated telomere transcription. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Role of histone modifications and early termination in pervasive transcription and antisense-mediated gene silencing in yeast.

PubMed

Castelnuovo, Manuele; Zaugg, Judith B; Guffanti, Elisa; Maffioletti, Andrea; Camblong, Jurgi; Xu, Zhenyu; Clauder-Münster, Sandra; Steinmetz, Lars M; Luscombe, Nicholas M; Stutz, Françoise

2014-04-01

Most genomes, including yeast Saccharomyces cerevisiae, are pervasively transcribed producing numerous non-coding RNAs, many of which are unstable and eliminated by nuclear or cytoplasmic surveillance pathways. We previously showed that accumulation of PHO84 antisense RNA (asRNA), in cells lacking the nuclear exosome component Rrp6, is paralleled by repression of sense transcription in a process dependent on the Hda1 histone deacetylase (HDAC) and the H3K4 histone methyl transferase Set1. Here we investigate this process genome-wide and measure the whole transcriptome of various histone modification mutants in a Δrrp6 strain using tiling arrays. We confirm widespread occurrence of potentially antisense-dependent gene regulation and identify three functionally distinct classes of genes that accumulate asRNAs in the absence of Rrp6. These classes differ in whether the genes are silenced by the asRNA and whether the silencing is HDACs and histone methyl transferase-dependent. Among the distinguishing features of asRNAs with regulatory potential, we identify weak early termination by Nrd1/Nab3/Sen1, extension of the asRNA into the open reading frame promoter and dependence of the silencing capacity on Set1 and the HDACs Hda1 and Rpd3 particularly at promoters undergoing extensive chromatin remodelling. Finally, depending on the efficiency of Nrd1/Nab3/Sen1 early termination, asRNA levels are modulated and their capability of silencing is changed.
Role of histone modifications and early termination in pervasive transcription and antisense-mediated gene silencing in yeast

PubMed Central

Castelnuovo, Manuele; Zaugg, Judith B.; Guffanti, Elisa; Maffioletti, Andrea; Camblong, Jurgi; Xu, Zhenyu; Clauder-Münster, Sandra; Steinmetz, Lars M.; Luscombe, Nicholas M.; Stutz, Françoise

2014-01-01

Most genomes, including yeast Saccharomyces cerevisiae, are pervasively transcribed producing numerous non-coding RNAs, many of which are unstable and eliminated by nuclear or cytoplasmic surveillance pathways. We previously showed that accumulation of PHO84 antisense RNA (asRNA), in cells lacking the nuclear exosome component Rrp6, is paralleled by repression of sense transcription in a process dependent on the Hda1 histone deacetylase (HDAC) and the H3K4 histone methyl transferase Set1. Here we investigate this process genome-wide and measure the whole transcriptome of various histone modification mutants in a Δrrp6 strain using tiling arrays. We confirm widespread occurrence of potentially antisense-dependent gene regulation and identify three functionally distinct classes of genes that accumulate asRNAs in the absence of Rrp6. These classes differ in whether the genes are silenced by the asRNA and whether the silencing is HDACs and histone methyl transferase-dependent. Among the distinguishing features of asRNAs with regulatory potential, we identify weak early termination by Nrd1/Nab3/Sen1, extension of the asRNA into the open reading frame promoter and dependence of the silencing capacity on Set1 and the HDACs Hda1 and Rpd3 particularly at promoters undergoing extensive chromatin remodelling. Finally, depending on the efficiency of Nrd1/Nab3/Sen1 early termination, asRNA levels are modulated and their capability of silencing is changed. PMID:24497191
xRRM: a new class of RRM found in the telomerase La family protein p65.

PubMed

Singh, Mahavir; Choi, Charles P; Feigon, Juli

2013-03-01

Genuine La and La-related proteins group 7 (LARP7) bind to the non-coding RNAs transcribed by RNA polymerase III (RNAPIII), which end in UUU-3'OH. The La motif and RRM1 of these proteins (the La module) cooperate to bind the UUU-3'OH, protecting the RNA from degradation, while other domains may be important for RNA folding or other functions. Among the RNAPIII transcripts is ciliate telomerase RNA (TER). p65, a member of the LARP7 family, is an integral Tetrahymena thermophila telomerase holoenzyme protein required for TER biogenesis and telomerase RNP assembly. p65, together with TER and telomerase reverse transcriptase (TERT), form the Tetrahymena telomerase RNP catalytic core. p65 has an N-terminal domain followed by a La module and a C-terminal domain, which binds to the TER stem 4. We recently showed that the p65 C-terminal domain harbors a cryptic, atypical RRM, which uses a unique mode of single- and double-strand RNA binding and is required for telomerase RNP catalytic core assembly. This domain, which we named xRRM, appears to be present in and unique to genuine La and LARP7 proteins. Here we review the structure of the xRRM, discuss how this domain could recognize diverse substrates of La and LARP7 proteins and discuss the functional implications of the xRRM as an RNP chaperone.
Solving Mendelian Mysteries: The Non-coding Genome May Hold the Key.

PubMed

Valente, Enza Maria; Bhatia, Kailash P

2018-02-22

Despite revolutionary advances in sequencing approaches, many mendelian disorders have remained unexplained. In this issue of Cell, Aneichyk et al. combine genomic and cell-type-specific transcriptomic data to causally link a non-coding mutation in the ubiquitous TAF1 gene to X-linked dystonia-parkinsonism. Copyright © 2018 Elsevier Inc. All rights reserved.
Long non-coding RNAs in cancer metabolism.

PubMed

Xiao, Zhen-Dong; Zhuang, Li; Gan, Boyi

2016-10-01

Altered cellular metabolism is an emerging hallmark of cancer. Accumulating recent evidence links long non-coding RNAs (lncRNAs), a still poorly understood class of non-coding RNAs, to cancer metabolism. Here we review the emerging findings on the functions of lncRNAs in cancer metabolism, with particular emphasis on how lncRNAs regulate glucose and glutamine metabolism in cancer cells, discuss how lncRNAs regulate various aspects of cancer metabolism through their cross-talk with other macromolecules, explore the mechanistic conceptual framework of lncRNAs in reprogramming metabolism in cancers, and highlight the challenges in this field. A more in-depth understanding of lncRNAs in cancer metabolism may enable the development of novel and effective therapeutic strategies targeting cancer metabolism. © 2016 WILEY Periodicals, Inc.
Identification and Characterization of Novel MicroRNAs from Schistosoma japonicum

PubMed Central

Xue, Xiangyang; Sun, Jun; Zhang, Qingfeng; Wang, Zhangxun; Huang, Yufu; Pan, Weiqing

2008-01-01

Background Schistosomiasis japonica remains a major public health problem in China. Its pathogen, Schistosoma japonicum has a complex life cycle and a unique repertoire of genes expressed at different life cycle stages. Exploring schistosome gene regulation will yield the best prospects for new drug targets and vaccine candidates. MicroRNAs (miRNAs) are a highly conserved class of noncoding RNA that control many biological processes by sequence-specific inhibition of gene expression. Although a large number of miRNAs have been identified from plants to mammals, it remains no experimental proof whether schistosome exist miRNAs. Methodology and Results We have identified novel miRNAs from Schistosoma japonicum by cloning and sequencing a small (18–26 nt) RNA cDNA library from the adult worms. Five novel miRNAs were identified from 227 cloned RNA sequences and verified by Northern blot. Alignments of the miRNAs with corresponding family members indicated that four of them belong to a metazoan miRNA family: let-7, miR-71, bantam and miR-125. The fifth potentially new (non conserved) miRNA appears to belong to a previously undescribed family in the genus Schistosome. The novel miRNAs were designated as sja-let-7, sja-miR-71, sja-bantam, sja-miR-125 and sja-miR-new1, respectively. Expression of sja-let-7, sja-miR-71 and sja-bantam were analyzed in six stages of the life cycle, i.e. egg, miracidium, sporocyst, cercaria, schistosomulum, and adult worm, by a modified stem-loop reverse transcribed polymerase chain reaction (RT-PCR) method developed in our laboratory. The expression patterns of these miRNAs were highly stage-specific. In particular, sja-miR-71 and sja-bantam expression reach their peaks in the cercaria stage and then drop quickly to the nadirs in the schistosomulum stage, following penetration of cercaria into a mammalian host. Conclusions Authentic miRNAs were identified for the first time in S. japonicum, including a new schistosome family member. The different expression patterns of the novel miRNAs over the life stages of S. japonicum suggest that they may mediate important roles in Schistosome growth and development. PMID:19107204

The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

NASA Astrophysics Data System (ADS)

Nallaseth, Ferez Soli

The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1) sequence content of deletion products confirmed the previously unidentified loss of genetic control of mammalian chromosome biology and hybrid dysgenesis.
Molecular diversity of tuliposide B-converting enzyme in tulip (Tulipa gesneriana): identification of the root-specific isozyme.

PubMed

Nomura, Taiji; Ueno, Ayaka; Ogita, Shinjiro; Kato, Yasuo

2017-06-01

6-Tuliposide B (PosB) is a glucose ester accumulated in tulip (Tulipa gesneriana) as a major secondary metabolite. PosB serves as the precursor of the antimicrobial lactone tulipalin B (PaB), which is formed by PosB-converting enzyme (TCEB). The gene TgTCEB1, encoding a TCEB, is transcribed in tulip pollen but scarcely transcribed in other tissues (e.g. roots) even though those tissues show high TCEB activity. This led to the prediction of the presence of a TCEB isozyme with distinct tissue specificity. Herein, we describe the identification of the TgTCEB-R gene from roots via native enzyme purification; this gene is a paralog of TgTCEB1. Recombinant enzyme characterization verified that TgTCEB-R encodes a TCEB. Moreover, TgTCEB-R was localized in tulip plastids, as found for pollen TgTCEB1. TgTCEB-R is transcribed almost exclusively in roots, indicating a tissue preference for the transcription of TCEB isozyme genes.
Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals.

PubMed

Davies, Kalina T J; Tsagkogeorga, Georgia; Rossiter, Stephen J

2014-12-19

The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise frequency-modulated echolocation calls varying widely in frequency and intensity high levels of sequence divergence were found. Levels of selective constraint acting on CNEs differed both across genomic locations and taxa, with observed variation in substitution rates of CNEs among bat species. More work is needed to determine whether this variation can be linked to echolocation, and wider taxonomic sampling is necessary to fully document levels of conservation in CNEs across diverse taxa.
Non coding extremities of the seven influenza virus type C vRNA segments: effect on transcription and replication by the type C and type A polymerase complexes

PubMed Central

Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; van der Werf, Sylvie

2008-01-01

Background The transcription/replication of the influenza viruses implicate the terminal nucleotide sequences of viral RNA, which comprise sequences at the extremities conserved among the genomic segments as well as variable 3' and 5' non-coding (NC) regions. The plasmid-based system for the in vivo reconstitution of functional ribonucleoproteins, upon expression of viral-like RNAs together with the nucleoprotein and polymerase proteins has been widely used to analyze transcription/replication of influenza viruses. It was thus shown that the type A polymerase could transcribe and replicate type A, B, or C vRNA templates whereas neither type B nor type C polymerases were able to transcribe and replicate type A templates efficiently. Here we studied the importance of the NC regions from the seven segments of type C influenza virus for efficient transcription/replication by the type A and C polymerases. Results The NC sequences of the seven genomic segments of the type C influenza virus C/Johannesburg/1/66 strain were found to be more variable in length than those of the type A and B viruses. The levels of transcription/replication of viral-like vRNAs harboring the NC sequences of the respective type C virus segments flanking the CAT reporter gene were comparable in the presence of either type C or type A polymerase complexes except for the NS and PB2-like vRNAs. For the NS-like vRNA, the transcription/replication level was higher after introduction of a U residue at position 6 in the 5' NC region as for all other segments. For the PB2-like vRNA the CAT expression level was particularly reduced with the type C polymerase. Analysis of mutants of the 5' NC sequence in the PB2-like vRNA, the shortest 5' NC sequence among the seven segments, showed that additional sequences within the PB2 ORF were essential for the efficiency of transcription but not replication by the type C polymerase complex. Conclusion In the context of a PB2-like reporter vRNA template, the sequence upstream the polyU stretch plays a role in the transcription/replication process by the type C polymerase complex. PMID:18973655
Comprehensive Analysis of Human Endogenous Retrovirus Group HERV-W Locus Transcription in Multiple Sclerosis Brain Lesions by High-Throughput Amplicon Sequencing

PubMed Central

Schmitt, Katja; Richter, Christin; Backes, Christina; Meese, Eckart; Ruprecht, Klemens

2013-01-01

Human endogenous retroviruses (HERVs) of the HERV-W group comprise hundreds of loci in the human genome. Deregulated HERV-W expression and HERV-W locus ERVWE1-encoded Syncytin-1 protein have been implicated in the pathogenesis of multiple sclerosis (MS). However, the actual transcription of HERV-W loci in the MS context has not been comprehensively analyzed. We investigated transcription of HERV-W in MS brain lesions and white matter brain tissue from healthy controls by employing next-generation amplicon sequencing of HERV-W env-specific reverse transcriptase (RT) PCR products, thus revealing transcribed HERV-W loci and the relative transcript levels of those loci. We identified more than 100 HERV-W loci that were transcribed in the human brain, with a limited number of loci being predominantly transcribed. Importantly, relative transcript levels of HERV-W loci were very similar between MS and healthy brain tissue samples, refuting deregulated transcription of HERV-W env in MS brain lesions, including the high-level-transcribed ERVWE1 locus encoding Syncytin-1. Quantitative RT-PCR likewise did not reveal differences in MS regarding HERV-W env general transcript or ERVWE1- and ERVWE2-specific transcript levels. However, we obtained evidence for interindividual differences in HERV-W transcript levels. Reporter gene assays indicated promoter activity of many HERV-W long terminal repeats (LTRs), including structurally incomplete LTRs. Our comprehensive analysis of HERV-W transcription in the human brain thus provides important information on the biology of HERV-W in MS lesions and normal human brain, implications for study design, and mechanisms by which HERV-W may (or may not) be involved in MS. PMID:24109235
Circulating microRNAs and long non-coding RNAs in gastric cancer diagnosis: An update and review

PubMed Central

Huang, Ya-Kai; Yu, Jian-Chun

2015-01-01

Gastric cancer (GC) is the fourth most common cancer and the third leading cause of cancer mortality worldwide. MicroRNAs (miRNAs) and long non-coding RNAs (lncRNAs) are the most popular non-coding RNAs in cancer research. To date, the roles of miRNAs and lncRNAs have been extensively studied in GC, suggesting that miRNAs and lncRNAs represent a vital component of tumor biology. Furthermore, circulating miRNAs and lncRNAs are found to be dysregulated in patients with GC compared with healthy individuals. Circulating miRNAs and lncRNAs may function as promising biomarkers to improve the early detection of GC. Multiple possibilities for miRNA secretion have been elucidated, including active secretion by microvesicles, exosomes, apoptotic bodies, high-density lipoproteins and protein complexes as well as passive leakage from cells. However, the mechanism underlying lncRNA secretion and the functions of circulating miRNAs and lncRNAs have not been fully illuminated. Concurrently, to standardize results of global investigations of circulating miRNAs and lncRNAs biomarker studies, several recommendations for pre-analytic considerations are put forward. In this review, we summarize the known circulating miRNAs and lncRNAs for GC diagnosis. The possible mechanism of miRNA and lncRNA secretion as well as methodologies for identification of circulating miRNAs and lncRNAs are also discussed. The topics covered here highlight new insights into GC diagnosis and screening. PMID:26379393
DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts

PubMed Central

Paraskevopoulou, Maria D.; Vlachos, Ioannis S.; Karagkouni, Dimitra; Georgakilas, Georgios; Kanellos, Ilias; Vergoulis, Thanasis; Zagganas, Konstantinos; Tsanakas, Panayiotis; Floros, Evangelos; Dalamagas, Theodore; Hatzigeorgiou, Artemis G.

2016-01-01

microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads. PMID:26612864
Identification and characterization of moonlighting long non-coding RNAs based on RNA and protein interactome.

PubMed

Cheng, Lixin; Leung, Kwong-Sak

2018-05-16

Moonlighting proteins are a class of proteins having multiple distinct functions, which play essential roles in a variety of cellular and enzymatic functioning systems. Although there have long been calls for computational algorithms for the identification of moonlighting proteins, research on approaches to identify moonlighting long non-coding RNAs (lncRNAs) has never been undertaken. Here, we introduce a novel methodology, MoonFinder, for the identification of moonlighting lncRNAs. MoonFinder is a statistical algorithm identifying moonlighting lncRNAs without a priori knowledge through the integration of protein interactome, RNA-protein interactions, and functional annotation of proteins. We identify 155 moonlighting lncRNA candidates and uncover that they are a distinct class of lncRNAs characterized by specific sequence and cellular localization features. The non-coding genes that transcript moonlighting lncRNAs tend to have shorter but more exons and the moonlighting lncRNAs have a variable localization pattern with a high chance of residing in the cytoplasmic compartment in comparison to the other lncRNAs. Moreover, moonlighting lncRNAs and moonlighting proteins are rather mutually exclusive in terms of both their direct interactions and interacting partners. Our results also shed light on how the moonlighting candidates and their interacting proteins implicated in the formation and development of cancers and other diseases. The code implementing MoonFinder is supplied as an R package in the supplementary material. lxcheng@cse.cuhk.edu.hk or ksleung@cse.cuhk.edu.hk. Supplementary data are available at Bioinformatics online.
Long non-coding RNA colon cancer-associated transcript 1 functions as a competing endogenous RNA to regulate cyclin-dependent kinase 1 expression by sponging miR-490-3p in hepatocellular carcinoma progression.

PubMed

Dou, Chunqing; Sun, Liyuan; Jin, Xin; Han, Mingming; Zhang, Bao; Li, Tao

2017-04-01

Hepatocellular carcinoma is an aggressive neoplasm and is one of the most common human cancers. Recently, long non-coding RNAs have been demonstrated to participate in pathogenesis of many diseases including the progression in several cancers. In this study, we found that the long non-coding RNA colon cancer-associated transcript 1 was upregulated in hepatocellular carcinoma tissues (p < 0.05), and high colon cancer-associated transcript 1 expression level was positively associated with tumor volume (p < 0.05) and American Joint Committee on Cancer stage (p < 0.05) in hepatocellular carcinoma patients. Luciferase reporter assays and RNA-pulldown assays showed that colon cancer-associated transcript 1 is a target of miR-490-3p. Real-time quantitative polymerase chain reaction and Western blot analysis indicated that colon cancer-associated transcript 1 regulated cyclin-dependent kinase 1 expression as a competing endogenous RNA by sponging miR-490-3p in hepatocellular carcinoma cells. Furthermore, colon cancer-associated transcript 1 silencing decreased hepatocellular carcinoma cells proliferation and invasion and overexpression promoted cell proliferation and invasion in vitro. These data demonstrated that the colon cancer-associated transcript 1/miR-490-3p/cyclin-dependent kinase 1 regulatory pathway promotes the progression of hepatocellular carcinoma. Inhibition of colon cancer-associated transcript 1 expression may be a novel therapeutic strategy for hepatocellular carcinoma.
Circular RNA profiling reveals that circular RNAs from ANXA2 can be used as new biomarkers for multiple sclerosis.

PubMed

Iparraguirre, Leire; Muñoz-Culla, Maider; Prada-Luengo, Iñigo; Castillo-Triviño, Tamara; Olascoaga, Javier; Otaegui, David

2017-09-15

Multiple sclerosis is an autoimmune disease, with higher prevalence in women, in whom the immune system is dysregulated. This dysregulation has been shown to correlate with changes in transcriptome expression as well as in gene-expression regulators, such as non-coding RNAs (e.g. microRNAs). Indeed, some of these have been suggested as biomarkers for multiple sclerosis even though few biomarkers have reached the clinical practice. Recently, a novel family of non-coding RNAs, circular RNAs, has emerged as a new player in the complex network of gene-expression regulation. MicroRNA regulation function through a 'sponge system' and a RNA splicing regulation function have been proposed for the circular RNAs. This regulating role together with their high stability in biofluids makes them seemingly good candidates as biomarkers. Given the dysregulation of both protein-coding and non-coding transcriptome that have been reported in multiple sclerosis patients, we hypothesised that circular RNA expression may also be altered. Therefore, we carried out expression profiling of 13.617 circular RNAs in peripheral blood leucocytes from multiple sclerosis patients and healthy controls finding 406 differentially expressed (P-value < 0.05, Fold change > 1.5) and demonstrate after validation that, circ_0005402 and circ_0035560 are underexpressed in multiple sclerosis patients and could be used as biomarkers of the disease. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Control of seed dormancy in Arabidopsis by a cis-acting noncoding antisense transcript.

PubMed

Fedak, Halina; Palusinska, Malgorzata; Krzyczmonik, Katarzyna; Brzezniak, Lien; Yatusevich, Ruslan; Pietras, Zbigniew; Kaczanowski, Szymon; Swiezewski, Szymon

2016-11-29

Seed dormancy is one of the most crucial process transitions in a plant's life cycle. Its timing is tightly controlled by the expression level of the Delay of Germination 1 gene (DOG1). DOG1 is the major quantitative trait locus for seed dormancy in Arabidopsis and has been shown to control dormancy in many other plant species. This is reflected by the evolutionary conservation of the functional short alternatively polyadenylated form of the DOG1 mRNA. Notably, the 3' region of DOG1, including the last exon that is not included in this transcript isoform, shows a high level of conservation at the DNA level, but the encoded polypeptide is poorly conserved. Here, we demonstrate that this region of DOG1 contains a promoter for the transcription of a noncoding antisense RNA, asDOG1, that is 5' capped, polyadenylated, and relatively stable. This promoter is autonomous and asDOG1 has an expression profile that is different from known DOG1 transcripts. Using several approaches we show that asDOG1 strongly suppresses DOG1 expression during seed maturation in cis, but is unable to do so in trans Therefore, the negative regulation of seed dormancy by asDOG1 in cis results in allele-specific suppression of DOG1 expression and promotes germination. Given the evolutionary conservation of the asDOG1 promoter, we propose that this cis-constrained noncoding RNA-mediated mechanism limiting the duration of seed dormancy functions across the Brassicaceae.
The Mitochondrial Cytochrome Oxidase Subunit I Gene Occurs on a Minichromosome with Extensive Heteroplasmy in Two Species of Chewing Lice, Geomydoecus aurei and Thomomydoecus minor

PubMed Central

Pietan, Lucas L.; Spradling, Theresa A.

2016-01-01

In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589
DDM1 represses noncoding RNA expression and RNA-directed DNA methylation in heterochromatin.

PubMed

Tan, Feng; Lu, Yue; Jiang, Wei; Zhao, Yu; Wu, Tian; Zhang, Ruoyu; Zhou, Dao-Xiu

2018-05-24

Cytosine methylation of DNA, which occurs at CG, CHG, and CHH (H=A, C, or T) sequences in plants, is a hallmark for epigenetic repression of repetitive sequences. The chromatin remodeling factor DECREASE IN DNA METHYLATION1 (DDM1) is essential for DNA methylation, especially at CG and CHG sequences. However, its potential role in RNA-directed DNA methylation (RdDM) and in chromatin function is not completely understood in rice (Oryza sativa). In this work, we used high-throughput approaches to study the function of rice DDM1 (OsDDM1) in RdDM and the expression of non-coding RNA (ncRNA). We show that loss of function of OsDDM1 results in ectopic CHH methylation of transposable elements and repeats. The ectopic CHH methylation was dependent on rice DOMAINS REARRANGED METHYLTRANSFERASE2 (OsDRM2), a DNA methyltransferase involved in RdDM. Mutations in OsDDM1 lead to decreases of histone H3K9me2 and increases in the levels of heterochromatic small RNA (sRNA) and long noncoding RNA (lncRNA). In particular, OsDDM1 was found to be essential to repress transcription of the two repetitive sequences, Centromeric Retrotransposons of Rice1 (CRR1) and the dominant centromeric CentO repeats. These results suggest that OsDDM1 antagonizes RdDM at heterochromatin and represses tissue-specific expression of ncRNA from repetitive sequences in the rice genome. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
Decoding the usefulness of non-coding RNAs as breast cancer markers.

PubMed

Amorim, Maria; Salta, Sofia; Henrique, Rui; Jerónimo, Carmen

2016-09-15

Although important advances in the management of breast cancer (BC) have been recently accomplished, it still constitutes the leading cause of cancer death in women worldwide. BC is a heterogeneous and complex disease, making clinical prediction of outcome a very challenging task. In recent years, gene expression profiling emerged as a tool to assist in clinical decision, enabling the identification of genetic signatures that better predict prognosis and response to therapy. Nevertheless, translation to routine practice has been limited by economical and technical reasons and, thus, novel biomarkers, especially those requiring non-invasive or minimally invasive collection procedures, while retaining high sensitivity and specificity might represent a significant development in this field. An increasing amount of evidence demonstrates that non-coding RNAs (ncRNAs), particularly microRNAs (miRNAs) and long noncoding RNAs (lncRNAs), are aberrantly expressed in several cancers, including BC. miRNAs are of particular interest as new, easily accessible, cost-effective and non-invasive tools for precise management of BC patients because they circulate in bodily fluids (e.g., serum and plasma) in a very stable manner, enabling BC assessment and monitoring through liquid biopsies. This review focus on how ncRNAs have the potential to answer present clinical needs in the personalized management of patients with BC and comprehensively describes the state of the art on the role of ncRNAs in the diagnosis, prognosis and prediction of response to therapy in BC.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci

PubMed Central

Brorsson, Caroline A.; Pociot, Flemming

2014-01-01

Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Natural Selection and Functional Potentials of Human Noncoding Elements Revealed by Analysis of Next Generation Sequencing Data

PubMed Central

Xu, Shuhua

2015-01-01

Noncoding DNA sequences (NCS) have attracted much attention recently due to their functional potentials. Here we attempted to reveal the functional roles of noncoding sequences from the point of view of natural selection that typically indicates the functional potentials of certain genomic elements. We analyzed nearly 37 million single nucleotide polymorphisms (SNPs) of Phase I data of the 1000 Genomes Project. We estimated a series of key parameters of population genetics and molecular evolution to characterize sequence variations of the noncoding genome within and between populations, and identified the natural selection footprints in NCS in worldwide human populations. Our results showed that purifying selection is prevalent and there is substantial constraint of variations in NCS, while positive selectionis more likely to be specific to some particular genomic regions and regional populations. Intriguingly, we observed larger fraction of non-conserved NCS variants with lower derived allele frequency in the genome, indicating possible functional gain of non-conserved NCS. Notably, NCS elements are enriched for potentially functional markers such as eQTLs, TF motif, and DNase I footprints in the genome. More interestingly, some NCS variants associated with diseases such as Alzheimer's disease, Type 1 diabetes, and immune-related bowel disorder (IBD) showed signatures of positive selection, although the majority of NCS variants, reported as risk alleles by genome-wide association studies, showed signatures of negative selection. Our analyses provided compelling evidence of natural selection forces on noncoding sequences in the human genome and advanced our understanding of their functional potentials that play important roles in disease etiology and human evolution. PMID:26053627
Dysregulation of non-coding RNAs in gastric cancer

PubMed Central

Yang, Qing; Zhang, Ren-Wen; Sui, Peng-Cheng; He, Hai-Tao; Ding, Lei

2015-01-01

Gastric cancer (GC) is one of the most common cancers in the world and a significant threat to the health of patients, especially those from China and Japan. The prognosis for patients with late stage GC receiving the standard of care treatment, including surgery, chemotherapy and radiotherapy, remains poor. Developing novel treatment strategies, identifying new molecules for targeted therapy, and devising screening techniques to detect this cancer in its early stages are needed for GC patients. The discovery of non-coding RNAs (ncRNAs), primarily microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), helped to elucidate the mechanisms of tumorigenesis, diagnosis and treatment of GC. Recently, significant research has been conducted on non-coding RNAs and how the regulatory dysfunction of these RNAs impacts the tumorigenesis of GC. In this study, we review papers published in the last five years concerning the dysregulation of non-coding RNAs, especially miRNAs and lncRNAs, in GC. We summarize instances of aberrant expression of the ncRNAs in GC and their effect on survival-related events, including cell cycle regulation, AKT signaling, apoptosis and drug resistance. Additionally, we evaluate how ncRNA dysregulation affects the metastatic process, including the epithelial-mesenchymal transition, stem cells, transcription factor activity, and oncogene and tumor suppressor expression. Lastly, we determine how ncRNAs affect angiogenesis in the microenvironment of GC. We further discuss the use of ncRNAs as potential biomarkers for use in clinical screening, early diagnosis and prognosis of GC. At present, no ideal ncRNAs have been identified as targets for the treatment of GC. PMID:26494954
Identification and characterization of long non-coding RNAs in subcutaneous adipose tissue from castrated and intact full-sib pair Huainan male pigs

USDA-ARS?s Scientific Manuscript database

Testosterone deficiency is associated with obesity in humans. It has been proven that long non-coding RNAs (lncRNAs) regulate adipose tissue metabolism; therefore, we first study the role of lncRNAs on testosterone deficiency-induced fat deposition using castrated male pigs as the model animal. The ...
Dengue Non-coding RNA: TRIMmed for Transmission.

PubMed

Göertz, Giel P; Pijlman, Gorben P

2015-08-12

Dengue virus RNA is trimmed by the 5'→3' exoribonuclease XRN1 to produce an abundant, non-coding subgenomic flavivirus RNA (sfRNA) in infected cells. In a recent paper in Science, Manokaran et al. (2015) report that sfRNA binds TRIM25 to evade innate immune sensing of viral RNA by RIG-I. Copyright © 2015 Elsevier Inc. All rights reserved.
RRE: a tool for the extraction of non-coding regions surrounding annotated genes from genomic datasets.

PubMed

Lazzarato, F; Franceschinis, G; Botta, M; Cordero, F; Calogero, R A

2004-11-01

RRE allows the extraction of non-coding regions surrounding a coding sequence [i.e. gene upstream region, 5'-untranslated region (5'-UTR), introns, 3'-UTR, downstream region] from annotated genomic datasets available at NCBI. RRE parser and web-based interface are accessible at http://www.bioinformatica.unito.it/bioinformatics/rre/rre.html

Some links on this page may take you to non-federal websites. Their policies may differ from this site.