noncoding dna elements: Topics by Science.gov

Sample records for noncoding dna elements

A Surrogate Approach to Study the Evolution of Noncoding DNA Elements That Organize Eukaryotic Genomes

PubMed Central

Vermaak, Danielle; Bayes, Joshua J.

2009-01-01

Comparative genomics provides a facile way to address issues of evolutionary constraint acting on different elements of the genome. However, several important DNA elements have not reaped the benefits of this new approach. Some have proved intractable to current day sequencing technology. These include centromeric and heterochromatic DNA, which are essential for chromosome segregation as well as gene regulation, but the highly repetitive nature of the DNA sequences in these regions make them difficult to assemble into longer contigs. Other sequences, like dosage compensation X chromosomal sites, origins of DNA replication, or heterochromatic sequences that encode piwi-associated RNAs, have proved difficult to study because they do not have recognizable DNA features that allow them to be described functionally or computationally. We have employed an alternate approach to the direct study of these DNA elements. By using proteins that specifically bind these noncoding DNAs as surrogates, we can indirectly assay the evolutionary constraints acting on these important DNA elements. We review the impact that such “surrogate strategies” have had on our understanding of the evolutionary constraints shaping centromeres, origins of DNA replication, and dosage compensation X chromosomal sites. These have begun to reveal that in contrast to the view that such structural DNA elements are either highly constrained (under purifying selection) or free to drift (under neutral evolution), some of them may instead be shaped by adaptive evolution and genetic conflicts (these are not mutually exclusive). These insights also help to explain why the same elements (e.g., centromeres and replication origins), which are so complex in some eukaryotic genomes, can be simple and well defined in other where similar conflicts do not exist. PMID:19635763
Characterization of noncoding regulatory DNA in the human genome.

PubMed

Elkon, Ran; Agami, Reuven

2017-08-08

Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
DNA rearrangements directed by non-coding RNAs in ciliates

PubMed Central

Mochizuki, Kazufumi

2013-01-01

Extensive programmed rearrangement of DNA, including DNA elimination, chromosome fragmentation, and DNA descrambling, takes place in the newly developed macronucleus during the sexual reproduction of ciliated protozoa. Recent studies have revealed that two distant classes of ciliates use distinct types of non-coding RNAs to regulate such DNA rearrangement events. DNA elimination in Tetrahymena is regulated by small non-coding RNAs that are produced and utilized in an RNAi-related process. It has been proposed that the small RNAs produced from the micronuclear genome are used to identify eliminated DNA sequences by whole-genome comparison between the parental macronucleus and the micronucleus. In contrast, DNA descrambling in Oxytricha is guided by long non-coding RNAs that are produced from the parental macronuclear genome. These long RNAs are proposed to act as templates for the direct descrambling events that occur in the developing macronucleus. Both cases provide useful examples to study epigenetic chromatin regulation by non-coding RNAs. PMID:21956937
Scaling features of noncoding DNA

NASA Technical Reports Server (NTRS)

Stanley, H. E.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.

1999-01-01

We review evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene, and utilize this fact to build a Coding Sequence Finder Algorithm, which uses statistical ideas to locate the coding regions of an unknown DNA sequence. Finally, we describe briefly some recent work adapting to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function, and reporting that noncoding regions in eukaryotes display a larger redundancy than coding regions. Specifically, we consider the possibility that this result is solely a consequence of nucleotide concentration differences as first noted by Bonhoeffer and his collaborators. We find that cytosine-guanine (CG) concentration does have a strong "background" effect on redundancy. However, we find that for the purine-pyrimidine binary mapping rule, which is not affected by the difference in CG concentration, the Shannon redundancy for the set of analyzed sequences is larger for noncoding regions compared to coding regions.
Junk DNA and the long non-coding RNA twist in cancer genetics

PubMed Central

Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A

2015-01-01

The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839
Functional interrogation of non-coding DNA through CRISPR genome editing.

PubMed

Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

2017-05-15

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
Functional interrogation of non-coding DNA through CRISPR genome editing

PubMed Central

Canver, Matthew C.; Bauer, Daniel E.; Orkin, Stuart H.

2017-01-01

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. PMID:28288828
Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences.

PubMed

Bergman, C M; Kreitman, M

2001-08-01

Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
The protective function of noncoding DNA in genome defense of eukaryotic male germ cells.

PubMed

Qiu, Guo-Hua; Huang, Cuiqin; Zheng, Xintian; Yang, Xiaoyan

2018-04-01

Peripheral and abundant noncoding DNA has been hypothesized to protect the genome and the central protein-coding sequences against DNA damage in somatic genome. In the cytosol, invading exogenous nucleic acids may first be deactivated by small RNAs encoded by noncoding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. In the nucleus, the radicals generated by radiation in the cytosol, radiation energy and invading exogenous nucleic acids are absorbed, blocked and/or reduced by peripheral heterochromatin, and damaged DNA in heterochromatin is removed and excluded from the nucleus to the cytoplasm through nuclear pore complexes. To further strengthen the hypothesis, this review summarizes the experimental evidence supporting the protective function of noncoding DNA in the genome of male germ cells. Based on these data, this review provides evidence supporting the protective role of noncoding DNA in the genome defense of sperm genome through similar mechanisms to those of the somatic genome.
Noncoding sequence classification based on wavelet transform analysis: part I

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.
Genome defense against exogenous nucleic acids in eukaryotes by non-coding DNA occurs through CRISPR-like mechanisms in the cytosol and the bodyguard protection in the nucleus.

PubMed

Qiu, Guo-Hua

2016-01-01

In this review, the protective function of the abundant non-coding DNA in the eukaryotic genome is discussed from the perspective of genome defense against exogenous nucleic acids. Peripheral non-coding DNA has been proposed to act as a bodyguard that protects the genome and the central protein-coding sequences from ionizing radiation-induced DNA damage. In the proposed mechanism of protection, the radicals generated by water radiolysis in the cytosol and IR energy are absorbed, blocked and/or reduced by peripheral heterochromatin; then, the DNA damage sites in the heterochromatin are removed and expelled from the nucleus to the cytoplasm through nuclear pore complexes, most likely through the formation of extrachromosomal circular DNA. To strengthen this hypothesis, this review summarizes the experimental evidence supporting the protective function of non-coding DNA against exogenous nucleic acids. Based on these data, I hypothesize herein about the presence of an additional line of defense formed by small RNAs in the cytosol in addition to their bodyguard protection mechanism in the nucleus. Therefore, exogenous nucleic acids may be initially inactivated in the cytosol by small RNAs generated from non-coding DNA via mechanisms similar to the prokaryotic CRISPR-Cas system. Exogenous nucleic acids may enter the nucleus, where some are absorbed and/or blocked by heterochromatin and others integrate into chromosomes. The integrated fragments and the sites of DNA damage are removed by repetitive non-coding DNA elements in the heterochromatin and excluded from the nucleus. Therefore, the normal eukaryotic genome and the central protein-coding sequences are triply protected by non-coding DNA against invasion by exogenous nucleic acids. This review provides evidence supporting the protective role of non-coding DNA in genome defense. Copyright © 2016 Elsevier B.V. All rights reserved.
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

NASA Technical Reports Server (NTRS)

Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.
DDM1 represses noncoding RNA expression and RNA-directed DNA methylation in heterochromatin.

PubMed

Tan, Feng; Lu, Yue; Jiang, Wei; Zhao, Yu; Wu, Tian; Zhang, Ruoyu; Zhou, Dao-Xiu

2018-05-24

Cytosine methylation of DNA, which occurs at CG, CHG, and CHH (H=A, C, or T) sequences in plants, is a hallmark for epigenetic repression of repetitive sequences. The chromatin remodeling factor DECREASE IN DNA METHYLATION1 (DDM1) is essential for DNA methylation, especially at CG and CHG sequences. However, its potential role in RNA-directed DNA methylation (RdDM) and in chromatin function is not completely understood in rice (Oryza sativa). In this work, we used high-throughput approaches to study the function of rice DDM1 (OsDDM1) in RdDM and the expression of non-coding RNA (ncRNA). We show that loss of function of OsDDM1 results in ectopic CHH methylation of transposable elements and repeats. The ectopic CHH methylation was dependent on rice DOMAINS REARRANGED METHYLTRANSFERASE2 (OsDRM2), a DNA methyltransferase involved in RdDM. Mutations in OsDDM1 lead to decreases of histone H3K9me2 and increases in the levels of heterochromatic small RNA (sRNA) and long noncoding RNA (lncRNA). In particular, OsDDM1 was found to be essential to repress transcription of the two repetitive sequences, Centromeric Retrotransposons of Rice1 (CRR1) and the dominant centromeric CentO repeats. These results suggest that OsDDM1 antagonizes RdDM at heterochromatin and represses tissue-specific expression of ncRNA from repetitive sequences in the rice genome. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan; Pfreundt, Ulrike; Nelson, William C.; ...

2015-03-23

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance because of its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20–25% less than the average for other cyanobacteria and nonpathogenic, free-living bacteria. In this paper, we use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus, both in culture and in situ. Transcriptome analysis indicates that 86% ofmore » the noncoding space is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and twofold higher than that found in the gene-dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium noncoding RNA secondary structures were predicted between most culture and metagenomic sequences, lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Finally, our data suggest that transposition of selfish DNA, low effective population size, and high-fidelity replication allowed the unusual “inflation” of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Natural Selection and Functional Potentials of Human Noncoding Elements Revealed by Analysis of Next Generation Sequencing Data

PubMed Central

Xu, Shuhua

2015-01-01

Noncoding DNA sequences (NCS) have attracted much attention recently due to their functional potentials. Here we attempted to reveal the functional roles of noncoding sequences from the point of view of natural selection that typically indicates the functional potentials of certain genomic elements. We analyzed nearly 37 million single nucleotide polymorphisms (SNPs) of Phase I data of the 1000 Genomes Project. We estimated a series of key parameters of population genetics and molecular evolution to characterize sequence variations of the noncoding genome within and between populations, and identified the natural selection footprints in NCS in worldwide human populations. Our results showed that purifying selection is prevalent and there is substantial constraint of variations in NCS, while positive selectionis more likely to be specific to some particular genomic regions and regional populations. Intriguingly, we observed larger fraction of non-conserved NCS variants with lower derived allele frequency in the genome, indicating possible functional gain of non-conserved NCS. Notably, NCS elements are enriched for potentially functional markers such as eQTLs, TF motif, and DNase I footprints in the genome. More interestingly, some NCS variants associated with diseases such as Alzheimer's disease, Type 1 diabetes, and immune-related bowel disorder (IBD) showed signatures of positive selection, although the majority of NCS variants, reported as risk alleles by genome-wide association studies, showed signatures of negative selection. Our analyses provided compelling evidence of natural selection forces on noncoding sequences in the human genome and advanced our understanding of their functional potentials that play important roles in disease etiology and human evolution. PMID:26053627
DNA topoisomerase 1α promotes transcriptional silencing of transposable elements through DNA methylation and histone lysine 9 dimethylation in Arabidopsis.

PubMed

Dinh, Thanh Theresa; Gao, Lei; Liu, Xigang; Li, Dongming; Li, Shengben; Zhao, Yuanyuan; O'Leary, Michael; Le, Brandon; Schmitz, Robert J; Manavella, Pablo A; Manavella, Pablo; Li, Shaofang; Weigel, Detlef; Pontes, Olga; Ecker, Joseph R; Chen, Xuemei

2014-07-01

RNA-directed DNA methylation (RdDM) and histone H3 lysine 9 dimethylation (H3K9me2) are related transcriptional silencing mechanisms that target transposable elements (TEs) and repeats to maintain genome stability in plants. RdDM is mediated by small and long noncoding RNAs produced by the plant-specific RNA polymerases Pol IV and Pol V, respectively. Through a chemical genetics screen with a luciferase-based DNA methylation reporter, LUCL, we found that camptothecin, a compound with anti-cancer properties that targets DNA topoisomerase 1α (TOP1α) was able to de-repress LUCL by reducing its DNA methylation and H3K9me2 levels. Further studies with Arabidopsis top1α mutants showed that TOP1α silences endogenous RdDM loci by facilitating the production of Pol V-dependent long non-coding RNAs, AGONAUTE4 recruitment and H3K9me2 deposition at TEs and repeats. This study assigned a new role in epigenetic silencing to an enzyme that affects DNA topology.
Trichodesmium genome maintains abundant, widespread noncoding DNA in situ, despite oligotrophic lifestyle

DOE PAGES

Walworth, Nathan G.; Pfreundt, Ulrike; Nelson, William C.; ...

2015-04-07

Understanding the evolution of the free-living, cyanobacterial, diazotroph Trichodesmium is of great importance due to its critical role in oceanic biogeochemistry and primary production. Unlike the other >150 available genomes of free-living cyanobacteria, only 63.8% of the Trichodesmium erythraeum (strain IMS101) genome is predicted to encode protein, which is 20-25% less than the average for other cyanobacteria and non-pathogenic, free-living bacteria. We use distinctive isolates and metagenomic data to show that low coding density observed in IMS101 is a common feature of the Trichodesmium genus both in culture and in situ. Transcriptome analysis indicates that 86% of the non-coding spacemore » is expressed, although the function of these transcripts is unclear. The density of noncoding, possible regulatory elements predicted in Trichodesmium, when normalized per intergenic kilobase, was comparable and two fold higher than that found in the gene dense genomes of the sympatric cyanobacterial genera Synechococcus and Prochlorococcus, respectively. Conserved Trichodesmium ncRNA secondary structures were predicted between most culture and metagenomic sequences lending support to the structural conservation. Conservation of these intergenic regions in spatiotemporally separated Trichodesmium populations suggests possible genus-wide selection for their maintenance. These large intergenic spacers may have developed during intervals of strong genetic drift caused by periodic blooms of a subset of genotypes, which may have reduced effective population size. Our data suggest that transposition of selfish DNA, low effective population size, and high fidelity replication allowed the unusual ‘inflation’ of noncoding sequence observed in Trichodesmium despite its oligotrophic lifestyle.« less
Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis.

PubMed

Buldyrev, S V; Goldberger, A L; Havlin, S; Mantegna, R N; Matsa, M E; Peng, C K; Simons, M; Stanley, H E

1995-05-01

An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33301 coding and all 29453 noncoding eukaryotic sequences--each of length larger than 512 base pairs (bp)--in the present release of the GenBank to dtermine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent beta=0.00 +/- 0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent beta is positive (0.16 +/- 0.05) which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 10(-10). We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA's known mosaic structure ("patchiness") arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.
Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis

NASA Technical Reports Server (NTRS)

Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Matsa, M. E.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33301 coding and all 29453 noncoding eukaryotic sequences--each of length larger than 512 base pairs (bp)--in the present release of the GenBank to dtermine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent beta=0.00 +/- 0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent beta is positive (0.16 +/- 0.05) which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 10(-10). We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA's known mosaic structure ("patchiness") arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.
Defining functional DNA elements in the human genome

PubMed Central

Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

2014-01-01

With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

Conserved Noncoding Elements in the Most Distant Genera of Cephalochordates: The Goldilocks Principle

PubMed Central

Yue, Jia-Xing; Kozmikova, Iryna; Ono, Hiroki; Nossa, Carlos W.; Kozmik, Zbynek; Putnam, Nicholas H.; Yu, Jr-Kai; Holland, Linda Z.

2016-01-01

Cephalochordates, the sister group of vertebrates + tunicates, are evolving particularly slowly. Therefore, genome comparisons between two congeners of Branchiostoma revealed so many conserved noncoding elements (CNEs), that it was not clear how many are functional regulatory elements. To more effectively identify CNEs with potential regulatory functions, we compared noncoding sequences of genomes of the most phylogenetically distant cephalochordate genera, Asymmetron and Branchiostoma, which diverged approximately 120–160 million years ago. We found 113,070 noncoding elements conserved between the two species, amounting to 3.3% of the genome. The genomic distribution, target gene ontology, and enriched motifs of these CNEs all suggest that many of them are probably cis-regulatory elements. More than 90% of previously verified amphioxus regulatory elements were re-captured in this study. A search of the cephalochordate CNEs around 50 developmental genes in several vertebrate genomes revealed eight CNEs conserved between cephalochordates and vertebrates, indicating sequence conservation over >500 million years of divergence. The function of five CNEs was tested in reporter assays in zebrafish, and one was also tested in amphioxus. All five CNEs proved to be tissue-specific enhancers. Taken together, these findings indicate that even though Branchiostoma and Asymmetron are distantly related, as they are evolving slowly, comparisons between them are likely optimal for identifying most of their tissue-specific cis-regulatory elements laying the foundation for functional characterizations and a better understanding of the evolution of developmental regulation in cephalochordates. PMID:27412606
Transcriptional regulatory elements in the noncoding region of human papillomavirus type 6

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Tzyy-Choou.

1989-01-01

The structure and function of the transcriptional regulatory region of human papillomavirus type 6 (HPV-6) has been investigated. To investigate tissue specific gene expression, a sensitive method to detect and localize HPV-6 viral DNA, mRNA and protein in plastic-embedded tissue sections of genital and respiratory tract papillomata by using in situ hybridization and immunoperoxidase assays has been developed. This method, using ultrathin sections and strand-specific {sup 3}H labeled riboprobes, offers the advantages of superior morphological preservation and detection of viral genomes at low copy number with good resolution, and the modified immunocytochemistry provides better sensitivity. The results suggest that genitalmore » tract epithelium is more permissive for HPV-6 replication than respiratory tract epithelium. To study the tissue tropism of HPV-6 at the level of regulation of viral gene expression, the polymerase chain reaction was used to isolate the noncoding region (NCR) of HPV-6 in independent isolates. Nucleotide sequence analysis of molecularly cloned DNA identified base substitutions, deletions/insertions and tandem duplications. Transcriptional regulatory elements in the NCR were assayed in recombinant plasmids containing the bacterial gene for chloramphenicol acetyl transferase.« less
RNA-DNA Triplex Formation by Long Noncoding RNAs.

PubMed

Li, Yue; Syed, Junetha; Sugiyama, Hiroshi

2016-11-17

Long noncoding RNAs (lncRNAs) play a pivotal role in the regulation of biological processes through various mechanisms that are not fully understood. Proposed mechanisms include regulation based on RNA-protein interactions, as well as RNA-RNA interactions and RNA-DNA interactions. Here, we focus on one possible mechanism that lncRNA might be using to impact biological function, the RNA-DNA triplex formation. We summarize currently available examples of lncRNA triplex formation and discuss the details surrounding orientation of triplex formation as one of the key properties guiding this process. We propose that symmetrical triplex-forming motifs, especially those in cis-acting lncRNAs, favor triplex formation. We also consider the effects of lncRNA structures, protein or ligand binding, and chromatin structures on the lncRNAs triplex formation. Copyright © 2016 Elsevier Ltd. All rights reserved.
Noncoding RNAs in DNA Repair and Genome Integrity

PubMed Central

Wan, Guohui; Liu, Yunhua; Han, Cecil; Zhang, Xinna

2014-01-01

Abstract Significance: The well-studied sequences in the human genome are those of protein-coding genes, which account for only 1%–2% of the total genome. However, with the advent of high-throughput transcriptome sequencing technology, we now know that about 90% of our genome is extensively transcribed and that the vast majority of them are transcribed into noncoding RNAs (ncRNAs). It is of great interest and importance to decipher the functions of these ncRNAs in humans. Recent Advances: In the last decade, it has become apparent that ncRNAs play a crucial role in regulating gene expression in normal development, in stress responses to internal and environmental stimuli, and in human diseases. Critical Issues: In addition to those constitutively expressed structural RNA, such as ribosomal and transfer RNAs, regulatory ncRNAs can be classified as microRNAs (miRNAs), Piwi-interacting RNAs (piRNAs), small interfering RNAs (siRNAs), small nucleolar RNAs (snoRNAs), and long noncoding RNAs (lncRNAs). However, little is known about the biological features and functional roles of these ncRNAs in DNA repair and genome instability, although a number of miRNAs and lncRNAs are regulated in the DNA damage response. Future Directions: A major goal of modern biology is to identify and characterize the full profile of ncRNAs with regard to normal physiological functions and roles in human disorders. Clinically relevant ncRNAs will also be evaluated and targeted in therapeutic applications. Antioxid. Redox Signal. 20, 655–677. PMID:23879367
Transposable elements (TEs) contribute to stress-related long intergenic noncoding RNAs in plants.

PubMed

Wang, Dong; Qu, Zhipeng; Yang, Lan; Zhang, Qingzhu; Liu, Zhi-Hong; Do, Trung; Adelson, David L; Wang, Zhen-Yu; Searle, Iain; Zhu, Jian-Kang

2017-04-01

Noncoding RNAs have been extensively described in plant and animal transcriptomes by using high-throughput sequencing technology. Of these noncoding RNAs, a growing number of long intergenic noncoding RNAs (lincRNAs) have been described in multicellular organisms, however the origins and functions of many lincRNAs remain to be explored. In many eukaryotic genomes, transposable elements (TEs) are widely distributed and often account for large fractions of plant and animal genomes yet the contribution of TEs to lincRNAs is largely unknown. By using strand-specific RNA-sequencing, we profiled the expression patterns of lincRNAs in Arabidopsis, rice and maize, and identified 47 611 and 398 TE-associated lincRNAs (TE-lincRNAs), respectively. TE-lincRNAs were more often derived from retrotransposons than DNA transposons and as retrotransposon copy number in both rice and maize genomes so did TE-lincRNAs. We validated the expression of these TE-lincRNAs by strand-specific RT-PCR and also demonstrated tissue-specific transcription and stress-induced TE-lincRNAs either after salt, abscisic acid (ABA) or cold treatments. For Arabidopsis TE-lincRNA11195, mutants had reduced sensitivity to ABA as demonstrated by longer roots and higher shoot biomass when compared to wild-type. Finally, by altering the chromatin state in the Arabidopsis chromatin remodelling mutant ddm1, unique lincRNAs including TE-lincRNAs were generated from the preceding untranscribed regions and interestingly inherited in a wild-type background in subsequent generations. Our findings not only demonstrate that TE-associated lincRNAs play important roles in plant abiotic stress responses but lincRNAs and TE-lincRNAs might act as an adaptive reservoir in eukaryotes. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.
The 'dark matter' in the plant genomes: non-coding and unannotated DNA sequences associated with open chromatin.

PubMed

Jiang, Jiming

2015-04-01

Sequencing of complete plant genomes has become increasingly more routine since the advent of the next-generation sequencing technology. Identification and annotation of large amounts of noncoding but functional DNA sequences, including cis-regulatory DNA elements (CREs), have become a new frontier in plant genome research. Genomic regions containing active CREs bound to regulatory proteins are hypersensitive to DNase I digestion and are called DNase I hypersensitive sites (DHSs). Several recent DHS studies in plants illustrate that DHS datasets produced by DNase I digestion followed by next-generation sequencing (DNase-seq) are highly valuable for the identification and characterization of CREs associated with plant development and responses to environmental cues. DHS-based genomic profiling has opened a door to identify and annotate the 'dark matter' in sequenced plant genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Transposable Elements and DNA Methylation Create in Embryonic Stem Cells Human-Specific Regulatory Sequences Associated with Distal Enhancers and Noncoding RNAs

PubMed Central

Glinsky, Gennadi V.

2015-01-01

Despite significant progress in the structural and functional characterization of the human genome, understanding of the mechanisms underlying the genetic basis of human phenotypic uniqueness remains limited. Here, I report that transposable element-derived sequences, most notably LTR7/HERV-H, LTR5_Hs, and L1HS, harbor 99.8% of the candidate human-specific regulatory loci (HSRL) with putative transcription factor-binding sites in the genome of human embryonic stem cells (hESC). A total of 4,094 candidate HSRL display selective and site-specific binding of critical regulators (NANOG [Nanog homeobox], POU5F1 [POU class 5 homeobox 1], CCCTC-binding factor [CTCF], Lamin B1), and are preferentially located within the matrix of transcriptionally active DNA segments that are hypermethylated in hESC. hESC-specific NANOG-binding sites are enriched near the protein-coding genes regulating brain size, pluripotency long noncoding RNAs, hESC enhancers, and 5-hydroxymethylcytosine-harboring regions immediately adjacent to binding sites. Sequences of only 4.3% of hESC-specific NANOG-binding sites are present in Neanderthals’ genome, suggesting that a majority of these regulatory elements emerged in Modern Humans. Comparisons of estimated creation rates of novel TF-binding sites revealed that there was 49.7-fold acceleration of creation rates of NANOG-binding sites in genomes of Chimpanzees compared with the mouse genomes and further 5.7-fold acceleration in genomes of Modern Humans compared with the Chimpanzees genomes. Preliminary estimates suggest that emergence of one novel NANOG-binding site detectable in hESC required 466 years of evolution. Pathway analysis of coding genes that have hESC-specific NANOG-binding sites within gene bodies or near gene boundaries revealed their association with physiological development and functions of nervous and cardiovascular systems, embryonic development, behavior, as well as development of a diverse spectrum of pathological conditions
Exploring the read-write genome: mobile DNA and mammalian adaptation.

PubMed

Shapiro, James A

2017-02-01

The read-write genome idea predicts that mobile DNA elements will act in evolution to generate adaptive changes in organismal DNA. This prediction was examined in the context of mammalian adaptations involving regulatory non-coding RNAs, viviparous reproduction, early embryonic and stem cell development, the nervous system, and innate immunity. The evidence shows that mobile elements have played specific and sometimes major roles in mammalian adaptive evolution by generating regulatory sites in the DNA and providing interaction motifs in non-coding RNA. Endogenous retroviruses and retrotransposons have been the predominant mobile elements in mammalian adaptive evolution, with the notable exception of bats, where DNA transposons are the major agents of RW genome inscriptions. A few examples of independent but convergent exaptation of mobile DNA elements for similar regulatory rewiring functions are noted.
Genome-scale deletion screening of human long non-coding RNAs using a paired-guide RNA CRISPR library

PubMed Central

Zhu, Shiyou; Li, Wei; Liu, Jingze; Chen, Chen-Hao; Liao, Qi; Xu, Ping; Xu, Han; Xiao, Tengfei; Cao, Zhongzheng; Peng, Jingyu; Yuan, Pengfei; Brown, Myles; Liu, Xiaole Shirley; Wei, Wensheng

2017-01-01

CRISPR/Cas9 screens have been widely adopted to analyse coding gene functions, but high throughput screening of non-coding elements using this method is more challenging, because indels caused by a single cut in non-coding regions are unlikely to produce a functional knockout. A high-throughput method to produce deletions of non-coding DNA is needed. Herein, we report a high throughput genomic deletion strategy to screen for functional long non-coding RNAs (lncRNAs) that is based on a lentiviral paired-guide RNA (pgRNA) library. Applying our screening method, we identified 51 lncRNAs that can positively or negatively regulate human cancer cell growth. We individually validated 9 lncRNAs using CRISPR/Cas9-mediated genomic deletion and functional rescue, CRISPR activation or inhibition, and gene expression profiling. Our high-throughput pgRNA genome deletion method should enable rapid identification of functional mammalian non-coding elements. PMID:27798563
Detecting the borders between coding and non-coding DNA regions in prokaryotes based on recursive segmentation and nucleotide doublets statistics

PubMed Central

2012-01-01

Background Detecting the borders between coding and non-coding regions is an essential step in the genome annotation. And information entropy measures are useful for describing the signals in genome sequence. However, the accuracies of previous methods of finding borders based on entropy segmentation method still need to be improved. Methods In this study, we first applied a new recursive entropic segmentation method on DNA sequences to get preliminary significant cuts. A 22-symbol alphabet is used to capture the differential composition of nucleotide doublets and stop codon patterns along three phases in both DNA strands. This process requires no prior training datasets. Results Comparing with the previous segmentation methods, the experimental results on three bacteria genomes, Rickettsia prowazekii, Borrelia burgdorferi and E.coli, show that our approach improves the accuracy for finding the borders between coding and non-coding regions in DNA sequences. Conclusions This paper presents a new segmentation method in prokaryotes based on Jensen-Rényi divergence with a 22-symbol alphabet. For three bacteria genomes, comparing to A12_JR method, our method raised the accuracy of finding the borders between protein coding and non-coding regions in DNA sequences. PMID:23282225
Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin

PubMed Central

Böhmdorfer, Gudrun; Sethuraman, Shriya; Rowley, M Jordan; Krzyszton, Michal; Rothi, M Hafiz; Bouzit, Lilia; Wierzbicki, Andrzej T

2016-01-01

RNA-mediated transcriptional gene silencing is a conserved process where small RNAs target transposons and other sequences for repression by establishing chromatin modifications. A central element of this process are long non-coding RNAs (lncRNA), which in Arabidopsis thaliana are produced by a specialized RNA polymerase known as Pol V. Here we show that non-coding transcription by Pol V is controlled by preexisting chromatin modifications located within the transcribed regions. Most Pol V transcripts are associated with AGO4 but are not sliced by AGO4. Pol V-dependent DNA methylation is established on both strands of DNA and is tightly restricted to Pol V-transcribed regions. This indicates that chromatin modifications are established in close proximity to Pol V. Finally, Pol V transcription is preferentially enriched on edges of silenced transposable elements, where Pol V transcribes into TEs. We propose that Pol V may play an important role in the determination of heterochromatin boundaries. DOI: http://dx.doi.org/10.7554/eLife.19092.001 PMID:27779094
Translational efficiency of poliovirus mRNA: mapping inhibitory cis-acting elements within the 5' noncoding region.

PubMed Central

Pelletier, J; Kaplan, G; Racaniello, V R; Sonenberg, N

1988-01-01

Poliovirus mRNA contains a long 5' noncoding region of about 750 nucleotides (the exact number varies among the three virus serotypes), which contains several AUG codons upstream of the major initiator AUG. Unlike most eucaryotic mRNAs, poliovirus does not contain a m7GpppX (where X is any nucleotide) cap structure at its 5' end and is translated by a cap-independent mechanism. To study the manner by which poliovirus mRNA is expressed, we examined the translational efficiencies of a series of deletion mutants within the 5' noncoding region of the mRNA. In this paper we report striking translation system-specific differences in the ability of the altered mRNAs to be translated. The results suggest the existence of an inhibitory cis-acting element(s) within the 5' noncoding region of poliovirus (between nucleotides 70 and 381) which restricts mRNA translation in reticulocyte lysate, wheat germ extract, and Xenopus oocytes, but not in HeLa cell extracts. In addition, we show that HeLa cell extracts contain a trans-acting factor(s) that overcomes this restriction. Images PMID:2836606
Transposable elements at the center of the crossroads between embryogenesis, embryonic stem cells, reprogramming, and long non-coding RNAs.

PubMed

Hutchins, Andrew Paul; Pei, Duanqing

Transposable elements (TEs) are mobile genomic sequences of DNA capable of autonomous and non-autonomous duplication. TEs have been highly successful, and nearly half of the human genome now consists of various families of TEs. Originally thought to be non-functional, these elements have been co-opted by animal genomes to perform a variety of physiological functions ranging from TE-derived proteins acting directly in normal biological functions, to innovations in transcription factor logic and influence on epigenetic control of gene expression. During embryonic development, when the genome is epigenetically reprogrammed and DNA-demethylated, TEs are released from repression and show embryonic stage-specific expression, and in human and mouse embryos, intact TE-derived endogenous viral particles can even be detected. A similar process occurs during the reprogramming of somatic cells to pluripotent cells: When the somatic DNA is demethylated, TEs are released from repression. In embryonic stem cells (ESCs), where DNA is hypomethylated, an elaborate system of epigenetic control is employed to suppress TEs, a system that often overlaps with normal epigenetic control of ESC gene expression. Finally, many long non-coding RNAs (lncRNAs) involved in normal ESC function and those assisting or impairing reprogramming contain multiple TEs in their RNA. These TEs may act as regulatory units to recruit RNA-binding proteins and epigenetic modifiers. This review covers how TEs are interlinked with the epigenetic machinery and lncRNAs, and how these links influence each other to modulate aspects of ESCs, embryogenesis, and somatic cell reprogramming.
WordCluster: detecting clusters of DNA words and genomic elements

PubMed Central

2011-01-01

Background Many k-mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (k-mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used WordCluster to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions WordCluster seems to predict biological meaningful clusters of DNA words (k-mers) and genomic entities. The implementation of the method into a web server is available at http://bioinfo2.ugr.es/wordCluster/wordCluster.php including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes. PMID:21261981
Variation in conserved non-coding sequences on chromosome 5q andsusceptibility to asthma and atopy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Donfack, Joseph; Schneider, Daniel H.; Tan, Zheng

2005-09-10

Background: Evolutionarily conserved sequences likely havebiological function. Methods: To determine whether variation in conservedsequences in non-coding DNA contributes to risk for human disease, westudied six conserved non-coding elements in the Th2 cytokine cluster onhuman chromosome 5q31 in a large Hutterite pedigree and in samples ofoutbred European American and African American asthma cases and controls.Results: Among six conserved non-coding elements (>100 bp,>70percent identity; human-mouse comparison), we identified one singlenucleotide polymorphism (SNP) in each of two conserved elements and sixSNPs in the flanking regions of three conserved elements. We genotypedour samples for four of these SNPs and an additional three SNPs eachmore » inthe IL13 and IL4 genes. While there was only modest evidence forassociation with single SNPs in the Hutterite and European Americansamples (P<0.05), there were highly significant associations inEuropean Americans between asthma and haplotypes comprised of SNPs in theIL4 gene (P<0.001), including a SNP in a conserved non-codingelement. Furthermore, variation in the IL13 gene was strongly associatedwith total IgE (P = 0.00022) and allergic sensitization to mold allergens(P = 0.00076) in the Hutterites, and more modestly associated withsensitization to molds in the European Americans and African Americans (P<0.01). Conclusion: These results indicate that there is overalllittle variation in the conserved non-coding elements on 5q31, butvariation in IL4 and IL13, including possibly one SNP in a conservedelement, influence asthma and atopic phenotypes in diversepopulations.« less
Hiding in Plain Sight: Rediscovering the Importance of Noncoding RNA in Human Malignancy.

PubMed

Feeley, Kyle P; Edmonds, Mick D

2018-05-01

At the time of its construction in the 1950s, the central dogma of molecular biology was a useful model that represented the current state of knowledge for the flow of genetic information after a period of prolific scientific discovery. Unknowingly, it also biased many of our assumptions going forward. Whether intentional or not, genomic elements not fitting into this paradigm were deemed unimportant and emphasis on the study of protein-coding genes prevailed for decades. The phrase "Junk DNA," first popularized in the 1960s, is still used with alarming frequency to describe the entirety of noncoding DNA. It has since become apparent that RNA molecules not coding for protein are vitally important in both normal development and human malignancy. Cancer researchers have been pioneers in determining noncoding RNA function and developing new technologies to study these molecules. In this review, we will discuss well known and newly emerging species of noncoding RNAs, their functions in cancer, and new technologies being utilized to understand their mechanisms of action in cancer. Cancer Res; 78(9); 2149-58. ©2018 AACR . ©2018 American Association for Cancer Research.
Noncoding transcripts in sense and antisense orientation regulate the epigenetic state of ribosomal RNA genes.

PubMed

Bierhoff, H; Schmitz, K; Maass, F; Ye, J; Grummt, I

2010-01-01

Alternative transcription of the same gene in sense and antisense orientation regulates expression of protein-coding genes. Here we show that noncoding RNA (ncRNA) in sense and antisense orientation also controls transcription of rRNA genes (rDNA). rDNA exists in two types of chromatin--a euchromatic conformation that is permissive to transcription and a heterochromatic conformation that is transcriptionally silent. Silencing of rDNA is mediated by NoRC, a chromatin-remodeling complex that triggers heterochromatin formation. NoRC function requires RNA that is complementary to the rDNA promoter (pRNA). pRNA forms a DNA:RNA triplex with a regulatory element in the rDNA promoter, and this triplex structure is recognized by DNMT3b. The results imply that triplex-mediated targeting of DNMT3b to specific sequences may be a common pathway in epigenetic regulation. We also show that rDNA is transcribed in antisense orientation. The level of antisense RNA (asRNA) is down-regulated in cancer cells and up-regulated in senescent cells. Ectopic asRNA triggers trimethylation of histone H4 at lysine 20 (H4K20me3), suggesting that antisense transcripts guide the histone methyltransferase Suv4-20 to rDNA. The results reveal that noncoding RNAs in sense and antisense orientation are important determinants of the epigenetic state of rDNA.
Cap-independent translation of poliovirus mRNA is conferred by sequence elements within the 5' noncoding region

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pelletier, J.; Kaplan, G.; Racaniello, V.R.

1988-03-01

Poliovirus polysomal RNA is naturally uncapped, and as such, its translation must bypass any 5' cap-dependent ribosome recognition event. To elucidate the manner by which poliovirus mRNA is translated, the authors determined the translational efficiencies of a series of deletion mutants within the 5' noncoding region of the mRNA. They found striking differences in translatability among the altered mRNAs when assayed in mock-infected and poliovirus-infected HeLa cell extracts. The results identify a functional cis-acting element within the 5' noncoding region of the poliovirus mRNA which enables it to translate in a cap-independent fashion. The major determinant of this element mapsmore » between nucleotides 320 and 631 of the 5' end of the poliovirus mRNA. They also show that this region (320 to 631), when fused to a heterologous mRNA, can function in cis to render the mRNA cap independent in translation.« less
Stress induced gene expression drives transient DNA methylation changes at adjacent repetitive elements.

PubMed

Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan

2015-07-21

Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress.
A transcriptional serenAID: the role of noncoding RNAs in class switch recombination

PubMed Central

Yewdell, William T.; Chaudhuri, Jayanta

2017-01-01

Abstract During an immune response, activated B cells may undergo class switch recombination (CSR), a molecular rearrangement that allows B cells to switch from expressing IgM and IgD to a secondary antibody heavy chain isotype such as IgG, IgA or IgE. Secondary antibody isotypes provide the adaptive immune system with distinct effector functions to optimally combat various pathogens. CSR occurs between repetitive DNA elements within the immunoglobulin heavy chain (Igh) locus, termed switch (S) regions and requires the DNA-modifying enzyme activation-induced cytidine deaminase (AID). AID-mediated DNA deamination within S regions initiates the formation of DNA double-strand breaks, which serve as biochemical beacons for downstream DNA repair pathways that coordinate the ligation of DNA breaks. Myriad factors contribute to optimal AID targeting; however, many of these factors also localize to genomic regions outside of the Igh locus. Thus, a current challenge is to explain the specific targeting of AID to the Igh locus. Recent studies have implicated noncoding RNAs in CSR, suggesting a provocative mechanism that incorporates Igh-specific factors to enable precise AID targeting. Here, we chronologically recount the rich history of noncoding RNAs functioning in CSR to provide a comprehensive context for recent and future discoveries. We present a model for the RNA-guided targeting of AID that attempts to integrate historical and recent findings, and highlight potential caveats. Lastly, we discuss testable hypotheses ripe for current experimentation, and explore promising ideas for future investigations. PMID:28535205

Noncoding sequence classification based on wavelet transform analysis: part II

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez-Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. We hypothesize that the characteristic periodicities of the noncoding sequences are related to their function. We describe the procedure to identify these characteristic periodicities using the wavelet analysis. Our results show that three groups of noncoding sequences, each one with different biological function, may be differentiated by their wavelet coefficients within specific frequency range.
Authentication of Botanical Origin in Herbal Teas by Plastid Noncoding DNA Length Polymorphisms.

PubMed

Uncu, Ali Tevfik; Uncu, Ayse Ozgur; Frary, Anne; Doganlar, Sami

2015-07-01

The aim of this study was to develop a DNA barcode assay to authenticate the botanical origin of herbal teas. To reach this aim, we tested the efficiency of a PCR-capillary electrophoresis (PCR-CE) approach on commercial herbal tea samples using two noncoding plastid barcodes, the trnL intron and the intergenic spacer between trnL and trnF. Barcode DNA length polymorphisms proved successful in authenticating the species origin of herbal teas. We verified the validity of our approach by sequencing species-specific barcode amplicons from herbal tea samples. Moreover, we displayed the utility of PCR-CE assays coupled with sequencing to identify the origin of undeclared plant material in herbal tea samples. The PCR-CE assays proposed in this work can be applied as routine tests for the verification of botanical origin in herbal teas and can be extended to authenticate all types of herbal foodstuffs.
Genome-wide DNA methylation patterns in LSH mutant reveals de-repression of repeat elements and redundant epigenetic silencing pathways

PubMed Central

Yu, Weishi; McIntosh, Carl; Lister, Ryan; Zhu, Iris; Han, Yixing; Ren, Jianke; Landsman, David; Lee, Eunice; Briones, Victorino; Terashima, Minoru; Leighty, Robert; Ecker, Joseph R.

2014-01-01

Cytosine methylation is critical in mammalian development and plays a role in diverse biologic processes such as genomic imprinting, X chromosome inactivation, and silencing of repeat elements. Several factors regulate DNA methylation in early embryogenesis, but their precise role in the establishment of DNA methylation at a given site remains unclear. We have generated a comprehensive methylation map in fibroblasts derived from the murine DNA methylation mutant Hells−/− (helicase, lymphoid specific, also known as LSH). It has been previously shown that HELLS can influence de novo methylation of retroviral sequences and endogenous genes. Here, we describe that HELLS controls cytosine methylation in a nuclear compartment that is in part defined by lamin B1 attachment regions. Despite widespread loss of cytosine methylation at regulatory sequences, including promoter regions of protein-coding genes and noncoding RNA genes, overall relative transcript abundance levels in the absence of HELLS are similar to those in wild-type cells. A subset of promoter regions shows increases of the histone modification H3K27me3, suggesting redundancy of epigenetic silencing mechanisms. Furthermore, HELLS modulates CG methylation at all classes of repeat elements and is critical for repression of a subset of repeat elements. Overall, we provide a detailed analysis of gene expression changes in relation to DNA methylation alterations, which contributes to our understanding of the biological role of cytosine methylation. PMID:25170028
A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence

PubMed Central

2018-01-01

FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
Statistical properties of DNA sequences

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Simons, M.; Stanley, H. E.

1995-01-01

We review evidence supporting the idea that the DNA sequence in genes containing non-coding regions is correlated, and that the correlation is remarkably long range--indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationarity" feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33301 coding and 29453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.
A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region

PubMed Central

Kress, W. John; Erickson, David L.

2007-01-01

Background A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Methodology/Principal Findings Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. Conclusions/Significance A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination. PMID:17551588
A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

PubMed

Kress, W John; Erickson, David L

2007-06-06

A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Stress induced gene expression drives transient DNA methylation changes at adjacent repetitive elements

PubMed Central

Secco, David; Wang, Chuang; Shou, Huixia; Schultz, Matthew D; Chiarenza, Serge; Nussaume, Laurent; Ecker, Joseph R; Whelan, James; Lister, Ryan

2015-01-01

Cytosine DNA methylation (mC) is a genome modification that can regulate the expression of coding and non-coding genetic elements. However, little is known about the involvement of mC in response to environmental cues. Using whole genome bisulfite sequencing to assess the spatio-temporal dynamics of mC in rice grown under phosphate starvation and recovery conditions, we identified widespread phosphate starvation-induced changes in mC, preferentially localized in transposable elements (TEs) close to highly induced genes. These changes in mC occurred after changes in nearby gene transcription, were mostly DCL3a-independent, and could partially be propagated through mitosis, however no evidence of meiotic transmission was observed. Similar analyses performed in Arabidopsis revealed a very limited effect of phosphate starvation on mC, suggesting a species-specific mechanism. Overall, this suggests that TEs in proximity to environmentally induced genes are silenced via hypermethylation, and establishes the temporal hierarchy of transcriptional and epigenomic changes in response to stress. DOI: http://dx.doi.org/10.7554/eLife.09343.001 PMID:26196146
NONCODE v2.0: decoding the non-coding.

PubMed

He, Shunmin; Liu, Changning; Skogerbø, Geir; Zhao, Haitao; Wang, Jie; Liu, Tao; Bai, Baoyan; Zhao, Yi; Chen, Runsheng

2008-01-01

The NONCODE database is an integrated knowledge database designed for the analysis of non-coding RNAs (ncRNAs). Since NONCODE was first released 3 years ago, the number of known ncRNAs has grown rapidly, and there is growing recognition that ncRNAs play important regulatory roles in most organisms. In the updated version of NONCODE (NONCODE v2.0), the number of collected ncRNAs has reached 206 226, including a wide range of microRNAs, Piwi-interacting RNAs and mRNA-like ncRNAs. The improvements brought to the database include not only new and updated ncRNA data sets, but also an incorporation of BLAST alignment search service and access through our custom UCSC Genome Browser. NONCODE can be found under http://www.noncode.org or http://noncode.bioinfo.org.cn.
Noncoding RNAs of the Ultrabithorax Domain of the Drosophila Bithorax Complex

PubMed Central

Pease, Benjamin; Borges, Ana C.; Bender, Welcome

2013-01-01

RNA transcripts without obvious coding potential are widespread in many creatures, including the fruit fly, Drosophila melanogaster. Several noncoding RNAs have been identified within the Drosophila bithorax complex. These first appear in blastoderm stage embryos, and their expression patterns indicate that they are transcribed only from active domains of the bithorax complex. It has been suggested that these noncoding RNAs have a role in establishing active domains, perhaps by setting the state of Polycomb Response Elements A comprehensive survey across the proximal half of the bithorax complex has now revealed nine distinct noncoding RNA transcripts, including four within the Ultrabithorax transcription unit. At the blastoderm stage, the noncoding transcripts collectively span ∼75% of the 135 kb surveyed. Recombination-mediated cassette exchange was used to invert the promoter of one of the noncoding RNAs, a 23-kb transcript from the bxd domain of the bithorax complex. The resulting animals fail to make the normal bxd noncoding RNA and show no transcription across the bxd Polycomb Response Element in early embryos. The mutant flies look normal; the regulation of the bxd domain appears unaffected. Thus, the bxd noncoding RNA has no apparent function. PMID:24077301
Disruption of long-distance highly conserved noncoding elements in neurocristopathies.

PubMed

Amiel, Jeanne; Benko, Sabina; Gordon, Christopher T; Lyonnet, Stanislas

2010-12-01

One of the key discoveries of vertebrate genome sequencing projects has been the identification of highly conserved noncoding elements (CNEs). Some characteristics of CNEs include their high frequency in mammalian genomes, their potential regulatory role in gene expression, and their enrichment in gene deserts nearby master developmental genes. The abnormal development of neural crest cells (NCCs) leads to a broad spectrum of congenital malformation(s), termed neurocristopathies, and/or tumor predisposition. Here we review recent findings that disruptions of CNEs, within or at long distance from the coding sequences of key genes involved in NCC development, result in neurocristopathies via the alteration of tissue- or stage-specific long-distance regulation of gene expression. While most studies on human genetic disorders have focused on protein-coding sequences, these examples suggest that investigation of genomic alterations of CNEs will provide a broader understanding of the molecular etiology of both rare and common human congenital malformations. © 2010 New York Academy of Sciences.
Study characterizes long non-coding RNA’s response to DNA damage in colon cancer cells | Center for Cancer Research

Cancer.gov

Researchers led by Ashish Lal, Ph.D., Investigator in the Genetics Branch, have shown that when the DNA in human colon cancer cells is damaged, a long non-coding RNA (lncRNA) regulates the expression of genes that halt growth, which allows the cells to repair the damage and promote survival. Their findings suggest an important pro-survival function of a lncRNA in cancer
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

PubMed

Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

2012-01-01

Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.
Bleomycin Can Cleave an Oncogenic Noncoding RNA.

PubMed

Angelbello, Alicia J; Disney, Matthew D

2018-01-04

Noncoding RNAs are pervasive in cells and contribute to diseases such as cancer. A question in biomedical research is whether noncoding RNAs are targets of medicines. Bleomycin is a natural product that cleaves DNA; however, it is known to cleave RNA in vitro. Herein, an in-depth analysis of the RNA cleavage preferences of bleomycin A5 is presented. Bleomycin A5 prefers to cleave RNAs with stretches of AU base pairs. Based on these preferences and bioinformatic analysis, the microRNA-10b hairpin precursor was identified as a potential substrate for bleomycin A5. Both in vitro and cellular experiments demonstrated cleavage. Importantly, chemical cleavage by bleomycin A5 in the microRNA-10b hairpin precursors occurred near the Drosha and Dicer enzymatic processing sites and led to destruction of the microRNA. Evidently, oncogenic noncoding RNAs can be considered targets of cancer medicines and might elicit their pharmacological effects by targeting noncoding RNA. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Study characterizes long non-coding RNA’s response to DNA damage in colon cancer cells | Center for Cancer Research

Cancer.gov

Researchers led by Ashish Lal, Ph.D., Investigator in the Genetics Branch, have shown that when the DNA in human colon cancer cells is damaged, a long non-coding RNA (lncRNA) regulates the expression of genes that halt growth, which allows the cells to repair the damage and promote survival. Their findings suggest an important pro-survival function of a lncRNA in cancer cells. Read more...
Determination of the Optimal Chromosomal Location(s) for a DNA Element in Escherichia coli Using a Novel Transposon-mediated Approach.

PubMed

Frimodt-Møller, Jakob; Charbon, Godefroid; Krogfelt, Karen A; Løbner-Olesen, Anders

2017-09-11

The optimal chromosomal position(s) of a given DNA element was/were determined by transposon-mediated random insertion followed by fitness selection. In bacteria, the impact of the genetic context on the function of a genetic element can be difficult to assess. Several mechanisms, including topological effects, transcriptional interference from neighboring genes, and/or replication-associated gene dosage, may affect the function of a given genetic element. Here, we describe a method that permits the random integration of a DNA element into the chromosome of Escherichia coli and select the most favorable locations using a simple growth competition experiment. The method takes advantage of a well-described transposon-based system of random insertion, coupled with a selection of the fittest clone(s) by growth advantage, a procedure that is easily adjustable to experimental needs. The nature of the fittest clone(s) can be determined by whole-genome sequencing on a complex multi-clonal population or by easy gene walking for the rapid identification of selected clones. Here, the non-coding DNA region DARS2, which controls the initiation of chromosome replication in E. coli, was used as an example. The function of DARS2 is known to be affected by replication-associated gene dosage; the closer DARS2 gets to the origin of DNA replication, the more active it becomes. DARS2 was randomly inserted into the chromosome of a DARS2-deleted strain. The resultant clones containing individual insertions were pooled and competed against one another for hundreds of generations. Finally, the fittest clones were characterized and found to contain DARS2 inserted in close proximity to the original DARS2 location.
Noncoding RNAs in Neurodegenerative Diseases

PubMed Central

Rege, Shraddha D.; Geetha, Thangiah; Pondugula, Satyanarayana R.; Zizza, Claire A.; Wernette, Catherine M.

2013-01-01

Noncoding RNAs are widely known for their various essential roles in the development of central nervous system. It involves neurogenesis, neural stem cells generation, maintenance and maturation, neurotransmission, neural network plasticity, formation of synapses, and even brain aging and DNA damage responses. In this review, we will discuss the biogenesis of microRNA, various functions of noncoding RNA's specifically microRNAs (miRNAs) that act as the chief regulators of gene expression, and focus in particular on misregulation of miRNAs which leads to several neurodegenerative diseases as well as its therapeutic outcome. Recent evidences has shown that miRNAs expression levels are changed in patients with neurodegenerative diseases; hence, miRNA can be used as a potential diagnostic biomarker and serve as an effective therapeutic tool in overcoming various neurodegenerative disease processes. PMID:23738143
Long non-coding RNAs as novel expression signatures modulate DNA damage and repair in cadmium toxicology

NASA Astrophysics Data System (ADS)

Zhou, Zhiheng; Liu, Haibai; Wang, Caixia; Lu, Qian; Huang, Qinhai; Zheng, Chanjiao; Lei, Yixiong

2015-10-01

Increasing evidence suggests that long non-coding RNAs (lncRNAs) are involved in a variety of physiological and pathophysiological processes. Our study was to investigate whether lncRNAs as novel expression signatures are able to modulate DNA damage and repair in cadmium(Cd) toxicity. There were aberrant expression profiles of lncRNAs in 35th Cd-induced cells as compared to untreated 16HBE cells. siRNA-mediated knockdown of ENST00000414355 inhibited the growth of DNA-damaged cells and decreased the expressions of DNA-damage related genes (ATM, ATR and ATRIP), while increased the expressions of DNA-repair related genes (DDB1, DDB2, OGG1, ERCC1, MSH2, RAD50, XRCC1 and BARD1). Cadmium increased ENST00000414355 expression in the lung of Cd-exposed rats in a dose-dependent manner. A significant positive correlation was observed between blood ENST00000414355 expression and urinary/blood Cd concentrations, and there were significant correlations of lncRNA-ENST00000414355 expression with the expressions of target genes in the lung of Cd-exposed rats and the blood of Cd exposed workers. These results indicate that some lncRNAs are aberrantly expressed in Cd-treated 16HBE cells. lncRNA-ENST00000414355 may serve as a signature for DNA damage and repair related to the epigenetic mechanisms underlying the cadmium toxicity and become a novel biomarker of cadmium toxicity.
Non-coding RNAs in lung cancer

PubMed Central

Ricciuti, Biagio; Mecca, Carmen; Crinò, Lucio; Baglivo, Sara; Cenci, Matteo; Metro, Giulio

2014-01-01

The discovery that protein-coding genes represent less than 2% of all human genome, and the evidence that more than 90% of it is actively transcribed, changed the classical point of view of the central dogma of molecular biology, which was always based on the assumption that RNA functions mainly as an intermediate bridge between DNA sequences and protein synthesis machinery. Accumulating data indicates that non-coding RNAs are involved in different physiological processes, providing for the maintenance of cellular homeostasis. They are important regulators of gene expression, cellular differentiation, proliferation, migration, apoptosis, and stem cell maintenance. Alterations and disruptions of their expression or activity have increasingly been associated with pathological changes of cancer cells, this evidence and the prospect of using these molecules as diagnostic markers and therapeutic targets, make currently non-coding RNAs among the most relevant molecules in cancer research. In this paper we will provide an overview of non-coding RNA function and disruption in lung cancer biology, also focusing on their potential as diagnostic, prognostic and predictive biomarkers. PMID:25593996
Differential DNA methylation profiles of coding and non-coding genes define hippocampal sclerosis in human temporal lobe epilepsy

PubMed Central

Miller-Delaney, Suzanne F.C.; Bryan, Kenneth; Das, Sudipto; McKiernan, Ross C.; Bray, Isabella M.; Reynolds, James P.; Gwinn, Ryder; Stallings, Raymond L.

2015-01-01

Temporal lobe epilepsy is associated with large-scale, wide-ranging changes in gene expression in the hippocampus. Epigenetic changes to DNA are attractive mechanisms to explain the sustained hyperexcitability of chronic epilepsy. Here, through methylation analysis of all annotated C-phosphate-G islands and promoter regions in the human genome, we report a pilot study of the methylation profiles of temporal lobe epilepsy with or without hippocampal sclerosis. Furthermore, by comparative analysis of expression and promoter methylation, we identify methylation sensitive non-coding RNA in human temporal lobe epilepsy. A total of 146 protein-coding genes exhibited altered DNA methylation in temporal lobe epilepsy hippocampus (n = 9) when compared to control (n = 5), with 81.5% of the promoters of these genes displaying hypermethylation. Unique methylation profiles were evident in temporal lobe epilepsy with or without hippocampal sclerosis, in addition to a common methylation profile regardless of pathology grade. Gene ontology terms associated with development, neuron remodelling and neuron maturation were over-represented in the methylation profile of Watson Grade 1 samples (mild hippocampal sclerosis). In addition to genes associated with neuronal, neurotransmitter/synaptic transmission and cell death functions, differential hypermethylation of genes associated with transcriptional regulation was evident in temporal lobe epilepsy, but overall few genes previously associated with epilepsy were among the differentially methylated. Finally, a panel of 13, methylation-sensitive microRNA were identified in temporal lobe epilepsy including MIR27A, miR-193a-5p (MIR193A) and miR-876-3p (MIR876), and the differential methylation of long non-coding RNA documented for the first time. The present study therefore reports select, genome-wide DNA methylation changes in human temporal lobe epilepsy that may contribute to the molecular architecture of the epileptic brain. PMID

Separating the wheat from the chaff: systematic identification of functionally relevant noncoding variants in ADHD.

PubMed

Tong, J H S; Hawi, Z; Dark, C; Cummins, T D R; Johnson, B P; Newman, D P; Lau, R; Vance, A; Heussler, H S; Matthews, N; Bellgrove, M A; Pang, K C

2016-11-01

Attention deficit hyperactivity disorder (ADHD) is a highly heritable psychiatric condition with negative lifetime outcomes. Uncovering its genetic architecture should yield important insights into the neurobiology of ADHD and assist development of novel treatment strategies. Twenty years of candidate gene investigations and more recently genome-wide association studies have identified an array of potential association signals. In this context, separating the likely true from false associations ('the wheat' from 'the chaff') will be crucial for uncovering the functional biology of ADHD. Here, we defined a set of 2070 DNA variants that showed evidence of association with ADHD (or were in linkage disequilibrium). More than 97% of these variants were noncoding, and were prioritised for further exploration using two tools-genome-wide annotation of variants (GWAVA) and Combined Annotation-Dependent Depletion (CADD)-that were recently developed to rank variants based upon their likely pathogenicity. Capitalising on recent efforts such as the Encyclopaedia of DNA Elements and US National Institutes of Health Roadmap Epigenomics Projects to improve understanding of the noncoding genome, we subsequently identified 65 variants to which we assigned functional annotations, based upon their likely impact on alternative splicing, transcription factor binding and translational regulation. We propose that these 65 variants, which possess not only a high likelihood of pathogenicity but also readily testable functional hypotheses, represent a tractable shortlist for future experimental validation in ADHD. Taken together, this study brings into sharp focus the likely relevance of noncoding variants for the genetic risk associated with ADHD, and more broadly suggests a bioinformatics approach that should be relevant to other psychiatric disorders.
Transcription of tandemly repetitive DNA: functional roles.

PubMed

Biscotti, Maria Assunta; Canapa, Adriana; Forconi, Mariko; Olmo, Ettore; Barucca, Marco

2015-09-01

A considerable fraction of the eukaryotic genome is made up of satellite DNA constituted of tandemly repeated sequences. These elements are mainly located at centromeres, pericentromeres, and telomeres and are major components of constitutive heterochromatin. Although originally satellite DNA was thought silent and inert, an increasing number of studies are providing evidence on its transcriptional activity supporting, on the contrary, an unexpected dynamicity. This review summarizes the multiple structural roles of satellite noncoding RNAs at chromosome level. Indeed, satellite noncoding RNAs play a role in the establishment of a heterochromatic state at centromere and telomere. These highly condensed structures are indispensable to preserve chromosome integrity and genome stability, preventing recombination events, and ensuring the correct chromosome pairing and segregation. Moreover, these RNA molecules seem to be involved also in maintaining centromere identity and in elongation, capping, and replication of telomere. Finally, the abnormal variation of centromeric and pericentromeric DNA transcription across major eukaryotic lineages in stress condition and disease has evidenced the critical role that these transcripts may play and the potentially dire consequences for the organism.
Origin of noncoding DNA sequences: molecular fossils of genome evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Naora, H.; Miyahara, K.; Curnow, R.N.

The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. The authors propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approx. 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approx. 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stopmore » codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. They attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.« less
Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals.

PubMed

Davies, Kalina T J; Tsagkogeorga, Georgia; Rossiter, Stephen J

2014-12-19

The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise
The development of non-coding RNA ontology.

PubMed

Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; de Silva, Nisansa; Kasukurthi, Mohan Vamsi; Jha, Vikash Kumar; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

2016-01-01

Identification of non-coding RNAs (ncRNAs) has been significantly improved over the past decade. On the other hand, semantic annotation of ncRNA data is facing critical challenges due to the lack of a comprehensive ontology to serve as common data elements and data exchange standards in the field. We developed the Non-Coding RNA Ontology (NCRO) to handle this situation. By providing a formally defined ncRNA controlled vocabulary, the NCRO aims to fill a specific and highly needed niche in semantic annotation of large amounts of ncRNA biological and clinical data.
Small Open Reading Frames, Non-Coding RNAs and Repetitive Elements in Bradyrhizobium japonicum USDA 110

PubMed Central

Hahn, Julia; Tsoy, Olga V.; Thalmann, Sebastian; Čuklina, Jelena; Gelfand, Mikhail S.

2016-01-01

Small open reading frames (sORFs) and genes for non-coding RNAs are poorly investigated components of most genomes. Our analysis of 1391 ORFs recently annotated in the soybean symbiont Bradyrhizobium japonicum USDA 110 revealed that 78% of them contain less than 80 codons. Twenty-one of these sORFs are conserved in or outside Alphaproteobacteria and most of them are similar to genes found in transposable elements, in line with their broad distribution. Stabilizing selection was demonstrated for sORFs with proteomic evidence and bll1319_ISGA which is conserved at the nucleotide level in 16 alphaproteobacterial species, 79 species from other taxa and 49 other Proteobacteria. Further we used Northern blot hybridization to validate ten small RNAs (BjsR1 to BjsR10) belonging to new RNA families. We found that BjsR1 and BjsR3 have homologs outside the genus Bradyrhizobium, and BjsR5, BjsR6, BjsR7, and BjsR10 have up to four imperfect copies in Bradyrhizobium genomes. BjsR8, BjsR9, and BjsR10 are present exclusively in nodules, while the other sRNAs are also expressed in liquid cultures. We also found that the level of BjsR4 decreases after exposure to tellurite and iron, and this down-regulation contributes to survival under high iron conditions. Analysis of additional small RNAs overlapping with 3’-UTRs revealed two new repetitive elements named Br-REP1 and Br-REP2. These REP elements may play roles in the genomic plasticity and gene regulation and could be useful for strain identification by PCR-fingerprinting. Furthermore, we studied two potential toxin genes in the symbiotic island and confirmed toxicity of the yhaV homolog bll1687 but not of the newly annotated higB homolog blr0229_ISGA in E. coli. Finally, we revealed transcription interference resulting in an antisense RNA complementary to blr1853, a gene induced in symbiosis. The presented results expand our knowledge on sORFs, non-coding RNAs and repetitive elements in B. japonicum and related bacteria. PMID
Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.

PubMed

Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M

2017-03-27

Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome
Kaposi's Sarcoma-Associated Herpesvirus mRNA Accumulation in Nuclear Foci Is Influenced by Viral DNA Replication and Viral Noncoding Polyadenylated Nuclear RNA.

PubMed

Vallery, Tenaya K; Withers, Johanna B; Andoh, Joana A; Steitz, Joan A

2018-07-01

Kaposi's sarcoma-associated herpesvirus (KSHV), like other herpesviruses, replicates within the nuclei of its human cell host and hijacks host machinery for expression of its genes. The activities that culminate in viral DNA synthesis and assembly of viral proteins into capsids physically concentrate in nuclear areas termed viral replication compartments. We sought to better understand the spatiotemporal regulation of viral RNAs during the KSHV lytic phase by examining and quantifying the subcellular localization of select viral transcripts. We found that viral mRNAs, as expected, localized to the cytoplasm throughout the lytic phase. However, dependent on active viral DNA replication, viral transcripts also accumulated in the nucleus, often in foci in and around replication compartments, independent of the host shutoff effect. Our data point to involvement of the viral long noncoding polyadenylated nuclear (PAN) RNA in the localization of an early, intronless viral mRNA encoding ORF59-58 to nuclear foci that are associated with replication compartments. IMPORTANCE Late in the lytic phase, mRNAs from Kaposi's sarcoma-associated herpesvirus accumulate in the host cell nucleus near viral replication compartments, centers of viral DNA synthesis and virion production. This work contributes spatiotemporal data on herpesviral mRNAs within the lytic host cell and suggests a mechanism for viral RNA accumulation. Our findings indicate that the mechanism is independent of the host shutoff effect and splicing but dependent on active viral DNA synthesis and in part on the viral noncoding RNA, PAN RNA. PAN RNA is essential for the viral life cycle, and its contribution to the nuclear accumulation of viral messages may facilitate propagation of the virus. Copyright © 2018 American Society for Microbiology.
Evolution in the block: common elements of 5S rDNA organization and evolutionary patterns in distant fish genera.

PubMed

Campo, Daniel; García-Vázquez, Eva

2012-01-01

The 5S rDNA is organized in the genome as tandemly repeated copies of a structural unit composed of a coding sequence plus a nontranscribed spacer (NTS). The coding region is highly conserved in the evolution, whereas the NTS vary in both length and sequence. It has been proposed that 5S rRNA genes are members of a gene family that have arisen through concerted evolution. In this study, we describe the molecular organization and evolution of the 5S rDNA in the genera Lepidorhombus and Scophthalmus (Scophthalmidae) and compared it with already known 5S rDNA of the very different genera Merluccius (Merluccidae) and Salmo (Salmoninae), to identify common structural elements or patterns for understanding 5S rDNA evolution in fish. High intra- and interspecific diversity within the 5S rDNA family in all the genera can be explained by a combination of duplications, deletions, and transposition events. Sequence blocks with high similarity in all the 5S rDNA members across species were identified for the four studied genera, with evidences of intense gene conversion within noncoding regions. We propose a model to explain the evolution of the 5S rDNA, in which the evolutionary units are blocks of nucleotides rather than the entire sequences or single nucleotides. This model implies a "two-speed" evolution: slow within blocks (homogenized by recombination) and fast within the gene family (diversified by duplications and deletions).
The domain structure and distribution of Alu elements in long noncoding RNAs and mRNAs

PubMed Central

Kim, Eugene Z.; Wespiser, Adam R.; Caffrey, Daniel R.

2016-01-01

Approximately 75% of the human genome is transcribed and many of these spliced transcripts contain primate-specific Alu elements, the most abundant mobile element in the human genome. The majority of exonized Alu elements are located in long noncoding RNAs (lncRNAs) and the untranslated regions of mRNA, with some performing molecular functions. To further assess the potential for Alu elements to be repurposed as functional RNA domains, we investigated the distribution and evolution of Alu elements in spliced transcripts. Our analysis revealed that Alu elements are underrepresented in mRNAs and lncRNAs, suggesting that most exonized Alu elements arising in the population are rare or deleterious to RNA function. When mRNAs and lncRNAs retain exonized Alu elements, they have a clear preference for Alu dimers, left monomers, and right monomers. mRNAs often acquire Alu elements when their genes are duplicated within Alu-rich regions. In lncRNAs, reverse-oriented Alu elements are significantly enriched and are not restricted to the 3′ and 5′ ends. Both lncRNAs and mRNAs primarily contain the Alu J and S subfamilies that were amplified relatively early in primate evolution. Alu J subfamilies are typically overrepresented in lncRNAs, whereas the Alu S dimer is overrepresented in mRNAs. The sequences of Alu dimers tend to be constrained in both lncRNAs and mRNAs, whereas the left and right monomers are constrained within particular Alu subfamilies and classes of RNA. Collectively, these findings suggest that Alu-containing RNAs are capable of forming stable structures and that some of these Alu domains might have novel biological functions. PMID:26654912
Characterization of Non-coding DNA Satellites Associated with Sweepoviruses (Genus Begomovirus, Geminiviridae) – Definition of a Distinct Class of Begomovirus-Associated Satellites

PubMed Central

Lozano, Gloria; Trenado, Helena P.; Fiallo-Olivé, Elvira; Chirinos, Dorys; Geraud-Pouey, Francis; Briddon, Rob W.; Navas-Castillo, Jesús

2016-01-01

Begomoviruses (family Geminiviridae) are whitefly-transmitted, plant-infecting single-stranded DNA viruses that cause crop losses throughout the warmer parts of the World. Sweepoviruses are a phylogenetically distinct group of begomoviruses that infect plants of the family Convolvulaceae, including sweet potato (Ipomoea batatas). Two classes of subviral molecules are often associated with begomoviruses, particularly in the Old World; the betasatellites and the alphasatellites. An analysis of sweet potato and Ipomoea indica samples from Spain and Merremia dissecta samples from Venezuela identified small non-coding subviral molecules in association with several distinct sweepoviruses. The sequences of 18 clones were obtained and found to be structurally similar to tomato leaf curl virus-satellite (ToLCV-sat, the first DNA satellite identified in association with a begomovirus), with a region with significant sequence identity to the conserved region of betasatellites, an A-rich sequence, a predicted stem–loop structure containing the nonanucleotide TAATATTAC, and a second predicted stem–loop. These sweepovirus-associated satellites join an increasing number of ToLCV-sat-like non-coding satellites identified recently. Although sharing some features with betasatellites, evidence is provided to suggest that the ToLCV-sat-like satellites are distinct from betasatellites and should be considered a separate class of satellites, for which the collective name deltasatellites is proposed. PMID:26925037
Statistical and linguistic features of DNA sequences

NASA Technical Reports Server (NTRS)

Havlin, S.; Buldyrev, S. V.; Goldberger, A. L.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We present evidence supporting the idea that the DNA sequence in genes containing noncoding regions is correlated, and that the correlation is remarkably long range--indeed, base pairs thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the "non-stationary" feature of the sequence of base pairs by applying a new algorithm called Detrended Fluctuation Analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and noncoding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to all eukaryotic DNA sequences (33 301 coding and 29 453 noncoding) in the entire GenBank database. We describe a simple model to account for the presence of long-range power-law correlations which is based upon a generalization of the classic Levy walk. Finally, we describe briefly some recent work showing that the noncoding sequences have certain statistical features in common with natural languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts, and the Shannon approach to quantifying the "redundancy" of a linguistic text in terms of a measurable entropy function. We suggest that noncoding regions in plants and invertebrates may display a smaller entropy and larger redundancy than coding regions, further supporting the possibility that noncoding regions of DNA may carry biological information.
Noncoding origins of anthropoid traits and a new null model of transposon functionalization

PubMed Central

del Rosario, Ricardo C.H.; Rayan, Nirmala Arul

2014-01-01

Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting that novel anthropoid functional elements were overwhelmingly cis-regulatory. ASCs were highly enriched in loci associated with fetal brain development, motor coordination, neurotransmission, and vision, thus providing a large set of candidate elements for exploring the molecular basis of hallmark primate traits. We validated ASC192 as a primate-specific enhancer in proliferative zones of the developing brain. Unexpectedly, transposable elements (TEs) contributed to >56% of ASCs, and almost all TE families showed functional potential similar to that of nonrepetitive DNA. Three L1PA repeat-derived ASCs displayed coherent eye-enhancer function, thus demonstrating that the “gene-battery” model of TE functionalization applies to enhancers in vivo. Our study provides fundamental insights into genome evolution and the origins of anthropoid phenotypes and supports an elegantly simple new null model of TE exaptation. PMID:25043600
Noncoding Genomics in Gastric Cancer and the Gastric Precancerous Cascade: Pathogenesis and Biomarkers

PubMed Central

Garcia-Bloj, Benjamin; Fry, Jacqueline; Wichmann, Ignacio

2015-01-01

Gastric cancer is the fifth most common cancer and the third leading cause of cancer-related death, whose patterns vary among geographical regions and ethnicities. It is a multifactorial disease, and its development depends on infection by Helicobacter pylori (H. pylori) and Epstein-Barr virus (EBV), host genetic factors, and environmental factors. The heterogeneity of the disease has begun to be unraveled by a comprehensive mutational evaluation of primary tumors. The low-abundance of mutations suggests that other mechanisms participate in the evolution of the disease, such as those found through analyses of noncoding genomics. Noncoding genomics includes single nucleotide polymorphisms (SNPs), regulation of gene expression through DNA methylation of promoter sites, miRNAs, other noncoding RNAs in regulatory regions, and other topics. These processes and molecules ultimately control gene expression. Potential biomarkers are appearing from analyses of noncoding genomics. This review focuses on noncoding genomics and potential biomarkers in the context of gastric cancer and the gastric precancerous cascade. PMID:26379360
A 5′ Noncoding Exon Containing Engineered Intron Enhances Transgene Expression from Recombinant AAV Vectors in vivo

PubMed Central

Lu, Jiamiao; Williams, James A.; Luke, Jeremy; Zhang, Feijie; Chu, Kirk; Kay, Mark A.

2017-01-01

We previously developed a mini-intronic plasmid (MIP) expression system in which the essential bacterial elements for plasmid replication and selection are placed within an engineered intron contained within a universal 5′ UTR noncoding exon. Like minicircle DNA plasmids (devoid of bacterial backbone sequences), MIP plasmids overcome transcriptional silencing of the transgene. However, in addition MIP plasmids increase transgene expression by 2 and often >10 times higher than minicircle vectors in vivo and in vitro. Based on these findings, we examined the effects of the MIP intronic sequences in a recombinant adeno-associated virus (AAV) vector system. Recombinant AAV vectors containing an intron with a bacterial replication origin and bacterial selectable marker increased transgene expression by 40 to 100 times in vivo when compared with conventional AAV vectors. Therefore, inclusion of this noncoding exon/intron sequence upstream of the coding region can substantially enhance AAV-mediated gene expression in vivo. PMID:27903072
HEXIM1 and NEAT1 Long Non-coding RNA Form a Multi-subunit Complex that Regulates DNA-Mediated Innate Immune Response.

PubMed

Morchikh, Mehdi; Cribier, Alexandra; Raffel, Raoul; Amraoui, Sonia; Cau, Julien; Severac, Dany; Dubois, Emeric; Schwartz, Olivier; Bennasser, Yamina; Benkirane, Monsef

2017-08-03

The DNA-mediated innate immune response underpins anti-microbial defenses and certain autoimmune diseases. Here we used immunoprecipitation, mass spectrometry, and RNA sequencing to identify a ribonuclear complex built around HEXIM1 and the long non-coding RNA NEAT1 that we dubbed the HEXIM1-DNA-PK-paraspeckle components-ribonucleoprotein complex (HDP-RNP). The HDP-RNP contains DNA-PK subunits (DNAPKc, Ku70, and Ku80) and paraspeckle proteins (SFPQ, NONO, PSPC1, RBM14, and MATRIN3). We show that binding of HEXIM1 to NEAT1 is required for its assembly. We further demonstrate that the HDP-RNP is required for the innate immune response to foreign DNA, through the cGAS-STING-IRF3 pathway. The HDP-RNP interacts with cGAS and its partner PQBP1, and their interaction is remodeled by foreign DNA. Remodeling leads to the release of paraspeckle proteins, recruitment of STING, and activation of DNAPKc and IRF3. Our study establishes the HDP-RNP as a key nuclear regulator of DNA-mediated activation of innate immune response through the cGAS-STING pathway. Copyright © 2017 Elsevier Inc. All rights reserved.
Densely ionizing radiation affects DNA methylation of selective LINE-1 elements

DOE Office of Scientific and Technical Information (OSTI.GOV)

Prior, Sara; Miousse, Isabelle R.

Long Interspersed Nucleotide Element 1 (LINE-1) retrotransposons are heavily methylated and are the most abundant transposable elements in mammalian genomes. Here, we investigated the differential DNA methylation within the LINE-1 under normal conditions and in response to environmentally relevant doses of sparsely and densely ionizing radiation. We demonstrate that DNA methylation of LINE-1 elements in the lungs of C57BL6 mice is dependent on their evolutionary age, where the elder age of the element is associated with the lower extent of DNA methylation. Exposure to 5-aza-2′-deoxycytidine and methionine-deficient diet affected DNA methylation of selective LINE-1 elements in an age- and promotermore » type-dependent manner. Exposure to densely IR, but not sparsely IR, resulted in DNA hypermethylation of older LINE-1 elements, while the DNA methylation of evolutionary younger elements remained mostly unchanged. We also demonstrate that exposure to densely IR increased mRNA and protein levels of LINE-1 via the loss of the histone H3K9 dimethylation and an increase in the H3K4 trimethylation at the LINE-1 5′-untranslated region, independently of DNA methylation. Our findings suggest that DNA methylation is important for regulation of LINE-1 expression under normal conditions, but histone modifications may dictate the transcriptional activity of LINE-1 in response to exposure to densely IR. - Highlights: • DNA methylation of LINE-1 elements is dependent on their evolutionary age. • Densely ionizing radiation affects DNA methylation of selective LINE-1 elements. • Radiation-induced reactivation of LINE-1 is DNA methylation-independent. • Histone modifications dictate the transcriptional activity of LINE-1.« less
Conserved noncoding sequences (CNSs) in higher plants.

PubMed

Freeling, Michael; Subramaniam, Shabarinath

2009-04-01

Plant conserved noncoding sequences (CNSs)--a specific category of phylogenetic footprint--have been shown experimentally to function. No plant CNS is conserved to the extent that ultraconserved noncoding sequences are conserved in vertebrates. Plant CNSs are enriched in known transcription factor or other cis-acting binding sites, and are usually clustered around genes. Genes that encode transcription factors and/or those that respond to stimuli are particularly CNS-rich. Only rarely could this function involve small RNA binding. Some transcribed CNSs encode short translation products as a form of negative control. Approximately 4% of Arabidopsis gene content is estimated to be both CNS-rich and occupies a relatively long stretch of chromosome: Bigfoot genes (long phylogenetic footprints). We discuss a 'DNA-templated protein assembly' idea that might help explain Bigfoot gene CNSs.
Identification of Transposable Elements Contributing to Tissue-Specific Expression of Long Non-Coding RNAs

PubMed Central

Chishima, Takafumi; Iwakiri, Junichi

2018-01-01

It has been recently suggested that transposable elements (TEs) are re-used as functional elements of long non-coding RNAs (lncRNAs). This is supported by some examples such as the human endogenous retrovirus subfamily H (HERVH) elements contained within lncRNAs and expressed specifically in human embryonic stem cells (hESCs), as required to maintain hESC identity. There are at least two unanswered questions about all lncRNAs. How many TEs are re-used within lncRNAs? Are there any other TEs that affect tissue specificity of lncRNA expression? To answer these questions, we comprehensively identify TEs that are significantly related to tissue-specific expression levels of lncRNAs. We downloaded lncRNA expression data corresponding to normal human tissue from the Expression Atlas and transformed the data into tissue specificity estimates. Then, Fisher’s exact tests were performed to verify whether the presence or absence of TE-derived sequences influences the tissue specificity of lncRNA expression. Many TE–tissue pairs associated with tissue-specific expression of lncRNAs were detected, indicating that multiple TE families can be re-used as functional domains or regulatory sequences of lncRNAs. In particular, we found that the antisense promoter region of L1PA2, a LINE-1 subfamily, appears to act as a promoter for lncRNAs with placenta-specific expression. PMID:29315213
Genetic evidence for conserved non-coding element function across species–the ears have it

PubMed Central

Turner, Eric E.; Cox, Timothy C.

2014-01-01

Comparison of genomic sequences from diverse vertebrate species has revealed numerous highly conserved regions that do not appear to encode proteins or functional RNAs. Often these “conserved non-coding elements,” or CNEs, can direct gene expression to specific tissues in transgenic models, demonstrating they have regulatory function. CNEs are frequently found near “developmental” genes, particularly transcription factors, implying that these elements have essential regulatory roles in development. However, actual examples demonstrating CNE regulatory functions across species have been few, and recent loss-of-function studies of several CNEs in mice have shown relatively minor effects. In this Perspectives article, we discuss new findings in “fancy” rats and Highland cattle demonstrating that function of a CNE near the Hmx1 gene is crucial for normal external ear development and when disrupted can mimic loss-of function Hmx1 coding mutations in mice and humans. These findings provide important support for conserved developmental roles of CNEs in divergent species, and reinforce the concept that CNEs should be examined systematically in the ongoing search for genetic causes of human developmental disorders in the era of genome-scale sequencing. PMID:24478720

Mapping Fifteen Trace Elements in Human Seminal Plasma and Sperm DNA.

PubMed

Ali, Sazan; Chaspoul, Florence; Anderson, Loundou; Bergé-Lefranc, David; Achard, Vincent; Perrin, Jeanne; Gallice, Philippe; Guichaoua, Marie

2017-02-01

Studies suggest a relationship between semen quality and the concentration of trace elements in serum or seminal plasma. However, trace elements may be linked to DNA and capable of altering the gene expression patterns. Thus, trace element interactions with DNA may contribute to the mechanisms for a trans-generational reproductive effect. We developed an analytical method to determine the amount of trace elements bound to the sperm DNA, and to estimate their affinity for the sperm DNA by the ratio: R = Log [metal concentration in the sperm DNA/metal concentration in seminal plasma]. We then analyzed the concentrations of 15 trace elements (Al, Cd, Cr, Cu, Hg, Mn, Mo, Ni, Pb, Ti, V, Zn, As, Sb, and Se) in the seminal plasma and the sperm DNA in 64 normal and 30 abnormal semen specimens with Inductively Coupled Plasma/Mass Spectrometry (ICP-MS). This study showed all trace elements were detected in the seminal plasma and only metals were detected in the sperm DNA. There was no correlation between the metals' concentrations in the seminal plasma and the sperm DNA. Al had the highest affinity for DNA followed by Pb and Cd. This strong affinity is consistent with the known mutagenic effects of these metals. The lowest affinity was observed for Zn and Ti. We observed a significant increase of Al linked to the sperm DNA of patients with oligozoospermia and teratozoospermia. Al's reproductive toxicity might be due to Al linked to DNA, by altering spermatogenesis and expression patterns of genes involved in the function of reproduction.
Computational Identification and Functional Predictions of Long Noncoding RNA in Zea mays

PubMed Central

Boerner, Susan; McGinnis, Karen M.

2012-01-01

Background Computational analysis of cDNA sequences from multiple organisms suggests that a large portion of transcribed DNA does not code for a functional protein. In mammals, noncoding transcription is abundant, and often results in functional RNA molecules that do not appear to encode proteins. Many long noncoding RNAs (lncRNAs) appear to have epigenetic regulatory function in humans, including HOTAIR and XIST. While epigenetic gene regulation is clearly an essential mechanism in plants, relatively little is known about the presence or function of lncRNAs in plants. Methodology/Principal Findings To explore the connection between lncRNA and epigenetic regulation of gene expression in plants, a computational pipeline using the programming language Python has been developed and applied to maize full length cDNA sequences to identify, classify, and localize potential lncRNAs. The pipeline was used in parallel with an SVM tool for identifying ncRNAs to identify the maximal number of ncRNAs in the dataset. Although the available library of sequences was small and potentially biased toward protein coding transcripts, 15% of the sequences were predicted to be noncoding. Approximately 60% of these sequences appear to act as precursors for small RNA molecules and may function to regulate gene expression via a small RNA dependent mechanism. ncRNAs were predicted to originate from both genic and intergenic loci. Of the lncRNAs that originated from genic loci, ∼20% were antisense to the host gene loci. Conclusions/Significance Consistent with similar studies in other organisms, noncoding transcription appears to be widespread in the maize genome. Computational predictions indicate that maize lncRNAs may function to regulate expression of other genes through multiple RNA mediated mechanisms. PMID:22916204
Interplay between DNA methylation, histone modification and chromatin remodeling in stem cells and during development.

PubMed

Ikegami, Kohta; Ohgane, Jun; Tanaka, Satoshi; Yagi, Shintaro; Shiota, Kunio

2009-01-01

Genes constitute only a small proportion of the mammalian genome, the majority of which is composed of non-genic repetitive elements including interspersed repeats and satellites. A unique feature of the mammalian genome is that there are numerous tissue-dependent, differentially methylated regions (T-DMRs) in the non-repetitive sequences, which include genes and their regulatory elements. The epigenetic status of T-DMRs varies from that of repetitive elements and constitutes the DNA methylation profile genome-wide. Since the DNA methylation profile is specific to each cell and tissue type, much like a fingerprint, it can be used as a means of identification. The formation of DNA methylation profiles is the basis for cell differentiation and development in mammals. The epigenetic status of each T-DMR is regulated by the interplay between DNA methyltransferases, histone modification enzymes, histone subtypes, non-histone nuclear proteins and non-coding RNAs. In this review, we will discuss how these epigenetic factors cooperate to establish cell- and tissue-specific DNA methylation profiles.
Noncoding origins of anthropoid traits and a new null model of transposon functionalization.

PubMed

del Rosario, Ricardo C H; Rayan, Nirmala Arul; Prabhakar, Shyam

2014-09-01

Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting that novel anthropoid functional elements were overwhelmingly cis-regulatory. ASCs were highly enriched in loci associated with fetal brain development, motor coordination, neurotransmission, and vision, thus providing a large set of candidate elements for exploring the molecular basis of hallmark primate traits. We validated ASC192 as a primate-specific enhancer in proliferative zones of the developing brain. Unexpectedly, transposable elements (TEs) contributed to >56% of ASCs, and almost all TE families showed functional potential similar to that of nonrepetitive DNA. Three L1PA repeat-derived ASCs displayed coherent eye-enhancer function, thus demonstrating that the "gene-battery" model of TE functionalization applies to enhancers in vivo. Our study provides fundamental insights into genome evolution and the origins of anthropoid phenotypes and supports an elegantly simple new null model of TE exaptation. © 2014 del Rosario et al.; Published by Cold Spring Harbor Laboratory Press.
Emergence of the Noncoding Cancer Genome: A Target of Genetic and Epigenetic Alterations.

PubMed

Zhou, Stanley; Treloar, Aislinn E; Lupien, Mathieu

2016-11-01

The emergence of whole-genome annotation approaches is paving the way for the comprehensive annotation of the human genome across diverse cell and tissue types exposed to various environmental conditions. This has already unmasked the positions of thousands of functional cis-regulatory elements integral to transcriptional regulation, such as enhancers, promoters, and anchors of chromatin interactions that populate the noncoding genome. Recent studies have shown that cis-regulatory elements are commonly the targets of genetic and epigenetic alterations associated with aberrant gene expression in cancer. Here, we review these findings to showcase the contribution of the noncoding genome and its alteration in the development and progression of cancer. We also highlight the opportunities to translate the biological characterization of genetic and epigenetic alterations in the noncoding cancer genome into novel approaches to treat or monitor disease. The majority of genetic and epigenetic alterations accumulate in the noncoding genome throughout oncogenesis. Discriminating driver from passenger events is a challenge that holds great promise to improve our understanding of the etiology of different cancer types. Advancing our understanding of the noncoding cancer genome may thus identify new therapeutic opportunities and accelerate our capacity to find improved biomarkers to monitor various stages of cancer development. Cancer Discov; 6(11); 1215-29. ©2016 AACR. ©2016 American Association for Cancer Research.
Highly conserved elements discovered in vertebrates are present in non-syntenic loci of tunicates, act as enhancers and can be transcribed during development

PubMed Central

Sanges, Remo; Hadzhiev, Yavor; Gueroult-Bellone, Marion; Roure, Agnes; Ferg, Marco; Meola, Nicola; Amore, Gabriele; Basu, Swaraj; Brown, Euan R.; De Simone, Marco; Petrera, Francesca; Licastro, Danilo; Strähle, Uwe; Banfi, Sandro; Lemaire, Patrick; Birney, Ewan; Müller, Ferenc; Stupka, Elia

2013-01-01

Co-option of cis-regulatory modules has been suggested as a mechanism for the evolution of expression sites during development. However, the extent and mechanisms involved in mobilization of cis-regulatory modules remains elusive. To trace the history of non-coding elements, which may represent candidate ancestral cis-regulatory modules affirmed during chordate evolution, we have searched for conserved elements in tunicate and vertebrate (Olfactores) genomes. We identified, for the first time, 183 non-coding sequences that are highly conserved between the two groups. Our results show that all but one element are conserved in non-syntenic regions between vertebrate and tunicate genomes, while being syntenic among vertebrates. Nevertheless, in all the groups, they are significantly associated with transcription factors showing specific functions fundamental to animal development, such as multicellular organism development and sequence-specific DNA binding. The majority of these regions map onto ultraconserved elements and we demonstrate that they can act as functional enhancers within the organism of origin, as well as in cross-transgenesis experiments, and that they are transcribed in extant species of Olfactores. We refer to the elements as ‘Olfactores conserved non-coding elements’. PMID:23393190
[Long non-coding RNAs in the pathophysiology of atherosclerosis].

PubMed

Novak, Jan; Vašků, Julie Bienertová; Souček, Miroslav

2018-01-01

The human genome contains about 22 000 protein-coding genes that are transcribed to an even larger amount of messenger RNAs (mRNA). Interestingly, the results of the project ENCODE from 2012 show, that despite up to 90 % of our genome being actively transcribed, protein-coding mRNAs make up only 2-3 % of the total amount of the transcribed RNA. The rest of RNA transcripts is not translated to proteins and that is why they are referred to as "non-coding RNAs". Earlier the non-coding RNA was considered "the dark matter of genome", or "the junk", whose genes has accumulated in our DNA during the course of evolution. Today we already know that non-coding RNAs fulfil a variety of regulatory functions in our body - they intervene into epigenetic processes from chromatin remodelling to histone methylation, or into the transcription process itself, or even post-transcription processes. Long non-coding RNAs (lncRNA) are one of the classes of non-coding RNAs that have more than 200 nucleotides in length (non-coding RNAs with less than 200 nucleotides in length are called small non-coding RNAs). lncRNAs represent a widely varied and large group of molecules with diverse regulatory functions. We can identify them in all thinkable cell types or tissues, or even in an extracellular space, which includes blood, specifically plasma. Their levels change during the course of organogenesis, they are specific to different tissues and their changes also occur along with the development of different illnesses, including atherosclerosis. This review article aims to present lncRNAs problematics in general and then focuses on some of their specific representatives in relation to the process of atherosclerosis (i.e. we describe lncRNA involvement in the biology of endothelial cells, vascular smooth muscle cells or immune cells), and we further describe possible clinical potential of lncRNA, whether in diagnostics or therapy of atherosclerosis and its clinical manifestations.Key words
VEZF1 Elements Mediate Protection from DNA Methylation

PubMed Central

Strogantsev, Ruslan; Gaszner, Miklos; Hair, Alan; Felsenfeld, Gary; West, Adam G.

2010-01-01

There is growing consensus that genome organization and long-range gene regulation involves partitioning of the genome into domains of distinct epigenetic chromatin states. Chromatin insulator or barrier elements are key components of these processes as they can establish boundaries between chromatin states. The ability of elements such as the paradigm β-globin HS4 insulator to block the range of enhancers or the spread of repressive histone modifications is well established. Here we have addressed the hypothesis that a barrier element in vertebrates should be capable of defending a gene from silencing by DNA methylation. Using an established stable reporter gene system, we find that HS4 acts specifically to protect a gene promoter from de novo DNA methylation. Notably, protection from methylation can occur in the absence of histone acetylation or transcription. There is a division of labor at HS4; the sequences that mediate protection from methylation are separable from those that mediate CTCF-dependent enhancer blocking and USF-dependent histone modification recruitment. The zinc finger protein VEZF1 was purified as the factor that specifically interacts with the methylation protection elements. VEZF1 is a candidate CpG island protection factor as the G-rich sequences bound by VEZF1 are frequently found at CpG island promoters. Indeed, we show that VEZF1 elements are sufficient to mediate demethylation and protection of the APRT CpG island promoter from DNA methylation. We propose that many barrier elements in vertebrates will prevent DNA methylation in addition to blocking the propagation of repressive histone modifications, as either process is sufficient to direct the establishment of an epigenetically stable silent chromatin state. PMID:20062523
Cancer-linked satellite 2 DNA hypomethylation does not regulate Sat2 non-coding RNA expression and is initiated by heat shock pathway activation.

PubMed

Tilman, Gaëlle; Arnoult, Nausica; Lenglez, Sandrine; Van Beneden, Amandine; Loriot, Axelle; De Smet, Charles; Decottignies, Anabelle

2012-08-01

Epigenetic dysfunctions, including DNA methylation alterations, play major roles in cancer initiation and progression. Although it is well established that gene promoter demethylation activates transcription, it remains unclear whether hypomethylation of repetitive heterochromatin similarly affects expression of non-coding RNA from these loci. Understanding how repetitive non-coding RNAs are transcriptionally regulated is important given that their established upregulation by the heat shock (HS) pathway suggests important functions in cellular response to stress, possibly by promoting heterochromatin reconstruction. We found that, although pericentromeric satellite 2 (Sat2) DNA hypomethylation is detected in a majority of cancer cell lines of various origins, DNA methylation loss does not constitutively hyperactivate Sat2 expression, and also does not facilitate Sat2 transcriptional induction upon heat shock. In melanoma tumor samples, our analysis revealed that the HS response, frequently upregulated in tumors, is probably the main determinant of Sat2 RNA expression in vivo. Next, we tested whether HS pathway hyperactivation may drive Sat2 demethylation. Strikingly, we found that both hyperthermia and hyperactivated RasV12 oncogene, another potent inducer of the HS pathway, reduced Sat2 methylation levels by up to 27% in human fibroblasts recovering from stress. Demethylation occurred locally on Sat2 repeats, resulting in a demethylation signature that was also detected in cancer cell lines with moderate genome-wide hypomethylation. We therefore propose that upregulation of Sat2 transcription in response to HS pathway hyperactivation during tumorigenesis may promote localized demethylation of the locus. This, in turn, may contribute to tumorigenesis, as demethylation of Sat2 was previously reported to favor chromosomal rearrangements.
Densely ionizing radiation affects DNA methylation of selective LINE-1 elements1

PubMed Central

Prior, Sara; Miousse, Isabelle R.; Nzabarushimana, Etienne; Pathak, Rupak; Skinner, Charles; Kutanzi, Kristy R.; Allen, Antiño R.; Raber, Jacob; Tackett, Alan J.; Hauer-Jensen, Martin; Nelson, Gregory A.; Koturbash, Igor

2016-01-01

Long Interspersed Nucleotide Element 1 (LINE-1) retrotransposons are heavily methylated and are the most abundant transposable elements in mammalian genomes. Here, we investigated the differential DNA methylation within the LINE-1 under normal conditions and in response to environmentally relevant doses of sparsely and densely ionizing radiation. We demonstrate that DNA methylation of LINE-1 elements in the lungs of C57BL6 mice is dependent on their evolutionary age, where the elder age of the element is associated with the lower extent of DNA methylation. Exposure to 5-aza-2′-deoxycytidine and methionine-deficient diet affected DNA methylation of selective LINE-1 elements in an age- and promoter type-dependent manner. Exposure to densely IR, but not sparsely IR, resulted in DNA hypermethylation of older LINE-1 elements, while the DNA methylation of evolutionary younger elements remained mostly unchanged. We also demonstrate that exposure to densely IR increased mRNA and protein levels of LINE-1 via the loss of the histone H3K9 dimethylation and an increase in the H3K4 trimethylation at the LINE-1 5′-untranslated region, independently of DNA methylation. Our findings suggest that DNA methylation is important for regulation of LINE-1 expression under normal conditions, but histone modifications may dictate the transcriptional activity of LINE-1 in response to exposure to densely IR. PMID:27419368
Mutation in a primate-conserved retrotransposon reveals a noncoding RNA as a mediator of infantile encephalopathy

PubMed Central

Cartault, François; Munier, Patrick; Benko, Edgar; Desguerre, Isabelle; Hanein, Sylvain; Boddaert, Nathalie; Bandiera, Simonetta; Vellayoudom, Jeanine; Krejbich-Trotot, Pascale; Bintner, Marc; Hoarau, Jean-Jacques; Girard, Muriel; Génin, Emmanuelle; de Lonlay, Pascale; Fourmaintraux, Alain; Naville, Magali; Rodriguez, Diana; Feingold, Josué; Renouil, Michel; Munnich, Arnold; Westhof, Eric; Fähling, Michael; Lyonnet, Stanislas; Henrion-Caude, Alexandra

2012-01-01

The human genome is densely populated with transposons and transposon-like repetitive elements. Although the impact of these transposons and elements on human genome evolution is recognized, the significance of subtle variations in their sequence remains mostly unexplored. Here we report homozygosity mapping of an infantile neurodegenerative disease locus in a genetic isolate. Complete DNA sequencing of the 400-kb linkage locus revealed a point mutation in a primate-specific retrotransposon that was transcribed as part of a unique noncoding RNA, which was expressed in the brain. In vitro knockdown of this RNA increased neuronal apoptosis, consistent with the inappropriate dosage of this RNA in vivo and with the phenotype. Moreover, structural analysis of the sequence revealed a small RNA-like hairpin that was consistent with the putative gain of a functional site when mutated. We show here that a mutation in a unique transposable element-containing RNA is associated with lethal encephalopathy, and we suggest that RNAs that harbor evolutionarily recent repetitive elements may play important roles in human brain development. PMID:22411793
DEPPDB - DNA electrostatic potential properties database. Electrostatic properties of genome DNA elements.

PubMed

Osypov, Alexander A; Krutinin, Gleb G; Krutinina, Eugenia A; Kamzolova, Svetlana G

2012-04-01

Electrostatic properties of genome DNA are important to its interactions with different proteins, in particular, related to transcription. DEPPDB - DNA Electrostatic Potential (and other Physical) Properties Database - provides information on the electrostatic and other physical properties of genome DNA combined with its sequence and annotation of biological and structural properties of genomes and their elements. Genomes are organized on taxonomical basis, supporting comparative and evolutionary studies. Currently, DEPPDB contains all completely sequenced bacterial, viral, mitochondrial, and plastids genomes according to the NCBI RefSeq, and some model eukaryotic genomes. Data for promoters, regulation sites, binding proteins, etc., are incorporated from established DBs and literature. The database is complemented by analytical tools. User sequences calculations are available. Case studies discovered electrostatics complementing DNA bending in E.coli plasmid BNT2 promoter functioning, possibly affecting host-environment metabolic switch. Transcription factors binding sites gravitate to high potential regions, confirming the electrostatics universal importance in protein-DNA interactions beyond the classical promoter-RNA polymerase recognition and regulation. Other genome elements, such as terminators, also show electrostatic peculiarities. Most intriguing are gene starts, exhibiting taxonomic correlations. The necessity of the genome electrostatic properties studies is discussed.
Transposable elements and G-quadruplexes.

PubMed

Kejnovsky, Eduard; Tokan, Viktor; Lexa, Matej

2015-09-01

A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

PubMed

Schnitzler, P; Darai, G

1989-09-01

The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.
Lnc2Meth: a manually curated database of regulatory relationships between long non-coding RNAs and DNA methylation associated with human disease

PubMed Central

Zhi, Hui; Li, Xin; Wang, Peng; Gao, Yue; Gao, Baoqing; Zhou, Dianshuang; Zhang, Yan; Guo, Maoni; Yue, Ming; Shen, Weitao

2018-01-01

Abstract Lnc2Meth (http://www.bio-bigdata.com/Lnc2Meth/), an interactive resource to identify regulatory relationships between human long non-coding RNAs (lncRNAs) and DNA methylation, is not only a manually curated collection and annotation of experimentally supported lncRNAs-DNA methylation associations but also a platform that effectively integrates tools for calculating and identifying the differentially methylated lncRNAs and protein-coding genes (PCGs) in diverse human diseases. The resource provides: (i) advanced search possibilities, e.g. retrieval of the database by searching the lncRNA symbol of interest, DNA methylation patterns, regulatory mechanisms and disease types; (ii) abundant computationally calculated DNA methylation array profiles for the lncRNAs and PCGs; (iii) the prognostic values for each hit transcript calculated from the patients clinical data; (iv) a genome browser to display the DNA methylation landscape of the lncRNA transcripts for a specific type of disease; (v) tools to re-annotate probes to lncRNA loci and identify the differential methylation patterns for lncRNAs and PCGs with user-supplied external datasets; (vi) an R package (LncDM) to complete the differentially methylated lncRNAs identification and visualization with local computers. Lnc2Meth provides a timely and valuable resource that can be applied to significantly expand our understanding of the regulatory relationships between lncRNAs and DNA methylation in various human diseases. PMID:29069510
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.

PubMed

Spangler, Jacob B; Feltus, Frank Alex

2013-01-01

Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis

PubMed Central

Spangler, Jacob B.; Feltus, Frank Alex

2013-01-01

Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377
Behind the curtain of non-coding RNAs; long non-coding RNAs regulating hepatocarcinogenesis

PubMed Central

El Khodiry, Aya; Afify, Menna; El Tayebi, Hend M

2018-01-01

Hepatocellular carcinoma (HCC) is one of the most common and aggressive cancers worldwide. HCC is the fifth common malignancy in the world and the second leading cause of cancer death in Asia. Long non-coding RNAs (lncRNAs) are RNAs with a length greater than 200 nucleotides that do not encode proteins. lncRNAs can regulate gene expression and protein synthesis in several ways by interacting with DNA, RNA and proteins in a sequence specific manner. They could regulate cellular and developmental processes through either gene inhibition or gene activation. Many studies have shown that dysregulation of lncRNAs is related to many human diseases such as cardiovascular diseases, genetic disorders, neurological diseases, immune mediated disorders and cancers. However, the study of lncRNAs is challenging as they are poorly conserved between species, their expression levels aren’t as high as that of mRNAs and have great interpatient variations. The study of lncRNAs expression in cancers have been a breakthrough as it unveils potential biomarkers and drug targets for cancer therapy and helps understand the mechanism of pathogenesis. This review discusses many long non-coding RNAs and their contribution in HCC, their role in development, metastasis, and prognosis of HCC and how to regulate and target these lncRNAs as a therapeutic tool in HCC treatment in the future. PMID:29434445
RNA Helicase Associated with AU-rich Element (RHAU/DHX36) Interacts with the 3′-Tail of the Long Non-coding RNA BC200 (BCYRN1)*

PubMed Central

Booy, Evan P.; McRae, Ewan K. S.; Howard, Ryan; Deo, Soumya R.; Ariyo, Emmanuel O.; Dzananovic, Edis; Meier, Markus; Stetefeld, Jörg; McKenna, Sean A.

2016-01-01

RNA helicase associated with AU-rich element (RHAU) is an ATP-dependent RNA helicase that demonstrates high affinity for quadruplex structures in DNA and RNA. To elucidate the significance of these quadruplex-RHAU interactions, we have performed RNA co-immunoprecipitation screens to identify novel RNAs bound to RHAU and characterize their function. In the course of this study, we have identified the non-coding RNA BC200 (BCYRN1) as specifically enriched upon RHAU immunoprecipitation. Although BC200 does not adopt a quadruplex structure and does not bind the quadruplex-interacting motif of RHAU, it has direct affinity for RHAU in vitro. Specifically designed BC200 truncations and RNase footprinting assays demonstrate that RHAU binds to an adenosine-rich region near the 3′-end of the RNA. RHAU truncations support binding that is dependent upon a region within the C terminus and is specific to RHAU isoform 1. Tests performed to assess whether BC200 interferes with RHAU helicase activity have demonstrated the ability of BC200 to act as an acceptor of unwound quadruplexes via a cytosine-rich region near the 3′-end of the RNA. Furthermore, an interaction between BC200 and the quadruplex-containing telomerase RNA was confirmed by pull-down assays of the endogenous RNAs. This leads to the possibility that RHAU may direct BC200 to bind and exert regulatory functions at quadruplex-containing RNA or DNA sequences. PMID:26740632
[Long non-coding RNAs in plants].

PubMed

Xiaoqing, Huang; Dandan, Li; Juan, Wu

2015-04-01

Long non-coding RNAs (lncRNAs), which are longer than 200 nucleotides in length, widely exist in organisms and function in a variety of biological processes. Currently, most of lncRNAs found in plants are transcribed by RNA polymerase Ⅱ and mediate gene expression through multiple mechanisms, such as target mimicry, transcription interference, histone methylation and DNA methylation, and play important roles in flowering, male sterility, nutrition metabolism, biotic and abiotic stress and other biological processes as regulators in plants. In this review, we summarize the databases, prediction methods, and possible functions of plant lncRNAs discovered in recent years.

Noncoding somatic and inherited single-nucleotide variants converge to promote ESR1 expression in breast cancer.

PubMed

Bailey, Swneke D; Desai, Kinjal; Kron, Ken J; Mazrooei, Parisa; Sinnott-Armstrong, Nicholas A; Treloar, Aislinn E; Dowar, Mark; Thu, Kelsie L; Cescon, David W; Silvester, Jennifer; Yang, S Y Cindy; Wu, Xue; Pezo, Rossanna C; Haibe-Kains, Benjamin; Mak, Tak W; Bedard, Philippe L; Pugh, Trevor J; Sallari, Richard C; Lupien, Mathieu

2016-10-01

Sustained expression of the estrogen receptor-α (ESR1) drives two-thirds of breast cancer and defines the ESR1-positive subtype. ESR1 engages enhancers upon estrogen stimulation to establish an oncogenic expression program. Somatic copy number alterations involving the ESR1 gene occur in approximately 1% of ESR1-positive breast cancers, suggesting that other mechanisms underlie the persistent expression of ESR1. We report significant enrichment of somatic mutations within the set of regulatory elements (SRE) regulating ESR1 in 7% of ESR1-positive breast cancers. These mutations regulate ESR1 expression by modulating transcription factor binding to the DNA. The SRE includes a recurrently mutated enhancer whose activity is also affected by rs9383590, a functional inherited single-nucleotide variant (SNV) that accounts for several breast cancer risk-associated loci. Our work highlights the importance of considering the combinatorial activity of regulatory elements as a single unit to delineate the impact of noncoding genetic alterations on single genes in cancer.
Modular structural elements in the replication origin region of Tetrahymena rDNA.

PubMed Central

Du, C; Sanzgiri, R P; Shaiu, W L; Choi, J K; Hou, Z; Benbow, R M; Dobbs, D L

1995-01-01

Computer analyses of the DNA replication origin region in the amplified rRNA genes of Tetrahymena thermophila identified a potential initiation zone in the 5'NTS [Dobbs, Shaiu and Benbow (1994), Nucleic Acids Res. 22, 2479-2489]. This region consists of a putative DNA unwinding element (DUE) aligned with predicted bent DNA segments, nuclear matrix or scaffold associated region (MAR/SAR) consensus sequences, and other common modular sequence elements previously shown to be clustered in eukaryotic chromosomal origin regions. In this study, two mung bean nuclease-hypersensitive sites in super-coiled plasmid DNA were localized within the major DUE-like element predicted by thermodynamic analyses. Three restriction fragments of the 5'NTS region predicted to contain bent DNA segments exhibited anomalous migration characteristic of bent DNA during electrophoresis on polyacrylamide gels. Restriction fragments containing the 5'NTS region bound Tetrahymena nuclear matrices in an in vitro binding assay, consistent with an association of the replication origin region with the nuclear matrix in vivo. The direct demonstration in a protozoan origin region of elements previously identified in Drosophila, chick and mammalian origin regions suggests that clusters of modular structural elements may be a conserved feature of eukaryotic chromosomal origins of replication. Images PMID:7784181
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.

PubMed

Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel

2013-09-01

RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Lnc2Meth: a manually curated database of regulatory relationships between long non-coding RNAs and DNA methylation associated with human disease.

PubMed

Zhi, Hui; Li, Xin; Wang, Peng; Gao, Yue; Gao, Baoqing; Zhou, Dianshuang; Zhang, Yan; Guo, Maoni; Yue, Ming; Shen, Weitao; Ning, Shangwei; Jin, Lianhong; Li, Xia

2018-01-04

Lnc2Meth (http://www.bio-bigdata.com/Lnc2Meth/), an interactive resource to identify regulatory relationships between human long non-coding RNAs (lncRNAs) and DNA methylation, is not only a manually curated collection and annotation of experimentally supported lncRNAs-DNA methylation associations but also a platform that effectively integrates tools for calculating and identifying the differentially methylated lncRNAs and protein-coding genes (PCGs) in diverse human diseases. The resource provides: (i) advanced search possibilities, e.g. retrieval of the database by searching the lncRNA symbol of interest, DNA methylation patterns, regulatory mechanisms and disease types; (ii) abundant computationally calculated DNA methylation array profiles for the lncRNAs and PCGs; (iii) the prognostic values for each hit transcript calculated from the patients clinical data; (iv) a genome browser to display the DNA methylation landscape of the lncRNA transcripts for a specific type of disease; (v) tools to re-annotate probes to lncRNA loci and identify the differential methylation patterns for lncRNAs and PCGs with user-supplied external datasets; (vi) an R package (LncDM) to complete the differentially methylated lncRNAs identification and visualization with local computers. Lnc2Meth provides a timely and valuable resource that can be applied to significantly expand our understanding of the regulatory relationships between lncRNAs and DNA methylation in various human diseases. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Complex interplay among DNA modification, noncoding RNA expression and protein-coding RNA expression in Salvia miltiorrhiza chloroplast genome.

PubMed

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box-like motif (CPGDMM1, "TATANNNATNA"), and an unknown motif (CPGDMM2 "WNYANTGAW"). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome.
An expanding universe of the non-coding genome in cancer biology.

PubMed

Xue, Bin; He, Lin

2014-06-01

Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Current Research on Non-Coding Ribonucleic Acid (RNA).

PubMed

Wang, Jing; Samuels, David C; Zhao, Shilin; Xiang, Yu; Zhao, Ying-Yong; Guo, Yan

2017-12-05

Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.
SHOX gene and conserved noncoding element deletions/duplications in Colombian patients with idiopathic short stature.

PubMed

Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio

2014-03-01

SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3-15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS.
SHOX gene and conserved noncoding element deletions/duplications in Colombian patients with idiopathic short stature

PubMed Central

Sandoval, Gloria Tatiana Vinasco; Jaimes, Giovanna Carola; Barrios, Mauricio Coll; Cespedes, Camila; Velasco, Harvy Mauricio

2014-01-01

SHOX gene mutations or haploinsufficiency cause a wide range of phenotypes such as Leri Weill dyschondrosteosis (LWD), Turner syndrome, and disproportionate short stature (DSS). However, this gene has also been found to be mutated in cases of idiopathic short stature (ISS) with a 3–15% frequency. In this study, the multiplex ligation-dependent probe amplification (MLPA) technique was employed to determine the frequency of SHOX gene mutations and their conserved noncoding elements (CNE) in Colombian patients with ISS. Patients were referred from different centers around the county. From a sample of 62 patients, 8.1% deletions and insertions in the intragenic regions and in the CNE were found. This result is similar to others published in other countries. Moreover, an isolated case of CNE 9 duplication and a new intron 6b deletion in another patient, associated with ISS, are described. This is one of the first studies of a Latin American population in which deletions/duplications of the SHOX gene and its CNE are examined in patients with ISS. PMID:24689071
Transposable Elements Are Major Contributors to the Origin, Diversification, and Regulation of Vertebrate Long Noncoding RNAs

PubMed Central

Kapusta, Aurélie; Zhuo, Xiaoyu; Ramsay, LeeAnn; Bourque, Guillaume; Yandell, Mark; Feschotte, Cédric

2013-01-01

Advances in vertebrate genomics have uncovered thousands of loci encoding long noncoding RNAs (lncRNAs). While progress has been made in elucidating the regulatory functions of lncRNAs, little is known about their origins and evolution. Here we explore the contribution of transposable elements (TEs) to the makeup and regulation of lncRNAs in human, mouse, and zebrafish. Surprisingly, TEs occur in more than two thirds of mature lncRNA transcripts and account for a substantial portion of total lncRNA sequence (∼30% in human), whereas they seldom occur in protein-coding transcripts. While TEs contribute less to lncRNA exons than expected, several TE families are strongly enriched in lncRNAs. There is also substantial interspecific variation in the coverage and types of TEs embedded in lncRNAs, partially reflecting differences in the TE landscapes of the genomes surveyed. In human, TE sequences in lncRNAs evolve under greater evolutionary constraint than their non–TE sequences, than their intronic TEs, or than random DNA. Consistent with functional constraint, we found that TEs contribute signals essential for the biogenesis of many lncRNAs, including ∼30,000 unique sites for transcription initiation, splicing, or polyadenylation in human. In addition, we identified ∼35,000 TEs marked as open chromatin located within 10 kb upstream of lncRNA genes. The density of these marks in one cell type correlate with elevated expression of the downstream lncRNA in the same cell type, suggesting that these TEs contribute to cis-regulation. These global trends are recapitulated in several lncRNAs with established functions. Finally a subset of TEs embedded in lncRNAs are subject to RNA editing and predicted to form secondary structures likely important for function. In conclusion, TEs are nearly ubiquitous in lncRNAs and have played an important role in the lineage-specific diversification of vertebrate lncRNA repertoires. PMID:23637635
Complex Interplay among DNA Modification, Noncoding RNA Expression and Protein-Coding RNA Expression in Salvia miltiorrhiza Chloroplast Genome

PubMed Central

Chen, Haimei; Zhang, Jianhui; Yuan, George; Liu, Chang

2014-01-01

Salvia miltiorrhiza is one of the most widely used medicinal plants. As a first step to develop a chloroplast-based genetic engineering method for the over-production of active components from S. miltiorrhiza, we have analyzed the genome, transcriptome, and base modifications of the S. miltiorrhiza chloroplast. Total genomic DNA and RNA were extracted from fresh leaves and then subjected to strand-specific RNA-Seq and Single-Molecule Real-Time (SMRT) sequencing analyses. Mapping the RNA-Seq reads to the genome assembly allowed us to determine the relative expression levels of 80 protein-coding genes. In addition, we identified 19 polycistronic transcription units and 136 putative antisense and intergenic noncoding RNA (ncRNA) genes. Comparison of the abundance of protein-coding transcripts (cRNA) with and without overlapping antisense ncRNAs (asRNA) suggest that the presence of asRNA is associated with increased cRNA abundance (p<0.05). Using the SMRT Portal software (v1.3.2), 2687 potential DNA modification sites and two potential DNA modification motifs were predicted. The two motifs include a TATA box–like motif (CPGDMM1, “TATANNNATNA”), and an unknown motif (CPGDMM2 “WNYANTGAW”). Specifically, 35 of the 97 CPGDMM1 motifs (36.1%) and 91 of the 369 CPGDMM2 motifs (24.7%) were found to be significantly modified (p<0.01). Analysis of genes downstream of the CPGDMM1 motif revealed the significantly increased abundance of ncRNA genes that are less than 400 bp away from the significantly modified CPGDMM1motif (p<0.01). Taking together, the present study revealed a complex interplay among DNA modifications, ncRNA and cRNA expression in chloroplast genome. PMID:24914614
Noncoding copy-number variations are associated with congenital limb malformation.

PubMed

Flöttmann, Ricarda; Kragesteen, Bjørt K; Geuer, Sinje; Socha, Magdalena; Allou, Lila; Sowińska-Seidler, Anna; Bosquillon de Jarcy, Laure; Wagner, Johannes; Jamsheer, Aleksander; Oehl-Jaschkowitz, Barbara; Wittler, Lars; de Silva, Deepthi; Kurth, Ingo; Maya, Idit; Santos-Simarro, Fernando; Hülsemann, Wiebke; Klopocki, Eva; Mountford, Roger; Fryer, Alan; Borck, Guntram; Horn, Denise; Lapunzina, Pablo; Wilson, Meredith; Mascrez, Bénédicte; Duboule, Denis; Mundlos, Stefan; Spielmann, Malte

2017-10-12

PurposeCopy-number variants (CNVs) are generally interpreted by linking the effects of gene dosage with phenotypes. The clinical interpretation of noncoding CNVs remains challenging. We investigated the percentage of disease-associated CNVs in patients with congenital limb malformations that affect noncoding cis-regulatory sequences versus genes sensitive to gene dosage effects.MethodsWe applied high-resolution copy-number analysis to 340 unrelated individuals with isolated limb malformation. To investigate novel candidate CNVs, we re-engineered human CNVs in mice using clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing.ResultsOf the individuals studied, 10% harbored CNVs segregating with the phenotype in the affected families. We identified 31 CNVs previously associated with congenital limb malformations and four novel candidate CNVs. Most of the disease-associated CNVs (57%) affected the noncoding cis-regulatory genome, while only 43% included a known disease gene and were likely to result from gene dosage effects. In transgenic mice harboring four novel candidate CNVs, we observed altered gene expression in all cases, indicating that the CNVs had a regulatory effect either by changing the enhancer dosage or altering the topological associating domain architecture of the genome.ConclusionOur findings suggest that CNVs affecting noncoding regulatory elements are a major cause of congenital limb malformations.Genetics in Medicine advance online publication, 12 October 2017; doi:10.1038/gim.2017.154.
Transcription and DNA Damage: Holding Hands or Crossing Swords?

PubMed

D'Alessandro, Giuseppina; d'Adda di Fagagna, Fabrizio

2017-10-27

Transcription has classically been considered a potential threat to genome integrity. Collision between transcription and DNA replication machinery, and retention of DNA:RNA hybrids, may result in genome instability. On the other hand, it has been proposed that active genes repair faster and preferentially via homologous recombination. Moreover, while canonical transcription is inhibited in the proximity of DNA double-strand breaks, a growing body of evidence supports active non-canonical transcription at DNA damage sites. Small non-coding RNAs accumulate at DNA double-strand break sites in mammals and other organisms, and are involved in DNA damage signaling and repair. Furthermore, RNA binding proteins are recruited to DNA damage sites and participate in the DNA damage response. Here, we discuss the impact of transcription on genome stability, the role of RNA binding proteins at DNA damage sites, and the function of small non-coding RNAs generated upon damage in the signaling and repair of DNA lesions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Non-coding stem-bulge RNAs are required for cell proliferation and embryonic development in C. elegans

PubMed Central

Kowalski, Madzia P.; Baylis, Howard A.; Krude, Torsten

2015-01-01

ABSTRACT Stem bulge RNAs (sbRNAs) are a family of small non-coding stem-loop RNAs present in Caenorhabditis elegans and other nematodes, the function of which is unknown. Here, we report the first functional characterisation of nematode sbRNAs. We demonstrate that sbRNAs from a range of nematode species are able to reconstitute the initiation of chromosomal DNA replication in the presence of replication proteins in vitro, and that conserved nucleotide sequence motifs are essential for this function. By functionally inactivating sbRNAs with antisense morpholino oligonucleotides, we show that sbRNAs are required for S phase progression, early embryonic development and the viability of C. elegans in vivo. Thus, we demonstrate a new and essential role for sbRNAs during the early development of C. elegans. sbRNAs show limited nucleotide sequence similarity to vertebrate Y RNAs, which are also essential for the initiation of DNA replication. Our results therefore establish that the essential function of small non-coding stem-loop RNAs during DNA replication extends beyond vertebrates. PMID:25908866
Rat L (long interspersed repeated DNA) elements contain guanine-rich homopurine sequences that induce unpairing of contiguous duplex DNA.

PubMed Central

Usdin, K; Furano, A V

1988-01-01

The L family (long interspersed repeated DNA) of mobile genetic elements is a persistent feature of the mammalian genome. In rats, this family contains approximately equal to 40,000 members and accounts for approximately equal to 10% of the haploid genome. We demonstrate here that the guanine-rich homopurine stretches located at the right end of L-DNA induce oligonucleotide uptake by contiguous duplex DNA. The uptake is dependent on negative supercoiling and the length of the homopurine stretch and occurs even when the L-DNA homopurine stretches are introduced into a different DNA environment. The bound oligomer primes DNA synthesis when DNA polymerase and deoxyribonucleoside triphosphates are added, resulting in a faithful copy of the template to which the oligonucleotide had bound. The implications of this property of the L-DNA guanine-rich homopurine stretches in the amplification, recombination, and dispersal of L elements is discussed. Images PMID:2837766
Noncoding somatic and inherited single-nucleotide variants converge to promote ESR1 expression in breast cancer

PubMed Central

Bailey, Swneke D.; Desai, Kinjal; Kron, Ken J.; Mazrooei, Parisa; Sinnott-Armstrong, Nicholas A.; Treloar, Aislinn E.; Dowar, Mark; Thu, Kelsie L.; Cescon, David W.; Silvester, Jennifer; Yang, S. Y. Cindy; Wu, Xue; Pezo, Rossanna C.; Haibe-Kains, Benjamin; Mak, Tak W.; Bedard, Philippe L.; Pugh, Trevor J.; Sallari, Richard C.; Lupien, Mathieu

2016-01-01

Sustained expression of the oestrogen receptor alpha (ESR1) drives two-thirds of breast cancer and defines the ESR1-positive subtype. ESR1 engages enhancers upon oestrogen stimulation to establish an oncogenic expression program1. Somatic copy number alterations involving the ESR1 gene occur in approximately 1% of ESR1-positive breast cancers2–5, implying that other mechanisms underlie the persistent expression of ESR1. We report the significant enrichment of somatic mutations within the set of regulatory elements (SRE) regulating ESR1 in 7% of ESR1-positive breast cancers. These mutations regulate ESR1 expression by modulating transcription factor binding to the DNA. The SRE includes a recurrently mutated enhancer whose activity is also affected by a functional inherited single nucleotide variant (SNV) rs9383590 that accounts for several breast cancer risk-loci. Our work highlights the importance of considering the combinatorial activity of regulatory elements as a single unit to delineate the impact of noncoding genetic alterations on single genes in cancer. PMID:27571262
Genome-wide colonization of gene regulatory elements by G4 DNA motifs

PubMed Central

Du, Zhuo; Zhao, Yiqiang; Li, Ning

2009-01-01

G-quadruplex (or G4 DNA), a stable four-stranded structure found in guanine-rich regions, is implicated in the transcriptional regulation of genes involved in growth and development. Previous studies on the role of G4 DNA in gene regulation mostly focused on genomic regions proximal to transcription start sites (TSSs). To gain a more comprehensive understanding of the regulatory role of G4 DNA, we examined the landscape of potential G4 DNA (PG4Ms) motifs in the human genome and found that G4 motifs, not restricted to those found in the TSS-proximal regions, are bias toward gene-associated regions. Significantly, analyses of G4 motifs in seven types of well-known gene regulatory elements revealed a constitutive enrichment pattern and the clusters of G4 motifs tend to be colocalized with regulatory elements. Considering our analysis from a genome evolutionary perspective, we found evidence that the occurrence and accumulation of certain progenitors and canonical G4 DNA motifs within regulatory regions were progressively favored by natural selection. Our results suggest that G4 DNA motifs are ‘colonized’ in regulatory regions, supporting a likely genome-wide role of G4 DNA in gene regulation. We hypothesize that G4 DNA is a regulatory apparatus situated in regulatory elements, acting as a molecular switch that can modulate the role of the host functional regions, by transition in DNA structure. PMID:19759215
Transcriptional role of androgen receptor in the expression of long non-coding RNA Sox2OT in neurogenesis

PubMed Central

Tosetti, Valentina; Sassone, Jenny; Ferri, Anna L. M.; Taiana, Michela; Bedini, Gloria; Nava, Sara; Brenna, Greta; Di Resta, Chiara; Pareyson, Davide; Di Giulio, Anna Maria; Carelli, Stephana

2017-01-01

The complex architecture of adult brain derives from tightly regulated migration and differentiation of precursor cells generated during embryonic neurogenesis. Changes at transcriptional level of genes that regulate migration and differentiation may lead to neurodevelopmental disorders. Androgen receptor (AR) is a transcription factor that is already expressed during early embryonic days. However, AR role in the regulation of gene expression at early embryonic stage is yet to be determinate. Long non-coding RNA (lncRNA) Sox2 overlapping transcript (Sox2OT) plays a crucial role in gene expression control during development but its transcriptional regulation is still to be clearly defined. Here, using Bicalutamide in order to pharmacologically inactivated AR, we investigated whether AR participates in the regulation of the transcription of the lncRNASox2OTat early embryonic stage. We identified a new DNA binding region upstream of Sox2 locus containing three androgen response elements (ARE), and found that AR binds such a sequence in embryonic neural stem cells and in mouse embryonic brain. Our data suggest that through this binding, AR can promote the RNA polymerase II dependent transcription of Sox2OT. Our findings also suggest that AR participates in embryonic neurogenesis through transcriptional control of the long non-coding RNA Sox2OT. PMID:28704421
Transcriptional role of androgen receptor in the expression of long non-coding RNA Sox2OT in neurogenesis.

PubMed

Tosetti, Valentina; Sassone, Jenny; Ferri, Anna L M; Taiana, Michela; Bedini, Gloria; Nava, Sara; Brenna, Greta; Di Resta, Chiara; Pareyson, Davide; Di Giulio, Anna Maria; Carelli, Stephana; Parati, Eugenio A; Gorio, Alfredo

2017-01-01

The complex architecture of adult brain derives from tightly regulated migration and differentiation of precursor cells generated during embryonic neurogenesis. Changes at transcriptional level of genes that regulate migration and differentiation may lead to neurodevelopmental disorders. Androgen receptor (AR) is a transcription factor that is already expressed during early embryonic days. However, AR role in the regulation of gene expression at early embryonic stage is yet to be determinate. Long non-coding RNA (lncRNA) Sox2 overlapping transcript (Sox2OT) plays a crucial role in gene expression control during development but its transcriptional regulation is still to be clearly defined. Here, using Bicalutamide in order to pharmacologically inactivated AR, we investigated whether AR participates in the regulation of the transcription of the lncRNASox2OTat early embryonic stage. We identified a new DNA binding region upstream of Sox2 locus containing three androgen response elements (ARE), and found that AR binds such a sequence in embryonic neural stem cells and in mouse embryonic brain. Our data suggest that through this binding, AR can promote the RNA polymerase II dependent transcription of Sox2OT. Our findings also suggest that AR participates in embryonic neurogenesis through transcriptional control of the long non-coding RNA Sox2OT.
The emergence of noncoding RNAs as Heracles in autophagy.

PubMed

Zhang, Jian; Wang, Peiyuan; Wan, Lin; Xu, Shouping; Pang, Da

2017-06-03

Macroautophagy/autophagy is a catabolic process that is widely found in nature. Over the past few decades, mounting evidence has indicated that noncoding RNAs, ranging from small noncoding RNAs to long noncoding RNAs (lncRNAs) and even circular RNAs (circRNAs), mediate the transcriptional and post-transcriptional regulation of autophagy-related genes by participating in autophagy regulatory networks. The differential expression of noncoding RNAs affects autophagy levels at different physiological and pathological stages, including embryonic proliferation and differentiation, cellular senescence, and even diseases such as cancer. We summarize the current knowledge regarding noncoding RNA dysregulation in autophagy and investigate the molecular regulatory mechanisms underlying noncoding RNA involvement in autophagy regulatory networks. Then, we integrate public resources to predict autophagy-related noncoding RNAs across species and discuss strategies for and the challenges of identifying autophagy-related noncoding RNAs. This article will deepen our understanding of the relationship between noncoding RNAs and autophagy, and provide new insights to specifically target noncoding RNAs in autophagy-associated therapeutic strategies.

Transcription profiling suggests that mitochondrial topoisomerase IB acts as a topological barrier and regulator of mitochondrial DNA transcription.

PubMed

Dalla Rosa, Ilaria; Zhang, Hongliang; Khiati, Salim; Wu, Xiaolin; Pommier, Yves

2017-12-08

Mitochondrial DNA (mtDNA) is essential for cell viability because it encodes subunits of the respiratory chain complexes. Mitochondrial topoisomerase IB (TOP1MT) facilitates mtDNA replication by removing DNA topological tensions produced during mtDNA transcription, but it appears to be dispensable. To test whether cells lacking TOP1MT have aberrant mtDNA transcription, we performed mitochondrial transcriptome profiling. To that end, we designed and implemented a customized tiling array, which enabled genome-wide, strand-specific, and simultaneous detection of all mitochondrial transcripts. Our technique revealed that Top1mt KO mouse cells process the mitochondrial transcripts normally but that protein-coding mitochondrial transcripts are elevated. Moreover, we found discrete long noncoding RNAs produced by H-strand transcription and encompassing the noncoding regulatory region of mtDNA in human and murine cells and tissues. Of note, these noncoding RNAs were strongly up-regulated in the absence of TOP1MT. In contrast, 7S DNA, produced by mtDNA replication, was reduced in the Top1mt KO cells. We propose that the long noncoding RNA species in the D-loop region are generated by the extension of H-strand transcripts beyond their canonical stop site and that TOP1MT acts as a topological barrier and regulator for mtDNA transcription and D-loop formation.
Primate-specific evolution of noncoding element insertion into PLA2G4C and human preterm birth

PubMed Central

2010-01-01

Background The onset of birth in humans, like other apes, differs from non-primate mammals in its endocrine physiology. We hypothesize that higher primate-specific gene evolution may lead to these differences and target genes involved in human preterm birth, an area of global health significance. Methods We performed a comparative genomics screen of highly conserved noncoding elements and identified PLA2G4C, a phospholipase A isoform involved in prostaglandin biosynthesis as human accelerated. To examine whether this gene demonstrating primate-specific evolution was associated with birth timing, we genotyped and analyzed 8 common single nucleotide polymorphisms (SNPs) in PLA2G4C in US Hispanic (n = 73 preterm, 292 control), US White (n = 147 preterm, 157 control) and US Black (n = 79 preterm, 166 control) mothers. Results Detailed structural and phylogenic analysis of PLA2G4C suggested a short genomic element within the gene duplicated from a paralogous highly conserved element on chromosome 1 specifically in primates. SNPs rs8110925 and rs2307276 in US Hispanics and rs11564620 in US Whites were significant after correcting for multiple tests (p < 0.006). Additionally, rs11564620 (Thr360Pro) was associated with increased metabolite levels of the prostaglandin thromboxane in healthy individuals (p = 0.02), suggesting this variant may affect PLA2G4C activity. Conclusions Our findings suggest that variation in PLA2G4C may influence preterm birth risk by increasing levels of prostaglandins, which are known to regulate labor. PMID:21184677
DNA preservation in skeletal elements from the World Trade Center disaster: recommendations for mass fatality management.

PubMed

Mundorff, Amy Z; Bartelink, Eric J; Mar-Cash, Elaine

2009-07-01

The World Trade Center (WTC) victim identification effort highlights taphonomic influences on the degradation of DNA from victims of mass fatality incidents. This study uses a subset of the WTC-Human Remains Database to evaluate differential preservation of DNA by skeletal element. Recovery location, sex, and victim type (civilian, firefighter, or plane passenger) do not appear to influence DNA preservation. Results indicate that more intact elements, as well as elements encased in soft tissue, produced slightly higher identification rates than more fragmented remains. DNA identification rates by element type conform to previous findings, with higher rates generally found in denser, weight-bearing bones. However, smaller bones including patellae, metatarsals, and foot phalanges yielded rates comparable to both femora and tibiae. These elements can be easily sampled with a disposable scalpel, and thus reduce potential DNA contamination. These findings have implications for DNA sampling guidelines in future mass fatality incidents.
Useful DNA polymorphisms are identified by snapback, a midrepetitive element in Tribolium castaneum.

PubMed

Stuart, J J; De Gortari, M J; Hall, P S; Maxwell, M E; Mocelin, G; Brown, S J; Muir, W M

1996-06-01

The red flour bettle, Tribolium castaneum, is both a pest of stored grain products and an important experimental organism. To improve its facility as a genetic model, we are developing DNA fingerprinting methods for this insect. A Tribolium DNA fragment, snapback-1 (SBI), identified among sequences that reassociate before a Cot of 0.03 mol.s/L, was found to produce a banding pattern in restriction endonuclease digested genomic DNA that is characteristic of a midrepetitive element. DNA fingerprints of individual beetles demonstrated that unvarying inherited DNA polymorphism is revealed, and that polymorphism is inherited in a dominant Mendelian fashion. Linkage between bands was minimal. The sequence of SBI was determined, and hybridization experiments indicated that SBI is a fragment of a larger midrepetitive element. Fingerprinting individuals with known inbreeding coefficients indicated that SBI loci have relatively high mutation rates. The possibility that SBI is a fragment of a transposable element is discussed.
Artificial Intelligence, DNA Mimicry, and Human Health.

PubMed

Stefano, George B; Kream, Richard M

2017-08-14

The molecular evolution of genomic DNA across diverse plant and animal phyla involved dynamic registrations of sequence modifications to maintain existential homeostasis to increasingly complex patterns of environmental stressors. As an essential corollary, driver effects of positive evolutionary pressure are hypothesized to effect concerted modifications of genomic DNA sequences to meet expanded platforms of regulatory controls for successful implementation of advanced physiological requirements. It is also clearly apparent that preservation of updated registries of advantageous modifications of genomic DNA sequences requires coordinate expansion of convergent cellular proofreading/error correction mechanisms that are encoded by reciprocally modified genomic DNA. Computational expansion of operationally defined DNA memory extends to coordinate modification of coding and previously under-emphasized noncoding regions that now appear to represent essential reservoirs of untapped genetic information amenable to evolutionary driven recruitment into the realm of biologically active domains. Additionally, expansion of DNA memory potential via chemical modification and activation of noncoding sequences is targeted to vertical augmentation and integration of an expanded cadre of transcriptional and epigenetic regulatory factors affecting linear coding of protein amino acid sequences within open reading frames.
Transcriptional activation of short interspersed elements by DNA-damaging agents.

PubMed

Rudin, C M; Thompson, C B

2001-01-01

Short interspersed elements (SINEs), typified by the human Alu repeat, are RNA polymerase III (pol III)-transcribed sequences that replicate within the genome through an RNA intermediate. Replication of SINEs has been extensive in mammalian evolution: an estimated 5% of the human genome consists of Alu repeats. The mechanisms regulating transcription, reverse transcription, and reinsertion of SINE elements in genomic DNA are poorly understood. Here we report that expression of murine SINE transcripts of both the B1 and B2 classes is strongly upregulated after prolonged exposure to cisplatin, etoposide, or gamma radiation. A similar induction of Alu transcripts in human cells occurs under these conditions. This induction is not due to a general upregulation of pol III activity in either species. Genotoxic treatment of murine cells containing an exogenous human Alu element induced Alu transcription. Concomitant with the increased expression of SINEs, an increase in cellular reverse transcriptase was observed after exposure to these same DNA-damaging agents. These findings suggest that genomic damage may be an important activator of SINEs, and that SINE mobility may contribute to secondary malignancy after exposure to DNA-damaging chemotherapy.
Transposable Elements in Human Cancer: Causes and Consequences of Deregulation.

PubMed

Anwar, Sumadi Lukman; Wulaningsih, Wahyu; Lehmann, Ulrich

2017-05-04

Transposable elements (TEs) comprise nearly half of the human genome and play an essential role in the maintenance of genomic stability, chromosomal architecture, and transcriptional regulation. TEs are repetitive sequences consisting of RNA transposons, DNA transposons, and endogenous retroviruses that can invade the human genome with a substantial contribution in human evolution and genomic diversity. TEs are therefore firmly regulated from early embryonic development and during the entire course of human life by epigenetic mechanisms, in particular DNA methylation and histone modifications. The deregulation of TEs has been reported in some developmental diseases, as well as for different types of human cancers. To date, the role of TEs, the mechanisms underlying TE reactivation, and the interplay with DNA methylation in human cancers remain largely unexplained. We reviewed the loss of epigenetic regulation and subsequent genomic instability, chromosomal aberrations, transcriptional deregulation, oncogenic activation, and aberrations of non-coding RNAs as the potential mechanisms underlying TE deregulation in human cancers.
Transposable Elements in Human Cancer: Causes and Consequences of Deregulation

PubMed Central

Anwar, Sumadi Lukman; Wulaningsih, Wahyu; Lehmann, Ulrich

2017-01-01

Transposable elements (TEs) comprise nearly half of the human genome and play an essential role in the maintenance of genomic stability, chromosomal architecture, and transcriptional regulation. TEs are repetitive sequences consisting of RNA transposons, DNA transposons, and endogenous retroviruses that can invade the human genome with a substantial contribution in human evolution and genomic diversity. TEs are therefore firmly regulated from early embryonic development and during the entire course of human life by epigenetic mechanisms, in particular DNA methylation and histone modifications. The deregulation of TEs has been reported in some developmental diseases, as well as for different types of human cancers. To date, the role of TEs, the mechanisms underlying TE reactivation, and the interplay with DNA methylation in human cancers remain largely unexplained. We reviewed the loss of epigenetic regulation and subsequent genomic instability, chromosomal aberrations, transcriptional deregulation, oncogenic activation, and aberrations of non-coding RNAs as the potential mechanisms underlying TE deregulation in human cancers. PMID:28471386
The expanding universe of noncoding RNAs.

PubMed

Hannon, G J; Rivas, F V; Murchison, E P; Steitz, J A

2006-01-01

The 71st Cold Spring Harbor Symposium on Quantitative Biology celebrated the numerous and expanding roles of regulatory RNAs in systems ranging from bacteria to mammals. It was clearly evident that noncoding RNAs are undergoing a renaissance, with reports of their involvement in nearly every cellular process. Previously known classes of longer noncoding RNAs were shown to function by every possible means-acting catalytically, sensing physiological states through adoption of complex secondary and tertiary structures, or using their primary sequences for recognition of target sites. The many recently discovered classes of small noncoding RNAs, generally less than 35 nucleotides in length, most often exert their effects by guiding regulatory complexes to targets via base-pairing. With the ability to analyze the RNA products of the genome in ever greater depth, it has become clear that the universe of noncoding RNAs may extend far beyond the boundaries we had previously imagined. Thus, as much as the Symposium highlighted exciting progress in the field, it also revealed how much farther we must go to understand fully the biological impact of noncoding RNAs.
Small RNA-Mediated trans-Nuclear and trans-Element Communications in Tetrahymena DNA Elimination.

PubMed

Noto, Tomoko; Mochizuki, Kazufumi

2018-06-18

Epigenetic inheritance of acquired traits is widespread among eukaryotes, but how and to what extent such information is transgenerationally inherited is still unclear. The patterns of programmed DNA elimination in ciliates are epigenetically and transgenerationally inherited, and it has been proposed that small RNAs, which shuttle between the germline and the soma, regulate this epigenetic inheritance. In this study, we test the existence and role of such small-RNA-mediated communication by epigenetically disturbing the pattern of DNA elimination in Tetrahymena. We show that the pattern of DNA elimination is, indeed, determined by the selective turnover of small RNAs, which is induced by the interaction between germline-derived small RNAs and the somatic genome. In addition, we show that DNA elimination of an element is regulated by small-RNA-mediated communication with other eliminated elements. By contrast, no evidence obtained thus far supports the notion that transfer of epigenetic information from the soma to the germline, if any, regulates DNA elimination. Our results indicate that small-RNA-mediated trans-nuclear and trans-element communication, in addition to unknown information in the germline genome, contributes to determining the pattern of DNA elimination. Copyright © 2018 Elsevier Ltd. All rights reserved.
Surveying DNA Elements within Functional Genes of Heterocyst-Forming Cyanobacteria

PubMed Central

Hilton, Jason A.; Meeks, John C.; Zehr, Jonathan P.

2016-01-01

Some cyanobacteria are capable of differentiating a variety of cell types in response to environmental factors. For instance, in low nitrogen conditions, some cyanobacteria form heterocysts, which are specialized for N2 fixation. Many heterocyst-forming cyanobacteria have DNA elements interrupting key N2 fixation genes, elements that are excised during heterocyst differentiation. While the mechanism for the excision of the element has been well-studied, many questions remain regarding the introduction of the elements into the cyanobacterial lineage and whether they have been retained ever since or have been lost and reintroduced. To examine the evolutionary relationships and possible function of DNA sequences that interrupt genes of heterocyst-forming cyanobacteria, we identified and compared 101 interruption element sequences within genes from 38 heterocyst-forming cyanobacterial genomes. The interruption element lengths ranged from about 1 kb (the minimum able to encode the recombinase responsible for element excision), up to nearly 1 Mb. The recombinase gene sequences served as genetic markers that were common across the interruption elements and were used to track element evolution. Elements were found that interrupted 22 different orthologs, only five of which had been previously observed to be interrupted by an element. Most of the newly identified interrupted orthologs encode proteins that have been shown to have heterocyst-specific activity. However, the presence of interruption elements within genes with no known role in N2 fixation, as well as in three non-heterocyst-forming cyanobacteria, indicates that the processes that trigger the excision of elements may not be limited to heterocyst development or that the elements move randomly within genomes. This comprehensive analysis provides the framework to study the history and behavior of these unique sequences, and offers new insight regarding the frequency and persistence of interruption elements in
Surveying DNA Elements within Functional Genes of Heterocyst-Forming Cyanobacteria.

PubMed

Hilton, Jason A; Meeks, John C; Zehr, Jonathan P

2016-01-01

Some cyanobacteria are capable of differentiating a variety of cell types in response to environmental factors. For instance, in low nitrogen conditions, some cyanobacteria form heterocysts, which are specialized for N2 fixation. Many heterocyst-forming cyanobacteria have DNA elements interrupting key N2 fixation genes, elements that are excised during heterocyst differentiation. While the mechanism for the excision of the element has been well-studied, many questions remain regarding the introduction of the elements into the cyanobacterial lineage and whether they have been retained ever since or have been lost and reintroduced. To examine the evolutionary relationships and possible function of DNA sequences that interrupt genes of heterocyst-forming cyanobacteria, we identified and compared 101 interruption element sequences within genes from 38 heterocyst-forming cyanobacterial genomes. The interruption element lengths ranged from about 1 kb (the minimum able to encode the recombinase responsible for element excision), up to nearly 1 Mb. The recombinase gene sequences served as genetic markers that were common across the interruption elements and were used to track element evolution. Elements were found that interrupted 22 different orthologs, only five of which had been previously observed to be interrupted by an element. Most of the newly identified interrupted orthologs encode proteins that have been shown to have heterocyst-specific activity. However, the presence of interruption elements within genes with no known role in N2 fixation, as well as in three non-heterocyst-forming cyanobacteria, indicates that the processes that trigger the excision of elements may not be limited to heterocyst development or that the elements move randomly within genomes. This comprehensive analysis provides the framework to study the history and behavior of these unique sequences, and offers new insight regarding the frequency and persistence of interruption elements in
Conserved expression of transposon-derived non-coding transcripts in primate stem cells.

PubMed

Ramsay, LeeAnn; Marchetto, Maria C; Caron, Maxime; Chen, Shu-Huang; Busche, Stephan; Kwan, Tony; Pastinen, Tomi; Gage, Fred H; Bourque, Guillaume

2017-02-28

A significant portion of expressed non-coding RNAs in human cells is derived from transposable elements (TEs). Moreover, it has been shown that various long non-coding RNAs (lncRNAs), which come from the human endogenous retrovirus subfamily H (HERVH), are not only expressed but required for pluripotency in human embryonic stem cells (hESCs). To identify additional TE-derived functional non-coding transcripts, we generated RNA-seq data from induced pluripotent stem cells (iPSCs) of four primate species (human, chimpanzee, gorilla, and rhesus) and searched for transcripts whose expression was conserved. We observed that about 30% of TE instances expressed in human iPSCs had orthologous TE instances that were also expressed in chimpanzee and gorilla. Notably, our analysis revealed a number of repeat families with highly conserved expression profiles including HERVH but also MER53, which is known to be the source of a placental-specific family of microRNAs (miRNAs). We also identified a number of repeat families from all classes of TEs, including MLT1-type and Tigger families, that contributed a significant amount of sequence to primate lncRNAs whose expression was conserved. Together, these results describe TE families and TE-derived lncRNAs whose conserved expression patterns can be used to identify what are likely functional TE-derived non-coding transcripts in primate iPSCs.
The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

PubMed

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5
Extraordinary Structured Noncoding RNAs Revealed by Bacterial Metagenome Analysis

PubMed Central

Weinberg, Zasha; Perreault, Jonathan; Meyer, Michelle M.; Breaker, Ronald R.

2012-01-01

Estimates of the total number of bacterial species1-3 suggest that existing DNA sequence databases carry only a tiny fraction of the total amount of DNA sequence space represented by this division of life. Indeed, environmental DNA samples have been shown to encode many previously unknown classes of proteins4 and RNAs5. Bioinformatics searches6-10 of genomic DNA from bacteria commonly identify novel noncoding RNAs (ncRNAs)10-12 such as riboswitches13,14. In rare instances, RNAs that exhibit more extensive sequence and structural conservation across a wide range of bacteria are encountered15,16. Given that large structured RNAs are known to carry out complex biochemical functions such as protein synthesis and RNA processing reactions, identifying more RNAs of great size and intricate structure is likely to reveal additional biochemical functions that can be achieved by RNA. We applied an updated computational pipeline17 to discover ncRNAs that rival the known large ribozymes in size and structural complexity or that are among the most abundant RNAs in bacteria that encode them. These RNAs would have been difficult or impossible to detect without examining environmental DNA sequences, suggesting that numerous RNAs with extraordinary size, structural complexity, or other exceptional characteristics remain to be discovered in unexplored sequence space. PMID:19956260
Long noncoding RNAs responsive to Fusarium oxysporum infection in Arabidopsis thaliana.

PubMed

Zhu, Qian-Hao; Stephen, Stuart; Taylor, Jennifer; Helliwell, Chris A; Wang, Ming-Bo

2014-01-01

Short noncoding RNAs have been demonstrated to play important roles in regulation of gene expression and stress responses, but the repertoire and functions of long noncoding RNAs (lncRNAs) remain largely unexplored, particularly in plants. To explore the role of lncRNAs in disease resistance, we used a strand-specific RNA-sequencing approach to identify lncRNAs responsive to Fusarium oxysporum infection in Arabidopsis thaliana. Antisense transcription was found in c. 20% of the annotated A. thaliana genes. Several noncoding natural antisense transcripts responsive to F. oxysporum infection were found in genes implicated in disease defense. While the majority of the novel transcriptionally active regions (TARs) were adjacent to annotated genes and could be an extension of the annotated transcripts, 159 novel intergenic TARs, including 20 F. oxysporum-responsive lncTARs, were identified. Ten F. oxysporum-induced lncTARs were functionally characterized using T-DNA insertion or RNA-interference knockdown lines, and five were demonstrated to be related to disease development. Promoter analysis suggests that some of the F. oxysporum-induced lncTARs are direct targets of transcription factor(s) responsive to pathogen attack. Our results demonstrated that strand-specific RNA sequencing is a powerful tool for uncovering hidden levels of transcriptome and that IncRNAs are important components of the antifungal networks in A. thaliana. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Gene regulation by noncoding RNAs

PubMed Central

Patil, Veena S.; Zhou, Rui; Rana, Tariq M.

2015-01-01

The past two decades have seen an explosion in research on noncoding RNAs and their physiological and pathological functions. Several classes of small (20–30 nucleotides) and long (>200 nucleotides) noncoding RNAs have been firmly established as key regulators of gene expression in myriad processes ranging from embryonic development to innate immunity. In this review, we focus on our current understanding of the molecular mechanisms underlying the biogenesis and function of small interfering RNAs (siRNAs), microRNAs (miRNAs), and Piwi-interacting RNAs (piRNAs). In addition, we briefly review the relevance of small and long noncoding RNAs to human physiology and pathology and their potential to be exploited as therapeutic agents. PMID:24164576
Satellite DNA Modulates Gene Expression in the Beetle Tribolium castaneum after Heat Stress

PubMed Central

Feliciello, Isidoro; Akrap, Ivana; Ugarković, Đurđica

2015-01-01

Non-coding repetitive DNAs have been proposed to perform a gene regulatory role, however for tandemly repeated satellite DNA no such role was defined until now. Here we provide the first evidence for a role of satellite DNA in the modulation of gene expression under specific environmental conditions. The major satellite DNA TCAST1 in the beetle Tribolium castaneum is preferentially located within pericentromeric heterochromatin but is also dispersed as single repeats or short arrays in the vicinity of protein-coding genes within euchromatin. Our results show enhanced suppression of activity of TCAST1-associated genes and slower recovery of their activity after long-term heat stress relative to the same genes without associated TCAST1 satellite DNA elements. The level of gene suppression is not influenced by the distance of TCAST1 elements from the associated genes up to 40 kb from the genes’ transcription start sites, but it does depend on the copy number of TCAST1 repeats within an element, being stronger for the higher number of copies. The enhanced gene suppression correlates with the enrichment of the repressive histone marks H3K9me2/3 at dispersed TCAST1 elements and their flanking regions as well as with increased expression of TCAST1 satellite DNA. The results reveal transient, RNAi based heterochromatin formation at dispersed TCAST1 repeats and their proximal regions as a mechanism responsible for enhanced silencing of TCAST1-associated genes. Differences in the pattern of distribution of TCAST1 elements contribute to gene expression diversity among T. castaneum strains after long-term heat stress and might have an impact on adaptation to different environmental conditions. PMID:26275223
Applications of statistical physics and information theory to the analysis of DNA sequences

NASA Astrophysics Data System (ADS)

Grosse, Ivo

2000-10-01

DNA carries the genetic information of most living organisms, and the of genome projects is to uncover that genetic information. One basic task in the analysis of DNA sequences is the recognition of protein coding genes. Powerful computer programs for gene recognition have been developed, but most of them are based on statistical patterns that vary from species to species. In this thesis I address the question if there exist universal statistical patterns that are different in coding and noncoding DNA of all living species, regardless of their phylogenetic origin. In search for such species-independent patterns I study the mutual information function of genomic DNA sequences, and find that it shows persistent period-three oscillations. To understand the biological origin of the observed period-three oscillations, I compare the mutual information function of genomic DNA sequences to the mutual information function of stochastic model sequences. I find that the pseudo-exon model is able to reproduce the mutual information function of genomic DNA sequences. Moreover, I find that a generalization of the pseudo-exon model can connect the existence and the functional form of long-range correlations to the presence and the length distributions of coding and noncoding regions. Based on these theoretical studies I am able to find an information-theoretical quantity, the average mutual information (AMI), whose probability distributions are significantly different in coding and noncoding DNA, while they are almost identical in all studied species. These findings show that there exist universal statistical patterns that are different in coding and noncoding DNA of all studied species, and they suggest that the AMI may be used to identify genes in different living species, irrespective of their taxonomic origin.
Expression of Antisense Long Noncoding RNAs as Potential Regulators in Rainbow Trout with Different Tolerance to Plant-Based Diets.

PubMed

Abernathy, Jason; Overturf, Ken

2018-01-04

Reformulation of aquafeeds in salmonid diets to include more plant proteins is critical for sustainable aquaculture. However, increasing plant proteins can lead to stunted growth and enteritis. Toward an understanding of the regulatory mechanisms behind plant protein utilization, directional RNA sequencing of liver tissues from a rainbow trout strain selected for growth on an all plant-protein diet and a control strain, both fed a plant diet for 12 weeks, were utilized to construct long noncoding RNAs. Antisense long noncoding RNAs were selected for differential expression and functional analyses since they have been shown to have regulatory actions within a genome. A total of 142 unique antisense long noncoding RNAs were differentially expressed between strains, 60 of which could be mapped to a gene. Genes underlying these noncoding RNAs are indicated in lipid metabolism and immunity. Six noncoding transcripts were also found to overlap with differentially expressed protein-coding genes, all of which were co-expressed. Associating variation in regulatory elements between rainbow trout strains with differing tolerance to plant-protein diets will assist in future studies toward increased gains throughout carnivorous aquaculture.

Normalized cDNA libraries

DOEpatents

Soares, Marcelo B.; Efstratiadis, Argiris

1997-01-01

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Normalized cDNA libraries

DOEpatents

Soares, M.B.; Efstratiadis, A.

1997-06-10

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3{prime} noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

DOE Office of Scientific and Technical Information (OSTI.GOV)

Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

2003-12-31

Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involvedmore » in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.« less
Alterations in sperm DNA methylation, non-coding RNA expression, and histone retention mediate vinclozolin-induced epigenetic transgenerational inheritance of disease.

PubMed

Ben Maamar, Millissia; Sadler-Riggleman, Ingrid; Beck, Daniel; McBirney, Margaux; Nilsson, Eric; Klukovich, Rachel; Xie, Yeming; Tang, Chong; Yan, Wei; Skinner, Michael K

2018-04-01

Epigenetic transgenerational inheritance of disease and phenotypic variation can be induced by several toxicants, such as vinclozolin. This phenomenon can involve DNA methylation, non-coding RNA (ncRNA) and histone retention, and/or modification in the germline (e.g. sperm). These different epigenetic marks are called epimutations and can transmit in part the transgenerational phenotypes. This study was designed to investigate the vinclozolin-induced concurrent alterations of a number of different epigenetic factors, including DNA methylation, ncRNA, and histone retention in rat sperm. Gestating females (F0 generation) were exposed transiently to vinclozolin during fetal gonadal development. The directly exposed F1 generation fetus, the directly exposed germline within the fetus that will generate the F2 generation, and the transgenerational F3 generation sperm were studied. DNA methylation and ncRNA were altered in each generation rat sperm with the direct exposure F1 and F2 generations being distinct from the F3 generation epimutations. Interestingly, an increased number of differential histone retention sites were found in the F3 generation vinclozolin sperm, but not in the F1 or F2 generations. All three different epimutation types were affected in the vinclozolin lineage transgenerational sperm (F3 generation). The direct exposure generations (F1 and F2) epigenetic alterations were distinct from the transgenerational sperm epimutations. The genomic features and gene pathways associated with the epimutations were investigated to help elucidate the integration of these different epigenetic processes. Our results show that the three different types of epimutations are involved and integrated in the mediation of the epigenetic transgenerational inheritance phenomenon.
Primer-Independent DNA Synthesis by a Family B DNA Polymerase from Self-Replicating Mobile Genetic Elements.

PubMed

Redrejo-Rodríguez, Modesto; Ordóñez, Carlos D; Berjón-Otero, Mónica; Moreno-González, Juan; Aparicio-Maldonado, Cristian; Forterre, Patrick; Salas, Margarita; Krupovic, Mart

2017-11-07

Family B DNA polymerases (PolBs) play a central role during replication of viral and cellular chromosomes. Here, we report the discovery of a third major group of PolBs, which we denote primer-independent PolB (piPolB), that might be a link between the previously known protein-primed and RNA/DNA-primed PolBs. PiPolBs are encoded by highly diverse mobile genetic elements, pipolins, integrated in the genomes of diverse bacteria and also present as circular plasmids in mitochondria. Biochemical characterization showed that piPolB displays efficient DNA polymerization activity that can use undamaged and damaged templates and is endowed with proofreading and strand displacement capacities. Remarkably, the protein is also capable of template-dependent de novo DNA synthesis, i.e., DNA-priming activity, thereby breaking the long-standing dogma that replicative DNA polymerases require a pre-existing primer for DNA synthesis. We suggest that piPolBs are involved in self-replication of pipolins and may also contribute to bacterial DNA damage tolerance. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Long noncoding RNA in hematopoiesis and immunity.

PubMed

Satpathy, Ansuman T; Chang, Howard Y

2015-05-19

Dynamic gene expression during cellular differentiation is tightly coordinated by transcriptional and post-transcriptional mechanisms. An emerging theme is the central role of long noncoding RNAs (lncRNAs) in the regulation of this specificity. Recent advances demonstrate that lncRNAs are expressed in a lineage-specific manner and control the development of several cell types in the hematopoietic system. Moreover, specific lncRNAs are induced to modulate innate and adaptive immune responses. lncRNAs can function via RNA-DNA, RNA-RNA, and RNA-protein target interactions. As a result, they affect several stages of gene regulation, including chromatin modification, mRNA biogenesis, and protein signaling. We discuss recent advances, future prospects, and challenges in understanding the roles of lncRNAs in immunity and immune-mediated diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
Divergent genome evolution caused by regional variation in DNA gain and loss between human and mouse

PubMed Central

Kortschak, R. Daniel

2018-01-01

The forces driving the accumulation and removal of non-coding DNA and ultimately the evolution of genome size in complex organisms are intimately linked to genome structure and organisation. Our analysis provides a novel method for capturing the regional variation of lineage-specific DNA gain and loss events in their respective genomic contexts. To further understand this connection we used comparative genomics to identify genome-wide individual DNA gain and loss events in the human and mouse genomes. Focusing on the distribution of DNA gains and losses, relationships to important structural features and potential impact on biological processes, we found that in autosomes, DNA gains and losses both followed separate lineage-specific accumulation patterns. However, in both species chromosome X was particularly enriched for DNA gain, consistent with its high L1 retrotransposon content required for X inactivation. We found that DNA loss was associated with gene-rich open chromatin regions and DNA gain events with gene-poor closed chromatin regions. Additionally, we found that DNA loss events tended to be smaller than DNA gain events suggesting that they were able to accumulate in gene-rich open chromatin regions due to their reduced capacity to interrupt gene regulatory architecture. GO term enrichment showed that mouse loss hotspots were strongly enriched for terms related to developmental processes. However, these genes were also located in regions with a high density of conserved elements, suggesting that despite high levels of DNA loss, gene regulatory architecture remained conserved. This is consistent with a model in which DNA gain and loss results in turnover or “churning” in regulatory element dense regions of open chromatin, where interruption of regulatory elements is selected against. PMID:29677183
A User's Guide to the Encyclopedia of DNA Elements (ENCODE)

PubMed Central

2011-01-01

The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome. PMID:21526222
The non-coding RNA landscape of human hematopoiesis and leukemia.

PubMed

Schwarzer, Adrian; Emmrich, Stephan; Schmidt, Franziska; Beck, Dominik; Ng, Michelle; Reimer, Christina; Adams, Felix Ferdinand; Grasedieck, Sarah; Witte, Damian; Käbler, Sebastian; Wong, Jason W H; Shah, Anushi; Huang, Yizhou; Jammal, Razan; Maroz, Aliaksandra; Jongen-Lavrencic, Mojca; Schambach, Axel; Kuchenbauer, Florian; Pimanda, John E; Reinhardt, Dirk; Heckl, Dirk; Klusmann, Jan-Henning

2017-08-09

Non-coding RNAs have emerged as crucial regulators of gene expression and cell fate decisions. However, their expression patterns and regulatory functions during normal and malignant human hematopoiesis are incompletely understood. Here we present a comprehensive resource defining the non-coding RNA landscape of the human hematopoietic system. Based on highly specific non-coding RNA expression portraits per blood cell population, we identify unique fingerprint non-coding RNAs-such as LINC00173 in granulocytes-and assign these to critical regulatory circuits involved in blood homeostasis. Following the incorporation of acute myeloid leukemia samples into the landscape, we further uncover prognostically relevant non-coding RNA stem cell signatures shared between acute myeloid leukemia blasts and healthy hematopoietic stem cells. Our findings highlight the importance of the non-coding transcriptome in the formation and maintenance of the human blood hierarchy.While micro-RNAs are known regulators of haematopoiesis and leukemogenesis, the role of long non-coding RNAs is less clear. Here the authors provide a non-coding RNA expression landscape of the human hematopoietic system, highlighting their role in the formation and maintenance of the human blood hierarchy.
Validation of an entirely in vitro approach for rapid prototyping of DNA regulatory elements for synthetic biology

PubMed Central

Chappell, James; Jensen, Kirsten; Freemont, Paul S.

2013-01-01

A bottleneck in our capacity to rationally and predictably engineer biological systems is the limited number of well-characterized genetic elements from which to build. Current characterization methods are tied to measurements in living systems, the transformation and culturing of which are inherently time-consuming. To address this, we have validated a completely in vitro approach for the characterization of DNA regulatory elements using Escherichia coli extract cell-free systems. Importantly, we demonstrate that characterization in cell-free systems correlates and is reflective of performance in vivo for the most frequently used DNA regulatory elements. Moreover, we devise a rapid and completely in vitro method to generate DNA templates for cell-free systems, bypassing the need for DNA template generation and amplification from living cells. This in vitro approach is significantly quicker than current characterization methods and is amenable to high-throughput techniques, providing a valuable tool for rapidly prototyping libraries of DNA regulatory elements for synthetic biology. PMID:23371936
The contribution of alu elements to mutagenic DNA double-strand break repair.

PubMed

Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L

2015-03-01

Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both
The Inescapable Influence of Noncoding RNAs in Cancer

PubMed Central

Adams, Brian D.; Anastasiadou, Eleni; Esteller, Manel; He, Lin; Slack, Frank J.

2015-01-01

This report summarizes information presented at the 2015 Keystone Symposium on “MicroRNAs and Noncoding RNAs in Cancer”. Nearly two decades after the discovery of the first microRNA (miRNA), the role of noncoding RNAs in developmental processes and the mechanisms behind their dysregulation in cancer has been steadily elucidated. Excitingly, miRNAs have begun making their way into the clinic to combat disease such a hepatitis C, and various forms of cancer. Therefore, at this Keystone meeting novel findings were presented that enhance our view on how small and long noncoding RNAs control developmental timing and oncogenic processes. Recurring themes included, 1) how miRNAs can be differentially processed, degraded, and regulated by ribonucleoprotein (RNP) complexes, 2) how particular miRNA genetic networks that control developmental process, when disrupted, can result in cancer disease, 3) the technologies available to therapeutically deliver RNA to combat diseases such as cancer, and 4) the elucidation of the mechanism of actions for long noncoding RNAs, currently a poorly understood class of noncoding RNA. During the meeting there was an emphasis on presenting unpublished findings, and the breadth of topics covered reflected how inescapable the influence of noncoding RNAs are in development and cancer. PMID:26567137
Nanoparticle-labeled DNA capture elements for detection and identification of biological agents

NASA Astrophysics Data System (ADS)

Kiel, Johnathan L.; Holwitt, Eric A.; Parker, Jill E.; Vivekananda, Jeevalatha; Franz, Veronica

2004-12-01

Aptamers, synthetic DNA capture elements (DCEs), can be made chemically or in genetically engineered bacteria. DNA capture elements are artificial DNA sequences, from a random pool of sequences, selected for their specific binding to potential biological warfare or terrorism agents. These sequences were selected by an affinity method using filters to which the target agent was attached and the DNA isolated and amplified by polymerase chain reaction (PCR) in an iterative, increasingly stringent, process. The probes can then be conjugated to Quantum Dots and super paramagnetic nanoparticles. The former provide intense, bleach-resistant fluorescent detection of bioagent and the latter provide a means to collect the bioagents with a magnet. The fluorescence can be detected in a flow cytometer, in a fluorescence plate reader, or with a fluorescence microscope. To date, we have made DCEs to Bacillus anthracis spores, Shiga toxin, Venezuelan Equine Encephalitis (VEE) virus, and Francisella tularensis. DCEs can easily distinguish Bacillus anthracis from its nearest relatives, Bacillus cereus and Bacillus thuringiensis. Development of a high through-put process is currently being investigated.
piRNA pathway targets active LINE1 elements to establish the repressive H3K9me3 mark in germ cells

PubMed Central

Pezic, Dubravka; Manakov, Sergei A.; Sachidanandam, Ravi; Aravin, Alexei A.

2014-01-01

Transposable elements (TEs) occupy a large fraction of metazoan genomes and pose a constant threat to genomic integrity. This threat is particularly critical in germ cells, as changes in the genome that are induced by TEs will be transmitted to the next generation. Small noncoding piwi-interacting RNAs (piRNAs) recognize and silence a diverse set of TEs in germ cells. In mice, piRNA-guided transposon repression correlates with establishment of CpG DNA methylation on their sequences, yet the mechanism and the spectrum of genomic targets of piRNA silencing are unknown. Here we show that in addition to DNA methylation, the piRNA pathway is required to maintain a high level of the repressive H3K9me3 histone modification on long interspersed nuclear elements (LINEs) in germ cells. piRNA-dependent chromatin repression targets exclusively full-length elements of actively transposing LINE families, demonstrating the remarkable ability of the piRNA pathway to recognize active elements among the large number of genomic transposon fragments. PMID:24939875
DNA capture elements for rapid detection and identification of biological agents

NASA Astrophysics Data System (ADS)

Kiel, Johnathan L.; Parker, Jill E.; Holwitt, Eric A.; Vivekananda, Jeeva

2004-08-01

DNA capture elements (DCEs; aptamers) are artificial DNA sequences, from a random pool of sequences, selected for their specific binding to potential biological warfare agents. These sequences were selected by an affinity method using filters to which the target agent was attached and the DNA isolated and amplified by polymerase chain reaction (PCR) in an iterative, increasingly stringent, process. Reporter molecules were attached to the finished sequences. To date, we have made DCEs to Bacillus anthracis spores, Shiga toxin, Venezuelan Equine Encephalitis (VEE) virus, and Francisella tularensis. These DCEs have demonstrated specificity and sensitivity equal to or better than antibody.
Molecular Regulatory Pathways Link Sepsis With Metabolic Syndrome: Non-coding RNA Elements Underlying the Sepsis/Metabolic Cross-Talk.

PubMed

Meydan, Chanan; Bekenstein, Uriya; Soreq, Hermona

2018-01-01

Sepsis and metabolic syndrome (MetS) are both inflammation-related entities with high impact for human health and the consequences of concussions. Both represent imbalanced parasympathetic/cholinergic response to insulting triggers and variably uncontrolled inflammation that indicates shared upstream regulators, including short microRNAs (miRs) and long non-coding RNAs (lncRNAs). These may cross talk across multiple systems, leading to complex molecular and clinical outcomes. Notably, biomedical and RNA-sequencing based analyses both highlight new links between the acquired and inherited pathogenic, cardiac and inflammatory traits of sepsis/MetS. Those include the HOTAIR and MIAT lncRNAs and their targets, such as miR-122, -150, -155, -182, -197, -375, -608 and HLA-DRA. Implicating non-coding RNA regulators in sepsis and MetS may delineate novel high-value biomarkers and targets for intervention.
[Relevance of long non-coding RNAs in tumour biology].

PubMed

Nagy, Zoltán; Szabó, Diána Rita; Zsippai, Adrienn; Falus, András; Rácz, Károly; Igaz, Péter

2012-09-23

The discovery of the biological relevance of non-coding RNA molecules represents one of the most significant advances in contemporary molecular biology. It has turned out that a major fraction of the non-coding part of the genome is transcribed. Beside small RNAs (including microRNAs) more and more data are disclosed concerning long non-coding RNAs of 200 nucleotides to 100 kb length that are implicated in the regulation of several basic molecular processes (cell proliferation, chromatin functioning, microRNA-mediated effects, etc.). Some of these long non-coding RNAs have been associated with human tumours, including H19, HOTAIR, MALAT1, etc., the different expression of which has been noted in various neoplasms relative to healthy tissues. Long non-coding RNAs may represent novel markers of molecular diagnostics and they might even turn out to be targets of therapeutic intervention.
Cell Type-Specific Chromatin Signatures Underline Regulatory DNA Elements in Human Induced Pluripotent Stem Cells and Somatic Cells.

PubMed

Zhao, Ming-Tao; Shao, Ning-Yi; Hu, Shijun; Ma, Ning; Srinivasan, Rajini; Jahanbani, Fereshteh; Lee, Jaecheol; Zhang, Sophia L; Snyder, Michael P; Wu, Joseph C

2017-11-10

Regulatory DNA elements in the human genome play important roles in determining the transcriptional abundance and spatiotemporal gene expression during embryonic heart development and somatic cell reprogramming. It is not well known how chromatin marks in regulatory DNA elements are modulated to establish cell type-specific gene expression in the human heart. We aimed to decipher the cell type-specific epigenetic signatures in regulatory DNA elements and how they modulate heart-specific gene expression. We profiled genome-wide transcriptional activity and a variety of epigenetic marks in the regulatory DNA elements using massive RNA-seq (n=12) and ChIP-seq (chromatin immunoprecipitation combined with high-throughput sequencing; n=84) in human endothelial cells (CD31 + CD144 + ), cardiac progenitor cells (Sca-1 + ), fibroblasts (DDR2 + ), and their respective induced pluripotent stem cells. We uncovered 2 classes of regulatory DNA elements: class I was identified with ubiquitous enhancer (H3K4me1) and promoter (H3K4me3) marks in all cell types, whereas class II was enriched with H3K4me1 and H3K4me3 in a cell type-specific manner. Both class I and class II regulatory elements exhibited stimulatory roles in nearby gene expression in a given cell type. However, class I promoters displayed more dominant regulatory effects on transcriptional abundance regardless of distal enhancers. Transcription factor network analysis indicated that human induced pluripotent stem cells and somatic cells from the heart selected their preferential regulatory elements to maintain cell type-specific gene expression. In addition, we validated the function of these enhancer elements in transgenic mouse embryos and human cells and identified a few enhancers that could possibly regulate the cardiac-specific gene expression. Given that a large number of genetic variants associated with human diseases are located in regulatory DNA elements, our study provides valuable resources for deciphering
Resurrection of DNA Function In Vivo from an Extinct Genome

PubMed Central

Pask, Andrew J.; Behringer, Richard R.; Renfree, Marilyn B.

2008-01-01

There is a burgeoning repository of information available from ancient DNA that can be used to understand how genomes have evolved and to determine the genetic features that defined a particular species. To assess the functional consequences of changes to a genome, a variety of methods are needed to examine extinct DNA function. We isolated a transcriptional enhancer element from the genome of an extinct marsupial, the Tasmanian tiger (Thylacinus cynocephalus or thylacine), obtained from 100 year-old ethanol-fixed tissues from museum collections. We then examined the function of the enhancer in vivo. Using a transgenic approach, it was possible to resurrect DNA function in transgenic mice. The results demonstrate that the thylacine Col2A1 enhancer directed chondrocyte-specific expression in this extinct mammalian species in the same way as its orthologue does in mice. While other studies have examined extinct coding DNA function in vitro, this is the first example of the restoration of extinct non-coding DNA and examination of its function in vivo. Our method using transgenesis can be used to explore the function of regulatory and protein-coding sequences obtained from any extinct species in an in vivo model system, providing important insights into gene evolution and diversity. PMID:18493600
DNA transposons have colonized the genome of the giant virus Pandoravirus salinus.

PubMed

Sun, Cheng; Feschotte, Cédric; Wu, Zhiqiang; Mueller, Rachel Lockridge

2015-06-12

Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii, a species thought to host pandoraviruses, which contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus belong to the widespread Tc1/mariner superfamily and may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Our discovery highlights the remarkable ability of DNA transposons to colonize and shape

Comprehensive Reconstruction and Visualization of Non-Coding Regulatory Networks in Human

PubMed Central

Bonnici, Vincenzo; Russo, Francesco; Bombieri, Nicola; Pulvirenti, Alfredo; Giugno, Rosalba

2014-01-01

Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs). Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established on-line repositories. The interactions involve RNA, DNA, proteins, and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command-line, and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape. PMID:25540777
Comprehensive reconstruction and visualization of non-coding regulatory networks in human.

PubMed

Bonnici, Vincenzo; Russo, Francesco; Bombieri, Nicola; Pulvirenti, Alfredo; Giugno, Rosalba

2014-01-01

Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs). Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established on-line repositories. The interactions involve RNA, DNA, proteins, and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command-line, and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape.
Alterations in sperm DNA methylation, non-coding RNA and histone retention associate with DDT-induced epigenetic transgenerational inheritance of disease.

PubMed

Skinner, Michael K; Ben Maamar, Millissia; Sadler-Riggleman, Ingrid; Beck, Daniel; Nilsson, Eric; McBirney, Margaux; Klukovich, Rachel; Xie, Yeming; Tang, Chong; Yan, Wei

2018-02-27

Environmental toxicants such as DDT have been shown to induce the epigenetic transgenerational inheritance of disease (e.g., obesity) through the germline. The current study was designed to investigate the DDT-induced concurrent alterations of a number of different epigenetic processes including DNA methylation, non-coding RNA (ncRNA) and histone retention in sperm. Gestating females were exposed transiently to DDT during fetal gonadal development, and then, the directly exposed F1 generation, the directly exposed germline F2 generation and the transgenerational F3 generation sperm were investigated. DNA methylation and ncRNA were altered in each generation sperm with the direct exposure F1 and F2 generations being predominantly distinct from the F3 generation epimutations. The piRNA and small tRNA were the most predominant classes of ncRNA altered. A highly conserved set of histone retention sites were found in the control lineage generations which was not significantly altered between generations, but a large number of new histone retention sites were found only in the transgenerational generation DDT lineage sperm. Therefore, all three different epigenetic processes were concurrently altered as DDT induced the epigenetic transgenerational inheritance of sperm epimutations. The direct exposure generations sperm epigenetic alterations were distinct from the transgenerational sperm epimutations. The genomic features and gene associations with the epimutations were investigated to help elucidate the integration of these different epigenetic processes. Observations demonstrate all three epigenetic processes are involved in transgenerational inheritance. The different epigenetic processes appear to be integrated in mediating the epigenetic transgenerational inheritance phenomenon.
Regulatory activities of transposable elements: from conflicts to benefits

PubMed Central

Chuong, Edward B.; Elde, Nels C.; Feschotte, Cédric

2017-01-01

Transposable elements (TEs) are a prolific source of tightly regulated, biochemically active non-coding elements, such as transcription factor binding sites and non-coding RNAs. A wealth of recent studies reinvigorates the idea that these elements are pervasively co-opted for the regulation of host genes. We argue that the inherent genetic properties of TEs and conflicting relationships with their hosts facilitate their recruitment for regulatory functions in diverse genomes. We review recent findings supporting the long-standing hypothesis that the waves of TE invasions endured by organisms for eons have catalyzed the evolution of gene regulatory networks. We also discuss the challenges of dissecting and interpreting the phenotypic impact of regulatory activities encoded by TEs in health and disease. PMID:27867194
Evidence for recombination of mtDNA in the marine mussel Mytilus trossulus from the Baltic.

PubMed

Burzyński, Artur; Zbawicka, Małgorzata; Skibinski, David O F; Wenne, Roman

2003-03-01

A number of studies have claimed that recombination occurs in animal mtDNA, although this evidence is controversial. Ladoukakis and Zouros (2001) provided strong evidence for mtDNA recombination in the COIII gene in gonadal tissue in the marine mussel Mytilus galloprovincialis from the Black Sea. The recombinant molecules they reported had not however become established in the population from which experimental animals were sampled. In the present study, we provide further evidence of the generality of mtDNA recombination in Mytilus by reporting recombinant mtDNA molecules in a related mussel species, Mytilus trossulus, from the Baltic. The mtDNA region studied begins in the 16S rRNA gene and terminates in the cytochrome b gene and includes a major noncoding region that may be analogous to the D-loop region observed in other animals. Many bivalve species, including some Mytilus species, are unusual in that they have two mtDNA genomes, one of which is inherited maternally (F genome) the other inherited paternally (M genome). Two recombinant variants reported in the present study have population frequencies of 5% and 36% and appear to be mosaic for F-like and M-like sequences. However, both variants have the noncoding region from the M genome, and both are transmitted to sperm like the M genome. We speculate that acquisition of the noncoding region by the recombinant molecules has conferred a paternal role on mtDNA genomes that otherwise resemble the F genome in sequence.
Movable Genetic Elements: Detection of Changes in Maize DNA at the Shrunken Locus Due to the Intervention of Ds Elements

DOE R&D Accomplishments Database

Burr, B.; Burr, F.A.

1980-05-28

This report describes our initial attempts at the molecular characterization of a maize controlling element. We have prepared a cDNA probe and used it to detect changes at a locus where Ds elements are found. Evidence of their presence are indicated by changes in the restriction patterns, but there is as yet no information on the physical nature of the controlling elements nor on the kinds of rearrangements they cause.
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

PubMed

Trofimova, Irina; Krasikova, Alla

2016-12-01

Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts

PubMed Central

Krasikova, Alla

2016-01-01

ABSTRACT Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription. PMID:27763817
Non-coding RNAs in cancer brain metastasis

PubMed Central

Wu, Kerui; Sharma, Sambad; Venkat, Suresh; Liu, Keqin; Zhou, Xiaobo; Watabe, Kounosuke

2017-01-01

More than 90% of cancer death is attributed to metastatic disease, and the brain is one of the major metastatic sites of melanoma, colon, renal, lung and breast cancers. Despite the recent advancement of targeted therapy for cancer, the incidence of brain metastasis is increasing. One reason is that most therapeutic drugs can’t penetrate blood-brain-barrier and tumor cells find the brain as sanctuary site. In this review, we describe the pathophysiology of brain metastases to introduce the latest understandings of metastatic brain malignancies. This review also particularly focuses on non-coding RNAs and their roles in cancer brain metastasis. Furthermore, we discuss the roles of the extracellular vesicles as they are known to transport information between cells to initiate cancer cell-microenvironment communication. The potential clinical translation of non-coding RNAs as a tool for diagnosis and for treatment is also discussed in this review. At the end, the computational aspects of non-coding RNA detection, the sequence and structure calculation and epigenetic regulation of non-coding RNA in brain metastasis are discussed. PMID:26709907
Genome-wide mapping of nuclear mitochondrial DNA sequences links DNA replication origins to chromosomal double-strand break formation in Schizosaccharomyces pombe

PubMed Central

Lenglez, Sandrine; Hermand, Damien; Decottignies, Anabelle

2010-01-01

Chromosomal double-strand breaks (DSBs) threaten genome integrity and repair of these lesions is often mutagenic. How and where DSBs are formed is a major question conveniently addressed in simple model organisms like yeast. NUMTs, nuclear DNA sequences of mitochondrial origin, are present in most eukaryotic genomes and probably result from the capture of mitochondrial DNA (mtDNA) fragments into chromosomal breaks. NUMT formation is ongoing and was reported to cause de novo human genetic diseases. Study of NUMTs is likely to contribute to the understanding of naturally occurring chromosomal breaks. We show that Schizosaccharomyces pombe NUMTs are exclusively located in noncoding regions with no preference for gene promoters and, when located into promoters, do not affect gene transcription level. Strikingly, most noncoding regions comprising NUMTs are also associated with a DNA replication origin (ORI). Chromatin immunoprecipitation experiments revealed that chromosomal NUMTs are probably not acting as ORI on their own but that mtDNA insertions occurred directly next to ORIs, suggesting that these loci may be prone to DSB formation. Accordingly, induction of excessive DNA replication origin firing, a phenomenon often associated with human tumor formation, resulted in frequent nucleotide deletion events within ORI3001 subtelomeric chromosomal locus, illustrating a novel aspect of DNA replication-driven genomic instability. How mtDNA is fragmented is another important issue that we addressed by sequencing experimentally induced NUMTs. This highlighted regions of S. pombe mtDNA prone to breaking. Together with an analysis of human NUMTs, we propose that these fragile sites in mtDNA may correspond to replication pause sites. PMID:20688779
Long non-coding RNAs in anti-cancer drug resistance.

PubMed

Chen, Qin-Nan; Wei, Chen-Chen; Wang, Zhao-Xia; Sun, Ming

2017-01-03

Chemotherapy is one of the basic treatments for cancers; however, drug resistance is mainly responsible for the failure of clinical treatment. The mechanism of drug resistance is complicated because of interaction among various factors including drug efflux, DNA damage repair, apoptosis and targets mutation. Long non-coding RNAs (lncRNAs) have been a focus of research in the field of bioscience, and the latest studies have revealed that lncRNAs play essential roles in drug resistance in breast cancer, gastric cancer and lung cancer, et al. Dysregulation of multiple targets and pathways by lncRNAs results in the occurrence of chemoresistance. In this review, we will discuss the mechanisms underlying lncRNA-mediated resistance to chemotherapy and the therapeutic potential of lncRNAs in future cancer treatment.
BcMF11, a novel non-coding RNA gene from Brassica campestris, is required for pollen development and male fertility.

PubMed

Song, Jiang-Hua; Cao, Jia-Shu; Wang, Cheng-Gang

2013-01-01

KEY MESSAGE : BcMF11 as a non-coding RNA gene has an essential role in pollen development, and might be useful for regulating the pollen fertility of crops by antisense RNA technology. We previously identified a 828-bp full-length cDNA of BcMF11, a novel pollen-specific non-coding mRNA-like gene from Chinese cabbage (Brassica campestris L. ssp. chinensis Makino). However, little information is known about the function of BcMF11 in pollen development. To investigate its exact biological roles in pollen development, the BcMF11 cDNA was antisense inhibited in transgenic Chinese cabbage under the control of a tapetum-specific promoter BcA9 and a constitutive promoter CaMV 35S. Antisense RNA transgenic plants displayed decreasing expression of BcMF11 and showed distinct morphological defects. Pollen germination test in vitro and in vivo of the transgenic plants suggested that inhibition of BcMF11 decreased pollen germination efficiency and delayed the pollen tubes' extension in the style. Under scanning electron microscopy, many shrunken and collapsed pollen grains were detected in the antisense BcMF11 transgenic Chinese cabbage. Further cytological observation revealed abnormal pollen development process in transgenic plants, including delayed degradation of tapetum, asynchronous separation of microspore, and aborted development of pollen grain. These results suggest that BcMF11, as a non-coding RNA, plays an essential role in pollen development and male fertility.
Facts and updates about cardiovascular non-coding RNAs in heart failure.

PubMed

Thum, Thomas

2015-09-01

About 11% of all deaths include heart failure as a contributing cause. The annual cost of heart failure amounts to US $34,000,000,000 in the United States alone. With the exception of heart transplantation, there is no curative therapy available. Only occasionally there are new areas in science that develop into completely new research fields. The topic on non-coding RNAs, including microRNAs, long non-coding RNAs, and circular RNAs, is such a field. In this short review, we will discuss the latest developments about non-coding RNAs in cardiovascular disease. MicroRNAs are short regulatory non-coding endogenous RNA species that are involved in virtually all cellular processes. Long non-coding RNAs also regulate gene and protein levels; however, by much more complicated and diverse mechanisms. In general, non-coding RNAs have been shown to be of great value as therapeutic targets in adverse cardiac remodelling and also as diagnostic and prognostic biomarkers for heart failure. In the future, non-coding RNA-based therapeutics are likely to enter the clinical reality offering a new treatment approach of heart failure.
Noncoding RNAs in human intervertebral disc degeneration: An integrated microarray study.

PubMed

Liu, Xu; Che, Lu; Xie, Yan-Ke; Hu, Qing-Jie; Ma, Chi-Jiao; Pei, Yan-Jun; Wu, Zhi-Gang; Liu, Zhi-Heng; Fan, Li-Ying; Wang, Hai-Qiang

2015-09-01

Accumulating evidence indicates that noncoding RNAs play important roles in a multitude of biological processes. The striking findings of miRNAs (microRNAs) and lncRNAs (long noncoding RNAs) as members of noncoding RNAs open up an exciting era in the studies of gene regulation. More recently, the reports of circRNAs (circular RNAs) add fuel to the noncoding RNAs research. Human intervertebral disc degeneration (IDD) is a main cause of low back pain as a disabling spinal disease. We have addressed the expression profiles if miRNAs, lncRNAs and mRNAs in IDD (Wang et al., J Pathology, 2011 and Wan et al., Arthritis Res Ther, 2014). Furthermore, we thoroughly analysed noncoding RNAs, including miRNAs, lncRNAs and circRNAs in IDD using the very same samples. Here we delineate in detail the contents of the aforementioned microarray analyses. Microarray and sample annotation data were deposited in GEO under accession number GSE67567 as SuperSeries. The integrated analyses of these noncoding RNAs will shed a novel light on coding-noncoding regulatory machinery.
Long Noncoding RNAs in Lung Cancer.

PubMed

Roth, Anna; Diederichs, Sven

2016-01-01

Despite great progress in research and treatment options, lung cancer remains the leading cause of cancer-related deaths worldwide. Oncogenic driver mutations in protein-encoding genes were defined and allow for personalized therapies based on genetic diagnoses. Nonetheless, diagnosis of lung cancer mostly occurs at late stages, and chronic treatment is followed by a fast onset of chemoresistance. Hence, there is an urgent need for reliable biomarkers and alternative treatment options. With the era of whole genome and transcriptome sequencing technologies, long noncoding RNAs emerged as a novel class of versatile, functional RNA molecules. Although for most of them the mechanism of action remains to be defined, accumulating evidence confirms their involvement in various aspects of lung tumorigenesis. They are functional on the epigenetic, transcriptional, and posttranscriptional level and are regulators of pathophysiological key pathways including cell growth, apoptosis, and metastasis. Long noncoding RNAs are gaining increasing attention as potential biomarkers and a novel class of druggable molecules. It has become clear that we are only beginning to understand the complexity of tumorigenic processes. The clinical integration of long noncoding RNAs in terms of prognostic and predictive biomarker signatures and additional cancer targets could provide a chance to increase the therapeutic benefit. Here, we review the current knowledge about the expression, regulation, biological function, and clinical relevance of long noncoding RNAs in lung cancer.
Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity

PubMed Central

Song, Lingyun; Zhang, Zhancheng; Grasfeder, Linda L.; Boyle, Alan P.; Giresi, Paul G.; Lee, Bum-Kyu; Sheffield, Nathan C.; Gräf, Stefan; Huss, Mikael; Keefe, Damian; Liu, Zheng; London, Darin; McDaniell, Ryan M.; Shibata, Yoichiro; Showers, Kimberly A.; Simon, Jeremy M.; Vales, Teresa; Wang, Tianyuan; Winter, Deborah; Zhang, Zhuzhu; Clarke, Neil D.; Birney, Ewan; Iyer, Vishwanath R.; Crawford, Gregory E.; Lieb, Jason D.; Furey, Terrence S.

2011-01-01

The human body contains thousands of unique cell types, each with specialized functions. Cell identity is governed in large part by gene transcription programs, which are determined by regulatory elements encoded in DNA. To identify regulatory elements active in seven cell lines representative of diverse human cell types, we used DNase-seq and FAIRE-seq (Formaldehyde Assisted Isolation of Regulatory Elements) to map “open chromatin.” Over 870,000 DNaseI or FAIRE sites, which correspond tightly to nucleosome-depleted regions, were identified across the seven cell lines, covering nearly 9% of the genome. The combination of DNaseI and FAIRE is more effective than either assay alone in identifying likely regulatory elements, as judged by coincidence with transcription factor binding locations determined in the same cells. Open chromatin common to all seven cell types tended to be at or near transcription start sites and to be coincident with CTCF binding sites, while open chromatin sites found in only one cell type were typically located away from transcription start sites and contained DNA motifs recognized by regulators of cell-type identity. We show that open chromatin regions bound by CTCF are potent insulators. We identified clusters of open regulatory elements (COREs) that were physically near each other and whose appearance was coordinated among one or more cell types. Gene expression and RNA Pol II binding data support the hypothesis that COREs control gene activity required for the maintenance of cell-type identity. This publicly available atlas of regulatory elements may prove valuable in identifying noncoding DNA sequence variants that are causally linked to human disease. PMID:21750106
α satellite DNA variation and function of the human centromere

PubMed Central

Sullivan, Lori L.; Chew, Kimberline

2017-01-01

ABSTRACT Genomic variation is a source of functional diversity that is typically studied in genic and non-coding regulatory regions. However, the extent of variation within noncoding portions of the human genome, particularly highly repetitive regions, and the functional consequences are not well understood. Satellite DNA, including α satellite DNA found at human centromeres, comprises up to 10% of the genome, but is difficult to study because its repetitive nature hinders contiguous sequence assemblies. We recently described variation within α satellite DNA that affects centromere function. On human chromosome 17 (HSA17), we showed that size and sequence polymorphisms within primary array D17Z1 are associated with chromosome aneuploidy and defective centromere architecture. However, HSA17 can counteract this instability by assembling the centromere at a second, “backup” array lacking variation. Here, we discuss our findings in a broader context of human centromere assembly, and highlight areas of future study to uncover links between genomic and epigenetic features of human centromeres. PMID:28406740
Mosaic organization of DNA nucleotides

NASA Technical Reports Server (NTRS)

Peng, C. K.; Buldyrev, S. V.; Havlin, S.; Simons, M.; Stanley, H. E.; Goldberger, A. L.

1994-01-01

Long-range power-law correlations have been reported recently for DNA sequences containing noncoding regions. We address the question of whether such correlations may be a trivial consequence of the known mosaic structure ("patchiness") of DNA. We analyze two classes of controls consisting of patchy nucleotide sequences generated by different algorithms--one without and one with long-range power-law correlations. Although both types of sequences are highly heterogenous, they are quantitatively distinguishable by an alternative fluctuation analysis method that differentiates local patchiness from long-range correlations. Application of this analysis to selected DNA sequences demonstrates that patchiness is not sufficient to account for long-range correlation properties.
DOMAINS REARRANGED METHYLTRANSFERASE3 controls DNA methylation and regulates RNA polymerase V transcript abundance in Arabidopsis

PubMed Central

Zhong, Xuehua; Hale, Christopher J.; Nguyen, Minh; Ausin, Israel; Groth, Martin; Hetzel, Jonathan; Vashisht, Ajay A.; Henderson, Ian R.; Wohlschlegel, James A.; Jacobsen, Steven E.

2015-01-01

DNA methylation is a mechanism of epigenetic gene regulation and genome defense conserved in many eukaryotic organisms. In Arabidopsis, the DNA methyltransferase DOMAINS REARRANGED METHYLASE 2 (DRM2) controls RNA-directed DNA methylation in a pathway that also involves the plant-specific RNA Polymerase V (Pol V). Additionally, the Arabidopsis genome encodes an evolutionarily conserved but catalytically inactive DNA methyltransferase, DRM3. Here, we show that DRM3 has moderate effects on global DNA methylation and small RNA abundance and that DRM3 physically interacts with Pol V. In Arabidopsis drm3 mutants, we observe a lower level of Pol V-dependent noncoding RNA transcripts even though Pol V chromatin occupancy is increased at many sites in the genome. These findings suggest that DRM3 acts to promote Pol V transcriptional elongation or assist in the stabilization of Pol V transcripts. This work sheds further light on the mechanism by which long noncoding RNAs facilitate RNA-directed DNA methylation. PMID:25561521
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements

PubMed Central

Maliszewska-Olejniczak, Kamila; Gruchota, Julita; Gromadka, Robert; Denby Wilkes, Cyril; Arnaiz, Olivier; Mathy, Nathalie; Duharcourt, Sandra; Bétermier, Mireille; Nowak, Jacek K.

2015-01-01

Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs) in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline) nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs). Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium, and establishes for

Functional annotation of the vlinc class of non-coding RNAs using systems biology approach

PubMed Central

Laurent, Georges St.; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J.L.; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R.R.; Nicolas, Estelle; McCaffrey, Timothy A.; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-01-01

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlincRNAs genes likely function in cis to activate nearby genes. This effect while most pronounced in closely spaced vlincRNA–gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlincRNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. PMID:27001520
A purified transcription factor (TIF-IB) binds to essential sequences of the mouse rDNA promoter.

PubMed Central

Clos, J; Buttgereit, D; Grummt, I

1986-01-01

A transcription factor that is specific for mouse rDNA has been partially purified from Ehrlich ascites cells. This factor [designated transcription initiation factor (TIF)-IB] is required for accurate in vitro synthesis of mouse rRNA in addition to RNA polymerase I and another regulatory factor, TIF-IA. TIF-IB activity is present in extracts both from growing and nongrowing cells in comparable amounts. Prebinding competition experiments with wild-type and mutant templates suggest that TIF-IB interacts with the core control element of the rDNA promoter, which is located immediately upstream of the initiation site. The specific binding of TIF-IB to the RNA polymerase I promoter is demonstrated by exonuclease III protection experiments. The 3' border of the sequences protected by TIF-IB is shown to be on the coding strand at position -21 and on the noncoding strand at position -7. The results suggest that direct binding of TIF-IB to sequences in the core promoter element is the mechanism by which this factor imparts promoter selectivity to RNA polymerase I. Images PMID:3456157
Biological significance of long non-coding RNA FTX expression in human colorectal cancer

PubMed Central

Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

2015-01-01

The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer. PMID:26629053
Biological significance of long non-coding RNA FTX expression in human colorectal cancer.

PubMed

Guo, Xiao-Bo; Hua, Zhu; Li, Chen; Peng, Li-Pan; Wang, Jing-Shen; Wang, Bo; Zhi, Qiao-Ming

2015-01-01

The purpose of this study was to determine the expression of long non-coding RNA (lncRNA) FTX and analyze its prognostic and biological significance in colorectal cancer (CRC). A quantitative reverse transcription PCR was performed to detect the expression of long non-coding RNA FTX in 35 pairs of colorectal cancer and corresponding noncancerous tissues. The expression of long non-coding RNA FTX was detected in 187 colorectal cancer tissues and its correlations with clinicopathological factors of patients were examined. Univariate and multivariate analyses were performed to analyze the prognostic significance of Long Non-coding RNA FTX expression. The effects of long non-coding RNA FTX expression on malignant phenotypes of colorectal cancer cells and its possible biological significances were further determined. Long non-coding RNA FTX was significantly upregulated in colorectal cancer tissues, and low long non-coding RNA FTX expression was significantly correlated with differentiation grade, lymph vascular invasion, and clinical stage. Patients with high long non-coding RNA FTX showed poorer overall survival than those with low long non-coding RNA FTX. Multivariate analyses indicated that status of long non-coding RNA FTX was an independent prognostic factor for patients. Functional analyses showed that upregulation of long non-coding RNA FTX significantly promoted growth, migration, invasion, and increased colony formation in colorectal cancer cells. Therefore, long non-coding RNA FTX may be a potential biomarker for predicting the survival of colorectal cancer patients and might be a molecular target for treatment of human colorectal cancer.
Nucleosome core particles containing a poly(dA.dT) sequence element exhibit a locally distorted DNA structure.

PubMed

Bao, Yunhe; White, Cindy L; Luger, Karolin

2006-08-25

Poly(dA.dT) DNA sequence elements are thought to promote transcription by either excluding nucleosomes or by altering their structural or dynamic properties. Here, the stability and structure of a defined nucleosome core particle containing a 16 base-pair poly(dA.dT) element (A16 NCP) was investigated. The A16 NCP requires a significantly higher temperature for histone octamer sliding in vitro compared to comparable nucleosomes that do not contain a poly(dA.dT) element. Fluorescence resonance energy transfer showed that the interactions between the nucleosomal DNA ends and the histone octamer were destabilized in A16 NCP. The crystal structure of A16 NCP was determined to a resolution of 3.2 A. The overall structure was maintained except for local deviations in DNA conformation. These results are consistent with previous in vivo and in vitro observations that poly(dA.dT) elements cause only modest changes in DNA accessibility and modest increases in steady-state transcription levels.
Regulation of mammalian cell differentiation by long non-coding RNAs

PubMed Central

Hu, Wenqian; Alvarez-Dominguez, Juan R; Lodish, Harvey F

2012-01-01

Differentiation of specialized cell types from stem and progenitor cells is tightly regulated at several levels, both during development and during somatic tissue homeostasis. Many long non-coding RNAs have been recognized as an additional layer of regulation in the specification of cellular identities; these non-coding species can modulate gene-expression programmes in various biological contexts through diverse mechanisms at the transcriptional, translational or messenger RNA stability levels. Here, we summarize findings that implicate long non-coding RNAs in the control of mammalian cell differentiation. We focus on several representative differentiation systems and discuss how specific long non-coding RNAs contribute to the regulation of mammalian development. PMID:23070366
Increased methylation of repetitive elements and DNA repair genes is associated with higher DNA oxidation in children in an urbanized, industrial environment.

PubMed

Alvarado-Cruz, Isabel; Sánchez-Guerra, Marco; Hernández-Cadena, Leticia; De Vizcaya-Ruiz, Andrea; Mugica, Violeta; Pelallo-Martínez, Nadia Azenet; Solís-Heredia, María de Jesús; Byun, Hyang-Min; Baccarelli, Andrea; Quintanilla-Vega, Betzabet

2017-01-01

DNA methylation in DNA repair genes participates in the DNA damage regulation. Particulate matter (PM), which has metals and polycyclic aromatic hydrocarbons (PAHs) adsorbed, among others has been linked to adverse health outcomes and may modify DNA methylation. To evaluate PM exposure impact on repetitive elements and gene-specific DNA methylation and DNA damage, we conducted a cross-sectional study in 150 schoolchildren (7-10 years old) from an urbanized, industrial area of the metropolitan area of Mexico City (MAMC), which frequently exhibits PM concentrations above safety standards. Methylation (5mC) of long interspersed nuclear element-1 (LINE1) and DNA repair gene (OGG1, APEX, and PARP1) was assessed by pyrosequencing in peripheral mononuclear cells, DNA damage by comet assay and DNA oxidation by 8-OHdG content. PAH and metal contents in PM 10 (≤10μm aerodynamic diameter) were determined by HPLC-MS and ICP-AES, respectively. Multiple regression analysis between DNA methylation, DNA damage, and PM 10 exposure showed that PM 10 was significantly associated with oxidative DNA damage; a 1% increase in 5mC at all CpG sites in PARP1 promoter was associated with a 35% increase in 8-OHdG, while a 1% increase at 1, 2, and 3 CpG sites resulted in 38, 9, and 56% increments, respectively. An increase of 10pg/m 3 in benzo[b]fluoranthene content of PM 10 was associated with a 6% increase in LINE1 methylation. Acenaphthene, indene [1,2,3-cd] pyrene, and pyrene concentrations correlated with higher dinucleotide methylation in OGG1, APEX and PARP1 genes, respectively. Vanadium concentration correlated with increased methylation at selected APEX and PARP1 CpG sites. DNA repair gene methylation was significantly correlated with DNA damage and with specific PM 10 -associated PAHs and Vanadium. Data suggest that exposure to PM and its components are associated with differences in DNA methylation of repair genes in children, which may contribute to DNA damage. Copyright © 2016
Dead Element Replicating: Degenerate R2 Element Replication and rDNA Genomic Turnover in the Bacillus rossius Stick Insect (Insecta: Phasmida)

PubMed Central

Martoni, Francesco; Eickbush, Danna G.; Scavariello, Claudia; Luchetti, Andrea; Mantovani, Barbara

2015-01-01

R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution. PMID:25799008
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction.

PubMed

Thiebaut, Flávia; Rojas, Cristian A; Grativol, Clícia; Calixto, Edmundo P da R; Motta, Mariana R; Ballesteros, Helkin G F; Peixoto, Barbara; de Lima, Berenice N S; Vieira, Lucas M; Walter, Maria Emilia; de Armas, Elvismary M; Entenza, Júlio O P; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S; Ferreira, Paulo C G

2017-12-20

Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae . Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae , while the siRNAs were repressed in the presence of A. avenae . Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

PubMed Central

Grativol, Clícia; Motta, Mariana R.; Ballesteros, Helkin G. F.; Peixoto, Barbara; Vieira, Lucas M.; Walter, Maria Emilia; de Armas, Elvismary M.; Entenza, Júlio O. P.; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S.

2017-01-01

Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly. PMID:29657296
T cells are influenced by a long non-coding RNA in the autoimmune associated PTPN2 locus.

PubMed

Houtman, Miranda; Shchetynsky, Klementy; Chemin, Karine; Hensvold, Aase Haj; Ramsköld, Daniel; Tandre, Karolina; Eloranta, Maija-Leena; Rönnblom, Lars; Uebe, Steffen; Catrina, Anca Irinel; Malmström, Vivianne; Padyukov, Leonid

2018-06-01

Non-coding SNPs in the protein tyrosine phosphatase non-receptor type 2 (PTPN2) locus have been linked with several autoimmune diseases, including rheumatoid arthritis, type I diabetes, and inflammatory bowel disease. However, the functional consequences of these SNPs are poorly characterized. Herein, we show in blood cells that SNPs in the PTPN2 locus are highly correlated with DNA methylation levels at four CpG sites downstream of PTPN2 and expression levels of the long non-coding RNA (lncRNA) LINC01882 downstream of these CpG sites. We observed that LINC01882 is mainly expressed in T cells and that anti-CD3/CD28 activated naïve CD4 + T cells downregulate the expression of LINC01882. RNA sequencing analysis of LINC01882 knockdown in Jurkat T cells, using a combination of antisense oligonucleotides and RNA interference, revealed the upregulation of the transcription factor ZEB1 and kinase MAP2K4, both involved in IL-2 regulation. Overall, our data suggests the involvement of LINC01882 in T cell activation and hints towards an auxiliary role of these non-coding SNPs in autoimmunity associated with the PTPN2 locus. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Automated conserved non-coding sequence (CNS) discovery reveals differences in gene content and promoter evolution among grasses

PubMed Central

Turco, Gina; Schnable, James C.; Pedersen, Brent; Freeling, Michael

2013-01-01

Conserved non-coding sequences (CNS) are islands of non-coding sequence that, like protein coding exons, show less divergence in sequence between related species than functionless DNA. Several CNSs have been demonstrated experimentally to function as cis-regulatory regions. However, the specific functions of most CNSs remain unknown. Previous searches for CNS in plants have either anchored on exons and only identified nearby sequences or required years of painstaking manual annotation. Here we present an open source tool that can accurately identify CNSs between any two related species with sequenced genomes, including both those immediately adjacent to exons and distal sequences separated by >12 kb of non-coding sequence. We have used this tool to characterize new motifs, associate CNSs with additional functions, and identify previously undetected genes encoding RNA and protein in the genomes of five grass species. We provide a list of 15,363 orthologous CNSs conserved across all grasses tested. We were also able to identify regulatory sequences present in the common ancestor of grasses that have been lost in one or more extant grass lineages. Lists of orthologous gene pairs and associated CNSs are provided for reference inbred lines of arabidopsis, Japonica rice, foxtail millet, sorghum, brachypodium, and maize. PMID:23874343
The identification of cis-regulatory elements: A review from a machine learning perspective.

PubMed

Li, Yifeng; Chen, Chih-Yu; Kaye, Alice M; Wasserman, Wyeth W

2015-12-01

The majority of the human genome consists of non-coding regions that have been called junk DNA. However, recent studies have unveiled that these regions contain cis-regulatory elements, such as promoters, enhancers, silencers, insulators, etc. These regulatory elements can play crucial roles in controlling gene expressions in specific cell types, conditions, and developmental stages. Disruption to these regions could contribute to phenotype changes. Precisely identifying regulatory elements is key to deciphering the mechanisms underlying transcriptional regulation. Cis-regulatory events are complex processes that involve chromatin accessibility, transcription factor binding, DNA methylation, histone modifications, and the interactions between them. The development of next-generation sequencing techniques has allowed us to capture these genomic features in depth. Applied analysis of genome sequences for clinical genetics has increased the urgency for detecting these regions. However, the complexity of cis-regulatory events and the deluge of sequencing data require accurate and efficient computational approaches, in particular, machine learning techniques. In this review, we describe machine learning approaches for predicting transcription factor binding sites, enhancers, and promoters, primarily driven by next-generation sequencing data. Data sources are provided in order to facilitate testing of novel methods. The purpose of this review is to attract computational experts and data scientists to advance this field. Crown Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.
Rational Design of Small Molecules Targeting Oncogenic Noncoding RNAs from Sequence.

PubMed

Disney, Matthew D; Angelbello, Alicia J

2016-12-20

The discovery of RNA catalysis in the 1980s and the dissemination of the human genome sequence at the start of this century inspired investigations of the regulatory roles of noncoding RNAs in biology. In fact, the Encyclopedia of DNA Elements (ENCODE) project has shown that only 1-2% of the human genome encodes protein, yet 75% is transcribed into RNA. Functional studies both preceding and following the ENCODE project have shown that these noncoding RNAs have important roles in regulating gene expression, developmental timing, and other critical functions. RNA's diverse roles are often a consequence of the various folds that it adopts. The single-stranded nature of the biopolymer enables it to adopt intramolecular folds with noncanonical pairings to lower its free energy. These folds can be scaffolds to bind proteins or to form frameworks to interact with other RNAs. Not surprisingly, dysregulation of certain noncoding RNAs has been shown to be causative of disease. Given this as the background, it is easy to see why it would be useful to develop methods that target RNA and manipulate its biology in rational and predictable ways. The antisense approach has afforded strategies to target RNAs via Watson-Crick base pairing and has typically focused on targeting partially unstructured regions of RNA. Small molecule strategies to target RNA would be desirable not only because compounds could be lead optimized via medicinal chemistry but also because structured regions within an RNA of interest could be targeted to directly interfere with RNA folds that contribute to disease. Additionally, small molecules have historically been the most successful drug candidates. Until recently, the ability to design small molecules that target non-ribosomal RNAs has been elusive, creating the perception that they are "undruggable". In this Account, approaches to demystify targeting RNA with small molecules are described. Rather than bulk screening for compounds that bind to singular
Method for construction of normalized cDNA libraries

DOEpatents

Soares, Marcelo B.; Efstratiadis, Argiris

1996-01-01

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library.
Method for construction of normalized cDNA libraries

DOEpatents

Soares, M.B.; Efstratiadis, A.

1996-01-09

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form. The method comprises: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to moderate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. 4 figs.
Molecular mechanisms of long noncoding RNAs on gastric cancer

PubMed Central

Li, Tianwen; Mo, Xiaoyan; Fu, Liyun; Xiao, Bingxiu; Guo, Junming

2016-01-01

Long noncoding RNAs (lncRNAs) are non-protein coding transcripts longer than 200 nucleotides. Aberrant expression of lncRNAs has been found associated with gastric cancer, one of the most malignant tumors. By complementary base pairing with mRNAs or forming complexes with RNA binding proteins (RBPs), some lncRNAs including GHET1, MALAT1, and TINCR may mediate mRNA stability and splicing. Other lncRNAs, such as BC032469, GAPLINC, and HOTAIR, participate in the competing endogenous RNA (ceRNA) network. Under certain circumstances, ANRIL, GACAT3, H19, MEG3, and TUSC7 exhibit their biological roles by associating with microRNAs (miRNAs). By recruiting histone-modifying complexes, ANRIL, FENDRR, H19, HOTAIR, MALAT1, and PVT1 may inhibit the transcription of target genes in cis or trans. Through these mechanisms, lncRNAs form RNA-dsDNA triplex. CCAT1, GAPLINC, GAS5, H19, MEG3, and TUSC7 play oncogenic or tumor suppressor roles by correlated with tumor suppressor P53 or onco-protein c-Myc, respectively. In conclusion, interaction with DNA, RNA and proteins is involved in lncRNAs’ participation in gastric tumorigenesis and development. PMID:26788991
Targeting noncoding RNAs in disease

PubMed Central

Parsons, Christine; Walker, Lisa; Zhang, Wen Cai; Slack, Frank J.

2017-01-01

Many RNA species have been identified as important players in the development of chronic diseases, including cancer. Over the past decade, numerous studies have highlighted how regulatory RNAs such as microRNAs (miRNAs) and long noncoding RNAs (lncRNAs) play crucial roles in the development of a disease state. It is clear that the aberrant expression of miRNAs promotes tumor initiation and progression, is linked with cardiac dysfunction, allows for the improper physiological response in maintaining glucose and insulin levels, and can prevent the appropriate integration of neuronal networks, resulting in neurodegenerative disorders. Because of this, there has been a major effort to therapeutically target these noncoding RNAs. In just the past 5 years, over 100 antisense oligonucleotide–based therapies have been tested in phase I clinical trials, a quarter of which have reached phase II/III. Most notable are fomivirsen and mipomersen, which have received FDA approval to treat cytomegalovirus retinitis and high blood cholesterol, respectively. The continued improvement of innovative RNA modifications and delivery entities, such as nanoparticles, will aid in the development of future RNA-based therapeutics for a broader range of chronic diseases. Here we summarize the latest promises and challenges of targeting noncoding RNAs in disease. PMID:28248199
Non-coding RNAs: Therapeutic Strategies and Delivery Systems.

PubMed

Ling, Hui

The vast majority of the human genome is transcribed into RNA molecules that do not code for proteins, which could be small ones approximately 20 nucleotide in length, known as microRNAs, or transcripts longer than 200 bp, defined as long noncoding RNAs. The prevalent deregulation of microRNAs in human cancers prompted immediate interest on the therapeutic value of microRNAs as drugs and drug targets. Many features of microRNAs such as well-defined mechanisms, and straightforward oligonucleotide design further make them attractive candidates for therapeutic development. The intensive efforts of exploring microRNA therapeutics are reflected by the large body of preclinical studies using oligonucleotide-based mimicking and blocking, culminated by the recent entry of microRNA therapeutics in clinical trial for several human diseases including cancer. Meanwhile, microRNA therapeutics faces the challenge of effective and safe delivery of nucleic acid therapeutics into the target site. Various chemical modifications of nucleic acids and delivery systems have been developed to increase targeting specificity and efficacy, and reduce the associated side effects including activation of immune response. Recently, long noncoding RNAs become attractive targets for therapeutic intervention because of their association with complex and delicate phenotypes, and their unconventional pharmaceutical activities such as capacity of increasing output of proteins. Here I discuss the general therapeutic strategies targeting noncoding RNAs, review delivery systems developed to maximize noncoding RNA therapeutic efficacy, and offer perspectives on the future development of noncoding RNA targeting agents for colorectal cancer.
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

PubMed

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-10-03

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes

PubMed Central

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-01-01

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes. PMID:29108274
Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria.

PubMed

Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R; Voß, Björn

2015-04-22

In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5'UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5'UTR. Such an sRNA/mRNA structure, which we name 'actuaton', represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation.
Variations in the non-coding transcriptome as a driver of inter-strain divergence and physiological adaptation in bacteria

PubMed Central

Kopf, Matthias; Klähn, Stephan; Scholz, Ingeborg; Hess, Wolfgang R.; Voß, Björn

2015-01-01

In all studied organisms, a substantial portion of the transcriptome consists of non-coding RNAs that frequently execute regulatory functions. Here, we have compared the primary transcriptomes of the cyanobacteria Synechocystis sp. PCC 6714 and PCC 6803 under 10 different conditions. These strains share 2854 protein-coding genes and a 16S rRNA identity of 99.4%, indicating their close relatedness. Conserved major transcriptional start sites (TSSs) give rise to non-coding transcripts within the sigB gene, from the 5′UTRs of cmpA and isiA, and 168 loci in antisense orientation. Distinct differences include single nucleotide polymorphisms rendering promoters inactive in one of the strains, e.g., for cmpR and for the asRNA PsbA2R. Based on the genome-wide mapped location, regulation and classification of TSSs, non-coding transcripts were identified as the most dynamic component of the transcriptome. We identified a class of mRNAs that originate by read-through from an sRNA that accumulates as a discrete and abundant transcript while also serving as the 5′UTR. Such an sRNA/mRNA structure, which we name ‘actuaton’, represents another way for bacteria to remodel their transcriptional network. Our findings support the hypothesis that variations in the non-coding transcriptome constitute a major evolutionary element of inter-strain divergence and capability for physiological adaptation. PMID:25902393
Functional annotation of the vlinc class of non-coding RNAs using systems biology approach.

PubMed

St Laurent, Georges; Vyatkin, Yuri; Antonets, Denis; Ri, Maxim; Qi, Yao; Saik, Olga; Shtokalo, Dmitry; de Hoon, Michiel J L; Kawaji, Hideya; Itoh, Masayoshi; Lassmann, Timo; Arner, Erik; Forrest, Alistair R R; Nicolas, Estelle; McCaffrey, Timothy A; Carninci, Piero; Hayashizaki, Yoshihide; Wahlestedt, Claes; Kapranov, Philipp

2016-04-20

Functionality of the non-coding transcripts encoded by the human genome is the coveted goal of the modern genomics research. While commonly relied on the classical methods of forward genetics, integration of different genomics datasets in a global Systems Biology fashion presents a more productive avenue of achieving this very complex aim. Here we report application of a Systems Biology-based approach to dissect functionality of a newly identified vast class of very long intergenic non-coding (vlinc) RNAs. Using highly quantitative FANTOM5 CAGE dataset, we show that these RNAs could be grouped into 1542 novel human genes based on analysis of insulators that we show here indeed function as genomic barrier elements. We show that vlinc RNAs genes likely function in cisto activate nearby genes. This effect while most pronounced in closely spaced vlinc RNA-gene pairs can be detected over relatively large genomic distances. Furthermore, we identified 101 vlinc RNA genes likely involved in early embryogenesis based on patterns of their expression and regulation. We also found another 109 such genes potentially involved in cellular functions also happening at early stages of development such as proliferation, migration and apoptosis. Overall, we show that Systems Biology-based methods have great promise for functional annotation of non-coding RNAs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Satellite DNA and Transposable Elements in Seabuckthorn (Hippophae rhamnoides), a Dioecious Plant with Small Y and Large X Chromosomes

PubMed Central

Puterova, Janka; Razumova, Olga; Martinek, Tomas; Alexandrov, Oleg; Divashuk, Mikhail; Kubat, Zdenek; Hobza, Roman; Karlov, Gennady

2017-01-01

Seabuckthorn (Hippophae rhamnoides) is a dioecious shrub commonly used in the pharmaceutical, cosmetic, and environmental industry as a source of oil, minerals and vitamins. In this study, we analyzed the transposable elements and satellites in its genome. We carried out Illumina DNA sequencing and reconstructed the main repetitive DNA sequences. For data analysis, we developed a new bioinformatics approach for advanced satellite DNA analysis and showed that about 25% of the genome consists of satellite DNA and about 24% is formed of transposable elements, dominated by Ty3/Gypsy and Ty1/Copia LTR retrotransposons. FISH mapping revealed X chromosome-accumulated, Y chromosome-specific or both sex chromosomes-accumulated satellites but most satellites were found on autosomes. Transposable elements were located mostly in the subtelomeres of all chromosomes. The 5S rDNA and 45S rDNA were localized on one autosomal locus each. Although we demonstrated the small size of the Y chromosome of the seabuckthorn and accumulated satellite DNA there, we were unable to estimate the age and extent of the Y chromosome degeneration. Analysis of dioecious relatives such as Shepherdia would shed more light on the evolution of these sex chromosomes. PMID:28057732
Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

PubMed

Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

2011-04-20

Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.
Method for construction of normalized cDNA libraries

DOEpatents

Soares, Marcelo B.; Efstratiadis, Argiris

1998-01-01

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3' noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries.
Method for construction of normalized cDNA libraries

DOEpatents

Soares, M.B.; Efstratiadis, A.

1998-11-03

This invention provides a method to normalize a directional cDNA library constructed in a vector that allows propagation in single-stranded circle form comprising: (a) propagating the directional cDNA library in single-stranded circles; (b) generating fragments complementary to the 3` noncoding sequence of the single-stranded circles in the library to produce partial duplexes; (c) purifying the partial duplexes; (d) melting and reassociating the purified partial duplexes to appropriate Cot; and (e) purifying the unassociated single-stranded circles, thereby generating a normalized cDNA library. This invention also provides normalized cDNA libraries generated by the above-described method and uses of the generated libraries. 19 figs.
Long interspersed nuclear elements (LINEs) show tissue-specific, mosaic genome and methylation-unrestricted, widespread expression of noncoding RNAs in somatic tissues of the rat

PubMed Central

Singh, Deepak K.; Rath, Pramod C.

2012-01-01

We report strong somatic and germ line expression of LINE RNAs in eight different tissues of rat by using a novel ~2.8 kb genomic PstI-LINE DNA (P1-LINE) isolated from the rat brain. P1-LINE is present in a 93 kb LINE-SINE-cluster in sub-telomeric region of chromosome 12 (12p12) and as multiple truncated copies interspersed in all rat chromosomes. P1-LINEs occur as inverted repeats at multiple genomic loci in tissue-specific and mosaic patterns. P1-LINE RNAs are strongly expressed in brain, liver, lungs, heart, kidney, testes, spleen and thymus into large to small heterogeneous RNAs (~5.0 to 0.2 kb) in tissue-specific and dynamic patterns in individual rats. P1-LINE DNA is strongly methylated at CpG-dinucleotides in most genomic copies in all the tissues and weakly hypomethylated in few copies in some tissues. Small (700–75 nt) P1-LINE RNAs expressed in all tissues may be possible precursors for small regulatory RNAs (PIWI-interacting/piRNAs) bioinformatically derived from P1-LINE. The strong and dynamic expression of LINE RNAs from multiple chromosomal loci and the putative piRNAs in somatic tissues of rat under normal physiological conditions may define functional chromosomal domains marked by LINE RNAs as long noncoding RNAs (lncRNAs) unrestricted by DNA methylation. The tissue-specific, dynamic RNA expression and mosaic genomic distribution of LINEs representing a steady-state genomic flux of retrotransposon RNAs suggest for biological role of LINE RNAs as long ncRNAs and small piRNAs in mammalian tissues independent of their cellular fate for translation, reverse-transcription and retrotransposition. This may provide evolutionary advantages to LINEs and mammalian genomes. PMID:23064113
An Integrated Encyclopedia of DNA Elements in the Human Genome

PubMed Central

2012-01-01

Summary The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure, and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall the project provides new insights into the organization and regulation of our genes and genome, and an expansive resource of functional annotations for biomedical research. PMID:22955616
Enhancer connectome in primary human cells identifies target genes of disease-associated DNA elements

PubMed Central

Mumbach, Maxwell R; Satpathy, Ansuman T; Boyle, Evan A; Dai, Chao; Gowen, Benjamin G; Cho, Seung Woo; Nguyen, Michelle L; Rubin, Adam J; Granja, Jeffrey M; Kazane, Katelynn R; Wei, Yuning; Nguyen, Trieu; Greenside, Peyton G; Corces, M Ryan; Tycko, Josh; Simeonov, Dimitre R; Suliman, Nabeela; Li, Rui; Xu, Jin; Flynn, Ryan A; Kundaje, Anshul; Khavari, Paul A; Marson, Alexander; Corn, Jacob E; Quertermous, Thomas; Greenleaf, William J; Chang, Howard Y

2018-01-01

The challenge of linking intergenic mutations to target genes has limited molecular understanding of human diseases. Here we show that H3K27ac HiChIP generates high-resolution contact maps of active enhancers and target genes in rare primary human T cell subtypes and coronary artery smooth muscle cells. Differentiation of naive T cells into T helper 17 cells or regulatory T cells creates subtype-specific enhancer–promoter interactions, specifically at regions of shared DNA accessibility. These data provide a principled means of assigning molecular functions to autoimmune and cardiovascular disease risk variants, linking hundreds of noncoding variants to putative gene targets. Target genes identified with HiChIP are further supported by CRISPR interference and activation at linked enhancers, by the presence of expression quantitative trait loci, and by allele-specific enhancer loops in patient-derived primary cells. The majority of disease-associated enhancers contact genes beyond the nearest gene in the linear genome, leading to a fourfold increase in the number of potential target genes for autoimmune and cardiovascular diseases. PMID:28945252
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

PubMed

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research

Cancer.gov

The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins
Spontaneous germline excision of Tol1, a DNA-based transposable element naturally occurring in the medaka fish genome.

PubMed

Watanabe, Kohei; Koga, Hajime; Nakamura, Kodai; Fujita, Akiko; Hattori, Akimasa; Matsuda, Masaru; Koga, Akihiko

2014-04-01

DNA-based transposable elements are ubiquitous constituents of eukaryotic genomes. Vertebrates are, however, exceptional in that most of their DNA-based elements appear to be inactivated. The Tol1 element of the medaka fish, Oryzias latipes, is one of the few elements for which copies containing an undamaged gene have been found. Spontaneous transposition of this element in somatic cells has previously been demonstrated, but there is only indirect evidence for its germline transposition. Here, we show direct evidence of spontaneous excision in the germline. Tyrosinase is the key enzyme in melanin biosynthesis. In an albino laboratory strain of medaka fish, which is homozygous for a mutant tyrosinase gene in which a Tol1 copy is inserted, we identified de novo reversion mutations related to melanin pigmentation. The gamete-based reversion rate was as high as 0.4%. The revertant fish carried the tyrosinase gene from which the Tol1 copy had been excised. We previously reported the germline transposition of Tol2, another DNA-based element that is thought to be a recent invader of the medaka fish genome. Tol1 is an ancient resident of the genome. Our results indicate that even an old element can contribute to genetic variation in the host genome as a natural mutator.
Loss of p53-inducible long non-coding RNA LINC01021 increases chemosensitivity

PubMed Central

Kaller, Markus; Götz, Ursula; Hermeking, Heiko

2017-01-01

We have previously identified the long non-coding RNA LINC01021 as a direct p53 target (Hünten et al. Mol Cell Proteomics. 2015; 14:2609-2629). Here, we show that LINC01021 is up-regulated in colorectal cancer (CRC) cell lines upon various p53-activating treatments. The LINC01021 promoter and the p53 binding site lie within a MER61C LTR, which originated from insertion of endogenous retrovirus 1 (ERV1) sequences. Deletion of this MER61C element by a CRISPR/Cas9 approach, as well as siRNA-mediated knockdown of LINC01021 RNA significantly enhanced the sensitivity of the CRC cell line HCT116 towards the chemotherapeutic drugs doxorubicin and 5-FU, suggesting that LINC01021 is an integral part of the p53-mediated response to DNA damage. Inactivation of LINC01021 and also its ectopic expression did not affect p53 protein expression and transcriptional activity, implying that LINC01021 does not feedback to p53. Furthermore, in CRC patient samples LINC01021 expression positively correlated with a wild-type p53-associated gene expression signature. LINC01021 expression was increased in primary colorectal tumors and displayed a bimodal distribution that was particularly pronounced in the mesenchymal CMS4 consensus molecular subtype of CRCs. CMS4 tumors with low LINC01021 expression were associated with poor patient survival. Our results suggest that the genomic redistribution of ERV1-derived p53 response elements and generation of novel p53-inducible lncRNA-encoding genes was selected for during primate evolution as integral part of the cellular response to various forms of genotoxic stress. PMID:29262524
Sequence and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency

PubMed Central

Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.

2013-01-01

Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
Paramecium tetraurelia chromatin assembly factor-1-like protein PtCAF-1 is involved in RNA-mediated control of DNA elimination.

PubMed

Ignarski, Michael; Singh, Aditi; Swart, Estienne C; Arambasic, Miroslav; Sandoval, Pamela Y; Nowacki, Mariusz

2014-10-29

Genome-wide DNA remodelling in the ciliate Paramecium is ensured by RNA-mediated trans-nuclear crosstalk between the germline and the somatic genomes during sexual development. The rearrangements include elimination of transposable elements, minisatellites and tens of thousands non-coding elements called internally eliminated sequences (IESs). The trans-nuclear genome comparison process employs a distinct class of germline small RNAs (scnRNAs) that are compared against the parental somatic genome to select the germline-specific subset of scnRNAs that subsequently target DNA elimination in the progeny genome. Only a handful of proteins involved in this process have been identified so far and the mechanism of DNA targeting is unknown. Here we describe chromatin assembly factor-1-like protein (PtCAF-1), which we show is required for the survival of sexual progeny and localizes first in the parental and later in the newly developing macronucleus. Gene silencing shows that PtCAF-1 is required for the elimination of transposable elements and a subset of IESs. PTCAF-1 depletion also impairs the selection of germline-specific scnRNAs during development. We identify specific histone modifications appearing during Paramecium development which are strongly reduced in PTCAF-1 depleted cells. Our results demonstrate the importance of PtCAF-1 for the epigenetic trans-nuclear cross-talk mechanism. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
The fission yeast CENP-B protein Abp1 prevents pervasive transcription of repetitive DNA elements.

PubMed

Daulny, Anne; Mejía-Ramírez, Eva; Reina, Oscar; Rosado-Lugo, Jesus; Aguilar-Arnal, Lorena; Auer, Herbert; Zaratiegui, Mikel; Azorin, Fernando

2016-10-01

It is well established that eukaryotic genomes are pervasively transcribed producing cryptic unstable transcripts (CUTs). However, the mechanisms regulating pervasive transcription are not well understood. Here, we report that the fission yeast CENP-B homolog Abp1 plays an important role in preventing pervasive transcription. We show that loss of abp1 results in the accumulation of CUTs, which are targeted for degradation by the exosome pathway. These CUTs originate from different types of genomic features, but the highest increase corresponds to Tf2 retrotransposons and rDNA repeats, where they map along the entire elements. In the absence of abp1, increased RNAPII-Ser5P occupancy is observed throughout the Tf2 coding region and, unexpectedly, RNAPII-Ser5P is enriched at rDNA repeats. Loss of abp1 also results in Tf2 derepression and increased nucleolus size. Altogether these results suggest that Abp1 prevents pervasive RNAPII transcription of repetitive DNA elements (i.e., Tf2 and rDNA repeats) from internal cryptic sites. Copyright © 2016 Elsevier B.V. All rights reserved.
Identification of a DNA Segment Exhibiting Rearrangement Modifying Effects upon Transgenic δ-deleting Elements

PubMed Central

Janowski, Karen M.; Ledbetter, Stephanie; Mayo, Matthew S.; Hockett, Richard D.

1997-01-01

Control of the rearrangement and expression of the T cell receptor α and δ chains is critical for determining T cell type. The process of δ deletion is a candidate mechanism for maintaining separation of the α and δ loci. Mice harboring a transgenic reporter δ deletion construct show α/β T cell lineage–specific use of the transgenic elements. A 48-basepair segment of DNA, termed HPS1A, when deleted from this reporter construct, loses tight lineage-specific rearrangement control of transgenic elements, with abundant rearrangements of transgenic δ-deleting elements now in γ/δ T cells. Furthermore, HPS1A augments recombination frequency of extrachromosomal substrates in an in vitro recombination assay. DNA binding proteins recognizing HPS1A have been identified and are restricted to early B and T cells, during the time of active rearrangement of endogenous TCR and immunoglobulin loci. These data are consistent with δ deletion playing an important role in maintaining separate TCR α and δ loci. PMID:9207011
Non-Coding RNA Analysis Using the Rfam Database.

PubMed

Kalvari, Ioanna; Nawrocki, Eric P; Argasinska, Joanna; Quinones-Olvera, Natalia; Finn, Robert D; Bateman, Alex; Petrov, Anton I

2018-06-01

Rfam is a database of non-coding RNA families in which each family is represented by a multiple sequence alignment, a consensus secondary structure, and a covariance model. Using a combination of manual and literature-based curation and a custom software pipeline, Rfam converts descriptions of RNA families found in the scientific literature into computational models that can be used to annotate RNAs belonging to those families in any DNA or RNA sequence. Valuable research outputs that are often locked up in figures and supplementary information files are encapsulated in Rfam entries and made accessible through the Rfam Web site. The data produced by Rfam have a broad application, from genome annotation to providing training sets for algorithm development. This article gives an overview of how to search and navigate the Rfam Web site, and how to annotate sequences with RNA families. The Rfam database is freely available at http://rfam.org. © 2018 by John Wiley & Sons, Inc. Copyright © 2018 John Wiley & Sons, Inc.

Decoding the non-coding RNAs in Alzheimer's disease.

PubMed

Schonrock, Nicole; Götz, Jürgen

2012-11-01

Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.
Satellite DNA and Transposable Elements in Seabuckthorn (Hippophae rhamnoides), a Dioecious Plant with Small Y and Large X Chromosomes.

PubMed

Puterova, Janka; Razumova, Olga; Martinek, Tomas; Alexandrov, Oleg; Divashuk, Mikhail; Kubat, Zdenek; Hobza, Roman; Karlov, Gennady; Kejnovsky, Eduard

2017-01-01

Seabuckthorn (Hippophae rhamnoides) is a dioecious shrub commonly used in the pharmaceutical, cosmetic, and environmental industry as a source of oil, minerals and vitamins. In this study, we analyzed the transposable elements and satellites in its genome. We carried out Illumina DNA sequencing and reconstructed the main repetitive DNA sequences. For data analysis, we developed a new bioinformatics approach for advanced satellite DNA analysis and showed that about 25% of the genome consists of satellite DNA and about 24% is formed of transposable elements, dominated by Ty3/Gypsy and Ty1/Copia LTR retrotransposons. FISH mapping revealed X chromosome-accumulated, Y chromosome-specific or both sex chromosomes-accumulated satellites but most satellites were found on autosomes. Transposable elements were located mostly in the subtelomeres of all chromosomes. The 5S rDNA and 45S rDNA were localized on one autosomal locus each. Although we demonstrated the small size of the Y chromosome of the seabuckthorn and accumulated satellite DNA there, we were unable to estimate the age and extent of the Y chromosome degeneration. Analysis of dioecious relatives such as Shepherdia would shed more light on the evolution of these sex chromosomes. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Integrating non-coding RNAs in JAK-STAT regulatory networks

PubMed Central

Witte, Steven; Muljo, Stefan A

2014-01-01

Being a well-characterized pathway, JAK-STAT signaling serves as a valuable paradigm for studying the architecture of gene regulatory networks. The discovery of untranslated or non-coding RNAs, namely microRNAs and long non-coding RNAs, provides an opportunity to elucidate their roles in such networks. In principle, these regulatory RNAs can act as downstream effectors of the JAK-STAT pathway and/or affect signaling by regulating the expression of JAK-STAT components. Examples of interactions between signaling pathways and non-coding RNAs have already emerged in basic cell biology and human diseases such as cancer, and can potentially guide the identification of novel biomarkers or drug targets for medicine. PMID:24778925
Long Noncoding RNA PANDA Positively Regulates Proliferation of Osteosarcoma Cells.

PubMed

Kotake, Yojiro; Goto, Taiki; Naemura, Madoka; Inoue, Yasutoshi; Okamoto, Haruna; Tahara, Keiichiro

2017-01-01

A long noncoding RNA, p21-associated ncRNA DNA damage-activated (PANDA), associates with nuclear transcription factor Y subunit alpha (NF-YA) and inhibits its binding to promoters of apoptosis-related genes, thereby repressing apoptosis in normal human fibroblasts. Here, we show that PANDA is involved in regulating proliferation in the U2OS human osteosarcoma cell line. U2OS cells were transfected with siRNAs against PANDA 72 h later and they were subjected to reverse transcription-polymerase chain reaction (RT-PCR), quantitative RT-PCR and cell-cycle analysis. PANDA was highly expressed in U2OS cells, and its expression was induced by DNA damage. Silencing PANDA caused arrest at the G 1 phase of the cell cycle, leading to inhibition of cell proliferation. Quantitative RT-PCR showed that silencing PANDA increased mRNA levels of the cyclin-dependent kinase inhibitor p18, which caused G 1 phase arrest. These results suggest that PANDA promotes G 1 -S transition by repressing p18 transcription, and thus promotes U2OS cell proliferation. Copyright© 2017 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Ancestral vinclozolin exposure alters the epigenetic transgenerational inheritance of sperm small noncoding RNAs.

PubMed

Schuster, Andrew; Skinner, Michael K; Yan, Wei

Exposure to the agricultural fungicide vinclozolin during gestation promotes a higher incidence of various diseases in the subsequent unexposed F3 and F4 generations. This phenomenon is termed epigenetic transgenerational inheritance and has been shown to in part involve alterations in DNA methylation, but the role of other epigenetic mechanisms remains unknown. The current study investigated the alterations in small noncoding RNA (sncRNA) in the sperm from F3 generation control and vinclozolin lineage rats. Over 200 differentially expressed sncRNAs were identified and the tRNA-derived sncRNAs, namely 5' halves of mature tRNAs (5' halves), displayed the most dramatic changes. Gene targets of the altered miRNAs and tRNA 5' halves revealed associations between the altered sncRNAs and differentially DNA methylated regions. Dysregulated sncRNAs appear to correlate with mRNA profiles associated with the previously observed vinclozolin-induced disease phenotypes. Data suggest potential connections between sperm-borne RNAs and the vinclozolin-induced epigenetic transgenerational inheritance phenomenon.
Origin and evolution of the long non-coding genes in the X-inactivation center.

PubMed

Romito, Antonio; Rougeulle, Claire

2011-11-01

Random X chromosome inactivation (XCI), the eutherian mechanism of X-linked gene dosage compensation, is controlled by a cis-acting locus termed the X-inactivation center (Xic). One of the striking features that characterize the Xic landscape is the abundance of loci transcribing non-coding RNAs (ncRNAs), including Xist, the master regulator of the inactivation process. Recent comparative genomic analyses have depicted the evolutionary scenario behind the origin of the X-inactivation center, revealing that this locus evolved from a region harboring protein-coding genes. During mammalian radiation, this ancestral protein-coding region was disrupted in the marsupial group, whilst it provided in eutherian lineage the starting material for the non-translated RNAs of the X-inactivation center. The emergence of non-coding genes occurred by a dual mechanism involving loss of protein-coding function of the pre-existing genes and integration of different classes of mobile elements, some of which modeled the structure and sequence of the non-coding genes in a species-specific manner. The rising genes started to produce transcripts that acquired function in regulating the epigenetic status of the X chromosome, as shown for Xist, its antisense Tsix, Jpx, and recently suggested for Ftx. Thus, the appearance of the Xic, which occurred after the divergence between eutherians and marsupials, was the basis for the evolution of random X inactivation as a strategy to achieve dosage compensation. Copyright © 2011. Published by Elsevier Masson SAS.
Prosurvival long noncoding RNA PINCR regulates a subset of p53 targets in human colorectal cancer cells by binding to Matrin 3

PubMed Central

Chaudhary, Ritu; Gryder, Berkley; Woods, Wendy S; Subramanian, Murugan; Jones, Matthew F; Li, Xiao Ling; Jenkins, Lisa M; Shabalina, Svetlana A; Mo, Min; Dasso, Mary; Yang, Yuan; Wakefield, Lalage M; Zhu, Yuelin; Frier, Susan M; Moriarity, Branden S; Prasanth, Kannanganattu V; Perez-Pinera, Pablo; Lal, Ashish

2017-01-01

Thousands of long noncoding RNAs (lncRNAs) have been discovered, yet the function of the vast majority remains unclear. Here, we show that a p53-regulated lncRNA which we named PINCR (p53-induced noncoding RNA), is induced ~100-fold after DNA damage and exerts a prosurvival function in human colorectal cancer cells (CRC) in vitro and tumor growth in vivo. Targeted deletion of PINCR in CRC cells significantly impaired G1 arrest and induced hypersensitivity to chemotherapeutic drugs. PINCR regulates the induction of a subset of p53 targets involved in G1 arrest and apoptosis, including BTG2, RRM2B and GPX1. Using a novel RNA pulldown approach that utilized endogenous S1-tagged PINCR, we show that PINCR associates with the enhancer region of these genes by binding to RNA-binding protein Matrin 3 that, in turn, associates with p53. Our findings uncover a critical prosurvival function of a p53/PINCR/Matrin 3 axis in response to DNA damage in CRC cells. DOI: http://dx.doi.org/10.7554/eLife.23244.001 PMID:28580901
Long Non-coding RNA, PANDA, Contributes to the Stabilization of p53 Tumor Suppressor Protein.

PubMed

Kotake, Yojiro; Kitagawa, Kyoko; Ohhata, Tatsuya; Sakai, Satoshi; Uchida, Chiharu; Niida, Hiroyuki; Naemura, Madoka; Kitagawa, Masatoshi

2016-04-01

P21-associated noncoding RNA DNA damage-activated (PANDA) is induced in response to DNA damage and represses apoptosis by inhibiting the function of nuclear transcription factor Y subunit alpha (NF-YA) transcription factor. Herein, we report that PANDA affects regulation of p53 tumor-suppressor protein. U2OS cells were transfected with PANDA siRNAs. At 72 h post-transfection, cells were subjected to immunoblotting and quantitative reverse transcription-polymerase chain reaction. Depletion of PANDA was associated with decreased levels of p53 protein, but not p53 mRNA. The stability of p53 protein was markedly reduced by PANDA silencing. Degradation of p53 protein by silencing PANDA was prevented by treatment of MG132, a proteasome inhibitor. Moreover, depletion of PANDA prevented accumulation of p53 protein, as a result of DNA damage, induced by the genotoxic agent etoposide. These results suggest that PANDA stabilizes p53 protein in response to DNA damage, and provide new insight into the regulatory mechanisms of p53. Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.
The RNAs of RNA-directed DNA methylation

PubMed Central

Wendte, Jered M.; Pikaard, Craig S.

2016-01-01

Summary RNA-directed chromatin modification that includes cytosine methylation silences transposable elements in both plants and mammals, contributing to genome defense and stability. In Arabidopsis thaliana, most RNA-directed DNA methylation (RdDM) is guided by small RNAs derived from double-stranded precursors synthesized at cytosine-methylated loci by nuclear multisubunit RNA Polymerase IV (Pol IV), in close partnership with the RNA-dependent RNA polymerase, RDR2. These small RNAs help keep transposons transcriptionally inactive. However, if transposons escape silencing, and are transcribed by multisubunit RNA polymerase II (Pol II), their mRNAs can be recognized and degraded, generating small RNAs that can also guide initial DNA methylation, thereby enabling subsequent Pol IV-RDR2 recruitment. In both pathways, the small RNAs find their target sites by interacting with longer noncoding RNAs synthesized by multisubunit RNA Polymerase V (Pol V). Despite a decade of progress, numerous questions remain concerning the initiation, synthesis, processing, size and features of the RNAs that drive RdDM. Here, we review recent insights, questions and controversies concerning RNAs produced by Pols IV and V, and their functions in RdDM. We also provide new data concerning Pol V transcript 5’ and 3’ ends. PMID:27521981
Short-lived non-coding transcripts (SLiTs): Clues to regulatory long non-coding RNA.

PubMed

Tani, Hidenori

2017-03-22

Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs). Although the importance of lncRNAs has been documented in previous reports, the biological and physiological functions of lncRNAs remain largely unknown. The role of lncRNAs seems an elusive problem. Here, I propose a clue to the identification of regulatory lncRNAs. The key point is RNA half-life. RNAs with a long half-life (t 1/2 > 4 h) contain a significant proportion of ncRNAs, as well as mRNAs involved in housekeeping functions, whereas RNAs with a short half-life (t 1/2 < 4 h) include known regulatory ncRNAs and regulatory mRNAs. This novel class of ncRNAs with a short half-life can be categorized as Short-Lived non-coding Transcripts (SLiTs). I consider that SLiTs are likely to be rich in functionally uncharacterized regulatory RNAs. This review describes recent progress in research into SLiTs.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.

PubMed

Edwards, Scott V; Cloutier, Alison; Baker, Allan J

2017-11-01

Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.
Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics

PubMed Central

Cloutier, Alison; Baker, Allan J.

2017-01-01

Abstract Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600–∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. PMID:28637293
Role of non-coding RNAs in non-aging-related neurological disorders.

PubMed

Vieira, A S; Dogini, D B; Lopes-Cendes, I

2018-06-11

Protein coding sequences represent only 2% of the human genome. Recent advances have demonstrated that a significant portion of the genome is actively transcribed as non-coding RNA molecules. These non-coding RNAs are emerging as key players in the regulation of biological processes, and act as "fine-tuners" of gene expression. Neurological disorders are caused by a wide range of genetic mutations, epigenetic and environmental factors, and the exact pathophysiology of many of these conditions is still unknown. It is currently recognized that dysregulations in the expression of non-coding RNAs are present in many neurological disorders and may be relevant in the mechanisms leading to disease. In addition, circulating non-coding RNAs are emerging as potential biomarkers with great potential impact in clinical practice. In this review, we discuss mainly the role of microRNAs and long non-coding RNAs in several neurological disorders, such as epilepsy, Huntington disease, fragile X-associated ataxia, spinocerebellar ataxias, amyotrophic lateral sclerosis (ALS), and pain. In addition, we give information about the conditions where microRNAs have demonstrated to be potential biomarkers such as in epilepsy, pain, and ALS.
The Landscape of long non-coding RNA classification

PubMed Central

St Laurent, Georges; Wahlestedt, Claes; Kapranov, Philipp

2015-01-01

Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long non-coding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the non-coding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual un-ambiguous classification framework results in a number of challenges in the annotation and interpretation of non-coding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets. PMID:25869999
Crosstalk between the Notch signaling pathway and non-coding RNAs in gastrointestinal cancers

PubMed Central

Pan, Yangyang; Mao, Yuyan; Jin, Rong; Jiang, Lei

2018-01-01

The Notch signaling pathway is one of the main signaling pathways that mediates direct contact between cells, and is essential for normal development. It regulates various cellular processes, including cell proliferation, apoptosis, migration, invasion, angiogenesis and metastasis. It additionally serves an important function in tumor progression. Non-coding RNAs mainly include small microRNAs, long non-coding RNAs and circular RNAs. At present, a large body of literature supports the biological significance of non-coding RNAs in tumor progression. It is also becoming increasingly evident that cross-talk exists between Notch signaling and non-coding RNAs. The present review summarizes the current knowledge of Notch-mediated gastrointestinal cancer cell processes, and the effect of the crosstalk between the three major types of non-coding RNAs and the Notch signaling pathway on the fate of gastrointestinal cancer cells. PMID:29285185
Structural architecture of the human long non-coding RNA, steroid receptor RNA activator

PubMed Central

Novikova, Irina V.; Hennelly, Scott P.; Sanbonmatsu, Karissa Y.

2012-01-01

While functional roles of several long non-coding RNAs (lncRNAs) have been determined, the molecular mechanisms are not well understood. Here, we report the first experimentally derived secondary structure of a human lncRNA, the steroid receptor RNA activator (SRA), 0.87 kB in size. The SRA RNA is a non-coding RNA that coactivates several human sex hormone receptors and is strongly associated with breast cancer. Coding isoforms of SRA are also expressed to produce proteins, making the SRA gene a unique bifunctional system. Our experimental findings (SHAPE, in-line, DMS and RNase V1 probing) reveal that this lncRNA has a complex structural organization, consisting of four domains, with a variety of secondary structure elements. We examine the coevolution of the SRA gene at the RNA structure and protein structure levels using comparative sequence analysis across vertebrates. Rapid evolutionary stabilization of RNA structure, combined with frame-disrupting mutations in conserved regions, suggests that evolutionary pressure preserves the RNA structural core rather than its translational product. We perform similar experiments on alternatively spliced SRA isoforms to assess their structural features. PMID:22362738
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

PubMed

Li, Sanshu; Breaker, Ronald R

2017-10-13

With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs
Mavericks, a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses.

PubMed

Pritham, Ellen J; Putliwala, Tasneem; Feschotte, Cédric

2007-04-01

We previously identified a group of atypical mobile elements designated Mavericks from the nematodes Caenorhabditis elegans and C. briggsae and the zebrafish Danio rerio. Here we present the results of comprehensive database searches of the genome sequences available, which reveal that Mavericks are widespread in invertebrates and non-mammalian vertebrates but show a patchy distribution in non-animal species, being present in the fungi Glomus intraradices and Phakopsora pachyrhizi and in several single-celled eukaryotes such as the ciliate Tetrahymena thermophila, the stramenopile Phytophthora infestans and the trichomonad Trichomonas vaginalis, but not detectable in plants. This distribution, together with comparative and phylogenetic analyses of Maverick-encoded proteins, is suggestive of an ancient origin of these elements in eukaryotes followed by lineage-specific losses and/or recurrent episodes of horizontal transmission. In addition, we report that Maverick elements have amplified recently to high copy numbers in T. vaginalis where they now occupy as much as 30% of the genome. Sequence analysis confirms that most Mavericks encode a retroviral-like integrase, but lack other open reading frames typically found in retroelements. Nevertheless, the length and conservation of the target site duplication created upon Maverick insertion (5- or 6-bp) is consistent with a role of the integrase-like protein in the integration of a double-stranded DNA transposition intermediate. Mavericks also display long terminal-inverted repeats but do not contain ORFs similar to proteins encoded by DNA transposons. Instead, Mavericks encode a conserved set of 5 to 9 genes (in addition to the integrase) that are predicted to encode proteins with homology to replication and packaging proteins of some bacteriophages and diverse eukaryotic double-stranded DNA viruses, including a DNA polymerase B homolog and putative capsid proteins. Based on these and other structural similarities, we
Noncoded amino acids in protein engineering: Structure-activity relationship studies of hirudin-thrombin interaction.

PubMed

De Filippis, Vincenzo; Acquasaliente, Laura; Pontarollo, Giulia; Peterle, Daniele

2018-01-01

The advent of recombinant DNA technology allowed to site-specifically insert, delete, or mutate almost any amino acid in a given protein, significantly improving our knowledge of protein structure, stability, and function. Nevertheless, a quantitative description of the physical and chemical basis that makes a polypeptide chain to efficiently fold into a stable and functionally active conformation is still elusive. This mainly originates from the fact that nature combined, in a yet unknown manner, different properties (i.e., hydrophobicity, conformational propensity, polarizability, and hydrogen bonding capability) into the 20 standard natural amino acids, thus making difficult, if not impossible, to univocally relate the change in protein stability or function to the alteration of physicochemical properties caused by amino acid exchange(s). In this view, incorporation of noncoded amino acids with tailored side chains, allowing to finely tune the structure at a protein site, would facilitate to dissect the effects of a given mutation in terms of one or a few physicochemical properties, thus much expanding the scope of physical organic chemistry in the study of proteins. In this review, relevant applications from our laboratory will be presented on the use of noncoded amino acids in structure-activity relationships studies of hirudin binding to thrombin. © 2017 International Union of Biochemistry and Molecular Biology, Inc.

The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development.

PubMed

Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez; Shaulsky, Gad

2017-02-09

Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development. Copyright © 2017 Rosengarten et al.
Uncovering drug-responsive regulatory elements

PubMed Central

Luizon, Marcelo R; Ahituv, Nadav

2015-01-01

Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224
Correlation approach to identify coding regions in DNA sequences

NASA Technical Reports Server (NTRS)

Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

1994-01-01

Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
cDNA cloning of the human peroxisomal enoyl-CoA hydratase: 3-Hydroxyacyl-CoA dehydrogenase bifunctional enzyme and localization to chromosome 3q26. 3-3q28: A free left Alu arm is inserted in the 3[prime] noncoding region

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hoefler, G.; Forstner, M.; Hulla, W.

1994-01-01

Enoyl-CoA hydratase:3-hydroxyacyl-CoA dehydrogenase bifunctional enzyme is one of the four enzymes of the peroxisomal, [beta]-oxidation pathway. Here, the authors report the full-length human cDNA sequence and the localization of the corresponding gene on chromosome 3q26.3-3q28. The cDNA sequence spans 3779 nucleotides with an open reading frame of 2169 nucleotides. The tripeptide SKL at the carboxy terminus, known to serve as a peroxisomal targeting signal, is present. DNA sequence comparison of the coding region showed an 80% homology between human and rat bifunctional enzyme cDNA. The 3[prime] noncoding sequence contains 117 nucleotides homologous to an Alu repeat. Based on sequence comparison,more » they propose that these nucleotides are a free left Alu arm with 86% homology to the Alu-J family. RNA analysis shows one band with highest intensity in liver and kidney. This cDNA will allow in-depth studies of molecular defects in patients with defective peroxisomal bifunctional enzyme. Moreover, it will also provide a means for studying the regulation of peroxisomal [beta]-oxidation in humans. 33 refs., 5 figs.« less
An expanding universe of noncoding RNAs between the poles of basic science and clinical investigations.

PubMed

Weil, Patrick P; Hensel, Kai O; Weber, David; Postberg, Jan

2016-03-01

The Keystone Symposium 'MicroRNAs and Noncoding RNAs in Cancer', Keystone, CO, USA, 7-12 June 2015 Since the discovery of RNAi, great efforts have been undertaken to unleash the potential biomedical applicability of small noncoding RNAs, mainly miRNAs, involving their use as biomarkers for personalized diagnostics or their usability as active agents or therapy targets. The research's focus on the noncoding RNA world is now slowly moving from a phase of basic discoveries into a new phase, where every single molecule out of many hundreds of cataloged noncoding RNAs becomes dissected in order to investigate these molecules' biomedical relevance. In addition, RNA classes neglected before, such as long noncoding RNAs or circular RNAs attract more attention. Numerous timely results and hypotheses were presented at the 2015 Keystone Symposium 'MicroRNAs and Noncoding RNAs in Cancer'.
Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species.

PubMed

Hezroni, Hadas; Koppstein, David; Schwartz, Matthew G; Avrutin, Alexandra; Bartel, David P; Ulitsky, Igor

2015-05-19

The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNAs). Although in each species, >70% of lincRNAs cannot be traced to homologs in species that diverged >50 million years ago, thousands of human lincRNAs have homologs with similar expression patterns in other species. These homologs share short, 5'-biased patches of sequence conservation nested in exonic architectures that have been extensively rewired, in part by transposable element exonization. Thus, over a thousand human lincRNAs are likely to have conserved functions in mammals, and hundreds beyond mammals, but those functions require only short patches of specific sequences and can tolerate major changes in gene architecture. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Non-coding RNAs and Berberine: A new mechanism of its anti-diabetic activities.

PubMed

Chang, Wenguang

2017-01-15

Type 2 Diabetes (T2D) is a metabolic disease with high mortality and morbidity. Non-coding RNAs, including small and long non-coding RNAs, are a novel class of functional RNA molecules that regulate multiple biological functions through diverse mechanisms. Studies in the last decade have demonstrated that non-coding RNAs may represent compelling therapeutic targets and play important roles in regulating the course of insulin resistance and T2D. Berberine, a plant-based alkaloid, has shown promise as an anti-hyperglycaemic, anti-hyperlipidaemic agent against T2D. Previous studies have primarily focused on a diverse array of efficacy end points of berberine in the pathogenesis of metabolic syndromes and inflammation or oxidative stress. Currently, an increasing number of studies have revealed the importance of non-coding RNAs as regulators of the anti-diabetic effects of berberine. The regulation of non-coding RNAs has been associated with several therapeutic actions of berberine in T2D progression. Thus, this review summarizes the anti-diabetic mechanisms of berberine by focusing on its role in regulating non-coding RNA, thus demonstrating that berberine exerts global anti-diabetic effects by targeting non-coding RNAs and that these effects involve several miRNAs, lncRNAs and multiple signal pathways, which may enhance the current understanding of the anti-diabetic mechanism actions of berberine and provide new pathological targets for the development of berberine-related drugs. Copyright © 2016 Elsevier B.V. All rights reserved.
Conserved Non-Coding Regulatory Signatures in Arabidopsis Co-Expressed Gene Modules

PubMed Central

Spangler, Jacob B.; Ficklin, Stephen P.; Luo, Feng; Freeling, Michael; Feltus, F. Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome. PMID:23024789
Conserved non-coding regulatory signatures in Arabidopsis co-expressed gene modules.

PubMed

Spangler, Jacob B; Ficklin, Stephen P; Luo, Feng; Freeling, Michael; Feltus, F Alex

2012-01-01

Complex traits and other polygenic processes require coordinated gene expression. Co-expression networks model mRNA co-expression: the product of gene regulatory networks. To identify regulatory mechanisms underlying coordinated gene expression in a tissue-enriched context, ten Arabidopsis thaliana co-expression networks were constructed after manually sorting 4,566 RNA profiling datasets into aerial, flower, leaf, root, rosette, seedling, seed, shoot, whole plant, and global (all samples combined) groups. Collectively, the ten networks contained 30% of the measurable genes of Arabidopsis and were circumscribed into 5,491 modules. Modules were scrutinized for cis regulatory mechanisms putatively encoded in conserved non-coding sequences (CNSs) previously identified as remnants of a whole genome duplication event. We determined the non-random association of 1,361 unique CNSs to 1,904 co-expression network gene modules. Furthermore, the CNS elements were placed in the context of known gene regulatory networks (GRNs) by connecting 250 CNS motifs with known GRN cis elements. Our results provide support for a regulatory role of some CNS elements and suggest the functional consequences of CNS activation of co-expression in specific gene sets dispersed throughout the genome.
Ancestral vinclozolin exposure alters the epigenetic transgenerational inheritance of sperm small noncoding RNAs

PubMed Central

Schuster, Andrew; Skinner, Michael K.; Yan, Wei

2016-01-01

Abstract Exposure to the agricultural fungicide vinclozolin during gestation promotes a higher incidence of various diseases in the subsequent unexposed F3 and F4 generations. This phenomenon is termed epigenetic transgenerational inheritance and has been shown to in part involve alterations in DNA methylation, but the role of other epigenetic mechanisms remains unknown. The current study investigated the alterations in small noncoding RNA (sncRNA) in the sperm from F3 generation control and vinclozolin lineage rats. Over 200 differentially expressed sncRNAs were identified and the tRNA-derived sncRNAs, namely 5′ halves of mature tRNAs (5′ halves), displayed the most dramatic changes. Gene targets of the altered miRNAs and tRNA 5′ halves revealed associations between the altered sncRNAs and differentially DNA methylated regions. Dysregulated sncRNAs appear to correlate with mRNA profiles associated with the previously observed vinclozolin-induced disease phenotypes. Data suggest potential connections between sperm-borne RNAs and the vinclozolin-induced epigenetic transgenerational inheritance phenomenon. PMID:27390623
Non-coding RNAs and plant male sterility: current knowledge and future prospects.

PubMed

Mishra, Ankita; Bohra, Abhishek

2018-02-01

Latest outcomes assign functional role to non-coding (nc) RNA molecules in regulatory networks that confer male sterility to plants. Male sterility in plants offers great opportunity for improving crop performance through application of hybrid technology. In this respect, cytoplasmic male sterility (CMS) and sterility induced by photoperiod (PGMS)/temperature (TGMS) have greatly facilitated development of high-yielding hybrids in crops. Participation of non-coding (nc) RNA molecules in plant reproductive development is increasingly becoming evident. Recent breakthroughs in rice definitively associate ncRNAs with PGMS and TGMS. In case of CMS, the exact mechanism through which the mitochondrial ORFs exert influence on the development of male gametophyte remains obscure in several crops. High-throughput sequencing has enabled genome-wide discovery and validation of these regulatory molecules and their target genes, describing their potential roles performed in relation to CMS. Discovery of ncRNA localized in plant mtDNA with its possible implication in CMS induction is intriguing in this respect. Still, conclusive evidences linking ncRNA with CMS phenotypes are currently unavailable, demanding complementing genetic approaches like transgenics to substantiate the preliminary findings. Here, we review the recent literature on the contribution of ncRNAs in conferring male sterility to plants, with an emphasis on microRNAs. Also, we present a perspective on improved understanding about ncRNA-mediated regulatory pathways that control male sterility in plants. A refined understanding of plant male sterility would strengthen crop hybrid industry to deliver hybrids with improved performance.
Corylin increases the sensitivity of hepatocellular carcinoma cells to chemotherapy through long noncoding RNA RAD51-AS1-mediated inhibition of DNA repair.

PubMed

Chen, Chin-Chuan; Chen, Chi-Yuan; Ueng, Shir-Hwa; Hsueh, Chuen; Yeh, Chau-Ting; Ho, Jar-Yi; Chou, Li-Fang; Wang, Tong-Hong

2018-05-10

Corylin, a biologically active agent extracted from Psoralea corylifolia L. (Fabaceae), promotes bone differentiation and inhibits inflammation. Currently, few reports have addressed the biological functions that are regulated by corylin, and to date, no studies have investigated its antitumor activity. In this study, we used cell functional assays to analyze the antitumor activity of corylin in hepatocellular carcinoma (HCC). Furthermore, whole-transcriptome assays were performed to identify the downstream genes that were regulated by corylin, and gain-of-function and loss-of-function experiments were conducted to examine the regulatory roles of the above genes. We found that corylin significantly inhibited the proliferation, migration, and invasion of HCC cells and increased the toxic effects of chemotherapeutic agents against HCC cells. These properties were due to the induction of a long noncoding RNA, RAD51-AS1, which bound to RAD51 mRNA, thereby inhibiting RAD51 protein expression, thus inhibiting the DNA damage repair ability of HCC cells. Animal experiments also showed that a combination treatment with corylin significantly increased the inhibitory effects of the chemotherapeutic agent etoposide (VP16) on tumor growth. These findings indicate that corylin has strong potential as an adjuvant drug in HCC treatment and that corylin can strengthen the therapeutic efficacy of chemotherapy and radiotherapy.
Regulating infidelity: RNA-mediated recruitment of AID to DNA during class switch recombination.

PubMed

DiMenna, Lauren J; Chaudhuri, Jayanta

2016-03-01

The mechanism by which the DNA deaminase activation-induced cytidine deaminase (AID) is specifically recruited to repetitive switch region DNA during class switch recombination is still poorly understood. Work over the past decade has revealed a strong link between transcription and RNA polymerase-associated factors in AID recruitment, yet none of these processes satisfactorily explain how AID specificity is affected. Here, we review a recent finding wherein AID is guided to switch regions not by a protein factor but by an RNA moiety, and especially one associated with a noncoding RNA that has been long thought of as being inert. This work explains the long-standing requirement of splicing of noncoding transcripts during class switching, and has implications in both B cell-mediated immunity as well as the underlying pathological syndromes associated with the recombination reaction. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains

PubMed Central

Hutchinson, John N; Ensminger, Alexander W; Clemson, Christine M; Lynch, Christopher R; Lawrence, Jeanne B; Chess, Andrew

2007-01-01

Background Noncoding RNA species play a diverse set of roles in the eukaryotic cell. While much recent attention has focused on smaller RNA species, larger noncoding transcripts are also thought to be highly abundant in mammalian cells. To search for large noncoding RNAs that might control gene expression or mRNA metabolism, we used Affymetrix expression arrays to identify polyadenylated RNA transcripts displaying nuclear enrichment. Results This screen identified no more than three transcripts; XIST, and two unique noncoding nuclear enriched abundant transcripts (NEAT) RNAs strikingly located less than 70 kb apart on human chromosome 11: NEAT1, a noncoding RNA from the locus encoding for TncRNA, and NEAT2 (also known as MALAT-1). While the two NEAT transcripts share no significant homology with each other, each is conserved within the mammalian lineage, suggesting significant function for these noncoding RNAs. NEAT2 is extraordinarily well conserved for a noncoding RNA, more so than even XIST. Bioinformatic analyses of publicly available mouse transcriptome data support our findings from human cells as they confirm that the murine homologs of these noncoding RNAs are also nuclear enriched. RNA FISH analyses suggest that these noncoding RNAs function in mRNA metabolism as they demonstrate an intimate association of these RNA species with SC35 nuclear speckles in both human and mouse cells. These studies show that one of these transcripts, NEAT1 localizes to the periphery of such domains, whereas the neighboring transcript, NEAT2, is part of the long-sought polyadenylated component of nuclear speckles. Conclusion Our genome-wide screens in two mammalian species reveal no more than three abundant large non-coding polyadenylated RNAs in the nucleus; the canonical large noncoding RNA XIST and NEAT1 and NEAT2. The function of these noncoding RNAs in mRNA metabolism is suggested by their high levels of conservation and their intimate association with SC35 splicing
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.

Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
Sost, independent of the non-coding enhancer ECR5, is required for bone mechanoadaptation

DOE PAGES

Robling, Alexander G.; Kang, Kyung Shin; Bullock, Whitney A.; ...

2016-09-04

Here, sclerostin ( Sost) is a negative regulator of bone formation that acts upon the Wnt signaling pathway. Sost is mechanically regulated at both mRNA and protein level such that loading represses and unloading enhances Sost expression, in osteocytes and in circulation. The non-coding evolutionarily conserved enhancer ECR5 has been previously reported as a transcriptional regulatory element required for modulating Sost expression in osteocytes. Here we explored the mechanisms by which ECR5, or several other putative transcriptional enhancers regulate Sost expression, in response to mechanical stimulation. We found that in vivo ulna loading is equally osteoanabolic in wildtype and Sostmore » –/– mice, although Sost is required for proper distribution of load-induced bone formation to regions of high strain. Using Luciferase reporters carrying the ECR5 non-coding enhancer and heterologous or homologous h SOST promoters, we found that ECR5 is mechanosensitive in vitro and that ECR5-driven Luciferase activity decreases in osteoblasts exposed to oscillatory fluid flow. Yet, ECR5–/– mice showed similar magnitude of load-induced bone formation and similar periosteal distribution of bone formation to high-strain regions compared to wildtype mice. Further, we found that in contrast to Sost–/– mice, which are resistant to disuse-induced bone loss, ECR5–/– mice lose bone upon unloading to a degree similar to wildtype control mice. ECR5 deletion did not abrogate positive effects of unloading on Sost, suggesting that additional transcriptional regulators and regulatory elements contribute to load-induced regulation of Sost.« less
Dehydration stress extends mRNA 3′ untranslated regions with noncoding RNA functions in Arabidopsis

PubMed Central

Sun, Hai-Xi; Li, Yan; Niu, Qi-Wen; Chua, Nam-Hai

2017-01-01

The 3′ untranslated regions (3′ UTRs) of mRNAs play important roles in the regulation of mRNA localization, translation, and stability. Alternative cleavage and polyadenylation (APA) generates mRNAs with different 3′ UTRs, but the involvement of this process in stress response has not yet been clarified. Here, we report that a subset of stress-related genes exhibits 3′ UTR extensions of their mRNAs during dehydration stress. These extended 3′ UTRs have characteristics of long noncoding RNAs and likely do not interact with miRNAs. Functional studies using T-DNA insertion mutants reveal that they can act as antisense transcripts to repress expression levels of sense genes from the opposite strand or can activate the transcription or lead to read-through transcription of their downstream genes. Further analysis suggests that transcripts with 3′ UTR extensions have weaker poly(A) signals than those without 3′ UTR extensions. Finally, we show that their biogenesis is partially dependent on a trans-acting factor FPA. Taken together, we report that dehydration stress could induce transcript 3′ UTR extensions and elucidate a novel function for these stress-induced 3′ UTR extensions as long noncoding RNAs in the regulation of their neighboring genes. PMID:28522613
Birth, coming of age and death: The intriguing life of long noncoding RNAs.

PubMed

Samudyata; Castelo-Branco, Gonçalo; Bonetti, Alessandro

2018-07-01

Mammalian genomes are pervasively transcribed, with long noncoding RNAs being the most abundant fraction. Recent studies have highlighted the central role played by these transcripts in several physiological and pathological processes. Despite several metabolic features shared between coding and noncoding transcripts, these two classes of RNAs exhibit multiple differences regarding their biogenesis and processing. Here we review such distinctions, focusing on the unique features of specific long noncoding RNAs. Copyright © 2017 Elsevier Ltd. All rights reserved.
Long Noncoding RNA-Associated Transcriptomic Changes in Resiliency or Susceptibility to Depression and Response to Antidepressant Treatment

PubMed Central

Roy, Bhaskar; Wang, Qingzhong; Dwivedi, Yogesh

2018-01-01

Abstract Background Recent emergence of long noncoding RNAs in regulating gene expression and thereby modulating physiological functions in brain has manifested their possible role in psychiatric disorders. In this study, the roles of long noncoding RNAs in susceptibility and resiliency to develop stress-induced depression and their response to antidepressant treatment were examined. Methods Microarray-based transcriptome-wide changes in long noncoding RNAs were determined in hippocampus of male Holtzman rats who showed susceptibility (learned helplessness) or resiliency (nonlearned helplessness) to develop depression. Changes in long noncoding RNA expression were also ascertained after subchronic administration of fluoxetine to learned helplessness rats. Bioinformatic and target prediction analyses (cis- and trans-acting) and qPCR-based assays were performed to decipher the functional role of altered long noncoding RNAs. Results Group-wise comparison showed an overrepresented class of long noncoding RNAs that were uniquely associated with nonlearned helplessness or learned helplessness behavior. Chromosomal mapping within the 5-kbp flank region of the top 20 dysregulated long noncoding RNAs in the learned helplessness group showed several target genes that were regulated through cis- or trans-actions, including Zbtb20 and Zfp385b from zinc finger binding protein family. Genomic context of differentially expressed long noncoding RNAs showed an overall blunted response in the learned helplessness group regardless of the long noncoding RNA classes analyzed. Gene ontology exhibited the functional clustering for anatomical structure development, cellular architecture modulation, protein metabolism, and cellular communications. Fluoxetine treatment reversed learned helplessness-induced changes in many long noncoding RNAs and target genes. Conclusions The involvement of specific classes of long noncoding RNAs with distinctive roles in modulating target gene expression
Changes in the Coding and Non-coding Transcriptome and DNA Methylome that Define the Schwann Cell Repair Phenotype after Nerve Injury.

PubMed

Arthur-Farraj, Peter J; Morgan, Claire C; Adamowicz, Martyna; Gomez-Sanchez, Jose A; Fazal, Shaline V; Beucher, Anthony; Razzaghi, Bonnie; Mirsky, Rhona; Jessen, Kristjan R; Aitman, Timothy J

2017-09-12

Repair Schwann cells play a critical role in orchestrating nerve repair after injury, but the cellular and molecular processes that generate them are poorly understood. Here, we perform a combined whole-genome, coding and non-coding RNA and CpG methylation study following nerve injury. We show that genes involved in the epithelial-mesenchymal transition are enriched in repair cells, and we identify several long non-coding RNAs in Schwann cells. We demonstrate that the AP-1 transcription factor C-JUN regulates the expression of certain micro RNAs in repair Schwann cells, in particular miR-21 and miR-34. Surprisingly, unlike during development, changes in CpG methylation are limited in injury, restricted to specific locations, such as enhancer regions of Schwann cell-specific genes (e.g., Nedd4l), and close to local enrichment of AP-1 motifs. These genetic and epigenomic changes broaden our mechanistic understanding of the formation of repair Schwann cell during peripheral nervous system tissue repair. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

Long non-coding RNAs in cancer metabolism.

PubMed

Xiao, Zhen-Dong; Zhuang, Li; Gan, Boyi

2016-10-01

Altered cellular metabolism is an emerging hallmark of cancer. Accumulating recent evidence links long non-coding RNAs (lncRNAs), a still poorly understood class of non-coding RNAs, to cancer metabolism. Here we review the emerging findings on the functions of lncRNAs in cancer metabolism, with particular emphasis on how lncRNAs regulate glucose and glutamine metabolism in cancer cells, discuss how lncRNAs regulate various aspects of cancer metabolism through their cross-talk with other macromolecules, explore the mechanistic conceptual framework of lncRNAs in reprogramming metabolism in cancers, and highlight the challenges in this field. A more in-depth understanding of lncRNAs in cancer metabolism may enable the development of novel and effective therapeutic strategies targeting cancer metabolism. © 2016 WILEY Periodicals, Inc.
RPS8—a New Informative DNA Marker for Phylogeny of Babesia and Theileria Parasites in China

PubMed Central

Tian, Zhan-Cheng; Liu, Guang-Yuan; Yin, Hong; Luo, Jian-Xun; Guan, Gui-Quan; Luo, Jin; Xie, Jun-Ren; Shen, Hui; Tian, Mei-Yuan; Zheng, Jin-feng; Yuan, Xiao-song; Wang, Fang-fang

2013-01-01

Piroplasmosis is a serious debilitating and sometimes fatal disease. Phylogenetic relationships within piroplasmida are complex and remain unclear. We compared the intron–exon structure and DNA sequences of the RPS8 gene from Babesia and Theileria spp. isolates in China. Similar to 18S rDNA, the 40S ribosomal protein S8 gene, RPS8, including both coding and non-coding regions is a useful and novel genetic marker for defining species boundaries and for inferring phylogenies because it tends to have little intra-specific variation but considerable inter-specific difference. However, more samples are needed to verify the usefulness of the RPS8 (coding and non-coding regions) gene as a marker for the phylogenetic position and detection of most Babesia and Theileria species, particularly for some closely related species. PMID:24244571
Dysregulation of non-coding RNAs in gastric cancer

PubMed Central

Yang, Qing; Zhang, Ren-Wen; Sui, Peng-Cheng; He, Hai-Tao; Ding, Lei

2015-01-01

Gastric cancer (GC) is one of the most common cancers in the world and a significant threat to the health of patients, especially those from China and Japan. The prognosis for patients with late stage GC receiving the standard of care treatment, including surgery, chemotherapy and radiotherapy, remains poor. Developing novel treatment strategies, identifying new molecules for targeted therapy, and devising screening techniques to detect this cancer in its early stages are needed for GC patients. The discovery of non-coding RNAs (ncRNAs), primarily microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), helped to elucidate the mechanisms of tumorigenesis, diagnosis and treatment of GC. Recently, significant research has been conducted on non-coding RNAs and how the regulatory dysfunction of these RNAs impacts the tumorigenesis of GC. In this study, we review papers published in the last five years concerning the dysregulation of non-coding RNAs, especially miRNAs and lncRNAs, in GC. We summarize instances of aberrant expression of the ncRNAs in GC and their effect on survival-related events, including cell cycle regulation, AKT signaling, apoptosis and drug resistance. Additionally, we evaluate how ncRNA dysregulation affects the metastatic process, including the epithelial-mesenchymal transition, stem cells, transcription factor activity, and oncogene and tumor suppressor expression. Lastly, we determine how ncRNAs affect angiogenesis in the microenvironment of GC. We further discuss the use of ncRNAs as potential biomarkers for use in clinical screening, early diagnosis and prognosis of GC. At present, no ideal ncRNAs have been identified as targets for the treatment of GC. PMID:26494954
Improved regulatory element prediction based on tissue-specific local epigenomic signatures

DOE Office of Scientific and Technical Information (OSTI.GOV)

He, Yupeng; Gorkin, David U.; Dickel, Diane E.

Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared withmore » existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.« less
Improved regulatory element prediction based on tissue-specific local epigenomic signatures

PubMed Central

He, Yupeng; Gorkin, David U.; Dickel, Diane E.; Nery, Joseph R.; Castanon, Rosa G.; Lee, Ah Young; Shen, Yin; Visel, Axel; Pennacchio, Len A.; Ren, Bing; Ecker, Joseph R.

2017-01-01

Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulatory element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared with existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types. REPTILE is available at https://github.com/yupenghe/REPTILE/. PMID:28193886
Improved regulatory element prediction based on tissue-specific local epigenomic signatures

DOE PAGES

He, Yupeng; Gorkin, David U.; Dickel, Diane E.; ...

2017-02-13

Accurate enhancer identification is critical for understanding the spatiotemporal transcriptional regulation during development as well as the functional impact of disease-related noncoding genetic variants. Computational methods have been developed to predict the genomic locations of active enhancers based on histone modifications, but the accuracy and resolution of these methods remain limited. Here, we present an algorithm, regulator y element prediction based on tissue-specific local epigenetic marks (REPTILE), which integrates histone modification and whole-genome cytosine DNA methylation profiles to identify the precise location of enhancers. We tested the ability of REPTILE to identify enhancers previously validated in reporter assays. Compared withmore » existing methods, REPTILE shows consistently superior performance across diverse cell and tissue types, and the enhancer locations are significantly more refined. We show that, by incorporating base-resolution methylation data, REPTILE greatly improves upon current methods for annotation of enhancers across a variety of cell and tissue types.« less
RNA Polymerase III promoter screen uncovers a novel noncoding RNA family conserved in Caenorhabditis and other clade V nematodes.

PubMed

Gruber, Andreas R

2014-07-10

RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.
Long Non-Coding RNAs Regulating Immunity in Insects

PubMed Central

Satyavathi, Valluri; Ghosh, Rupam; Subramanian, Srividya

2017-01-01

Recent advances in modern technology have led to the understanding that not all genetic information is coded into protein and that the genomes of each and every organism including insects produce non-coding RNAs that can control different biological processes. Among RNAs identified in the last decade, long non-coding RNAs (lncRNAs) represent a repertoire of a hidden layer of internal signals that can regulate gene expression in physiological, pathological, and immunological processes. Evidence shows the importance of lncRNAs in the regulation of host–pathogen interactions. In this review, an attempt has been made to view the role of lncRNAs regulating immune responses in insects. PMID:29657286
Ectopic recombination between Ty elements in Saccharomyces cerevisiae is not induced by DNA damage.

PubMed

Parket, A; Kupiec, M

1992-10-01

Mitotic recombination is increased when cells are treated with a variety of physical and chemical agents that cause damage to their DNA. We show here, using Saccharomyces cerevisiae strains that carry marked Ty elements, that recombination between members of this family of retrotransposons is not increased by UV irradiation or by treatment with the radiomimetic drug methyl methanesulfonate. Both ectopic recombination and mutation events were elevated by these agents for non-Ty sequences in the same strain. We discuss possible mechanisms that can prevent the induction of recombination between Ty elements.
Using FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) to isolate active regulatory DNA

PubMed Central

Simon, Jeremy M.; Giresi, Paul G.; Davis, Ian J.; Lieb, Jason D.

2013-01-01

Eviction or destabilization of nucleosomes from chromatin is a hallmark of functional regulatory elements of the eukaryotic genome. Historically identified by nuclease hypersensitivity, these regulatory elements are typically bound by transcription factors or other regulatory proteins. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) is an alternative approach to identify these genomic regions and has proven successful in a multitude of eukaryotic cell and tissue types. Cells or dissociated tissues are crosslinked briefly with formaldehyde, lysed, and sonicated. Sheared chromatin is subjected to phenol-chloroform extraction and the isolated DNA, typically encompassing 1–3% of the human genome, is purified. We provide guidelines for quantitative analysis by PCR, microarrays, or next-generation sequencing. Regulatory elements enriched by FAIRE display high concordance with those identified by nuclease hypersensitivity or ChIP, and the entire procedure can be completed in three days. FAIRE exhibits low technical variability, which allows its use in large-scale studies of chromatin from normal or diseased tissues. PMID:22262007
A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

PubMed

Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

2006-04-01

Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.
Turnover of R1 (Type I) and R2 (Type Ii) Retrotransposable Elements in the Ribosomal DNA of Drosophila Melanogaster

PubMed Central

Jakubczak, J. L.; Zenni, M. K.; Woodruff, R. C.; Eickbush, T. H.

1992-01-01

R1 and R2 are distantly related non-long terminal repeat retrotransposable elements each of which inserts into a specific site in the 28S rRNA genes of most insects. We have analyzed aspects of R1 and R2 abundance and sequence variation in 27 geographical isolates of Drosophila melanogaster. The fraction of 28S rRNA genes containing these elements varied greatly between strains, 17-67% for R1 elements and 2-28% for R2 elements. The total percentage of the rDNA repeats inserted ranged from 32 to 77%. The fraction of the rDNA repeats that contained both of these elements suggested that R1 and R2 exhibit neither an inhibition of nor preference for insertion into a 28S gene already containing the other type of element. Based on the conservation of restriction sites in the elements of all strains, and sequence analysis of individual elements from three strains, nucleotide divergence is very low for R1 and R2 elements within or between strains (<0.6%). This sequence uniformity is the expected result of the forces of concerted evolution (unequal crossovers and gene conversion) which act on the rRNA genes themselves. Evidence for the role of retrotransposition in the turnover of R1 and R2 was obtained by using naturally occurring 5' length polymorphisms of the elements as markers for independent transposition events. The pattern of these different length 5' truncations of R1 and R2 was found to be diverse and unique to most strains analyzed. Because recombination can only, with time, amplify or eliminate those length variants already present, the diversity found in each strain suggests that retrotransposition has played a critical role in maintaining these elements in the rDNA repeats of D. melanogaster. PMID:1317313
Control of seed dormancy in Arabidopsis by a cis-acting noncoding antisense transcript.

PubMed

Fedak, Halina; Palusinska, Malgorzata; Krzyczmonik, Katarzyna; Brzezniak, Lien; Yatusevich, Ruslan; Pietras, Zbigniew; Kaczanowski, Szymon; Swiezewski, Szymon

2016-11-29

Seed dormancy is one of the most crucial process transitions in a plant's life cycle. Its timing is tightly controlled by the expression level of the Delay of Germination 1 gene (DOG1). DOG1 is the major quantitative trait locus for seed dormancy in Arabidopsis and has been shown to control dormancy in many other plant species. This is reflected by the evolutionary conservation of the functional short alternatively polyadenylated form of the DOG1 mRNA. Notably, the 3' region of DOG1, including the last exon that is not included in this transcript isoform, shows a high level of conservation at the DNA level, but the encoded polypeptide is poorly conserved. Here, we demonstrate that this region of DOG1 contains a promoter for the transcription of a noncoding antisense RNA, asDOG1, that is 5' capped, polyadenylated, and relatively stable. This promoter is autonomous and asDOG1 has an expression profile that is different from known DOG1 transcripts. Using several approaches we show that asDOG1 strongly suppresses DOG1 expression during seed maturation in cis, but is unable to do so in trans Therefore, the negative regulation of seed dormancy by asDOG1 in cis results in allele-specific suppression of DOG1 expression and promotes germination. Given the evolutionary conservation of the asDOG1 promoter, we propose that this cis-constrained noncoding RNA-mediated mechanism limiting the duration of seed dormancy functions across the Brassicaceae.
Identification of functional elements and regulatory circuits by Drosophila modENCODE

DOE Office of Scientific and Technical Information (OSTI.GOV)

Roy, Sushmita; Ernst, Jason; Kharchenko, Peter V.

2010-12-22

To gain insight into how genomic information is translated into cellular and developmental programs, the Drosophila model organism Encyclopedia of DNA Elements (modENCODE) project is comprehensively mapping transcripts, histone modifications, chromosomal proteins, transcription factors, replication proteins and intermediates, and nucleosome properties across a developmental time course and in multiple cell lines. We have generated more than 700 data sets and discovered protein-coding, noncoding, RNA regulatory, replication, and chromatin elements, more than tripling the annotated portion of the Drosophila genome. Correlated activity patterns of these elements reveal a functional regulatory network, which predicts putative new functions for genes, reveals stage- andmore » tissue-specific regulators, and enables gene-expression prediction. Our results provide a foundation for directed experimental and computational studies in Drosophila and related species and also a model for systematic data integration toward comprehensive genomic and functional annotation. Several years after the complete genetic sequencing of many species, it is still unclear how to translate genomic information into a functional map of cellular and developmental programs. The Encyclopedia of DNA Elements (ENCODE) (1) and model organism ENCODE (modENCODE) (2) projects use diverse genomic assays to comprehensively annotate the Homo sapiens (human), Drosophila melanogaster (fruit fly), and Caenorhabditis elegans (worm) genomes, through systematic generation and computational integration of functional genomic data sets. Previous genomic studies in flies have made seminal contributions to our understanding of basic biological mechanisms and genome functions, facilitated by genetic, experimental, computational, and manual annotation of the euchromatic and heterochromatic genome (3), small genome size, short life cycle, and a deep knowledge of development, gene function, and chromosome biology. The
Characterization of human glucocorticoid receptor complexes formed with DNA fragments containing or lacking glucocorticoid response elements

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tully, D.B.; Cidlowski, J.A.

1989-03-07

Sucrose density gradient shift assays were used to study the interactions of human glucocorticoid receptors (GR) with small DNA fragments either containing or lacking glucocorticoid response element (GRE) DNA consensus sequences. When crude cytoplasmic extracts containing ({sup 3}H)triamcinolone acetonide (({sup 3}H)TA) labeled GR were incubated with unlabeled DNA under conditions of DNA excess, a GRE-containing DNA fragment obtained from the 5' long terminal repeat of mouse mammary tumor virus (MMTV LTR) formed a stable 12-16S complex with activated, but not nonactivated, ({sup 3}H)TA receptor. By contrast, if the cytosols were treated with calf thymus DNA-cellulose to deplete non-GR-DNA-binding proteins priormore » to heat activation, a smaller 7-10S complex was formed with the MMTV LTR DNA fragment. Activated ({sup 3}H)TA receptor from DNA-cellulose pretreated cytosols also interacted with two similarly sized fragments from pBR322 DNA. Stability of the complexes formed between GR and these three DNA fragments was strongly affected by even moderate alterations in either the salt concentration or the pH of the gradient buffer. Under all conditions tested, the complex formed with the MMTV LTR DNA fragment was more stable than the complexes formed with either of the pBR322 DNA fragments. Together these observations indicate that the formation of stable complexes between activated GR and isolated DNA fragments requires the presence of GRE consensus sequences in the DNA.« less
Intricate and Cell Type-Specific Populations of Endogenous Circular DNA (eccDNA) in Caenorhabditis elegans and Homo sapiens.

PubMed

Shoura, Massa J; Gabdank, Idan; Hansen, Loren; Merker, Jason; Gotlib, Jason; Levene, Stephen D; Fire, Andrew Z

2017-10-05

Investigations aimed at defining the 3D configuration of eukaryotic chromosomes have consistently encountered an endogenous population of chromosome-derived circular genomic DNA, referred to as extrachromosomal circular DNA (eccDNA). While the production, distribution, and activities of eccDNAs remain understudied, eccDNA formation from specific regions of the linear genome has profound consequences on the regulatory and coding capabilities for these regions. Here, we define eccDNA distributions in Caenorhabditis elegans and in three human cell types, utilizing a set of DNA topology-dependent approaches for enrichment and characterization. The use of parallel biophysical, enzymatic, and informatic approaches provides a comprehensive profiling of eccDNA robust to isolation and analysis methodology. Results in human and nematode systems provide quantitative analysis of the eccDNA loci at both unique and repetitive regions. Our studies converge on and support a consistent picture, in which endogenous genomic DNA circles are present in normal physiological states, and in which the circles come from both coding and noncoding genomic regions. Prominent among the coding regions generating DNA circles are several genes known to produce a diversity of protein isoforms, with mucin proteins and titin as specific examples. Copyright © 2017 Shoura et al.
A Looking-Glass of Non-Coding RNAs in Oral Cancer

PubMed Central

Irimie, Alexandra Iulia; Braicu, Cornelia; Sonea, Laura; Zimta, Alina Andreea; Diudea, Diana; Buduru, Smaranda; Berindan-Neagoe, Ioana

2017-01-01

Oral cancer is a multifactorial pathology and is characterized by the lack of efficient treatment and accurate diagnostic tools. This is mainly due the late diagnosis; therefore, reliable biomarkers for the timely detection of the disease and patient stratification are required. Non-coding RNAs (ncRNAs) are key elements in the physiological and pathological processes of various cancers, which is also reflected in oral cancer development and progression. A better understanding of their role could give a more thorough perspective on the future treatment options for this cancer type. This review offers a glimpse into the ncRNA involvement in oral cancer, which can help the medical community tap into the world of ncRNAs and lay the ground for more powerful diagnostic, prognostic and treatment tools for oral cancer that will ultimately help build a brighter future for these patients. PMID:29206174
Dnmt2 mediates intergenerational transmission of paternally acquired metabolic disorders through sperm small non-coding RNAs.

PubMed

Zhang, Yunfang; Zhang, Xudong; Shi, Junchao; Tuorto, Francesca; Li, Xin; Liu, Yusheng; Liebers, Reinhard; Zhang, Liwen; Qu, Yongcun; Qian, Jingjing; Pahima, Maya; Liu, Ying; Yan, Menghong; Cao, Zhonghong; Lei, Xiaohua; Cao, Yujing; Peng, Hongying; Liu, Shichao; Wang, Yue; Zheng, Huili; Woolsey, Rebekah; Quilici, David; Zhai, Qiwei; Li, Lei; Zhou, Tong; Yan, Wei; Lyko, Frank; Zhang, Ying; Zhou, Qi; Duan, Enkui; Chen, Qi

2018-05-01

The discovery of RNAs (for example, messenger RNAs, non-coding RNAs) in sperm has opened the possibility that sperm may function by delivering additional paternal information aside from solely providing the DNA 1 . Increasing evidence now suggests that sperm small non-coding RNAs (sncRNAs) can mediate intergenerational transmission of paternally acquired phenotypes, including mental stress 2,3 and metabolic disorders 4-6 . How sperm sncRNAs encode paternal information remains unclear, but the mechanism may involve RNA modifications. Here we show that deletion of a mouse tRNA methyltransferase, DNMT2, abolished sperm sncRNA-mediated transmission of high-fat-diet-induced metabolic disorders to offspring. Dnmt2 deletion prevented the elevation of RNA modifications (m 5 C, m 2 G) in sperm 30-40 nt RNA fractions that are induced by a high-fat diet. Also, Dnmt2 deletion altered the sperm small RNA expression profile, including levels of tRNA-derived small RNAs and rRNA-derived small RNAs, which might be essential in composing a sperm RNA 'coding signature' that is needed for paternal epigenetic memory. Finally, we show that Dnmt2-mediated m 5 C contributes to the secondary structure and biological properties of sncRNAs, implicating sperm RNA modifications as an additional layer of paternal hereditary information.
Non-coding variants contribute to the clinical heterogeneity of TTR amyloidosis.

PubMed

Iorio, Andrea; De Lillo, Antonella; De Angelis, Flavio; Di Girolamo, Marco; Luigetti, Marco; Sabatelli, Mario; Pradotto, Luca; Mauro, Alessandro; Mazzeo, Anna; Stancanelli, Claudia; Perfetto, Federico; Frusconi, Sabrina; My, Filomena; Manfellotto, Dario; Fuciarelli, Maria; Polimanti, Renato

2017-09-01

Coding mutations in TTR gene cause a rare hereditary form of systemic amyloidosis, which has a complex genotype-phenotype correlation. We investigated the role of non-coding variants in regulating TTR gene expression and consequently amyloidosis symptoms. We evaluated the genotype-phenotype correlation considering the clinical information of 129 Italian patients with TTR amyloidosis. Then, we conducted a re-sequencing of TTR gene to investigate how non-coding variants affect TTR expression and, consequently, phenotypic presentation in carriers of amyloidogenic mutations. Polygenic scores for genetically determined TTR expression were constructed using data from our re-sequencing analysis and the GTEx (Genotype-Tissue Expression) project. We confirmed a strong phenotypic heterogeneity across coding mutations causing TTR amyloidosis. Considering the effects of non-coding variants on TTR expression, we identified three patient clusters with specific expression patterns associated with certain phenotypic presentations, including late onset, autonomic neurological involvement, and gastrointestinal symptoms. This study provides novel data regarding the role of non-coding variation and the gene expression profiles in patients affected by TTR amyloidosis, also putting forth an approach that could be used to investigate the mechanisms at the basis of the genotype-phenotype correlation of the disease.
BiRen: predicting enhancers with a deep-learning-based model using the DNA sequence alone.

PubMed

Yang, Bite; Liu, Feng; Ren, Chao; Ouyang, Zhangyi; Xie, Ziwei; Bo, Xiaochen; Shu, Wenjie

2017-07-01

Enhancer elements are noncoding stretches of DNA that play key roles in controlling gene expression programmes. Despite major efforts to develop accurate enhancer prediction methods, identifying enhancer sequences continues to be a challenge in the annotation of mammalian genomes. One of the major issues is the lack of large, sufficiently comprehensive and experimentally validated enhancers for humans or other species. Thus, the development of computational methods based on limited experimentally validated enhancers and deciphering the transcriptional regulatory code encoded in the enhancer sequences is urgent. We present a deep-learning-based hybrid architecture, BiRen, which predicts enhancers using the DNA sequence alone. Our results demonstrate that BiRen can learn common enhancer patterns directly from the DNA sequence and exhibits superior accuracy, robustness and generalizability in enhancer prediction relative to other state-of-the-art enhancer predictors based on sequence characteristics. Our BiRen will enable researchers to acquire a deeper understanding of the regulatory code of enhancer sequences. Our BiRen method can be freely accessed at https://github.com/wenjiegroup/BiRen . shuwj@bmi.ac.cn or boxc@bmi.ac.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

Flow cytometry sorting of nuclei enables the first global characterization of Paramecium germline DNA and transposable elements.

PubMed

Guérin, Frédéric; Arnaiz, Olivier; Boggetto, Nicole; Denby Wilkes, Cyril; Meyer, Eric; Sperling, Linda; Duharcourt, Sandra

2017-04-26

DNA elimination is developmentally programmed in a wide variety of eukaryotes, including unicellular ciliates, and leads to the generation of distinct germline and somatic genomes. The ciliate Paramecium tetraurelia harbors two types of nuclei with different functions and genome structures. The transcriptionally inactive micronucleus contains the complete germline genome, while the somatic macronucleus contains a reduced genome streamlined for gene expression. During development of the somatic macronucleus, the germline genome undergoes massive and reproducible DNA elimination events. Availability of both the somatic and germline genomes is essential to examine the genome changes that occur during programmed DNA elimination and ultimately decipher the mechanisms underlying the specific removal of germline-limited sequences. We developed a novel experimental approach that uses flow cell imaging and flow cytometry to sort subpopulations of nuclei to high purity. We sorted vegetative micronuclei and macronuclei during development of P. tetraurelia. We validated the method by flow cell imaging and by high throughput DNA sequencing. Our work establishes the proof of principle that developing somatic macronuclei can be sorted from a complex biological sample to high purity based on their size, shape and DNA content. This method enabled us to sequence, for the first time, the germline DNA from pure micronuclei and to identify novel transposable elements. Sequencing the germline DNA confirms that the Pgm domesticated transposase is required for the excision of all ~45,000 Internal Eliminated Sequences. Comparison of the germline DNA and unrearranged DNA obtained from PGM-silenced cells reveals that the latter does not provide a faithful representation of the germline genome. We developed a flow cytometry-based method to purify P. tetraurelia nuclei to high purity and provided quality control with flow cell imaging and high throughput DNA sequencing. We identified 61
Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki

A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M{sub r} of its subunit was 77,000. The cells converted ({sup 14}C)-L-phenylalanine into ({sup 14}C)-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading framemore » capable of coding for a polypeptide with 707 amino acids (M{sub r} 77,137), a 22-bp 5{prime}-noncoding region and a 207-bp 3{prime}-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology.« less
Heavy Chronic Intermittent Ethanol Exposure Alters Small Noncoding RNAs in Mouse Sperm and Epididymosomes.

PubMed

Rompala, Gregory R; Mounier, Anais; Wolfe, Cody M; Lin, Qishan; Lefterov, Iliya; Homanics, Gregg E

2018-01-01

While the risks of maternal alcohol abuse during pregnancy are well-established, several preclinical studies suggest that chronic preconception alcohol consumption by either parent may also have significance consequences for offspring health and development. Notably, since isogenic male mice used in these studies are not involved in gestation or rearing of offspring, the cross-generational effects of paternal alcohol exposure suggest a germline-based epigenetic mechanism. Many recent studies have demonstrated that the effects of paternal environmental exposures such as stress or malnutrition can be transmitted to the next generation via alterations to small noncoding RNAs in sperm. Therefore, we used high throughput sequencing to examine the effect of preconception ethanol on small noncoding RNAs in sperm. We found that chronic intermittent ethanol exposure altered several small noncoding RNAs from three of the major small RNA classes in sperm, tRNA-derived small RNA (tDR), mitochondrial small RNA, and microRNA. Six of the ethanol-responsive small noncoding RNAs were evaluated with RT-qPCR on a separate cohort of mice and five of the six were confirmed to be altered by chronic ethanol exposure, supporting the validity of the sequencing results. In addition to altered sperm RNA abundance, chronic ethanol exposure affected post-transcriptional modifications to sperm small noncoding RNAs, increasing two nucleoside modifications previously identified in mitochondrial tRNA. Furthermore, we found that chronic ethanol reduced epididymal expression of a tRNA methyltransferase, Nsun2 , known to directly regulate tDR biogenesis. Finally, ethanol-responsive sperm tDR are similarly altered in extracellular vesicles of the epididymis (i.e., epididymosomes), supporting the hypothesis that alterations to sperm tDR emerge in the epididymis and that epididymosomes are the primary source of small noncoding RNAs in sperm. These results add chronic ethanol to the growing list of
Heavy Chronic Intermittent Ethanol Exposure Alters Small Noncoding RNAs in Mouse Sperm and Epididymosomes

PubMed Central

Rompala, Gregory R.; Mounier, Anais; Wolfe, Cody M.; Lin, Qishan; Lefterov, Iliya; Homanics, Gregg E.

2018-01-01

While the risks of maternal alcohol abuse during pregnancy are well-established, several preclinical studies suggest that chronic preconception alcohol consumption by either parent may also have significance consequences for offspring health and development. Notably, since isogenic male mice used in these studies are not involved in gestation or rearing of offspring, the cross-generational effects of paternal alcohol exposure suggest a germline-based epigenetic mechanism. Many recent studies have demonstrated that the effects of paternal environmental exposures such as stress or malnutrition can be transmitted to the next generation via alterations to small noncoding RNAs in sperm. Therefore, we used high throughput sequencing to examine the effect of preconception ethanol on small noncoding RNAs in sperm. We found that chronic intermittent ethanol exposure altered several small noncoding RNAs from three of the major small RNA classes in sperm, tRNA-derived small RNA (tDR), mitochondrial small RNA, and microRNA. Six of the ethanol-responsive small noncoding RNAs were evaluated with RT-qPCR on a separate cohort of mice and five of the six were confirmed to be altered by chronic ethanol exposure, supporting the validity of the sequencing results. In addition to altered sperm RNA abundance, chronic ethanol exposure affected post-transcriptional modifications to sperm small noncoding RNAs, increasing two nucleoside modifications previously identified in mitochondrial tRNA. Furthermore, we found that chronic ethanol reduced epididymal expression of a tRNA methyltransferase, Nsun2, known to directly regulate tDR biogenesis. Finally, ethanol-responsive sperm tDR are similarly altered in extracellular vesicles of the epididymis (i.e., epididymosomes), supporting the hypothesis that alterations to sperm tDR emerge in the epididymis and that epididymosomes are the primary source of small noncoding RNAs in sperm. These results add chronic ethanol to the growing list of
Paraspeckles: nuclear bodies built on long noncoding RNA

PubMed Central

Bond, Charles S.

2009-01-01

Paraspeckles are ribonucleoprotein bodies found in the interchromatin space of mammalian cell nuclei. These structures play a role in regulating the expression of certain genes in differentiated cells by nuclear retention of RNA. The core paraspeckle proteins (PSF/SFPQ, P54NRB/NONO, and PSPC1 [paraspeckle protein 1]) are members of the DBHS (Drosophila melanogaster behavior, human splicing) family. These proteins, together with the long nonprotein-coding RNA NEAT1 (MEN-ϵ/β), associate to form paraspeckles and maintain their integrity. Given the large numbers of long noncoding transcripts currently being discovered through whole transcriptome analysis, paraspeckles may be a paradigm for a class of subnuclear bodies formed around long noncoding RNA. PMID:19720872
Capturing Snapshots of APE1 Processing DNA Damage

PubMed Central

Freudenthal, Bret D.; Beard, William A.; Cuneo, Matthew J.; Dyrkheeva, Nadezhda S.; Wilson, Samuel H.

2015-01-01

DNA apurinic-apyrimidinic (AP) sites are prevalent non-coding threats to genomic stability and are processed by AP endonuclease 1 (APE1). APE1 incises the AP-site phosphodiester backbone, generating a DNA repair intermediate that is potentially cytotoxic. The molecular events of the incision reaction remain elusive due in part to limited structural information. We report multiple high-resolution human APE1:DNA structures that divulge novel features of the APE1 reaction, including the metal binding site, nucleophile, and arginine clamps that mediate product release. We also report APE1:DNA structures with a T:G mismatch 5′ to the AP-site, representing a clustered lesion occurring in methylated CpG dinucleotides. These reveal that APE1 molds the T:G mismatch into a unique Watson-Crick like geometry that distorts the active site reducing incision. These snapshots provide mechanistic clarity for APE1, while affording a rational framework to manipulate biological responses to DNA damage. PMID:26458045
Endogenous sex hormone exposure and repetitive element DNA methylation in healthy postmenopausal women.

PubMed

Boyne, Devon J; Friedenreich, Christine M; McIntyre, John B; Stanczyk, Frank Z; Courneya, Kerry S; King, Will D

2017-12-01

Epigenetic mechanisms may help to explain the complex and heterogeneous relation between sex hormones and cancer. Few studies have investigated the effects of sex hormones on epigenetic markers related to cancer risk such as levels of methylation within repetitive DNA elements. Our objective was to describe the association between endogenous sex hormone exposure and levels of LINE-1 and Alu methylation in healthy postmenopausal women. We nested a cross-sectional study within the Alberta Physical Activity and Breast Cancer Prevention Trial (2003-2006). Study participants consisted of healthy postmenopausal women who had never been diagnosed with cancer (n = 289). Sex hormone exposures included serum concentrations of estradiol, estrone, testosterone, androstenedione, and sex hormone-binding globulin. We estimated the participants' lifetime number of menstrual cycles (LNMC) as a proxy for cumulative exposure to ovarian sex hormones. Buffy coat samples were assessed for DNA methylation. Linear regression was used to model the associations of interest and to control for confounding. Both estradiol and estrone had a significant positive dose-response association with LINE-1 methylation. LNMC was associated with both LINE-1 and Alu methylation. Specifically, LNMC had a non-linear "U-shaped" association with LINE-1 methylation regardless of folate intake and a negative linear association with Alu methylation, but only amongst low folate consumers. Androgen exposure was not associated with either outcome. Current and cumulative estrogen exposure was associated with repetitive element DNA methylation in a group of healthy postmenopausal women. LINE-1 and Alu methylation may be epigenetic mechanisms through which estrogen exposure impacts cancer risk.
Foldback intercoil DNA and the mechanism of DNA transposition.

PubMed

Kim, Byung-Dong

2014-09-01

Foldback intercoil (FBI) DNA is formed by the folding back at one point of a non-helical parallel track of double-stranded DNA at as sharp as 180° and the intertwining of two double helixes within each other's major groove to form an intercoil with a diameter of 2.2 nm. FBI DNA has been suggested to mediate intra-molecular homologous recombination of a deletion and inversion. Inter-molecular homologous recombination, known as site-specific insertion, on the other hand, is mediated by the direct perpendicular approach of the FBI DNA tip, as the attP site, onto the target DNA, as the attB site. Transposition of DNA transposons involves the pairing of terminal inverted repeats and 5-7-bp tandem target duplication. FBI DNA configuration effectively explains simple as well as replicative transposition, along with the involvement of an enhancer element. The majority of diverse retrotransposable elements that employ a target site duplication mechanism is also suggested to follow the FBI DNA-mediated perpendicular insertion of the paired intercoil ends by non-homologous end-joining, together with gap filling. A genome-wide perspective of transposable elements in light of FBI DNA is discussed.
cncRNAs: Bi-functional RNAs with protein coding and non-coding functions

PubMed Central

Kumari, Pooja; Sampath, Karuna

2015-01-01

For many decades, the major function of mRNA was thought to be to provide protein-coding information embedded in the genome. The advent of high-throughput sequencing has led to the discovery of pervasive transcription of eukaryotic genomes and opened the world of RNA-mediated gene regulation. Many regulatory RNAs have been found to be incapable of protein coding and are hence termed as non-coding RNAs (ncRNAs). However, studies in recent years have shown that several previously annotated non-coding RNAs have the potential to encode proteins, and conversely, some coding RNAs have regulatory functions independent of the protein they encode. Such bi-functional RNAs, with both protein coding and non-coding functions, which we term as ‘cncRNAs’, have emerged as new players in cellular systems. Here, we describe the functions of some cncRNAs identified from bacteria to humans. Because the functions of many RNAs across genomes remains unclear, we propose that RNAs be classified as coding, non-coding or both only after careful analysis of their functions. PMID:26498036
Standing your Ground to Exoribonucleases: Function of Flavivirus Long Non-coding RNAs

PubMed Central

Charley, Phillida A.; Wilusz, Jeffrey

2015-01-01

Members of the Flaviviridae (e.g. Dengue virus, West Nile virus, and Hepatitis C virus) contain a positive-sense RNA genome that encodes a large polyprotein. It is now also clear most if not all of these viruses also produce an abundant subgenomic long non-coding RNA. These non-coding RNAs, which are called subgenomicflavivirus RNAs (sfRNAs) or Xrn1-resistant RNAs (xrRNAs), are stable decay intermediates generated from the viral genomic RNA through the stalling of the cellular exoribonuclease Xrn1 at highly structured regions. Several functions of these flavivirus long non-coding RNAs have been revealed in recent years. The generation of these sfRNAs/xrRNAs from viral transcripts results in the repression of Xrn1 and the dysregulation of cellular mRNA stability. The abundant sfRNAs also serve directly as a decoy for important cellular protein regulators of the interferon and RNA interference antiviral pathways. Thus the generation of long non-coding RNAs from flaviviruses, hepaciviruses and pestiviruses likely disrupts aspects of innate immunity and may directly contribute to viral replication, cytopathology and pathogenesis. PMID:26368052
Linear and Nonlinear Statistical Characterization of DNA

NASA Astrophysics Data System (ADS)

Norio Oiwa, Nestor; Goldman, Carla; Glazier, James

2002-03-01

We find spatial order in the distribution of protein-coding (including RNAs) and control segments of GenBank genomic sequences, irrespective of ATCG content. This is achieved by correlations, histograms, fractal dimensions and singularity spectra. Estimates of these quantities in complete nuclear genome indicate that coding sequences are long-range correlated and their disposition are self-similar (multifractal) for eukaryotes. These characteristics are absent in prokaryotes, where there are few noncoding sequences, suggesting the `junk' DNA play a relevant role to the genome structure and function. Concerning the genetic message of ATCG sequences, we build a random walk (Levy flight), using DNA symmetry arguments, where we associate A, T, C and G as left, right, down and up steps, respectively. Nonlinear analysis of mitochondrial DNA walks reveal multifractal pattern based on palindromic sequences, which fold in hairpins and loops.
Recovery and separation of rare earth elements using columns loaded with DNA-filter hybrid.

PubMed

Takahashi, Yoshio; Kondo, Kazuhiro; Miyaji, Asami; Umeo, Miyuki; Honma, Tetsuo; Asaoka, Satoshi

2012-01-01

Given that the supply of several rare earth elements (REEs) is sometimes limited, recycling REEs used in various advanced materials, such as Nd magnets, is important for realizing efficient use of REE resources. In the present work, the feasibility of using DNA for REE recovery and separation was examined, along with the identification of the binding site of REEs in DNA. In particular, a DNA-cellulose filter paper hybrid was prepared so that DNA-based materials can be used for the separation of REEs using columns loaded with DNA. N,N'-Disuccinimidyl was used as a cross-linker reagent for the fixation of DNA onto a fibrous cellulose filter. The results showed that (i) the DNA-filter hybrid has a sufficiently high affinity to adsorb REEs; (ii) the adsorption capacity was 0.182 mg/g for Nd; and (iii) the affinity of REEs for DNA was stronger for REEs with larger atomic numbers. The difference of the affinity among REEs in the third result was compared with the adsorption patterns of REEs discussed in the literature. The comparison suggests that phosphate in the DNA-filter paper hybrid was responsible for REE adsorption onto the hybrid. The results were supported by the Nd, Dy, and Lu L(III)-edge EXAFS; the REE-P shell was identified for the second neighboring atom, showing the importance of the phosphate site as REE binding sites. The difference in the affinity among REEs suggest that group separation of REEs (such as La, Ce, (Pr and Nd), (Ho, Dy, and Er), (Tb and Gd), (Sm, Eu), Tm, Yb, and Lu) is possible, although complete isolation of each REE from a solution containing all REEs may be difficult. For practical applications, Nd and Fe(III) were successfully separated from a synthetic solution of Nd magnet waste using columns loaded with the DNA-filter hybrid.
Comparative analysis of human protein-coding and noncoding RNAs between brain and 10 mixed cell lines by RNA-Seq.

PubMed

Chen, Geng; Yin, Kangping; Shi, Leming; Fang, Yuanzhang; Qi, Ya; Li, Peng; Luo, Jian; He, Bing; Liu, Mingyao; Shi, Tieliu

2011-01-01

In their expression process, different genes can generate diverse functional products, including various protein-coding or noncoding RNAs. Here, we investigated the protein-coding capacities and the expression levels of their isoforms for human known genes, the conservation and disease association of long noncoding RNAs (ncRNAs) with two transcriptome sequencing datasets from human brain tissues and 10 mixed cell lines. Comparative analysis revealed that about two-thirds of the genes expressed between brain and cell lines are the same, but less than one-third of their isoforms are identical. Besides those genes specially expressed in brain and cell lines, about 66% of genes expressed in common encoded different isoforms. Moreover, most genes dominantly expressed one isoform and some genes only generated protein-coding (or noncoding) RNAs in one sample but not in another. We found 282 human genes could encode both protein-coding and noncoding RNAs through alternative splicing in the two samples. We also identified more than 1,000 long ncRNAs, and most of those long ncRNAs contain conserved elements across either 46 vertebrates or 33 placental mammals or 10 primates. Further analysis showed that some long ncRNAs differentially expressed in human breast cancer or lung cancer, several of those differentially expressed long ncRNAs were validated by RT-PCR. In addition, those validated differentially expressed long ncRNAs were found significantly correlated with certain breast cancer or lung cancer related genes, indicating the important biological relevance between long ncRNAs and human cancers. Our findings reveal that the differences of gene expression profile between samples mainly result from the expressed gene isoforms, and highlight the importance of studying genes at the isoform level for completely illustrating the intricate transcriptome.
Progressive changes in non-coding RNA profile in leucocytes with age

PubMed Central

Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David

2017-01-01

It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962
Genomic Editing of Non-Coding RNA Genes with CRISPR/Cas9 Ushers in a Potential Novel Approach to Study and Treat Schizophrenia

PubMed Central

Zhuo, Chuanjun; Hou, Weihong; Hu, Lirong; Lin, Chongguang; Chen, Ce; Lin, Xiaodong

2017-01-01

Schizophrenia is a genetically related mental illness, in which the majority of genetic alterations occur in the non-coding regions of the human genome. In the past decade, a growing number of regulatory non-coding RNAs (ncRNAs) including microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) have been identified to be strongly associated with schizophrenia. However, the studies of these ncRNAs in the pathophysiology of schizophrenia and the reverting of their genetic defects in restoration of the normal phenotype have been hampered by insufficient technology to manipulate these ncRNA genes effectively as well as a lack of appropriate animal models. Most recently, a revolutionary gene editing technology known as Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated nuclease 9 (Cas9; CRISPR/Cas9) has been developed that enable researchers to overcome these challenges. In this review article, we mainly focus on the schizophrenia-related ncRNAs and the use of CRISPR/Cas9-mediated editing on the non-coding regions of the genomic DNA in proving causal relationship between the genetic defects and the pathophysiology of schizophrenia. We subsequently discuss the potential of translating this advanced technology into a clinical therapy for schizophrenia, although the CRISPR/Cas9 technology is currently still in its infancy and immature to put into use in the treatment of diseases. Furthermore, we suggest strategies to accelerate the pace from the bench to the bedside. This review describes the application of the powerful and feasible CRISPR/Cas9 technology to manipulate schizophrenia-associated ncRNA genes. This technology could help researchers tackle this complex health problem and perhaps other genetically related mental disorders due to the overlapping genetic alterations of schizophrenia with other mental illnesses. PMID:28217082
Interplay between cardiac transcription factors and non-coding RNAs in predisposing to atrial fibrillation.

PubMed

Mikhailov, Alexander T; Torrado, Mario

2018-05-12

There is growing evidence that putative gene regulatory networks including cardio-enriched transcription factors, such as PITX2, TBX5, ZFHX3, and SHOX2, and their effector/target genes along with downstream non-coding RNAs can play a potentially important role in the process of adaptive and maladaptive atrial rhythm remodeling. In turn, expression of atrial fibrillation-associated transcription factors is under the control of upstream regulatory non-coding RNAs. This review broadly explores gene regulatory mechanisms associated with susceptibility to atrial fibrillation-with key examples from both animal models and patients-within the context of both cardiac transcription factors and non-coding RNAs. These two systems appear to have multiple levels of cross-regulation and act coordinately to achieve effective control of atrial rhythm effector gene expression. Perturbations of a dynamic expression balance between transcription factors and corresponding non-coding RNAs can provoke the development or promote the progression of atrial fibrillation. We also outline deficiencies in current models and discuss ongoing studies to clarify remaining mechanistic questions. An understanding of the function of transcription factors and non-coding RNAs in gene regulatory networks associated with atrial fibrillation risk will enable the development of innovative therapeutic strategies.
Hyperosmotic stress memory in Arabidopsis is mediated by distinct epigenetically labile sites in the genome and is restricted in the male germline by DNA glycosylase activity

PubMed Central

Wibowo, Anjar; Becker, Claude; Marconi, Gianpiero; Durr, Julius; Price, Jonathan; Hagmann, Jorg; Papareddy, Ranjith; Putra, Hadi; Kageyama, Jorge; Becker, Jorg; Weigel, Detlef; Gutierrez-Marcos, Jose

2016-01-01

Inducible epigenetic changes in eukaryotes are believed to enable rapid adaptation to environmental fluctuations. We have found distinct regions of the Arabidopsis genome that are susceptible to DNA (de)methylation in response to hyperosmotic stress. The stress-induced epigenetic changes are associated with conditionally heritable adaptive phenotypic stress responses. However, these stress responses are primarily transmitted to the next generation through the female lineage due to widespread DNA glycosylase activity in the male germline, and extensively reset in the absence of stress. Using the CNI1/ATL31 locus as an example, we demonstrate that epigenetically targeted sequences function as distantly-acting control elements of antisense long non-coding RNAs, which in turn regulate targeted gene expression in response to stress. Collectively, our findings reveal that plants use a highly dynamic maternal ‘short-term stress memory’ with which to respond to adverse external conditions. This transient memory relies on the DNA methylation machinery and associated transcriptional changes to extend the phenotypic plasticity accessible to the immediate offspring. DOI: http://dx.doi.org/10.7554/eLife.13546.001 PMID:27242129
Islet Long Noncoding RNAs: A Playbook for Discovery and Characterization.

PubMed

Singer, Ruth A; Sussel, Lori

2018-06-24

Diabetes is a complex group of metabolic disorders that can be accompanied by several comorbidities, including increased risk of early death. Decades of diabetes research have elucidated many genetic drivers of normal islet function and dysfunction; however, a lack of suitable treatment options suggests our knowledge about the disease remains incomplete. The establishment of long noncoding RNAs (lncRNAs), once dismissed as "junk" DNA, as essential gene regulators in many biological processes has redefined the central role for RNA in cells. Studies showing that misregulation of lncRNAs can lead to disease have contributed to the emergence of lncRNAs as attractive candidates for drug targeting. These findings underscore the need to reexamine islet biology in the context of a regulatory role for RNA. This review will 1 ) highlight what is known about lncRNAs in the context of diabetes, 2 ) summarize the strategies used in lncRNA discovery pipelines, and 3 ) discuss future directions and the potential impact of studying the role of lncRNAs in diabetes. © 2018 by the American Diabetes Association.
Open chromatin reveals the functional maize genome

USDA-ARS?s Scientific Manuscript database

Every cellular process mediated through nuclear DNA must contend with chromatin. As results from ENCODE show, open chromatin assays can efficiently integrate across diverse regulatory elements, revealing functional non-coding genome. In this study, we use a MNase hypersensitivity assay to discover o...
Nuclear Proximity of Mtr4 with RNA exosome restricts DNA mutational asymmetry

PubMed Central

Lim, Junghyun; Giri, Pankaj Kumar; Kazadi, David; Laffleur, Brice; Zhang, Wanwei; Grinstein, Veronika; Pefanis, Evangelos; Brown, Lewis M.; Ladewig, Erik; Martin, Ophélie; Chen, Yuling; Rabadan, Raul; Boyer, François; Rothschild, Gerson; Cogné, Michel; Pinaud, Eric; Deng, Haiteng; Basu, Uttiya

2017-01-01

SUMMARY The distribution of sense and antisense strand DNA mutations on transcribed duplex DNA contributes to the development of immune and neural systems along with the progression of cancer. Because developmentally matured B cells undergo biologically programmed strand-specific DNA mutagenesis at focal DNA/RNA hybrid structures, they make a convenient system to investigate strand-specific mutagenesis mechanisms. We demonstrate that the sense and antisense strand DNA mutagenesis at the immunoglobulin heavy chain locus and some other regions of the B cell genome depends upon localized RNA processing protein complex formation in the nucleus. Both the physical proximity and coupled activities of RNA helicase Mtr4 (and Senataxin) with the noncoding RNA processing function of RNA exosome determine the strand specific distribution of DNA mutations. Our study suggests that strand-specific DNA mutagenesis-associated mechanisms will play major roles in other undiscovered aspects of organismic development. PMID:28431250

Differential expression and emerging functions of non-coding RNAs in cold adaptation.

PubMed

Frigault, Jacques J; Morin, Mathieu D; Morin, Pier Jr

2017-01-01

Several species undergo substantial physiological and biochemical changes to confront the harsh conditions associated with winter. Small mammalian hibernators and cold-hardy insects are examples of natural models of cold adaptation that have been amply explored. While the molecular picture associated with cold adaptation has started to become clearer in recent years, notably through the use of high-throughput experimental approaches, the underlying cold-associated functions attributed to several non-coding RNAs, including microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), remain to be better characterized. Nevertheless, key pioneering work has provided clues on the likely relevance of these molecules in cold adaptation. With an emphasis on mammalian hibernation and insect cold hardiness, this work first reviews various molecular changes documented so far in these processes. The cascades leading to miRNA and lncRNA production as well as the mechanisms of action of these non-coding RNAs are subsequently described. Finally, we present examples of differentially expressed non-coding RNAs in models of cold adaptation and elaborate on the potential significance of this modulation with respect to low-temperature adaptation.
Scaling in nature: From DNA through heartbeats to weather

NASA Astrophysics Data System (ADS)

Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Ivanov, P. Ch.; Peng, C.-K.; Stanley, H. E.

1999-12-01

The purpose of this talk is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent α quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent α is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
Scaling in nature: from DNA through heartbeats to weather

NASA Technical Reports Server (NTRS)

Havlin, S.; Buldyrev, S. V.; Bunde, A.; Goldberger, A. L.; Peng, C. K.; Stanley, H. E.

1999-01-01

The purpose of this report is to describe some recent progress in applying scaling concepts to various systems in nature. We review several systems characterized by scaling laws such as DNA sequences, heartbeat rates and weather variations. We discuss the finding that the exponent alpha quantifying the scaling in DNA in smaller for coding than for noncoding sequences. We also discuss the application of fractal scaling analysis to the dynamics of heartbeat regulation, and report the recent finding that the scaling exponent alpha is smaller during sleep periods compared to wake periods. We also discuss the recent findings that suggest a universal scaling exponent characterizing the weather fluctuations.
A Comparative Genomics Strategy for Targeted Discovery of Single-Nucleotide Polymorphisms and Conserved-Noncoding Sequences in Orphan Crops1[W

PubMed Central

Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.

2006-01-01

Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031
Identification of coding and non-coding mutational hotspots in cancer genomes.

PubMed

Piraino, Scott W; Furney, Simon J

2017-01-05

The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from
A biological inspired fuzzy adaptive window median filter (FAWMF) for enhancing DNA signal processing.

PubMed

Ahmad, Muneer; Jung, Low Tan; Bhuiyan, Al-Amin

2017-10-01

Digital signal processing techniques commonly employ fixed length window filters to process the signal contents. DNA signals differ in characteristics from common digital signals since they carry nucleotides as contents. The nucleotides own genetic code context and fuzzy behaviors due to their special structure and order in DNA strand. Employing conventional fixed length window filters for DNA signal processing produce spectral leakage and hence results in signal noise. A biological context aware adaptive window filter is required to process the DNA signals. This paper introduces a biological inspired fuzzy adaptive window median filter (FAWMF) which computes the fuzzy membership strength of nucleotides in each slide of window and filters nucleotides based on median filtering with a combination of s-shaped and z-shaped filters. Since coding regions cause 3-base periodicity by an unbalanced nucleotides' distribution producing a relatively high bias for nucleotides' usage, such fundamental characteristic of nucleotides has been exploited in FAWMF to suppress the signal noise. Along with adaptive response of FAWMF, a strong correlation between median nucleotides and the Π shaped filter was observed which produced enhanced discrimination between coding and non-coding regions contrary to fixed length conventional window filters. The proposed FAWMF attains a significant enhancement in coding regions identification i.e. 40% to 125% as compared to other conventional window filters tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. This study proves that conventional fixed length window filters applied to DNA signals do not achieve significant results since the nucleotides carry genetic code context. The proposed FAWMF algorithm is adaptive and outperforms significantly to process DNA signal contents. The algorithm applied to variety of DNA datasets produced noteworthy discrimination between coding and non-coding regions contrary
Secondary structure of the 3'-noncoding region of flavivirus genomes: comparative analysis of base pairing probabilities.

PubMed

Rauscher, S; Flamm, C; Mandl, C W; Heinz, F X; Stadler, P F

1997-07-01

The prediction of the complete matrix of base pairing probabilities was applied to the 3' noncoding region (NCR) of flavivirus genomes. This approach identifies not only well-defined secondary structure elements, but also regions of high structural flexibility. Flaviviruses, many of which are important human pathogens, have a common genomic organization, but exhibit a significant degree of RNA sequence diversity in the functionally important 3'-NCR. We demonstrate the presence of secondary structures shared by all flaviviruses, as well as structural features that are characteristic for groups of viruses within the genus reflecting the established classification scheme. The significance of most of the predicted structures is corroborated by compensatory mutations. The availability of infectious clones for several flaviviruses will allow the assessment of these structural elements in processes of the viral life cycle, such as replication and assembly.
Population genetics and molecular evolution of DNA sequences in transposable elements. I. A simulation framework.

PubMed

Kijima, T E; Innan, Hideki

2013-11-01

A population genetic simulation framework is developed to understand the behavior and molecular evolution of DNA sequences of transposable elements. Our model incorporates random transposition and excision of transposable element (TE) copies, two modes of selection against TEs, and degeneration of transpositional activity by point mutations. We first investigated the relationships between the behavior of the copy number of TEs and these parameters. Our results show that when selection is weak, the genome can maintain a relatively large number of TEs, but most of them are less active. In contrast, with strong selection, the genome can maintain only a limited number of TEs but the proportion of active copies is large. In such a case, there could be substantial fluctuations of the copy number over generations. We also explored how DNA sequences of TEs evolve through the simulations. In general, active copies form clusters around the original sequence, while less active copies have long branches specific to themselves, exhibiting a star-shaped phylogeny. It is demonstrated that the phylogeny of TE sequences could be informative to understand the dynamics of TE evolution.
Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

PubMed

Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas

2009-03-01

Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.
Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing.

PubMed

Zhang, Rui; Deng, Patricia; Jacobson, Dionna; Li, Jin Billy

2017-02-01

Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3'UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3'UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions.
Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing

PubMed Central

Jacobson, Dionna

2017-01-01

Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3’UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3’UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions. PMID:28166241
MicroRNAs in large herpesvirus DNA genomes: recent advances.

PubMed

Sorel, Océane; Dewals, Benjamin G

2016-08-01

MicroRNAs (miRNAs) are small non-coding RNAs (ncRNAs) that regulate gene expression. They alter mRNA translation through base-pair complementarity, leading to regulation of genes during both physiological and pathological processes. Viruses have evolved mechanisms to take advantage of the host cells to multiply and/or persist over the lifetime of the host. Herpesviridae are a large family of double-stranded DNA viruses that are associated with a number of important diseases, including lymphoproliferative diseases. Herpesviruses establish lifelong latent infections through modulation of the interface between the virus and its host. A number of reports have identified miRNAs in a very large number of human and animal herpesviruses suggesting that these short non-coding transcripts could play essential roles in herpesvirus biology. This review will specifically focus on the recent advances on the functions of herpesvirus miRNAs in infection and pathogenesis.
The Hippo pathway in hepatocellular carcinoma: Non-coding RNAs in action.

PubMed

Shi, Xuan; Zhu, Hai-Rong; Liu, Tao-Tao; Shen, Xi-Zhong; Zhu, Ji-Min

2017-08-01

Hepatocellular carcinoma (HCC) is the sixth most common cancer and the third leading cause of cancer-related death worldwide. However, current strategies curing HCC are far from satisfaction. The Hippo pathway is an evolutionarily conserved tumor suppressive pathway that plays crucial roles in organ size control and tissue homeostasis. Its dysregulation is commonly observed in various types of cancer including HCC. Recently, the prominent role of non-coding RNAs in the Hippo pathway during normal development and neoplastic progression is also emerging in liver. Thus, further investigation into the regulatory network between non-coding RNAs and the Hippo pathway and their connections with HCC may provide new therapeutic avenues towards developing an effective preventative or perhaps curative treatment for HCC. Herein we summarize the role of non-coding RNAs in the Hippo pathway, with an emphasis on their contribution to carcinogenesis, diagnosis, treatment and prognosis of HCC. Copyright © 2017 Elsevier B.V. All rights reserved.
RNA expression in a cartilaginous fish cell line reveals ancient 3′ noncoding regions highly conserved in vertebrates

PubMed Central

Forest, David; Nishikawa, Ryuhei; Kobayashi, Hiroshi; Parton, Angela; Bayne, Christopher J.; Barnes, David W.

2007-01-01

We have established a cartilaginous fish cell line [Squalus acanthias embryo cell line (SAE)], a mesenchymal stem cell line derived from the embryo of an elasmobranch, the spiny dogfish shark S. acanthias. Elasmobranchs (sharks and rays) first appeared >400 million years ago, and existing species provide useful models for comparative vertebrate cell biology, physiology, and genomics. Comparative vertebrate genomics among evolutionarily distant organisms can provide sequence conservation information that facilitates identification of critical coding and noncoding regions. Although these genomic analyses are informative, experimental verification of functions of genomic sequences depends heavily on cell culture approaches. Using ESTs defining mRNAs derived from the SAE cell line, we identified lengthy and highly conserved gene-specific nucleotide sequences in the noncoding 3′ UTRs of eight genes involved in the regulation of cell growth and proliferation. Conserved noncoding 3′ mRNA regions detected by using the shark nucleotide sequences as a starting point were found in a range of other vertebrate orders, including bony fish, birds, amphibians, and mammals. Nucleotide identity of shark and human in these regions was remarkably well conserved. Our results indicate that highly conserved gene sequences dating from the appearance of jawed vertebrates and representing potential cis-regulatory elements can be identified through the use of cartilaginous fish as a baseline. Because the expression of genes in the SAE cell line was prerequisite for their identification, this cartilaginous fish culture system also provides a physiologically valid tool to test functional hypotheses on the role of these ancient conserved sequences in comparative cell biology. PMID:17227856
Capturing snapshots of APE1 processing DNA damage

DOE PAGES

Freudenthal, Bret D.; Beard, William A.; Cuneo, Matthew J.; ...

2015-10-12

DNA apurinic-apyrimidinic (AP) sites are prevalent noncoding threats to genomic stability and are processed by AP endonuclease 1 (APE1). APE1 incises the AP-site phosphodiester backbone, generating a DNA-repair intermediate that is potentially cytotoxic. The molecular events of the incision reaction remain elusive, owing in part to limited structural information. Here we report multiple high-resolution human APE1-DNA structures that divulge new features of the APE1 reaction, including the metal-binding site, the nucleophile and the arginine clamps that mediate product release. We also report APE1-DNA structures with a T-G mismatch 5' to the AP site, representing a clustered lesion occurring in methylatedmore » CpG dinucleotides. Moreover, these structures reveal that APE1 molds the T-G mismatch into a unique Watson-Crick-like geometry that distorts the active site, thus reducing incision. Finally, these snapshots provide mechanistic clarity for APE1 while affording a rational framework to manipulate biological responses to DNA damage.« less
Spontaneous and engineered deletions in the 3' noncoding region of tick-borne encephalitis virus: construction of highly attenuated mutants of a flavivirus.

PubMed

Mandl, C W; Holzmann, H; Meixner, T; Rauscher, S; Stadler, P F; Allison, S L; Heinz, F X

1998-03-01

The flavivirus genome is a positive-strand RNA molecule containing a single long open reading frame flanked by noncoding regions (NCR) that mediate crucial processes of the viral life cycle. The 3' NCR of tick-borne encephalitis (TBE) virus can be divided into a variable region that is highly heterogeneous in length among strains of TBE virus and in certain cases includes an internal poly(A) tract and a 3'-terminal conserved core element that is believed to fold as a whole into a well-defined secondary structure. We have now investigated the genetic stability of the TBE virus 3' NCR and its influence on viral growth properties and virulence. We observed spontaneous deletions in the variable region during growth of TBE virus in cell culture and in mice. These deletions varied in size and location but always included the internal poly(A) element of the TBE virus 3' NCR and never extended into the conserved 3'-terminal core element. Subsequently, we constructed specific deletion mutants by using infectious cDNA clones with the entire variable region and increasing segments of the core element removed. A virus mutant lacking the entire variable region was indistinguishable from wild-type virus with respect to cell culture growth properties and virulence in the mouse model. In contrast, even small extensions of the deletion into the core element led to significant biological effects. Deletions extending to nucleotides 10826, 10847, and 10870 caused distinct attenuation in mice without measurable reduction of cell culture growth properties, which, however, were significantly restricted when the deletion was extended to nucleotide 10919. An even larger deletion (to nucleotide 10994) abolished viral viability. In spite of their high degree of attenuation, these mutants efficiently induced protective immune responses even at low inoculation doses. Thus, 3'-NCR deletions represent a useful technique for achieving stable attenuation of flaviviruses that can be included in the
Regulation of Immunoglobulin Class-Switch Recombination: Choreography of Noncoding Transcription, Targeted DNA Deamination, and Long-Range DNA Repair

PubMed Central

Matthews, Allysia J.; Zheng, Simin; DiMenna, Lauren J.; Chaudhuri, Jayanta

2014-01-01

Upon encountering antigens, mature IgM-positive B lymphocytes undergo class-switch recombination (CSR) wherein exons encoding the default Cμ constant coding gene segment of the immunoglobulin (Ig) heavy-chain (Igh) locus are excised and replaced with a new constant gene segment (referred to as “Ch genes”, e.g., Cγ, Cε, or Cα). The B cell thereby changes from expressing IgM to one producing IgG, IgE, or IgA, with each antibody isotype having a different effector function during an immune reaction. CSR is a DNA deletional-recombination reaction that proceeds through the generation of DNA double-strand breaks (DSBs) in repetitive switch (S) sequences preceding each Ch gene and is completed by end-joining between donor Sμ and acceptor S regions. CSR is a multistep reaction requiring transcription through S regions, the DNA cytidine deaminase AID, and the participation of several general DNA repair pathways including base excision repair, mismatch repair, and classical nonhomologous end-joining. In this review, we discuss our current understanding of how transcription through S regions generates substrates for AID-mediated deamination and how AID participates not only in the initiation of CSR but also in the conversion of deaminated residues into DSBs. Additionally, we review the multiple processes that regulate AID expression and facilitate its recruitment specifically to the Ig loci, and how deregulation of AID specificity leads to oncogenic translocations. Finally, we summarize recent data on the potential role of AID in the maintenance of the pluripotent stem cell state during epigenetic reprogramming. PMID:24507154
Non-coding RNA generated following lariat-debranching mediates targeting of AID to DNA

PubMed Central

Zheng, Simin; Vuong, Bao Q.; Vaidyanathan, Bharat; Lin, Jia-Yu; Huang, Feng-Ting; Chaudhuri, Jayanta

2015-01-01

SUMMARY Transcription through immunoglobulin switch (S) regions is essential for class switch recombination (CSR) but no molecular function of the transcripts has been described. Likewise, recruitment of activation-induced cytidine deaminase (AID) to S regions is critical for CSR; however, the underlying mechanism has not been fully elucidated. Here, we demonstrate that intronic switch RNA acts in trans to target AID to S region DNA. AID binds directly to switch RNA through G-quadruplexes formed by the RNA molecules. Disruption of this interaction by mutation of a key residue in the putative RNA-binding domain of AID impairs recruitment of AID to S region DNA, thereby abolishing CSR. Additionally, inhibition of RNA lariat processing leads to loss of AID localization to S regions and compromises CSR; both defects can be rescued by exogenous expression of switch transcripts in a sequence-specific manner. These studies uncover an RNA-mediated mechanism of targeting AID to DNA. PMID:25957684
DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts

PubMed Central

Paraskevopoulou, Maria D.; Vlachos, Ioannis S.; Karagkouni, Dimitra; Georgakilas, Georgios; Kanellos, Ilias; Vergoulis, Thanasis; Zagganas, Konstantinos; Tsanakas, Panayiotis; Floros, Evangelos; Dalamagas, Theodore; Hatzigeorgiou, Artemis G.

2016-01-01

microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads. PMID:26612864
Genomic identification of regulatory elements by evolutionary sequence comparison and functional analysis.

PubMed

Loots, Gabriela G

2008-01-01

Despite remarkable recent advances in genomics that have enabled us to identify most of the genes in the human genome, comparable efforts to define transcriptional cis-regulatory elements that control gene expression are lagging behind. The difficulty of this task stems from two equally important problems: our knowledge of how regulatory elements are encoded in genomes remains elementary, and there is a vast genomic search space for regulatory elements, since most of mammalian genomes are noncoding. Comparative genomic approaches are having a remarkable impact on the study of transcriptional regulation in eukaryotes and currently represent the most efficient and reliable methods of predicting noncoding sequences likely to control the patterns of gene expression. By subjecting eukaryotic genomic sequences to computational comparisons and subsequent experimentation, we are inching our way toward a more comprehensive catalog of common regulatory motifs that lie behind fundamental biological processes. We are still far from comprehending how the transcriptional regulatory code is encrypted in the human genome and providing an initial global view of regulatory gene networks, but collectively, the continued development of comparative and experimental approaches will rapidly expand our knowledge of the transcriptional regulome.

Virulence Phenotypes of Legionella pneumophila Associated with Noncoding RNA lpr0035

PubMed Central

Jayakumar, Deepak; Early, Julie V.

2012-01-01

The Philadelphia-1 strain of Legionella pneumophila, the causative organism of Legionnaires' disease, contains a recently discovered noncoding RNA, lpr0035. lpr0035 straddles the 5′ chromosomal junction of a 45-kbp mobile genetic element, pLP45, which can exist as an episome or integrated in the bacterial chromosome. A 121-bp deletion was introduced in strain JR32, a Philadelphia-1 derivative. The deletion inactivated lpr0035, removed the 49-bp direct repeat at the 5′ junction of pLP45, and locked pLP45 in the chromosome. Intracellular multiplication of the deletion mutant was decreased by nearly 3 orders of magnitude in Acanthamoeba castellanii amoebae and nearly 2 orders of magnitude in J774 mouse macrophages. Entry of the deletion mutant into amoebae and macrophages was decreased by >70%. The level of entry in both hosts was restored to that in strain JR32 by plasmid copies of two open reading frames immediately downstream of the 5′ junction and plasmid lpr0035 driven by its endogenous promoter. When induced from a tac promoter, plasmid lpr0035 completely reversed the intracellular multiplication defect in macrophages but was without effect in amoebae. These data are the first evidence of a role for noncoding RNA lpr0035, which has homologs in six other Legionella genomes, in entry of L. pneumophila into amoebae and macrophages and in host-specific intracellular multiplication. The data also demonstrate that deletion of a direct-repeat sequence restricts the mobility of pLP45 and is a means of studying the role of pLP45 mobility in Legionella virulence phenotypes. PMID:22966048
ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data.

PubMed

Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan; Zhou, Hui; Qu, Liang-Hu

2013-01-01

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) represent two classes of important non-coding RNAs in eukaryotes. Although these non-coding RNAs have been implicated in organismal development and in various human diseases, surprisingly little is known about their transcriptional regulation. Recent advances in chromatin immunoprecipitation with next-generation DNA sequencing (ChIP-Seq) have provided methods of detecting transcription factor binding sites (TFBSs) with unprecedented sensitivity. In this study, we describe ChIPBase (http://deepbase.sysu.edu.cn/chipbase/), a novel database that we have developed to facilitate the comprehensive annotation and discovery of transcription factor binding maps and transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. The current release of ChIPBase includes high-throughput sequencing data that were generated by 543 ChIP-Seq experiments in diverse tissues and cell lines from six organisms. By analysing millions of TFBSs, we identified tens of thousands of TF-lncRNA and TF-miRNA regulatory relationships. Furthermore, two web-based servers were developed to annotate and discover transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. In addition, we developed two genome browsers, deepView and genomeView, to provide integrated views of multidimensional data. Moreover, our web implementation supports diverse query types and the exploration of TFs, lncRNAs, miRNAs, gene ontologies and pathways.
LncRNA-DANCR: A valuable cancer related long non-coding RNA for human cancers.

PubMed

Thin, Khaing Zar; Liu, Xuefang; Feng, Xiaobo; Raveendran, Sudheesh; Tu, Jian Cheng

2018-06-01

Long noncoding RNAs (lncRNA) are a type of noncoding RNA that comprise of longer than 200 nucleotides sequences. They can regulate chromosome structure, gene expression and play an essential role in the pathophysiology of human diseases, especially in tumorigenesis and progression. Nowadays, they are being targeted as potential biomarkers for various cancer types. And many research studies have proven that lncRNAs might bring a new era to cancer diagnosis and support treatment management. The purpose of this review was to inspect the molecular mechanism and clinical significance of long non-coding RNA- differentiation antagonizing nonprotein coding RNA(DANCR) in various types of human cancers. In this review, we summarize and figure out recent research studies concerning the expression and biological mechanisms of lncRNA-DANCR in tumour development. The related studies were obtained through a systematic search of PubMed, Embase and Cochrane Library. Long non-coding RNAs-DANCR is a valuable cancer-related lncRNA that its dysregulated expression was found in a variety of malignancies, including hepatocellular carcinoma, breast cancer, glioma, colorectal cancer, gastric cancer, and lung cancer. The aberrant expressions of DANCR have been shown to contribute to proliferation, migration and invasion of cancer cells. Long non-coding RNAs-DANCR likely serves as a useful disease biomarker or therapeutic cancer target. Copyright © 2018 Elsevier GmbH. All rights reserved.
In vitro selection of DNA elements highly responsive to the human T-cell lymphotropic virus type I transcriptional activator, Tax.

PubMed

Paca-Uccaralertkun, S; Zhao, L J; Adya, N; Cross, J V; Cullen, B R; Boros, I M; Giam, C Z

1994-01-01

The human T-cell lymphotropic virus type I (HTLV-I) transactivator, Tax, the ubiquitous transcriptional factor cyclic AMP (cAMP) response element-binding protein (CREB protein), and the 21-bp repeats in the HTLV-I transcriptional enhancer form a ternary nucleoprotein complex (L. J. Zhao and C. Z. Giam, Proc. Natl. Acad. Sci. USA 89:7070-7074, 1992). Using an antibody directed against the COOH-terminal region of Tax along with purified Tax and CREB proteins, we selected DNA elements bound specifically by the Tax-CREB complex in vitro. Two distinct but related groups of sequences containing the cAMP response element (CRE) flanked by long runs of G and C residues in the 5' and 3' regions, respectively, were preferentially recognized by Tax-CREB. In contrast, CREB alone binds only to CRE motifs (GNTGACG[T/C]) without neighboring G- or C-rich sequences. The Tax-CREB-selected sequences bear a striking resemblance to the 5' or 3' two-thirds of the HTLV-I 21-bp repeats and are highly inducible by Tax. Gel electrophoretic mobility shift assays, DNA transfection, and DNase I footprinting analyses indicated that the G- and C-rich sequences flanking the CRE motif are crucial for Tax-CREB-DNA ternary complex assembly and Tax transactivation but are not in direct contact with the Tax-CREB complex. These data show that Tax recruits CREB to form a multiprotein complex that specifically recognizes the viral 21-bp repeats. The expanded DNA binding specificity of Tax-CREB and the obligatory role the ternary Tax-CREB-DNA complex plays in transactivation reveal a novel mechanism for regulating the transcriptional activity of leucine zipper proteins like CREB.
Transcription Factor Binding Profiles Reveal Cyclic Expression of Human Protein-coding Genes and Non-coding RNAs

PubMed Central

Cheng, Chao; Ung, Matthew; Grant, Gavin D.; Whitfield, Michael L.

2013-01-01

Cell cycle is a complex and highly supervised process that must proceed with regulatory precision to achieve successful cellular division. Despite the wide application, microarray time course experiments have several limitations in identifying cell cycle genes. We thus propose a computational model to predict human cell cycle genes based on transcription factor (TF) binding and regulatory motif information in their promoters. We utilize ENCODE ChIP-seq data and motif information as predictors to discriminate cell cycle against non-cell cycle genes. Our results show that both the trans- TF features and the cis- motif features are predictive of cell cycle genes, and a combination of the two types of features can further improve prediction accuracy. We apply our model to a complete list of GENCODE promoters to predict novel cell cycle driving promoters for both protein-coding genes and non-coding RNAs such as lincRNAs. We find that a similar percentage of lincRNAs are cell cycle regulated as protein-coding genes, suggesting the importance of non-coding RNAs in cell cycle division. The model we propose here provides not only a practical tool for identifying novel cell cycle genes with high accuracy, but also new insights on cell cycle regulation by TFs and cis-regulatory elements. PMID:23874175
Identification of Dlk1-Dio3 imprinted gene cluster noncoding RNAs as novel candidate biomarkers for liver tumor promotion.

PubMed

Lempiäinen, Harri; Couttet, Philippe; Bolognani, Federico; Müller, Arne; Dubost, Valérie; Luisier, Raphaëlle; Del Rio Espinola, Alberto; Vitry, Veronique; Unterberger, Elif B; Thomson, John P; Treindl, Fridolin; Metzger, Ute; Wrzodek, Clemens; Hahne, Florian; Zollinger, Tulipan; Brasa, Sarah; Kalteis, Magdalena; Marcellin, Magali; Giudicelli, Fanny; Braeuning, Albert; Morawiec, Laurent; Zamurovic, Natasa; Längle, Ulrich; Scheer, Nico; Schübeler, Dirk; Goodman, Jay; Chibout, Salah-Dine; Marlowe, Jennifer; Theil, Diethilde; Heard, David J; Grenet, Olivier; Zell, Andreas; Templin, Markus F; Meehan, Richard R; Wolf, Roland C; Elcombe, Clifford R; Schwarz, Michael; Moulin, Pierre; Terranova, Rémi; Moggs, Jonathan G

2013-02-01

The molecular events during nongenotoxic carcinogenesis and their temporal order are poorly understood but thought to include long-lasting perturbations of gene expression. Here, we have investigated the temporal sequence of molecular and pathological perturbations at early stages of phenobarbital (PB) mediated liver tumor promotion in vivo. Molecular profiling (mRNA, microRNA [miRNA], DNA methylation, and proteins) of mouse liver during 13 weeks of PB treatment revealed progressive increases in hepatic expression of long noncoding RNAs and miRNAs originating from the Dlk1-Dio3 imprinted gene cluster, a locus that has recently been associated with stem cell pluripotency in mice and various neoplasms in humans. PB induction of the Dlk1-Dio3 cluster noncoding RNA (ncRNA) Meg3 was localized to glutamine synthetase-positive hypertrophic perivenous hepatocytes, suggesting a role for β-catenin signaling in the dysregulation of Dlk1-Dio3 ncRNAs. The carcinogenic relevance of Dlk1-Dio3 locus ncRNA induction was further supported by in vivo genetic dependence on constitutive androstane receptor and β-catenin pathways. Our data identify Dlk1-Dio3 ncRNAs as novel candidate early biomarkers for mouse liver tumor promotion and provide new opportunities for assessing the carcinogenic potential of novel compounds.
The role of epigenetics and long noncoding RNA MIAT in neuroendocrine prostate cancer.

PubMed

Crea, Francesco; Venalainen, Erik; Ci, Xinpei; Cheng, Hongwei; Pikor, Larissa; Parolia, Abhijit; Xue, Hui; Nur Saidy, Nur Ridzwan; Lin, Dong; Lam, Wan; Collins, Colin; Wang, Yuzhuo

2016-05-01

Neuroendocrine prostate cancer (NEPC) is the most lethal prostatic neoplasm. NEPC is thought to originate from the transdifferentiation of AR-positive adenocarcinoma cells. We have previously shown that an epigenetic/noncoding interactome (ENI) orchestrates cancer cells' plasticity, thereby allowing the emergence of metastatic, drug-resistant neoplasms. The primary objective of this manuscript is to discuss evidence indicating that some components of the ENI (Polycomb genes, miRNAs) play a key role in NEPC initiation and progression. Long noncoding RNAs represent vast and largely unexplored component of the ENI. Their role in NEPC has not been investigated. We show preliminary evidence indicating that a lncRNA (MIAT) is selectively upregulated in NEPCs and might interact with Polycomb genes. Our results indicate that long noncoding RNAs can be exploited as new biomarkers and therapeutic targets for NEPC.
Non-coding RNAs: new biomarkers and therapeutic targets for esophageal cancer

PubMed Central

Ren, Zhipeng; Zhang, Guoliang

2017-01-01

Esophageal cancer is one of the most common gastrointestinal malignant diseases and there is still no effective treatment. The incidence of esophageal cancer in the world is relatively high and on the increase year by year. Thus, the elaboration on the carcinogenesis of esophageal cancer and the identification of new biomarkers and therapeutic targets is quite beneficial to optimizing the current therapeutic regimen for treating such deadly disease. More and more evidence has shown that non-coding RNAs play an important role in the development and progression of multiple human cancers, including esophageal cancer. microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) are two functional kinds of non-coding RNAs that have been well investigated. They exert tumor suppressive or promoting effect by specifically regulating the expression of certain downstream target genes, which is tumor specific. It is also proved that miRNAs and lncRNAs level in tissue and plasma from esophageal cancer patients are closely correlated with the survival and disease progression, which could be used as a prognostic factor and therapeutic target for esophageal cancer. PMID:28388588
Non-coding RNAs: new biomarkers and therapeutic targets for esophageal cancer.

PubMed

Hou, Xiaobin; Wen, Jiaxin; Ren, Zhipeng; Zhang, Guoliang

2017-06-27

Esophageal cancer is one of the most common gastrointestinal malignant diseases and there is still no effective treatment. The incidence of esophageal cancer in the world is relatively high and on the increase year by year. Thus, the elaboration on the carcinogenesis of esophageal cancer and the identification of new biomarkers and therapeutic targets is quite beneficial to optimizing the current therapeutic regimen for treating such deadly disease. More and more evidence has shown that non-coding RNAs play an important role in the development and progression of multiple human cancers, including esophageal cancer. microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) are two functional kinds of non-coding RNAs that have been well investigated. They exert tumor suppressive or promoting effect by specifically regulating the expression of certain downstream target genes, which is tumor specific. It is also proved that miRNAs and lncRNAs level in tissue and plasma from esophageal cancer patients are closely correlated with the survival and disease progression, which could be used as a prognostic factor and therapeutic target for esophageal cancer.
Long Noncoding RNAs: Past, Present, and Future

PubMed Central

Kung, Johnny T. Y.; Colognori, David; Lee, Jeannie T.

2013-01-01

Long noncoding RNAs (lncRNAs) have gained widespread attention in recent years as a potentially new and crucial layer of biological regulation. lncRNAs of all kinds have been implicated in a range of developmental processes and diseases, but knowledge of the mechanisms by which they act is still surprisingly limited, and claims that almost the entirety of the mammalian genome is transcribed into functional noncoding transcripts remain controversial. At the same time, a small number of well-studied lncRNAs have given us important clues about the biology of these molecules, and a few key functional and mechanistic themes have begun to emerge, although the robustness of these models and classification schemes remains to be seen. Here, we review the current state of knowledge of the lncRNA field, discussing what is known about the genomic contexts, biological functions, and mechanisms of action of lncRNAs. We also reflect on how the recent interest in lncRNAs is deeply rooted in biology’s longstanding concern with the evolution and function of genomes. PMID:23463798
Cytoplasmic long noncoding RNAs are frequently bound to and degraded at ribosomes in human cells

PubMed Central

Carlevaro-Fita, Joana; Rahim, Anisa; Guigó, Roderic; Vardy, Leah A.; Johnson, Rory

2016-01-01

Recent footprinting studies have made the surprising observation that long noncoding RNAs (lncRNAs) physically interact with ribosomes. However, these findings remain controversial, and the overall proportion of cytoplasmic lncRNAs involved is unknown. Here we make a global, absolute estimate of the cytoplasmic and ribosome-associated population of stringently filtered lncRNAs in a human cell line using polysome profiling coupled to spike-in normalized microarray analysis. Fifty-four percent of expressed lncRNAs are detected in the cytoplasm. The majority of these (70%) have >50% of their cytoplasmic copies associated with polysomal fractions. These interactions are lost upon disruption of ribosomes by puromycin. Polysomal lncRNAs are distinguished by a number of 5′ mRNA-like features, including capping and 5′UTR length. On the other hand, nonpolysomal “free cytoplasmic” lncRNAs have more conserved promoters and a wider range of expression across cell types. Exons of polysomal lncRNAs are depleted of endogenous retroviral insertions, suggesting a role for repetitive elements in lncRNA localization. Finally, we show that blocking of ribosomal elongation results in stabilization of many associated lncRNAs. Together these findings suggest that the ribosome is the default destination for the majority of cytoplasmic long noncoding RNAs and may play a role in their degradation. PMID:27090285
DNA Methylation-a Potential Source of Mitochondria DNA Base Mismatch in the Development of Diabetic Retinopathy.

PubMed

Mishra, Manish; Kowluru, Renu A

2018-04-21

In the development of diabetic retinopathy, retinal mitochondria are dysfunctional, and mitochondrial DNA (mtDNA) is damaged with increased base mismatches and hypermethylated cytosines. DNA methylation is also a potential source of mutation, and in diabetes, the noncoding region, the displacement loop (D-loop), experiences more methylation and base mismatches than other regions of the mtDNA. Our aim was to investigate a possible crosstalk between mtDNA methylation and base mismatches in the development of diabetic retinopathy. The effect of inhibition of Dnmts (by 5-aza-2'-deoxycytidine or Dnmt1-siRNA) on glucose-induced mtDNA base mismatches was investigated in human retinal endothelial cells by surveyor endonuclease digestion and validated by Sanger sequencing. The role of deamination factors on increased base mismatches was determined in the cells genetically modulated for mitochondrial superoxide dismutase (Sod2) or cytidine-deaminase (APOBEC3A). The results were confirmed in an in vivo model using retinal microvasculature from diabetic mice overexpressing Sod2. Inhibition of DNA methylation, or regulation of cytosine deamination, significantly inhibited an increase in base mismatches at the D-loop and prevented mitochondrial dysfunction. Overexpression of Sod2 in mice also prevented diabetes-induced D-loop hypermethylation and increase in base mismatches. The crosstalk between DNA methylation and base mismatches continued even after termination of hyperglycemia, suggesting its role in the metabolic memory phenomenon associated with the progression of diabetic retinopathy. Inhibition of DNA methylation limits the availability of methylated cytosine for deamination, suggesting a crosstalk between DNA methylation and base mismatches. Thus, regulation of DNA methylation, or its deamination, should impede the development of diabetic retinopathy by preventing formation of base mismatches and mitochondrial dysfunction.
Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

PubMed Central

Wang, Hongbo; Zhao, Yingchao; Chen, Mingyue; Cui, Jie

2017-01-01

Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16) mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN) tissues from three patients with high-throughput RNA sequencing (RNA-seq). In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE) in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO) terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA) network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer. PMID:28970820
Non-coding cancer driver candidates identified with a sample- and position-specific model of the somatic mutation rate

PubMed Central

Juul, Malene; Bertl, Johanna; Guo, Qianyun; Nielsen, Morten Muhlig; Świtnicki, Michał; Hornshøj, Henrik; Madsen, Tobias; Hobolth, Asger; Pedersen, Jakob Skou

2017-01-01

Non-coding mutations may drive cancer development. Statistical detection of non-coding driver regions is challenged by a varying mutation rate and uncertainty of functional impact. Here, we develop a statistically founded non-coding driver-detection method, ncdDetect, which includes sample-specific mutational signatures, long-range mutation rate variation, and position-specific impact measures. Using ncdDetect, we screened non-coding regulatory regions of protein-coding genes across a pan-cancer set of whole-genomes (n = 505), which top-ranked known drivers and identified new candidates. For individual candidates, presence of non-coding mutations associates with altered expression or decreased patient survival across an independent pan-cancer sample set (n = 5454). This includes an antigen-presenting gene (CD1A), where 5’UTR mutations correlate significantly with decreased survival in melanoma. Additionally, mutations in a base-excision-repair gene (SMUG1) correlate with a C-to-T mutational-signature. Overall, we find that a rich model of mutational heterogeneity facilitates non-coding driver identification and integrative analysis points to candidates of potential clinical relevance. DOI: http://dx.doi.org/10.7554/eLife.21778.001 PMID:28362259
Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

PubMed

Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

2016-05-11

Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.
A pseudogene long-noncoding-RNA network regulates PTEN transcription and translation in human cells.

PubMed

Johnsson, Per; Ackley, Amanda; Vidarsdottir, Linda; Lui, Weng-Onn; Corcoran, Martin; Grandér, Dan; Morris, Kevin V

2013-04-01

PTEN is a tumor-suppressor gene that has been shown to be under the regulatory control of a PTEN pseudogene expressed noncoding RNA, PTENpg1. Here, we characterize a previously unidentified PTENpg1-encoded antisense RNA (asRNA), which regulates PTEN transcription and PTEN mRNA stability. We find two PTENpg1 asRNA isoforms, α and β. The α isoform functions in trans, localizes to the PTEN promoter and epigenetically modulates PTEN transcription by the recruitment of DNA methyltransferase 3a and Enhancer of Zeste. In contrast, the β isoform interacts with PTENpg1 through an RNA-RNA pairing interaction, which affects PTEN protein output through changes of PTENpg1 stability and microRNA sponge activity. Disruption of this asRNA-regulated network induces cell-cycle arrest and sensitizes cells to doxorubicin, which suggests a biological function for the respective PTENpg1 expressed asRNAs.
Arabidopsis intragenomic conserved noncoding sequence

PubMed Central

Thomas, Brian C.; Rapaka, Lakshmi; Lyons, Eric; Pedersen, Brent; Freeling, Michael

2007-01-01

After the most recent tetraploidy in the Arabidopsis lineage, most gene pairs lost one, but not both, of their duplicates. We manually inspected the 3,179 retained gene pairs and their surrounding gene space still present in the genome using a custom-made viewer application. The display of these pairs allowed us to define intragenic conserved noncoding sequences (CNSs), identify exon annotation errors, and discover potentially new genes. Using a strict algorithm to sort high-scoring pair sequences from the bl2seq data, we created a database of 14,944 intragenomic Arabidopsis CNSs. The mean CNS length is 31 bp, ranging from 15 to 285 bp. There are ≈1.7 CNSs associated with a typical gene, and Arabidopsis CNSs are found in all areas around exons, most frequently in the 5′ upstream region. Gene ontology classifications related to transcription, regulation, or “response to …” external or endogenous stimuli, especially hormones, tend to be significantly overrepresented among genes containing a large number of CNSs, whereas protein localization, transport, and metabolism are common among genes with no CNSs. There is a 1.5% overlap between these CNSs and the 218,982 putative RNAs in the Arabidopsis Small RNA Project database, allowing for two mismatches. These CNSs provide a unique set of noncoding sequences enriched for function. CNS function is implied by evolutionary conservation and independently supported because CNS-richness predicts regulatory gene ontology categories. PMID:17301222
Spread of X-chromosome inactivation into autosomal sequences: role for DNA elements, chromatin features and chromosomal domains

PubMed Central

Cotton, Allison M.; Chen, Chih-Yu; Lam, Lucia L.; Wasserman, Wyeth W.; Kobor, Michael S.; Brown, Carolyn J.

2014-01-01

X-chromosome inactivation results in dosage equivalence between the X chromosome in males and females; however, over 15% of human X-linked genes escape silencing and these genes are enriched on the evolutionarily younger short arm of the X chromosome. The spread of inactivation onto translocated autosomal material allows the study of inactivation without the confounding evolutionary history of the X chromosome. The heterogeneity and reduced extent of silencing on autosomes are evidence for the importance of DNA elements underlying the spread of silencing. We have assessed DNA methylation in six unbalanced X-autosome translocations using the Illumina Infinium HumanMethylation450 array. Two to 42% of translocated autosomal genes showed this mark of silencing, with the highest degree of inactivation observed for trisomic autosomal regions. Generally, the extent of silencing was greatest close to the translocation breakpoint; however, silencing was detected well over 100 kb into the autosomal DNA. Alu elements were found to be enriched at autosomal genes that escaped from inactivation while L1s were enriched at subject genes. In cells without the translocation, there was enrichment of heterochromatic features such as EZH2 and H3K27me3 for those genes that become silenced when translocated, suggesting that underlying chromatin structure predisposes genes towards silencing. Additionally, the analysis of topological domains indicated physical clustering of autosomal genes of common inactivation status. Overall, our analysis indicated a complex interaction between DNA sequence, chromatin features and the three-dimensional structure of the chromosome. PMID:24158853
Genomic fossils reveal adaptation of non-autonomous pararetroviruses driven by concerted evolution of noncoding regulatory sequences.

PubMed

Chen, Sunlu; Zheng, Huizhen; Kishima, Yuji

2017-06-01

The interplay of different virus species in a host cell after infection can affect the adaptation of each virus. Endogenous viral elements, such as endogenous pararetroviruses (PRVs), have arisen from vertical inheritance of viral sequences integrated into host germline genomes. As viral genomic fossils, these sequences can thus serve as valuable paleogenomic data to study the long-term evolutionary dynamics of virus-virus interactions, but they have rarely been applied for this purpose. All extant PRVs have been considered autonomous species in their parasitic life cycle in host cells. Here, we provide evidence for multiple non-autonomous PRV species with structural defects in viral activity that have frequently infected ancient grass hosts and adapted through interplay between viruses. Our paleogenomic analyses using endogenous PRVs in grass genomes revealed that these non-autonomous PRV species have participated in interplay with autonomous PRVs in a possible commensal partnership, or, alternatively, with one another in a possible mutualistic partnership. These partnerships, which have been established by the sharing of noncoding regulatory sequences (NRSs) in intergenic regions between two partner viruses, have been further maintained and altered by the sequence homogenization of NRSs between partners. Strikingly, we found that frequent region-specific recombination, rather than mutation selection, is the main causative mechanism of NRS homogenization. Our results, obtained from ancient DNA records of viruses, suggest that adaptation of PRVs has occurred by concerted evolution of NRSs between different virus species in the same host. Our findings further imply that evaluation of within-host NRS interactions within and between populations of viral pathogens may be important.
Noncoding RNAs: New Players in Pulmonary Medicine and Sarcoidosis.

PubMed

Salamo, Oriana; Mortaz, Esmaeil; Mirsaeidi, Mehdi

2018-02-01

Noncoding RNAs (ncRNAs) are coded by 98% of human genomic DNA. They are grouped into two major classes according to length: small ncRNAs and long ncRNAs. They regulate genome organization, stability, and physiological processes that maintain cellular homeostasis. Recently, great interest has emerged in ncRNAs because of their significant roles in the development of inflammatory diseases, including sarcoidosis. Some have been introduced as novel markers for disease activity, such as increased levels of microRNA-34a in peripheral blood mononuclear cells of patients with sarcoidosis, re-emphasizing the inflammatory component in sarcoidosis. They are also important factors in the outcome of sarcoidosis. Dysregulation of microRNA-let7f leads to overexpression of profibrotic factors and could be related to the pathogenesis of pulmonary fibrosis in patients with sarcoidosis, owing to their stimulatory effect on collagen expression and deposition. However, many unanswered questions remain about the association of ncRNAs and sarcoidosis. By understanding the functions of ncRNAs in T-helper cell type 1 and T-helper cell type 17, we may uncover the mechanism of action of those cells in sarcoidosis. Further translational research is needed to define the RNA gene fingerprint of different sarcoidosis stages.

Distribution of Unlinked Transpositions of a Ds Element from a T-DNA Locus on Tomato Chromosome 4

PubMed Central

Briza, J.; Carroll, B. J.; Klimyuk, V. I.; Thomas, C. M.; Jones, D. A.; Jones, JDG.

1995-01-01

In maize, receptor sites for unlinked transpositions of Activator (Ac) elements are not distributed randomly. To test whether the same is true in tomato, the receptor sites for a Dissociation (Ds) element derived from Ac, were mapped for 26 transpositions unlinked to a donor T-DNA locus on chromosome 4. Four independent transposed Dss mapped to sites on chromosome 4 genetically unlinked to the donor T-DNA, consistent with a preference for transposition to unlinked sites on the same chromosome as opposed to sites on other chromosomes. There was little preference among the nondonor chromosomes, except perhaps for chromosome 2, which carried seven transposed Dss, but these could not be proven to be independent. However, these data, when combined with those from other studies in tomato examining the distribution of transposed Acs or Dss among nondonor chromosomes, suggest there may be absolute preferences for transposition irrespective of the chromosomal location of the donor site. If true, transposition to nondonor chromosomes in tomato would differ from that in maize, where the preference seems to be determined by the spatial arrangement of chromosomes in the interphase nucleus. The tomato lines carrying Ds elements at known locations are available for targeted transposon tagging experiments. PMID:8536985
DIANA-LncBase v2: indexing microRNA targets on non-coding transcripts.

PubMed

Paraskevopoulou, Maria D; Vlachos, Ioannis S; Karagkouni, Dimitra; Georgakilas, Georgios; Kanellos, Ilias; Vergoulis, Thanasis; Zagganas, Konstantinos; Tsanakas, Panayiotis; Floros, Evangelos; Dalamagas, Theodore; Hatzigeorgiou, Artemis G

2016-01-04

microRNAs (miRNAs) are short non-coding RNAs (ncRNAs) that act as post-transcriptional regulators of coding gene expression. Long non-coding RNAs (lncRNAs) have been recently reported to interact with miRNAs. The sponge-like function of lncRNAs introduces an extra layer of complexity in the miRNA interactome. DIANA-LncBase v1 provided a database of experimentally supported and in silico predicted miRNA Recognition Elements (MREs) on lncRNAs. The second version of LncBase (www.microrna.gr/LncBase) presents an extensive collection of miRNA:lncRNA interactions. The significantly enhanced database includes more than 70 000 low and high-throughput, (in)direct miRNA:lncRNA experimentally supported interactions, derived from manually curated publications and the analysis of 153 AGO CLIP-Seq libraries. The new experimental module presents a 14-fold increase compared to the previous release. LncBase v2 hosts in silico predicted miRNA targets on lncRNAs, identified with the DIANA-microT algorithm. The relevant module provides millions of predicted miRNA binding sites, accompanied with detailed metadata and MRE conservation metrics. LncBase v2 caters information regarding cell type specific miRNA:lncRNA regulation and enables users to easily identify interactions in 66 different cell types, spanning 36 tissues for human and mouse. Database entries are also supported by accurate lncRNA expression information, derived from the analysis of more than 6 billion RNA-Seq reads. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
A Somatically Acquired Enhancer of the Androgen Receptor Is a Noncoding Driver in Advanced Prostate Cancer.

PubMed

Takeda, David Y; Spisák, Sándor; Seo, Ji-Heui; Bell, Connor; O'Connor, Edward; Korthauer, Keegan; Ribli, Dezső; Csabai, István; Solymosi, Norbert; Szállási, Zoltán; Stillman, David R; Cejas, Paloma; Qiu, Xintao; Long, Henry W; Tisza, Viktória; Nuzzo, Pier Vitale; Rohanizadegan, Mersedeh; Pomerantz, Mark M; Hahn, William C; Freedman, Matthew L

2018-06-09

Increased androgen receptor (AR) activity drives therapeutic resistance in advanced prostate cancer. The most common resistance mechanism is amplification of this locus presumably targeting the AR gene. Here, we identify and characterize a somatically acquired AR enhancer located 650 kb centromeric to the AR. Systematic perturbation of this enhancer using genome editing decreased proliferation by suppressing AR levels. Insertion of an additional copy of this region sufficed to increase proliferation under low androgen conditions and to decrease sensitivity to enzalutamide. Epigenetic data generated in localized prostate tumors and benign specimens support the notion that this region is a developmental enhancer. Collectively, these observations underscore the importance of epigenomic profiling in primary specimens and the value of deploying genome editing to functionally characterize noncoding elements. More broadly, this work identifies a therapeutic vulnerability for targeting the AR and emphasizes the importance of regulatory elements as highly recurrent oncogenic drivers. Copyright © 2018 Elsevier Inc. All rights reserved.
AP1 Keeps Chromatin Poised for Action | Center for Cancer Research

Cancer.gov

The human genome harbors gene-encoding DNA, the blueprint for building proteins that regulate cellular function. Embedded across the genome, in non-coding regions, are DNA elements to which regulatory factors bind. The interaction of regulatory factors with DNA at these sites modifies gene expression to modulate cell activity. In cells, DNA exists in a complex with proteins called chromatin that compacts the DNA in the nucleus, strongly restricting access to DNA sequences. As a result, regulatory factors only interact with a small subset of their potential binding elements in a given cell to regulate genes. How factors recognize and select sites in chromatin across the genome is not well understood -- but several discoveries in CCR’s Laboratory of Receptor Biology and Gene Expression (LRBGE) have shed light on the mechanisms that direct factors to DNA.
The site-specific ribosomal DNA insertion element R1Bm belongs to a class of non-long-terminal-repeat retrotransposons.

PubMed Central

Xiong, Y; Eickbush, T H

1988-01-01

Two types of insertion elements, R1 and R2 (previously called type I and type II), are known to interrupt the 28S ribosomal genes of several insect species. In the silkmoth, Bombyx mori, each element occupies approximately 10% of the estimated 240 ribosomal DNA units, while at most only a few copies are located outside the ribosomal DNA units. We present here the complete nucleotide sequence of an R1 insertion from B. mori (R1Bm). This 5.1-kilobase element contains two overlapping open reading frames (ORFs) which together occupy 88% of its length. ORF1 is 461 amino acids in length and exhibits characteristics of retroviral gag genes. ORF2 is 1,051 amino acids in length and contains homology to reverse transcriptase-like enzymes. The analysis of 3' and 5' ends of independent isolates from the ribosomal locus supports the suggestion that R1 is still functioning as a transposable element. The precise location of the element within the genome implies that its transposition must occur with remarkable insertion sequence specificity. Comparison of the deduced amino acid sequences from six retrotransposons, R1 and R2 of B. mori, I factor and F element of Drosophila melanogaster, L1 of Mus domesticus, and Ingi of Trypanosoma brucei, reveals a relatively high level of sequence homology in the reverse transcriptase region. Like R1, these elements lack long terminal repeats. We have therefore named this class of related elements the non-long-terminal-repeat (non-LTR) retrotransposons. Images PMID:2447482
Long Noncoding RNAs in the Yeast S. cerevisiae.

PubMed

Niederer, Rachel O; Hass, Evan P; Zappulla, David C

2017-01-01

Long noncoding RNAs have recently been discovered to comprise a sizeable fraction of the RNA World. The scope of their functions, physical organization, and disease relevance remain in the early stages of characterization. Although many thousands of lncRNA transcripts recently have been found to emanate from the expansive DNA between protein-coding genes in animals, there are also hundreds that have been found in simple eukaryotes. Furthermore, lncRNAs have been found in the bacterial and archaeal branches of the tree of life, suggesting they are ubiquitous. In this chapter, we focus primarily on what has been learned so far about lncRNAs from the greatly studied single-celled eukaryote, the yeast Saccharomyces cerevisiae. Most lncRNAs examined in yeast have been implicated in transcriptional regulation of protein-coding genes-often in response to forms of stress-whereas a select few have been ascribed yet other functions. Of those known to be involved in transcriptional regulation of protein-coding genes, the vast majority function in cis. There are also some yeast lncRNAs identified that are not directly involved in regulation of transcription. Examples of these include the telomerase RNA and telomere-encoded transcripts. In addition to its role as a template-encoding telomeric DNA synthesis, telomerase RNA has been shown to function as a flexible scaffold for protein subunits of the RNP holoenzyme. The flexible scaffold model provides a specific mechanistic paradigm that is likely to apply to many other lncRNAs that assemble and orchestrate large RNP complexes, even in humans. Looking to the future, it is clear that considerable fundamental knowledge remains to be obtained about the architecture and functions of lncRNAs. Using genetically tractable unicellular model organisms should facilitate lncRNA characterization. The acquired basic knowledge will ultimately translate to better understanding of the growing list of lncRNAs linked to human maladies.
Long Noncoding RNA in Digestive Tract Cancers: Function, Mechanism, and Potential Biomarker

PubMed Central

Zeng, Shuo; Xiao, Yu-Feng; Tang, Bo; Hu, Chang-Jiang; Xie, Rei; Yang, Shi-Ming

2015-01-01

Digestive tract cancers (DTCs) are a leading cause of cancer-related death worldwide. Current therapeutic tools for advanced stage DTCs have limitations, and patients with early stage DTCs frequently have a missed diagnosis due to shortage of efficient biomarkers. Consequently, it is necessary to develop novel biomarkers for early diagnosis and novel therapeutic targets for treatment of DTCs. In recent years, long noncoding RNAs (lncRNAs), a class of noncoding RNAs with >200 nucleotides, have been shown to be aberrantly expressed in DTCs and to have an important role in DTC development: the expression profiles of lncRNAs strongly correlated with poor survival of patients with DTCs, and lncRNAs acted as oncogenes or tumor suppressor genes in DTC progression. In this review, we summarized the functional lncRNAs and expounded on their regulatory mechanisms in DTCs. Implications for Practice: Digestive tract cancers (DTCs) are a leading cause of cancer-related death worldwide. It is necessary to exploit novel biomarkers for early diagnosis and novel therapeutic targets for treatment of DTCs. Long noncoding RNAs (lncRNAs), a class of noncoding RNAs with approximately 200 nucleotides to 100,000 bases, participate in the progression of a variety of diseases. This review summarizes functional lncRNAs, which were shown to serve as novel biomarkers for diagnosis and prognosis of DTCs and to act as oncogenes or tumor suppressor genes in DTC development. In addition, the potential mechanism of functional lncRNAs in DTCs is highlighted. PMID:26156325
Dengue Non-coding RNA: TRIMmed for Transmission.

PubMed

Göertz, Giel P; Pijlman, Gorben P

2015-08-12

Dengue virus RNA is trimmed by the 5'→3' exoribonuclease XRN1 to produce an abundant, non-coding subgenomic flavivirus RNA (sfRNA) in infected cells. In a recent paper in Science, Manokaran et al. (2015) report that sfRNA binds TRIM25 to evade innate immune sensing of viral RNA by RIG-I. Copyright © 2015 Elsevier Inc. All rights reserved.
Evaluation of non-coding variation in GLUT1 deficiency.

PubMed

Liu, Yu-Chi; Lee, Jia Wei Audrey; Bellows, Susannah T; Damiano, John A; Mullen, Saul A; Berkovic, Samuel F; Bahlo, Melanie; Scheffer, Ingrid E; Hildebrand, Michael S

2016-12-01

Loss-of-function mutations in SLC2A1, encoding glucose transporter-1 (GLUT-1), lead to dysfunction of glucose transport across the blood-brain barrier. Ten percent of cases with hypoglycorrhachia (fasting cerebrospinal fluid [CSF] glucose <2.2mmol/L) do not have mutations. We hypothesized that GLUT1 deficiency could be due to non-coding SLC2A1 variants. We performed whole exome sequencing of one proband with a GLUT1 phenotype and hypoglycorrhachia negative for SLC2A1 sequencing and copy number variants. We studied a further 55 patients with different epilepsies and low CSF glucose who did not have exonic mutations or copy number variants. We sequenced non-coding promoter and intronic regions. We performed mRNA studies for the recurrent intronic variant. The proband had a de novo splice site mutation five base pairs from the intron-exon boundary. Three of 55 patients had deep intronic SLC2A1 variants, including a recurrent variant in two. The recurrent variant produced less SLC2A1 mRNA transcript. Fasting CSF glucose levels show an age-dependent correlation, which makes the definition of hypoglycorrhachia challenging. Low CSF glucose levels may be associated with pathogenic SLC2A1 mutations including deep intronic SLC2A1 variants. Extending genetic screening to non-coding regions will enable diagnosis of more patients with GLUT1 deficiency, allowing implementation of the ketogenic diet to improve outcomes. © 2016 Mac Keith Press.
Identification of antisense long noncoding RNAs that function as SINEUPs in human cells.

PubMed

Schein, Aleks; Zucchelli, Silvia; Kauppinen, Sakari; Gustincich, Stefano; Carninci, Piero

2016-09-20

Mammalian genomes encode numerous natural antisense long noncoding RNAs (lncRNAs) that regulate gene expression. Recently, an antisense lncRNA to mouse Ubiquitin carboxyl-terminal hydrolase L1 (Uchl1) was reported to increase UCHL1 protein synthesis, representing a new functional class of lncRNAs, designated as SINEUPs, for SINE element-containing translation UP-regulators. Here, we show that an antisense lncRNA to the human protein phosphatase 1 regulatory subunit 12A (PPP1R12A), named as R12A-AS1, which overlaps with the 5' UTR and first coding exon of the PPP1R12A mRNA, functions as a SINEUP, increasing PPP1R12A protein translation in human cells. The SINEUP activity depends on the aforementioned sense-antisense interaction and a free right Alu monomer repeat element at the 3' end of R12A-AS1. In addition, we identify another human antisense lncRNA with SINEUP activity. Our results demonstrate for the first time that human natural antisense lncRNAs can up-regulate protein translation, suggesting that endogenous SINEUPs may be widespread and present in many mammalian species.
The Ftx Noncoding Locus Controls X Chromosome Inactivation Independently of Its RNA Products.

PubMed

Furlan, Giulia; Gutierrez Hernandez, Nancy; Huret, Christophe; Galupa, Rafael; van Bemmel, Joke Gerarda; Romito, Antonio; Heard, Edith; Morey, Céline; Rougeulle, Claire

2018-05-03

Accumulation of the Xist long noncoding RNA (lncRNA) on one X chromosome is the trigger for X chromosome inactivation (XCI) in female mammals. Xist expression, which needs to be tightly controlled, involves a cis-acting region, the X-inactivation center (Xic), containing many lncRNA genes that evolved concomitantly to Xist from protein-coding ancestors through pseudogeneization and loss of coding potential. Here, we uncover an essential role for the Xic-linked noncoding gene Ftx in the regulation of Xist expression. We show that Ftx is required in cis to promote Xist transcriptional activation and establishment of XCI. Importantly, we demonstrate that this function depends on Ftx transcription and not on the RNA products. Our findings illustrate the multiplicity of layers operating in the establishment of XCI and highlight the diversity in the modus operandi of the noncoding players. Copyright © 2018 Elsevier Inc. All rights reserved.
In Vitro Selection of a Single-Stranded DNA Molecular Recognition Element against the Pesticide Fipronil and Sensitive Detection in River Water

PubMed Central

Sooter, Letha J.

2017-01-01

Fipronil is a commonly used insecticide that has been shown to have environmental and human health risks. The current standard methods of detection for fipronil and its metabolites, such as GC-MS, are time consuming and labor intensive. In this study, a variant of systematic evolution of ligands by exponential enrichment (SELEX), was utilized to identify the first single-stranded DNA (ssDNA) molecular recognition element (MRE) that binds to fipronil with high affinity (Kd = 48 ± 8 nM). The selected MRE displayed low cross binding activity on various environmentally relevant, structurally unrelated herbicides and pesticides, in addition to broad-spectrum binding activity on major metabolites of fipronil and a structurally similar pesticide in prepared river samples. Additionally, a proof-of-principle fluorescent detection assay was developed by using the selected ssDNA MRE as a signal-reporting element, with a limit of detection of 105 nM in a prepared river water sample. PMID:29283416
Identification and Functional Prediction of Large Intergenic Noncoding RNAs (lincRNAs) in Rainbow Trout (Oncorhynchus mykiss)

USDA-ARS?s Scientific Manuscript database

Long noncoding RNAs (lncRNAs) have been recognized in recent years as key regulators of diverse cellular processes. Genome-wide large-scale projects have uncovered thousands of lncRNAs in many model organisms. Large intergenic noncoding RNAs (lincRNAs) are lncRNAs that are transcribed from intergeni...
Evaluation of fluorescence in situ hybridization techniques to study long non-coding RNA expression in cultured cells

PubMed Central

Soares, Ricardo J; Maglieri, Giulia; Gutschner, Tony; Lund, Anders H; Nielsen, Boye S

2018-01-01

Abstract Deciphering the functions of long non-coding RNAs (lncRNAs) is facilitated by visualization of their subcellular localization using in situ hybridization (ISH) techniques. We evaluated four different ISH methods for detection of MALAT1 and CYTOR in cultured cells: a multiple probe detection approach with or without enzymatic signal amplification, a branched-DNA (bDNA) probe and an LNA-modified probe with enzymatic signal amplification. All four methods adequately stained MALAT1 in the nucleus in all of three cell lines investigated, HeLa, NHDF and T47D, and three of the methods detected the less expressed CYTOR. The sensitivity of the four ISH methods was evaluated by image analysis. In all three cell lines, the two methods involving enzymatic amplification gave the most intense MALAT1 signal, but the signal-to-background ratios were not different. CYTOR was best detected using the bDNA method. All four ISH methods showed significantly reduced MALAT1 signal in knock-out cells, and siRNA-induced knock-down of CYTOR resulted in significantly reduced CYTOR ISH signal, indicating good specificity of the probe designs and detection systems. Our data suggest that the ISH methods allow detection of both abundant and less abundantly expressed lncRNAs, although the latter required the use of the most specific and sensitive probe detection system. PMID:29059327
The effect of inflammation-related lifestyle exposures and interactions with gene variants on long interspersed nuclear element-1 DNA methylation.

PubMed

Gogna, Priyanka; O'Sullivan, Dylan E; King, Will D

2018-06-11

To examine the relationship between inflammation-related lifestyle factors and long interspersed nuclear element-1 (LINE-1) DNA methylation, and test for interaction by gene variants involved in one-carbon metabolism. The study population consisted of 280 individuals undergoing colonoscopy screening. Multivariable linear regression was employed to examine associations of physical activity, BMI and NSAID use with LINE-1 DNA methylation and interactions with MTR and MTHFR gene variants. The highest quartile of physical activity compared with the lowest was associated with higher LINE-1 DNA methylation (p = 0.005). Long-term NSAID use and a normal BMI were associated with increased LINE-1 DNA methylation among individuals with the variant MTR allele (p = 0.02; p = 0.03). This study provides evidence that inflammation-related exposures may influence LINE-1 DNA methylation.
Long Noncoding RNA H19 Inhibits Cell Viability, Migration, and Invasion Via Downregulation of IRS-1 in Thyroid Cancer Cells

PubMed Central

Wang, Peng; Xu, Weimin; Liu, Haixia; Bu, Qingao; Sun, Diwen

2017-01-01

Thyroid cancer is a common endocrine gland malignancy which exhibited rapid increased incidence worldwide in recent decades. This study was aimed to investigate the role of long noncoding RNA H19 in thyroid cancer. Long noncoding RNA H19 was overexpressed or knockdown in thyroid cancer cells SW579 and TPC-1, and the expression of long noncoding RNA H19 was detected by real-time polymerase chain reaction. The cell viability, migration, and invasion were determined by 3-(4, 5-dimethyl-2-thiazolyl)-2, 5-diphenyl-2-H-tetrazolium bromide assay, Transwell assay, and wound healing assay, respectively. Furthermore, cell apoptosis was analyzed by flow cytometry, and expressions of some factors that were related to phosphatidyl inositide 3-kinases/protein kinase B and nuclear factor κB signal pathway were measured by Western blotting. This study revealed that cell viability and migration/invasion of SW579 and TPC-1 were significantly decreased by long noncoding RNA H19 overexpression compared with the control group (P < .05), whereas cell apoptosis was statistically increased (P < .001). Meanwhile, cell viability and migration/invasion were significantly increased after long noncoding RNA H19 knockdown (P < .05). Furthermore, long noncoding RNA H19 negatively regulated the expression of insulin receptor substrate 1 and thus effect on cell proliferation and apoptosis. Insulin receptor substrate 1 regulated the activation of phosphatidyl inositide 3-kinases/AKT and nuclear factor κB signal pathways. In conclusion, long noncoding RNA H19 could suppress cell viability, migration, and invasion via downregulation of insulin receptor substrate 1 in SW579 and TPC-1 cells. These results suggested the important role of long noncoding RNA H19 in thyroid cancer, and long noncoding RNA H19 might be a potential target of thyroid cancer treatment. PMID:29332545
Dynamic interplay and function of multiple noncoding genes governing X chromosome inactivation

PubMed Central

Yue, Minghui; Richard, John Lalith Charles

2015-01-01

There is increasing evidence for the emergence of long noncoding RNAs (IncRNAs) as important components, especially in the regulation of gene expression. In the event of X chromosome inactivation, robust epigenetic marks are established in a long noncoding Xist RNA-dependent manner, giving rise to a distinct epigenetic landscape on the inactive X chromosome (Xi). The X inactivation center (Xic is essential for induction of X chromosome inactivation and harbors two topologically associated domains (TADs) to regulate monoallelic Xist expression: one at the noncoding Xist gene and its upstream region, and the other at the antisense Tsix and its upstream region. The monoallelic expression of Xist is tightly regulated by these two functionally distinct TADs as well as their constituting IncRNAs and proteins. In this review, we summarize recent updates in our knowledge of IncRNAs found at the Xic and discuss their overall mechanisms of action. We also discuss our current understanding of the molecular mechanism behind Xist RNA-mediated induction of the repressive epigenetic landscape at the Xi. PMID:26260844
Generation of Infectious Poliovirus with Altered Genetic Information from Cloned cDNA.

PubMed

Bujaki, Erika

2016-01-01

The effect of specific genetic alterations on virus biology and phenotype can be studied by a great number of available assays. The following method describes the basic protocol to generate infectious poliovirus with altered genetic information from cloned cDNA in cultured cells.The example explained here involves generation of a recombinant poliovirus genome by simply replacing a portion of the 5' noncoding region with a synthetic gene by restriction cloning. The vector containing the full length poliovirus genome and the insert DNA with the known mutation(s) are cleaved for directional cloning, then ligated and transformed into competent bacteria. The recombinant plasmid DNA is then propagated in bacteria and transcribed to RNA in vitro before RNA transfection of cultured cells is performed. Finally, viral particles are recovered from the cell culture.
Modular assembly of chimeric phi29 packaging RNAs that support DNA packaging.

PubMed

Fang, Yun; Shu, Dan; Xiao, Feng; Guo, Peixuan; Qin, Peter Z

2008-08-08

The bacteriophage phi29 DNA packaging motor is a protein/RNA complex that can produce strong force to condense the linear-double-stranded DNA genome into a pre-formed protein capsid. The RNA component, called the packaging RNA (pRNA), utilizes magnesium-dependent inter-molecular base-pairing interactions to form ring-shaped complexes. The pRNA is a class of non-coding RNA, interacting with phi29 motor proteins to enable DNA packaging. Here, we report a two-piece chimeric pRNA construct that is fully competent in interacting with partner pRNA to form ring-shaped complexes, in packaging DNA via the motor, and in assembling infectious phi29 virions in vitro. This is the first example of a fully functional pRNA assembled using two non-covalently interacting fragments. The results support the notion of modular pRNA architecture in the phi29 packaging motor.
Modular assembly of chimeric phi29 packaging RNAs that support DNA packaging

PubMed Central

Fang, Yun; Shu, Dan; Xiao, Feng; Guo, Peixuan; Qin, Peter Z.

2008-01-01

The bacteriophage phi29 DNA packaging motor is a protein/RNA complex that can produce strong force to condense the linear-double stranded DNA genome into a pre-formed protein capsid. The RNA component, called the packaging RNA (pRNA), utilizes magnesium-dependent intermolecular base-pairing interactions to form ring-shaped complexes. The pRNA is a class of non-coding RNA, interacting with phi29 motor proteins to enable DNA packaging. Here, we report a 2-piece chimeric pRNA construct that is fully competent in interacting with partner pRNA to form ring-shaped complexes, in packaging DNA via the motor, and in assembling infectious phi29 virions in vitro. This is the first example of a fully functional pRNA assembled using two non-covalently interacting fragments. The results support the notion of modular pRNA architecture in the phi29 packaging motor. PMID:18514064

Functional Interplay between Small Non-Coding RNAs and RNA Modification in the Brain.

PubMed

Leighton, Laura J; Bredy, Timothy W

2018-06-07

Small non-coding RNAs are essential for transcription, translation and gene regulation in all cell types, but are particularly important in neurons, with known roles in neurodevelopment, neuroplasticity and neurological disease. Many small non-coding RNAs are directly involved in the post-transcriptional modification of other RNA species, while others are themselves substrates for modification, or are functionally modulated by modification of their target RNAs. In this review, we explore the known and potential functions of several distinct classes of small non-coding RNAs in the mammalian brain, focusing on the newly recognised interplay between the epitranscriptome and the activity of small RNAs. We discuss the potential for this relationship to influence the spatial and temporal dynamics of gene activation in the brain, and predict that further research in the field of epitranscriptomics will identify interactions between small RNAs and RNA modifications which are essential for higher order brain functions such as learning and memory.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript

PubMed Central

Rose, Dominic; Stadler, Peter F.

2011-01-01

Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Polyomavirus BK non-coding control region rearrangements in health and disease.

PubMed

Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S

2007-08-01

BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.
Maternal noncoding transcripts antagonize the targeting of DNA elimination by scanRNAs in Paramecium tetraurelia

PubMed Central

Lepère, Gersende; Bétermier, Mireille; Meyer, Eric; Duharcourt, Sandra

2008-01-01

The germline genome of ciliates is extensively rearranged during the development of a new somatic macronucleus from the germline micronucleus, after sexual events. In Paramecium tetraurelia, single-copy internal eliminated sequences (IESs) are precisely excised from coding sequences and intergenic regions. For a subset of IESs, introduction of the IES sequence into the maternal macronucleus specifically inhibits excision of the homologous IES in the developing zygotic macronucleus, suggesting that epigenetic regulation of excision involves a global comparison of germline and somatic genomes. ScanRNAs (scnRNAs) produced during micronuclear meiosis by a developmentally regulated RNAi pathway have been proposed to mediate this transnuclear cross-talk. In this study, microinjection experiments provide direct evidence that 25-nucleotide (nt) scnRNAs promote IES excision. We further show that noncoding RNAs are produced from the somatic maternal genome, both during vegetative growth and during sexual events. Maternal inhibition of IES excision is abolished when maternal somatic transcripts containing an IES are targeted for degradation by a distinct RNAi pathway involving 23-nt siRNAs. The results strongly support a scnRNA/macronuclear RNA scanning model in which a natural genomic subtraction, occurring during meiosis between deletion-inducing scnRNAs and antagonistic transcripts from the maternal macronucleus, regulates rearrangements of the zygotic genome. PMID:18519642
Small non-coding RNAs in streptomycetes.

PubMed

Heueis, Nona; Vockenhuber, Michael-Paul; Suess, Beatrix

2014-01-01

Streptomycetes are Gram-positive, GC-rich, soil dwelling bacteria, occurring ubiquitary throughout nature. They undergo extensive morphological changes from spores to filamentous mycelia and produce a plethora of secondary metabolites. Owing to their complex life cycle, streptomycetes require efficient regulatory machinery for the control of gene expression. Therefore, they possess a large diversity of regulators. Within this review we summarize the current knowledge about the importance of small non-coding RNA for the control of gene expression in these organisms.
Epigenetic inactivation of the p53-induced long noncoding RNA TP53 target 1 in human cancer

PubMed Central

Diaz-Lagares, Angel; Crujeiras, Ana B.; Lopez-Serra, Paula; Soler, Marta; Setien, Fernando; Goyal, Ashish; Sandoval, Juan; Hashimoto, Yutaka; Martinez-Cardús, Anna; Gomez, Antonio; Heyn, Holger; Moutinho, Catia; Espada, Jesús; Vidal, August; Paúles, Maria; Galán, Maica; Sala, Núria; Akiyama, Yoshimitsu; Martínez-Iniesta, María; Farré, Lourdes; Villanueva, Alberto; Gross, Matthias; Diederichs, Sven; Guil, Sonia; Esteller, Manel

2016-01-01

Long noncoding RNAs (lncRNAs) are important regulators of cellular homeostasis. However, their contribution to the cancer phenotype still needs to be established. Herein, we have identified a p53-induced lncRNA, TP53TG1, that undergoes cancer-specific promoter hypermethylation-associated silencing. In vitro and in vivo assays identify a tumor-suppressor activity for TP53TG1 and a role in the p53 response to DNA damage. Importantly, we show that TP53TG1 binds to the multifaceted DNA/RNA binding protein YBX1 to prevent its nuclear localization and thus the YBX1-mediated activation of oncogenes. TP53TG1 epigenetic inactivation in cancer cells releases the transcriptional repression of YBX1-targeted growth-promoting genes and creates a chemoresistant tumor. TP53TG1 hypermethylation in primary tumors is shown to be associated with poor outcome. The epigenetic loss of TP53TG1 therefore represents an altered event in an lncRNA that is linked to classical tumoral pathways, such as p53 signaling, but is also connected to regulatory networks of the cancer cell. PMID:27821766
Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term.

PubMed

Romero, Roberto; Tarca, Adi L; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S; Kalita, Cynthia A; Cai, Juan; Yeo, Lami; Lipovich, Leonard

2014-09-01

To identify differentially expressed long non-coding RNA (lncRNA) genes in human myometrium in women with spontaneous labor at term. Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n = 19) and women in spontaneous labor at term (n = 20). RNA was extracted and profiled using an Illumina® microarray platform. We have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. We identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an experimental method completely independent of the microarray analysis. Intriguingly, one of the two lncRNA genes differentially expressed in term labor had a key genomic structure element, a splice site, that lacked evolutionary conservation beyond primates. We provide, for the first time, evidence for coordinated differential expression and correlation of cis-encoded antisense lncRNAs and protein-coding genes with known as well as novel roles in pregnancy in the myometrium of women in spontaneous labor at term.
RNA regulatory networks in animals and plants: a long noncoding RNA perspective.

PubMed

Bai, Youhuang; Dai, Xiaozhuan; Harrison, Andrew P; Chen, Ming

2015-03-01

A recent highlight of genomics research has been the discovery of many families of transcripts which have function but do not code for proteins. An important group is long noncoding RNAs (lncRNAs), which are typically longer than 200 nt, and whose members originate from thousands of loci across genomes. We review progress in understanding the biogenesis and regulatory mechanisms of lncRNAs. We describe diverse computational and high throughput technologies for identifying and studying lncRNAs. We discuss the current knowledge of functional elements embedded in lncRNAs as well as insights into the lncRNA-based regulatory network in animals. We also describe genome-wide studies of large amount of lncRNAs in plants, as well as knowledge of selected plant lncRNAs with a focus on biotic/abiotic stress-responsive lncRNAs. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Long non-coding RNAs as molecular players in plant defense against pathogens.

PubMed

Zaynab, Madiha; Fatima, Mahpara; Abbas, Safdar; Umair, Muhammad; Sharif, Yasir; Raza, Muhammad Ammar

2018-05-31

Long non-coding RNAs (lncRNAs) has significant role in of gene expression and silencing pathways for several biological processes in eukaryotes. lncRNAs has been reported as key player in remodeling chromatin and genome architecture, RNA stabilization and transcription regulation, including enhancer-associated activity. Host lncRNAs are reckoned as compulsory elements of plant defense. In response to pathogen attack, plants protect themselves with the help of lncRNAs -dependent immune systems in which lncRNAs regulate pathogen-associated molecular patterns (PAMPs) and other effectors. Role of lncRNAs in plant microbe interaction has been studied extensively but regulations of several lncRNAs still need extensive research. In this study we discussed and provide as overview the topical advancements and findings relevant to pathogen attack and plant defense mediated by lncRNAs. It is hoped that lncRNAs would be exploited as a mainstream player to achieve food security by tackling different plant diseases. Copyright © 2018. Published by Elsevier Ltd.
SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

PubMed

Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

2016-06-15

Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available
Long Non-Coding RNA Emergence During Renal Cell Carcinoma Tumorigenesis.

PubMed

Liu, Xiaobing; Hao, Yaxing; Yu, Wei; Yang, Xia; Luo, Xing; Zhao, Jiang; Li, Jia; Hu, Xiaoyan; Li, Longkun

2018-05-22

Renal cell carcinoma (RCC) is the most common kidney cancer diagnosed across the globe and has steadily increased in incidence in recent decades. Techniques for diagnosing or treating RCC are limited, and confined mostly to later stages of the disease. Almost all RCC pathological types are resistant to chemotherapeutics and radiation therapy. To this effect, new markers for diagnosis and target therapy are urgently needed. Advanced genome sequencing technologies have revealed long non-coding RNAs (lncRNAs) as a novel marker, transcribed throughout the human genome. The emergence of lncRNAs is an aberrant expression and is involved in the tumorigenesis of RCC. LncRNAs drive cancer phenotypes through their interaction with other cellular macromolecules including DNA, protein, and RNA. Recent research on lncRNA molecular mechanisms has revealed new markers to functionally annotate these cancers' associated transcripts, making them targets for effective diagnosis and therapeutic intervention in the fight against cancer. In this review, we first highlight the common mechanisms that underlie aberrant lncRNA expression in RCC. We go on to discuss the potential translational application of lncRNA research in the diagnosis, prognosis, and treatment of RCC. © 2018 The Author(s). Published by S. Karger AG, Basel.
Genome-Wide Discovery of Long Non-Coding RNAs in Rainbow Trout.

PubMed

Al-Tobasei, Rafet; Paneru, Bam; Salem, Mohamed

2016-01-01

The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1-2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200 nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200 nt, with more than 83-100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.
Primer retention owing to the absence of RNase H1 is catastrophic for mitochondrial DNA replication.

PubMed

Holmes, J Bradley; Akman, Gokhan; Wood, Stuart R; Sakhuja, Kiran; Cerritelli, Susana M; Moss, Chloe; Bowmaker, Mark R; Jacobs, Howard T; Crouch, Robert J; Holt, Ian J

2015-07-28

Encoding ribonuclease H1 (RNase H1) degrades RNA hybridized to DNA, and its function is essential for mitochondrial DNA maintenance in the developing mouse. Here we define the role of RNase H1 in mitochondrial DNA replication. Analysis of replicating mitochondrial DNA in embryonic fibroblasts lacking RNase H1 reveals retention of three primers in the major noncoding region (NCR) and one at the prominent lagging-strand initiation site termed Ori-L. Primer retention does not lead immediately to depletion, as the persistent RNA is fully incorporated in mitochondrial DNA. However, the retained primers present an obstacle to the mitochondrial DNA polymerase γ in subsequent rounds of replication and lead to the catastrophic generation of a double-strand break at the origin when the resulting gapped molecules are copied. Hence, the essential role of RNase H1 in mitochondrial DNA replication is the removal of primers at the origin of replication.
Colon Cancer-Upregulated Long Non-Coding RNA lincDUSP Regulates Cell Cycle Genes and Potentiates Resistance to Apoptosis.

PubMed

Forrest, Megan E; Saiakhova, Alina; Beard, Lydia; Buchner, David A; Scacheri, Peter C; LaFramboise, Thomas; Markowitz, Sanford; Khalil, Ahmad M

2018-05-09

Long non-coding RNAs (lncRNAs) are frequently dysregulated in many human cancers. We sought to identify candidate oncogenic lncRNAs in human colon tumors by utilizing RNA sequencing data from 22 colon tumors and 22 adjacent normal colon samples from The Cancer Genome Atlas (TCGA). The analysis led to the identification of ~200 differentially expressed lncRNAs. Validation in an independent cohort of normal colon and patient-derived colon cancer cell lines identified a novel lncRNA, lincDUSP, as a potential candidate oncogene. Knockdown of lincDUSP in patient-derived colon tumor cell lines resulted in significantly decreased cell proliferation and clonogenic potential, and increased susceptibility to apoptosis. The knockdown of lincDUSP affects the expression of ~800 genes, and NCI pathway analysis showed enrichment of DNA damage response and cell cycle control pathways. Further, identification of lincDUSP chromatin occupancy sites by ChIRP-Seq demonstrated association with genes involved in the replication-associated DNA damage response and cell cycle control. Consistent with these findings, lincDUSP knockdown in colon tumor cell lines increased both the accumulation of cells in early S-phase and γH2AX foci formation, indicating increased DNA damage response induction. Taken together, these results demonstrate a key role of lincDUSP in the regulation of important pathways in colon cancer.
Long noncoding RNA H19 interacts with polypyrimidine tract-binding protein 1 to reprogram hepatic lipid homeostasis.

PubMed

Liu, Chune; Yang, Zhihong; Wu, Jianguo; Zhang, Li; Lee, Sangmin; Shin, Dong-Ju; Tran, Melanie; Wang, Li

2018-05-01

H19 is an imprinted long noncoding RNA abundantly expressed in embryonic liver and repressed after birth. We show that H19 serves as a lipid sensor by synergizing with the RNA-binding polypyrimidine tract-binding protein 1 (PTBP1) to modulate hepatic metabolic homeostasis. H19 RNA interacts with PTBP1 to facilitate its association with sterol regulatory element-binding protein 1c mRNA and protein, leading to increased stability and nuclear transcriptional activity. H19 and PTBP1 are up-regulated by fatty acids in hepatocytes and in diet-induced fatty liver, which further augments lipid accumulation. Ectopic expression of H19 induces steatosis and pushes the liver into a "pseudo-fed" state in response to fasting by promoting sterol regulatory element-binding protein 1c protein cleavage and nuclear translocation. Deletion of H19 or knockdown of PTBP1 abolishes high-fat and high-sucrose diet-induced steatosis. Our study unveils an H19/PTBP1/sterol regulatory element-binding protein 1 feedforward amplifying signaling pathway to exacerbate the development of fatty liver. (Hepatology 2018;67:1768-1783). © 2017 by the American Association for the Study of Liver Diseases.
Cis-encoded non-coding antisense RNAs in streptococci and other low GC Gram (+) bacterial pathogens

PubMed Central

Cho, Kyu Hong; Kim, Jeong-Ho

2015-01-01

Due to recent advances of bioinformatics and high throughput sequencing technology, discovery of regulatory non-coding RNAs in bacteria has been increased to a great extent. Based on this bandwagon, many studies searching for trans-acting small non-coding RNAs in streptococci have been performed intensively, especially in the important human pathogen, group A and B streptococci. However, studies for cis-encoded non-coding antisense RNAs in streptococci have been scarce. A recent study shows antisense RNAs are involved in virulence gene regulation in group B streptococcus, S. agalactiae. This suggests antisense RNAs could have important roles in the pathogenesis of streptococcal pathogens. In this review, we describe recent discoveries of chromosomal cis-encoded antisense RNAs in streptococcal pathogens and other low GC Gram (+) bacteria to provide a guide for future studies. PMID:25859258
DNA-Damage Response RNA-Binding Proteins (DDRBPs): Perspectives from a New Class of Proteins and Their RNA Targets.

PubMed

Dutertre, Martin; Vagner, Stéphan

2017-10-27

Upon DNA damage, cells trigger an early DNA-damage response (DDR) involving DNA repair and cell cycle checkpoints, and late responses involving gene expression regulation that determine cell fate. Screens for genes involved in the DDR have found many RNA-binding proteins (RBPs), while screens for novel RBPs have identified DDR proteins. An increasing number of RBPs are involved in early and/or late DDR. We propose to call this new class of actors of the DDR, which contain an RNA-binding activity, DNA-damage response RNA-binding proteins (DDRBPs). We then discuss how DDRBPs contribute not only to gene expression regulation in the late DDR but also to early DDR signaling, DNA repair, and chromatin modifications at DNA-damage sites through interactions with both long and short noncoding RNAs. Copyright © 2016 Elsevier Ltd. All rights reserved.
A benchmark study of scoring methods for non-coding mutations.

PubMed

Drubay, Damien; Gautheret, Daniel; Michiels, Stefan

2018-05-15

Detailed knowledge of coding sequences has led to different candidate models for pathogenic variant prioritization. Several deleteriousness scores have been proposed for the non-coding part of the genome, but no large-scale comparison has been realized to date to assess their performance. We compared the leading scoring tools (CADD, FATHMM-MKL, Funseq2 and GWAVA) and some recent competitors (DANN, SNP and SOM scores) for their ability to discriminate assumed pathogenic variants from assumed benign variants (using the ClinVar, COSMIC and 1000 genomes project databases). Using the ClinVar benchmark, CADD was the best tool for detecting the pathogenic variants that are mainly located in protein coding gene regions. Using the COSMIC benchmark, FATHMM-MKL, GWAVA and SOMliver outperformed the other tools for pathogenic variants that are typically located in lincRNAs, pseudogenes and other parts of the non-coding genome. However, all tools had low precision, which could potentially be improved by future non-coding genome feature discoveries. These results may have been influenced by the presence of potential benign variants in the COSMIC database. The development of a gold standard as consistent as ClinVar for these regions will be necessary to confirm our tool ranking. The Snakemake, C++ and R codes are freely available from https://github.com/Oncostat/BenchmarkNCVTools and supported on Linux. damien.drubay@gustaveroussy.fr or stefan.michiels@gustaveroussy.fr. Supplementary data are available at Bioinformatics online.
nRC: non-coding RNA Classifier based on structural features.

PubMed

Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

2017-01-01

Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.
Noncoding RNPs of viral origin.

PubMed

Steitz, Joan; Borah, Sumit; Cazalla, Demian; Fok, Victor; Lytle, Robin; Mitton-Fry, Rachel; Riley, Kasandra; Samji, Tasleem

2011-03-01

Like their host cells, many viruses produce noncoding (nc)RNAs. These show diversity with respect to time of expression during viral infection, length and structure, protein-binding partners and relative abundance compared with their host-cell counterparts. Viruses, with their limited genomic capacity, presumably evolve or acquire ncRNAs only if they selectively enhance the viral life cycle or assist the virus in combating the host's response to infection. Despite much effort, identifying the functions of viral ncRNAs has been extremely challenging. Recent technical advances and enhanced understanding of host-cell ncRNAs promise accelerated insights into the RNA warfare mounted by this fascinating class of RNPs.

Spliced DNA Sequences in the Paramecium Germline: Their Properties and Evolutionary Potential

PubMed Central

Catania, Francesco; McGrath, Casey L.; Doak, Thomas G.; Lynch, Michael

2013-01-01

Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic—often coding—DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation. PMID:23737328
Non-coding RNA in cystic fibrosis.

PubMed

Glasgow, Arlene M A; De Santi, Chiara; Greene, Catherine M

2018-05-09

Non-coding RNAs (ncRNAs) are an abundant class of RNAs that include small ncRNAs, long non-coding RNAs (lncRNA) and pseudogenes. The human ncRNA atlas includes thousands of these specialised RNA molecules that are further subcategorised based on their size or function. Two of the more well-known and widely studied ncRNA species are microRNAs (miRNAs) and lncRNAs. These are regulatory RNAs and their altered expression has been implicated in the pathogenesis of a variety of human diseases. Failure to express a functional cystic fibrosis (CF) transmembrane receptor (CFTR) chloride ion channel in epithelial cells underpins CF. Secondary to the CFTR defect, it is known that other pathways can be altered and these may contribute to the pathophysiology of CF lung disease in particular. For example, quantitative alterations in expression of some ncRNAs are associated with CF. In recent years, there has been a series of published studies exploring ncRNA expression and function in CF. The majority have focussed principally on miRNAs, with just a handful of reports to date on lncRNAs. The present study reviews what is currently known about ncRNA expression and function in CF, and discusses the possibility of applying this knowledge to the clinical management of CF in the near future. © 2018 The Author(s). Published by Portland Press Limited on behalf of the Biochemical Society.
Silencing Effect of Hominoid Highly Conserved Noncoding Sequences on Embryonic Brain Development

PubMed Central

Mahmoudi Saber, Morteza

2017-01-01

Abstract Superfamily Hominoidea, which consists of Hominidae (humans and great apes) and Hylobatidae (gibbons), is well-known for sharing human-like characteristics, however, the genomic origins of these shared unique phenotypes have mainly remained elusive. To decipher the underlying genomic basis of Hominoidea-restricted phenotypes, we identified and characterized Hominoidea-restricted highly conserved noncoding sequences (HCNSs) that are a class of potential regulatory elements which may be involved in evolution of lineage-specific phenotypes. We discovered 679 such HCNSs from human, chimpanzee, gorilla, orangutan and gibbon genomes. These HCNSs were demonstrated to be under purifying selection but with lineage-restricted characteristics different from old CNSs. A significant proportion of their ancestral sequences had accelerated rates of nucleotide substitutions, insertions and deletions during the evolution of common ancestor of Hominoidea, suggesting the intervention of positive Darwinian selection for creating those HCNSs. In contrary to enhancer elements and similar to silencer sequences, these Hominoidea-restricted HCNSs are located in close proximity of transcription start sites. Their target genes are enriched in the nervous system, development and transcription, and they tend to be remotely located from the nearest coding gene. Chip-seq signals and gene expression patterns suggest that Hominoidea-restricted HCNSs are likely to be functional regulatory elements by imposing silencing effects on their target genes in a tissue-restricted manner during fetal brain development. These HCNSs, emerged through adaptive evolution and conserved through purifying selection, represent a set of promising targets for future functional studies of the evolution of Hominoidea-restricted phenotypes. PMID:28633494
Extensive Evolutionary Changes in Regulatory Element Activity during Human Origins Are Associated with Altered Gene Expression and Positive Selection

PubMed Central

Fedrigo, Olivier; Babbitt, Courtney C.; Wortham, Matthew; Tewari, Alok K.; London, Darin; Song, Lingyun; Lee, Bum-Kyu; Iyer, Vishwanath R.; Parker, Stephen C. J.; Margulies, Elliott H.; Wray, Gregory A.; Furey, Terrence S.; Crawford, Gregory E.

2012-01-01

Understanding the molecular basis for phenotypic differences between humans and other primates remains an outstanding challenge. Mutations in non-coding regulatory DNA that alter gene expression have been hypothesized as a key driver of these phenotypic differences. This has been supported by differential gene expression analyses in general, but not by the identification of specific regulatory elements responsible for changes in transcription and phenotype. To identify the genetic source of regulatory differences, we mapped DNaseI hypersensitive (DHS) sites, which mark all types of active gene regulatory elements, genome-wide in the same cell type isolated from human, chimpanzee, and macaque. Most DHS sites were conserved among all three species, as expected based on their central role in regulating transcription. However, we found evidence that several hundred DHS sites were gained or lost on the lineages leading to modern human and chimpanzee. Species-specific DHS site gains are enriched near differentially expressed genes, are positively correlated with increased transcription, show evidence of branch-specific positive selection, and overlap with active chromatin marks. Species-specific sequence differences in transcription factor motifs found within these DHS sites are linked with species-specific changes in chromatin accessibility. Together, these indicate that the regulatory elements identified here are genetic contributors to transcriptional and phenotypic differences among primate species. PMID:22761590
The molecular dynamics of long noncoding RNA control of transcription in PTEN and its pseudogene

PubMed Central

Lister, Nicholas; Shevchenko, Galina; Walshe, James L.; Groen, Jessica; Johnsson, Per; Vidarsdóttir, Linda; Grander, Dan; Ataide, Sandro F.; Morris, Kevin V.

2017-01-01

RNA has been found to interact with chromatin and modulate gene transcription. In human cells, little is known about how long noncoding RNAs (lncRNAs) interact with target loci in the context of chromatin. We find here, using the phosphatase and tensin homolog (PTEN) pseudogene as a model system, that antisense lncRNAs interact first with a 5′ UTR-containing promoter-spanning transcript, which is then followed by the recruitment of DNA methyltransferase 3a (DNMT3a), ultimately resulting in the transcriptional and epigenetic control of gene expression. Moreover, we find that the lncRNA and promoter-spanning transcript interaction are based on a combination of structural and sequence components of the antisense lncRNA. These observations suggest, on the basis of this one example, that evolutionary pressures may be placed on RNA structure more so than sequence conservation. Collectively, the observations presented here suggest a much more complex and vibrant RNA regulatory world may be operative in the regulation of gene expression. PMID:28847966
DNA-RNA hybrid formation mediates RNAi-directed heterochromatin formation.

PubMed

Nakama, Mina; Kawakami, Kei; Kajitani, Takuya; Urano, Takeshi; Murakami, Yota

2012-03-01

Certain noncoding RNAs (ncRNAs) implicated in the regulation of chromatin structure associate with chromatin. During the formation of RNAi-directed heterochromatin in fission yeast, ncRNAs transcribed from heterochromatin are thought to recruit the RNAi machinery to chromatin for the formation of heterochromatin; however, the molecular details of this association are not clear. Here, using RNA immunoprecipitation assay, we showed that the heterochromatic ncRNA was associated with chromatin via the formation of a DNA-RNA hybrid and bound to the RNA-induced transcriptional silencing (RITS) complex. The presence of DNA-RNA hybrid in the cell was also confirmed by immunofluorescence analysis using anti-DNA-RNA hybrid antibody. Over-expression and depletion of RNase H in vivo decreased and increased the amount of DNA-RNA hybrid formed, respectively, and both disturbed heterochromatin. Moreover, DNA-RNA hybrid was formed on, and over-expression of RNase H inhibited the formation of, artificial heterochromatin induced by tethering of RITS to mRNA. These results indicate that heterochromatic ncRNAs are retained on chromatin via the formation of DNA-RNA hybrids and provide a platform for the RNAi-directed heterochromatin assembly and suggest that DNA-RNA hybrid formation plays a role in chromatic ncRNA function. © 2012 The Authors. Journal compilation © 2012 by the Molecular Biology Society of Japan/Blackwell Publishing Ltd.
A novel regulatory element (E77) isolated from CHO-K1 genomic DNA enhances stable gene expression in Chinese hamster ovary cells.

PubMed

Kang, Shin-Young; Kim, Yeon-Gu; Kang, Seunghee; Lee, Hong Weon; Lee, Eun Gyo

2016-05-01

Vectors flanked by regulatory DNA elements have been used to generate stable cell lines with high productivity and transgene stability; however, regulatory elements in Chinese hamster ovary (CHO) cells, which are the most widely used mammalian cells in biopharmaceutical production, are still poorly understood. We isolated a novel gene regulatory element from CHO-K1 cells, designated E77, which was found to enhance the stable expression of a transgene. A genomic library was constructed by combining CHO-K1 genomic DNA fragments with a CMV promoter-driven GFP expression vector, and the E77 element was isolated by screening. The incorporation of the E77 regulatory element resulted in the generation of an increased number of clones with high expression, thereby enhancing the expression level of the transgene in the stable transfectant cell pool. Interestingly, the E77 element was found to consist of two distinct fragments derived from different locations in the CHO genome shotgun sequence. High and stable transgene expression was obtained in transfected CHO cells by combining these fragments. Additionally, the function of E77 was found to be dependent on its site of insertion and specific orientation in the vector construct. Our findings demonstrate that stable gene expression mediated by the CMV promoter in CHO cells may be improved by the isolated novel gene regulatory element E77 identified in the present study. © 2016 The Authors. Biotechnology Journal published by WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
An expanding universe of noncoding RNAs.

PubMed

Storz, Gisela

2002-05-17

Noncoding RNAs (ncRNAs) have been found to have roles in a great variety of processes, including transcriptional regulation, chromosome replication, RNA processing and modification, messenger RNA stability and translation, and even protein degradation and translocation. Recent studies indicate that ncRNAs are far more abundant and important than initially imagined. These findings raise several fundamental questions: How many ncRNAs are encoded by a genome? Given the absence of a diagnostic open reading frame, how can these genes be identified? How can all the functions of ncRNAs be elucidated?
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Roles of long non-coding RNAs in gastric cancer metastasis

PubMed Central

Yang, Zi-Guo; Gao, Ling; Guo, Xiao-Bo; Shi, Yu-Long

2015-01-01

Gastric cancer is the second leading cause of cancer-related deaths. Metastasis, which is an important element of gastric cancer, leads to a high mortality rate and to a poor prognosis. Gastric cancer metastasis has a complex progression that involves multiple biological processes. The comprehensive mechanisms of metastasis remain unclear, though traditional regulation modulates the molecular functions associated with metastasis. Long non-coding RNAs (lncRNAs) have a role in different gene regulatory pathways by epigenetic modification and by transcriptional and post-transcription regulation. lncRNAs participate in various diseases, including Alzheimer’s disease, cardiovascular disease, and cancer. The altered expressions of certain lncRNAs are linked to gastric cancer metastasis and invasion, as with tumor suppressor genes or oncogenes. Studies have partly elucidated the roles of lncRNAs as biomarkers and in therapies, as well as their gene regulatory mechanisms. However, comprehensive knowledge regarding the functional mechanisms of gene regulation in metastatic gastric cancer remains scarce. To provide a theoretical basis for therapeutic intervention in metastatic gastric cancer, we reviewed the functions of lncRNAs and their regulatory roles in gastric cancer metastasis. PMID:25954095
The mitochondrial genome of the gymnosperm Cycas taitungensis contains a novel family of short interspersed elements, Bpu sequences, and abundant RNA editing sites.

PubMed

Chaw, Shu-Miaw; Shih, Arthur Chun-Chieh; Wang, Daryi; Wu, Yu-Wei; Liu, Shu-Mei; Chou, The-Yuan

2008-03-01

The mtDNA of Cycas taitungensis is a circular molecule of 414,903 bp, making it 2- to 6-fold larger than the known mtDNAs of charophytes and bryophytes, but similar to the average of 7 elucidated angiosperm mtDNAs. It is characterized by abundant RNA editing sites (1,084), more than twice the number found in the angiosperm mtDNAs. The A + T content of Cycas mtDNA is 53.1%, the lowest among known land plants. About 5% of the Cycas mtDNA is composed of a novel family of mobile elements, which we designated as "Bpu sequences." They share a consensus sequence of 36 bp with 2 terminal direct repeats (AAGG) and a recognition site for the Bpu 10I restriction endonuclease (CCTGAAGC). Comparison of the Cycas mtDNA with other plant mtDNAs revealed many new insights into the biology and evolution of land plant mtDNAs. For example, the noncoding sequences in mtDNAs have drastically expanded as land plants have evolved, with abrupt increases appearing in the bryophytes, and then in the seed plants. As a result, the genomic organizations of seed plant mtDNAs are much less compact than in other plants. Also, the Cycas mtDNA appears to have been exempted from the frequent gene loss observed in angiosperm mtDNAs. Similar to the angiosperms, the 3 Cycas genes nad1, nad2, and nad5 are disrupted by 5 group II intron squences, which have brought the genes into trans-splicing arrangements. The evolutionary origin and invasion/duplication mechanism of the Bpu sequences in Cycas mtDNA are hypothesized and discussed.
Non-Coding RNAs in Castration-Resistant Prostate Cancer: Regulation of Androgen Receptor Signaling and Cancer Metabolism.

PubMed

Shih, Jing-Wen; Wang, Ling-Yu; Hung, Chiu-Lien; Kung, Hsing-Jien; Hsieh, Chia-Ling

2015-12-04

Hormone-refractory prostate cancer frequently relapses from therapy and inevitably progresses to a bone-metastatic status with no cure. Understanding of the molecular mechanisms conferring resistance to androgen deprivation therapy has the potential to lead to the discovery of novel therapeutic targets for type of prostate cancer with poor prognosis. Progression to castration-resistant prostate cancer (CRPC) is characterized by aberrant androgen receptor (AR) expression and persistent AR signaling activity. Alterations in metabolic activity regulated by oncogenic pathways, such as c-Myc, were found to promote prostate cancer growth during the development of CRPC. Non-coding RNAs represent a diverse family of regulatory transcripts that drive tumorigenesis of prostate cancer and various other cancers by their hyperactivity or diminished function. A number of studies have examined differentially expressed non-coding RNAs in each stage of prostate cancer. Herein, we highlight the emerging impacts of microRNAs and long non-coding RNAs linked to reactivation of the AR signaling axis and reprogramming of the cellular metabolism in prostate cancer. The translational implications of non-coding RNA research for developing new biomarkers and therapeutic strategies for CRPC are also discussed.
Non-Coding RNAs in Castration-Resistant Prostate Cancer: Regulation of Androgen Receptor Signaling and Cancer Metabolism

PubMed Central

Shih, Jing-Wen; Wang, Ling-Yu; Hung, Chiu-Lien; Kung, Hsing-Jien; Hsieh, Chia-Ling

2015-01-01

Hormone-refractory prostate cancer frequently relapses from therapy and inevitably progresses to a bone-metastatic status with no cure. Understanding of the molecular mechanisms conferring resistance to androgen deprivation therapy has the potential to lead to the discovery of novel therapeutic targets for type of prostate cancer with poor prognosis. Progression to castration-resistant prostate cancer (CRPC) is characterized by aberrant androgen receptor (AR) expression and persistent AR signaling activity. Alterations in metabolic activity regulated by oncogenic pathways, such as c-Myc, were found to promote prostate cancer growth during the development of CRPC. Non-coding RNAs represent a diverse family of regulatory transcripts that drive tumorigenesis of prostate cancer and various other cancers by their hyperactivity or diminished function. A number of studies have examined differentially expressed non-coding RNAs in each stage of prostate cancer. Herein, we highlight the emerging impacts of microRNAs and long non-coding RNAs linked to reactivation of the AR signaling axis and reprogramming of the cellular metabolism in prostate cancer. The translational implications of non-coding RNA research for developing new biomarkers and therapeutic strategies for CRPC are also discussed. PMID:26690121
Evolutionary Dynamics of 5S rDNA and Recurrent Association of Transposable Elements in Electric Fish of the Family Gymnotidae (Gymnotiformes): The Case of Gymnotus mamiraua.

PubMed

da Silva, Maelin; Barbosa, Patricia; Artoni, Roberto F; Feldberg, Eliana

2016-01-01

Gymnotidae is a family of electric fish endemic to the Neotropics consisting of 2 genera: Electrophorus and Gymnotus. The genus Gymnotus is widely distributed and is found in all of the major Brazilian river systems. Physical and molecular mapping data for the ribosomal DNA (rDNA) in this genus are still scarce, with its chromosomal location known in only 11 species. As other species of Gymnotus with 2n = 54 chromosomes from the Paraná-Paraguay basin, G. mamiraua was found to have a large number of 5S rDNA sites. Isolation and cloning of the 5S rDNA sequences from G. mamiraua identified a fragment of a transposable element similar to the Tc1/mariner transposon associated with a non-transcribed spacer. Double fluorescence in situ hybridization analysis of this element and the 5S rDNA showed that they were colocalized on several chromosomes, in addition to acting as nonsyntenic markers on others. Our data show the association between these sequences and suggest that the Tc1 retrotransposon may be the agent that drives the spread of these 5S rDNA-like sequences in the G. mamiraua genome. © 2016 S. Karger AG, Basel.
Targeting Promoter-Associated Noncoding RNA In Vivo.

PubMed

Civenni, Gianluca

2017-01-01

There are many classes of noncoding RNAs (ncRNAs), with wide-ranging functionalities (e.g., RNA editing, mediation of mRNA splicing, ribosomal function). MicroRNAs (miRNAs) and long ncRNAs (lncRNAs) are implicated in a wide variety of cellular processes, including the regulation of gene expression. Incorrect expression or mutation of lncRNAs has been reported to be associated with several disease conditions, such a malignant transformation in humans. Importantly, pivotal players in tumorigenesis and cancer progression, such as c-Myc, may be regulated by lncRNA at promoter level. The function of lncRNA can be reduced with antisense oligonucleotides that sequester or degrade mature lncRNAs. In alternative, lncRNA transcription can be blocked by small interference RNA (RNAi), which had acquired, recently, broad interested in clinical applications. In vivo-jetPEI™ is a linear polyethylenimine mediating nucleic acid (DNA, shRNA, siRNA, oligonucelotides) delivery with high efficiency. Different in vivo delivery routes have been validated: intravenous (IV), intraperitoneal (IP), intratumoral, subcutaneous, topical, and intrathecal. High levels of nucleic acid delivery are achieved into a broad range of tissues, such as lung, salivary glands, heart, spleen, liver, and prostate upon systemic administration. In addition, in vivo-jetPEI™ is also an efficient carrier for local gene and siRNA delivery such as intratumoral or topical application on the skin. After systemic injection, siRNA can be detected and the levels can be validated in target tissues by qRT-PCR. Targeting promoter-associated lncRNAs with siRNAs (small interfering RNAs) in vivo is becoming an exciting breakthrough for the treatment of human disease.
Discovering functional DNA elements using population genomic information: a proof of concept using human mtDNA.

PubMed

Schrider, Daniel R; Kern, Andrew D

2014-06-09

Identifying the complete set of functional elements within the human genome would be a windfall for multiple areas of biological research including medicine, molecular biology, and evolution. Complete knowledge of function would aid in the prioritization of loci when searching for the genetic bases of disease or adaptive phenotypes. Because mutations that disrupt function are disfavored by natural selection, purifying selection leaves a detectable signature within functional elements; accordingly, this signal has been exploited for over a decade through the use of genomic comparisons of distantly related species. While this is so, the functional complement of the genome changes extensively across time and between lineages; therefore, evidence of the current action of purifying selection in humans is essential. Because the removal of deleterious mutations by natural selection also reduces within-species genetic diversity within functional loci, dense population genetic data have the potential to reveal genomic elements that are currently functional. Here, we assess the potential of this approach by examining an ultradeep sample of human mitochondrial genomes (n = 16,411). We show that the high density of polymorphism in this data set precisely delineates regions experiencing purifying selection. Furthermore, we show that the number of segregating alleles at a site is strongly correlated with its divergence across species after accounting for known mutational biases in human mitochondrial DNA (ρ = 0.51; P < 2.2 × 10(-16)). These two measures track one another at a remarkably fine scale across many loci-a correlation that is purely the result of natural selection. Our results demonstrate that genetic variation has the potential to reveal with surprising precision which regions in the genome are currently performing important functions and likely to have deleterious fitness effects when mutated. As more complete human genomes are sequenced, similar power to reveal
Long Noncoding RNAs: New Players in the Osteogenic Differentiation of Bone Marrow- and Adipose-Derived Mesenchymal Stem Cells.

PubMed

Yang, Qiaolin; Jia, Lingfei; Li, Xiaobei; Guo, Runzhi; Huang, Yiping; Zheng, Yunfei; Li, Weiran

2018-06-01

Mesenchymal stem cells (MSCs) are an important population of multipotent stem cells that differentiate into multiple lineages and display great potential in bone regeneration and repair. Although the role of protein-coding genes in the osteogenic differentiation of MSCs has been extensively studied, the functions of noncoding RNAs in the osteogenic differentiation of MSCs are unclear. The recent application of next-generation sequencing to MSC transcriptomes has revealed that long noncoding RNAs (lncRNAs) are associated with the osteogenic differentiation of MSCs. LncRNAs are a class of non-coding transcripts of more than 200 nucleotides in length. Noncoding RNAs are thought to play a key role in osteoblast differentiation through various regulatory mechanisms including chromatin modification, transcription factor binding, competent endogenous mechanism, and other post-transcriptional mechanisms. Here, we review the roles of lncRNAs in the osteogenic differentiation of bone marrow- and adipose-derived stem cells and provide a theoretical foundation for future research.
The Most Deeply Conserved Noncoding Sequences in Plants Serve Similar Functions to Those in Vertebrates Despite Large Differences in Evolutionary Rates[W

PubMed Central

Burgess, Diane; Freeling, Michael

2014-01-01

In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing–associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates. PMID:24681619
Noncoding RNPs of Viral Origin

PubMed Central

Steitz, Joan; Borah, Sumit; Cazalla, Demian; Fok, Victor; Lytle, Robin; Mitton-Fry, Rachel; Riley, Kasandra; Samji, Tasleem

2011-01-01

SUMMARY Like their host cells, many viruses produce noncoding (nc)RNAs. These show diversity with respect to time of expression during viral infection, length and structure, protein-binding partners and relative abundance compared with their host-cell counterparts. Viruses, with their limited genomic capacity, presumably evolve or acquire ncRNAs only if they selectively enhance the viral life cycle or assist the virus in combating the host’s response to infection. Despite much effort, identifying the functions of viral ncRNAs has been extremely challenging. Recent technical advances and enhanced understanding of host-cell ncRNAs promise accelerated insights into the RNA warfare mounted by this fascinating class of RNPs. PMID:20719877
Uniformity of nucleosome preservation pattern in Mammalian sperm and its connection to repetitive DNA elements.

PubMed

Samans, Birgit; Yang, Yang; Krebs, Stefan; Sarode, Gaurav Vilas; Blum, Helmut; Reichenbach, Myriam; Wolf, Eckhard; Steger, Klaus; Dansranjavin, Temuujin; Schagdarsurengin, Undraga

2014-07-14

Nucleosome-to-protamine exchange during mammalian spermiogenesis is essential for compaction and protection of paternal DNA. It is interesting that, depending on the species, 1% to 15% of nucleosomes are retained, but the generalizability and biological function of this retention are unknown. Here, we show concordantly in human and bovine that nucleosomes remained in sperm chromatin predominantly within distal intergenic regions and introns and associated with centromere repeats and retrotransposons (LINE1 and SINEs). In contrast, nucleosome depletion concerned particularly exons, 5'-UTR, 3'-UTR, TSS, and TTS and was associated with simple and low-complexity repeats. Overlap of human and bovine genes exhibiting nucleosome preservation in the promoter and gene body revealed a significant enrichment of signal transduction and RNA- and protein-processing factors. Our study demonstrates the genome-wide uniformity of the nucleosome preservation pattern in mammalian sperm and its connection to repetitive DNA elements and suggests a function in preimplantation processes for paternally derived nucleosomes. Copyright © 2014 Elsevier Inc. All rights reserved.

On the roles of repetitive DNA elements in the context of a unified genomic-epigenetic system.

PubMed

von Sternberg, Richard

2002-12-01

Repetitive DNA sequences comprise a substantial portion of most eukaryotic and some prokaryotic chromosomes. Despite nearly forty years of research, the functions of various sequence families as a whole and their monomer units remain largely unknown. The inability to map specific functional roles onto many repetitive DNA elements (REs), coupled with the taxon-specificity of sequence families, have led many to speculate that these genomic components are "selfish" replicators generating genomic "junk." The purpose of this paper is to critically examine the selfishness, evolutionary effects, and functionality of REs. First, a brief overview of the range of ideas pertaining to RE function is presented. Second, the argument is presented that the selfish DNA "hypothesis" is actually a narrative scheme, that it serves to protect neo-Darwinian assumptions from criticism, and that this story is untestable and therefore not a hypothesis. Third, attempts to synthesize the selfish DNA concept with complex systems models of the genome and RE functionality are critiqued. Fourth, the supposed connection between RE-induced mutations and macroevolutionary events are stated to be at variance with empirical evidence and theoretical considerations. Hypotheses that base phylogenetic transitions in repetitive sequence changes thus remain speculative. Fifth and finally, the case is made for viewing REs as integrally functional components of chromosomes, genomes, and cells. It is argued throughout that a new conceptual framework is needed for understanding the roles of repetitive DNA in genomic/epigenetic systems, and that neo-Darwinian "narratives" have been the primary obstacle to elucidating the effects of these enigmatic components of chromosomes.
Genomic Heat Shock Element Sequences Drive Cooperative Human Heat Shock Factor 1 DNA Binding and Selectivity*

PubMed Central

Jaeger, Alex M.; Makley, Leah N.; Gestwicki, Jason E.; Thiele, Dennis J.

2014-01-01

The heat shock transcription factor 1 (HSF1) activates expression of a variety of genes involved in cell survival, including protein chaperones, the protein degradation machinery, anti-apoptotic proteins, and transcription factors. Although HSF1 activation has been linked to amelioration of neurodegenerative disease, cancer cells exhibit a dependence on HSF1 for survival. Indeed, HSF1 drives a program of gene expression in cancer cells that is distinct from that activated in response to proteotoxic stress, and HSF1 DNA binding activity is elevated in cycling cells as compared with arrested cells. Active HSF1 homotrimerizes and binds to a DNA sequence consisting of inverted repeats of the pentameric sequence nGAAn, known as heat shock elements (HSEs). Recent comprehensive ChIP-seq experiments demonstrated that the architecture of HSEs is very diverse in the human genome, with deviations from the consensus sequence in the spacing, orientation, and extent of HSE repeats that could influence HSF1 DNA binding efficacy and the kinetics and magnitude of target gene expression. To understand the mechanisms that dictate binding specificity, HSF1 was purified as either a monomer or trimer and used to evaluate DNA-binding site preferences in vitro using fluorescence polarization and thermal denaturation profiling. These results were compared with quantitative chromatin immunoprecipitation assays in vivo. We demonstrate a role for specific orientations of extended HSE sequences in driving preferential HSF1 DNA binding to target loci in vivo. These studies provide a biochemical basis for understanding differential HSF1 target gene recognition and transcription in neurodegenerative disease and in cancer. PMID:25204655
Strand invasion structures in the inverted repeat of Candida albicans mitochondrial DNA reveal a role for homologous recombination in replication.

PubMed

Gerhold, Joachim M; Aun, Anu; Sedman, Tiina; Jõers, Priit; Sedman, Juhan

2010-09-24

Molecular recombination and transcription are proposed mechanisms to initiate mitochondrial DNA (mtDNA) replication in yeast. We conducted a comprehensive analysis of mtDNA from the yeast Candida albicans. Two-dimensional agarose gel electrophoresis of mtDNA intermediates reveals no bubble structures diagnostic of specific replication origins, but rather supports recombination-driven replication initiation of mtDNA in yeast. Specific species of Y structures together with DNA copy number analyses of a C. albicans mutant strain provide evidence that a region in a mainly noncoding inverted repeat is predominantly involved in replication initiation via homologous recombination. Our further findings show that the C. albicans mtDNA forms a complex branched network that does not contain detectable amounts of circular molecules. We provide topological evidence for recombination-driven mtDNA replication initiation and introduce C. albicans as a suitable model organism to study wild-type mtDNA maintenance in yeast. Copyright © 2010 Elsevier Inc. All rights reserved.
Characterisation of a DNA sequence element that directs Dictyostelium stalk cell-specific gene expression.

PubMed

Ceccarelli, A; Zhukovskaya, N; Kawata, T; Bozzaro, S; Williams, J

2000-12-01

The ecmB gene of Dictyostelium is expressed at culmination both in the prestalk cells that enter the stalk tube and in ancillary stalk cell structures such as the basal disc. Stalk tube-specific expression is regulated by sequence elements within the cap-site proximal part of the promoter, the stalk tube (ST) promoter region. Dd-STATa, a member of the STAT transcription factor family, binds to elements present in the ST promoter-region and represses transcription prior to entry into the stalk tube. We have characterised an activatory DNA sequence element, that lies distal to the repressor elements and that is both necessary and sufficient for expression within the stalk tube. We have mapped this activator to a 28 nucleotide region (the 28-mer) within which we have identified a GA-containing sequence element that is required for efficient gene transcription. The Dd-STATa protein binds to the 28-mer in an in vitro binding assay, and binding is dependent upon the GA-containing sequence. However, the ecmB gene is expressed in a Dd-STATa null mutant, therefore Dd-STATa cannot be responsible for activating the 28-mer in vivo. Instead, we identified a distinct 28-mer binding activity in nuclear extracts from the Dd-STATa null mutant, the activity of this GA binding activity being largely masked in wild type extracts by the high affinity binding of the Dd-STATa protein. We suggest, that in addition to the long range repression exerted by binding to the two known repressor sites, Dd-STATa inhibits transcription by direct competition with this putative activator for binding to the GA sequence.
De Novo ORFs in Drosophila Are Important to Organismal Fitness and Evolved Rapidly from Previously Non-coding Sequences

PubMed Central

Reinhardt, Josephine A.; Wanjiru, Betty M.; Brant, Alicia T.; Saelao, Perot; Begun, David J.; Jones, Corbin D.

2013-01-01

How non-coding DNA gives rise to new protein-coding genes (de novo genes) is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs), while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important. PMID:24146629
Ets-1 promoter-associated noncoding RNA regulates the NONO/ERG/Ets-1 axis to drive gastric cancer progression.

PubMed

Li, Dan; Chen, Yajun; Mei, Hong; Jiao, Wanju; Song, Huajie; Ye, Lin; Fang, Erhu; Wang, Xiaojing; Yang, Feng; Huang, Kai; Zheng, Liduan; Tong, Qiangsong

2018-05-18

Emerging studies have indicated the essential functions of long noncoding RNAs (lncRNAs) during cancer progression. However, whether lncRNAs contribute to the upregulation of v-ets erythroblastosis virus E26 oncogene homolog 1 (Ets-1), an established oncogenic protein facilitating tumor invasion and metastasis, in gastric cancer remains elusive. Herein, we identified Ets-1 promoter-associated noncoding RNA (pancEts-1) as a novel lncRNA associated with the gastric cancer progression via mining of publicly available datasets and rapid amplification of cDNA ends. RNA pull-down, RNA immunoprecipitation, in vitro binding, and RNA electrophoretic mobility shift assays indicated the binding of pancEts-1 to non-POU domain containing octamer binding (NONO) protein. Mechanistically, pancEts-1 facilitated the physical interaction between NONO and Ets related gene (ERG), resulting in increased ERG transactivation and transcription of Ets-1 associated with gastric cancer progression. In addition, pancEts-1 facilitated the growth and aggressiveness of gastric cancer cells via interacting with NONO. In gastric cancer tissues, pancEts-1, NONO, and ERG were upregulated and significantly correlated with Ets-1 levels. High levels of pancEts-1, NONO, ERG, or Ets-1 were respectively associated with poor survival of gastric cancer patients, whereas simultaneous expression of all of them (HR = 3.012, P = 0.105) was not an independent prognostic factor for predicting clinical outcome. Overall, these results demonstrate that lncRNA pancEts-1 exhibits oncogenic properties that drive the progression of gastric cancer via regulating the NONO/ERG/Ets-1 axis.
Evidence of function for conserved noncoding sequences in Arabidopsis thaliana.

PubMed

Spangler, Jacob B; Subramaniam, Sabarinath; Freeling, Michael; Feltus, F Alex

2012-01-01

• Whole genome duplication events provide a lineage with a large reservoir of genes that can be molded by evolutionary forces into phenotypes that fit alternative environments. A well-studied whole genome duplication, the α-event, occurred in an ancestor of the model plant Arabidopsis thaliana. Retained segments of the α-event have been defined in recent years in the form of duplicate protein coding sequences (α-pairs) and associated conserved noncoding DNA sequences (CNSs). Our aim was to identify any association between CNSs and α-pair co-functionality at the gene expression level. • Here, we tested for correlation between CNS counts and α-pair co-expression and expression intensity across nine expression datasets: aerial tissue, flowers, leaves, roots, rosettes, seedlings, seeds, shoots and whole plants. • We provide evidence for a putative regulatory role of the CNSs. The association of CNSs with α-pair co-expression and expression intensity varied by gene function, subgene position and the presence of transcription factor binding motifs. A range of possible CNS regulatory mechanisms, including intron-mediated enhancement, messenger RNA fold stability and transcriptional regulation, are discussed. • This study provides a framework to understand how CNS motifs are involved in the maintenance of gene expression after a whole genome duplication event. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.
The noncoding human genome and the future of personalised medicine.

PubMed

Cowie, Philip; Hay, Elizabeth A; MacKenzie, Alasdair

2015-01-30

Non-coding cis-regulatory sequences act as the 'eyes' of the genome and their role is to perceive, organise and relay cellular communication information to RNA polymerase II at gene promoters. The evolution of these sequences, that include enhancers, silencers, insulators and promoters, has progressed in multicellular organisms to the extent that cis-regulatory sequences make up as much as 10% of the human genome. Parallel evidence suggests that 75% of polymorphisms associated with heritable disease occur within predicted cis-regulatory sequences that effectively alter the 'perception' of cis-regulatory sequences or render them blind to cell communication cues. Cis-regulatory sequences also act as major functional targets of epigenetic modification thus representing an important conduit through which changes in DNA-methylation affects disease susceptibility. The objectives of the current review are (1) to describe what has been learned about identifying and characterising cis-regulatory sequences since the sequencing of the human genome; (2) to discuss their role in interpreting cell signalling pathways pathways; and (3) outline how this role may be altered by polymorphisms and epigenetic changes. We argue that the importance of the cis-regulatory genome for the interpretation of cellular communication pathways cannot be overstated and understanding its role in health and disease will be critical for the future development of personalised medicine.
Conserved noncoding sequences conserve biological networks and influence genome evolution.

PubMed

Xie, Jianbo; Qian, Kecheng; Si, Jingna; Xiao, Liang; Ci, Dong; Zhang, Deqiang

2018-05-01

Comparative genomics approaches have identified numerous conserved cis-regulatory sequences near genes in plant genomes. Despite the identification of these conserved noncoding sequences (CNSs), our knowledge of their functional importance and selection remains limited. Here, we used a combination of DNA methylome analysis, microarray expression analyses, and functional annotation to study these sequences in the model tree Populus trichocarpa. Methylation in CG contexts and non-CG contexts was lower in CNSs, particularly CNSs in the 5'-upstream regions of genes, compared with other sites in the genome. We observed that CNSs are enriched in genes with transcription and binding functions, and this also associated with syntenic genes and those from whole-genome duplications, suggesting that cis-regulatory sequences play a key role in genome evolution. We detected a significant positive correlation between CNS number and protein interactions, suggesting that CNSs may have roles in the evolution and maintenance of biological networks. The divergence of CNSs indicates that duplication-degeneration-complementation drives the subfunctionalization of a proportion of duplicated genes from whole-genome duplication. Furthermore, population genomics confirmed that most CNSs are under strong purifying selection and only a small subset of CNSs shows evidence of adaptive evolution. These findings provide a foundation for future studies exploring these key genomic features in the maintenance of biological networks, local adaptation, and transcription.
Specific interaction of mutant p53 with regions of matrix attachment region DNA elements (MARs) with a high potential for base-unpairing

PubMed Central

Will, Katrin; Warnecke, Gabriele; Wiesmüller, Lisa; Deppert, Wolfgang

1998-01-01

Mutant, but not wild-type p53 binds with high affinity to a variety of MAR-DNA elements (MARs), suggesting that MAR-binding of mutant p53 relates to the dominant-oncogenic activities proposed for mutant p53. MARs recognized by mutant p53 share AT richness and contain variations of an AATATATTT “DNA-unwinding motif,” which enhances the structural dynamics of chromatin and promotes regional DNA base-unpairing. Mutant p53 specifically interacted with MAR-derived oligonucleotides carrying such unwinding motifs, catalyzing DNA strand separation when this motif was located within a structurally labile sequence environment. Addition of GC-clamps to the respective MAR-oligonucleotides or introducing mutations into the unwinding motif strongly reduced DNA strand separation, but supported the formation of tight complexes between mutant p53 and such oligonucleotides. We conclude that the specific interaction of mutant p53 with regions of MAR-DNA with a high potential for base-unpairing provides the basis for the high-affinity binding of mutant p53 to MAR-DNA. PMID:9811860
The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology.

PubMed

Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

2016-01-01

In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/ncro.owl.
Mitochondrial DNA repairs double-strand breaks in yeast chromosomes.

PubMed

Ricchetti, M; Fairhead, C; Dujon, B

1999-11-04

The endosymbiotic theory for the origin of eukaryotic cells proposes that genetic information can be transferred from mitochondria to the nucleus of a cell, and genes that are probably of mitochondrial origin have been found in nuclear chromosomes. Occasionally, short or rearranged sequences homologous to mitochondrial DNA are seen in the chromosomes of different organisms including yeast, plants and humans. Here we report a mechanism by which fragments of mitochondrial DNA, in single or tandem array, are transferred to yeast chromosomes under natural conditions during the repair of double-strand breaks in haploid mitotic cells. These repair insertions originate from noncontiguous regions of the mitochondrial genome. Our analysis of the Saccharomyces cerevisiae mitochondrial genome indicates that the yeast nuclear genome does indeed contain several short sequences of mitochondrial origin which are similar in size and composition to those that repair double-strand breaks. These sequences are located predominantly in non-coding regions of the chromosomes, frequently in the vicinity of retrotransposon long terminal repeats, and appear as recent integration events. Thus, colonization of the yeast genome by mitochondrial DNA is an ongoing process.
Long Noncoding RNAs as a Key Player in Hepatocellular Carcinoma

PubMed Central

Mehra, Mrigaya; Chauhan, Ranjit

2017-01-01

Hepatocellular carcinoma (HCC) is a major malignancy in the liver and has emerged as one of the main cancers in the world with a high mortality rate. However, the molecular mechanisms of HCC are still poorly understood. Long noncoding RNAs (lncRNAs) have recently come to the forefront as functional non–protein-coding RNAs that are involved in a variety of cellular processes ranging from maintaining the structural integrity of chromosomes to gene expression regulation in a spatiotemporal manner. Many recent studies have reported the involvement of lncRNAs in HCC which has led to a better understanding of the underlying molecular mechanisms operating in HCC. Long noncoding RNAs have been shown to regulate development and progression of HCC, and thus, lncRNAs have both diagnostic and therapeutic potentials. In this review, we present an overview of the lncRNAs involved in different stages of HCC and their potential in clinical applications which have been studied so far. PMID:29147078
Thermodynamics of complex structures formed between single-stranded DNA oligomers and the KH domains of the far upstream element binding protein

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chakraborty, Kaushik; Sinha, Sudipta Kumar; Bandyopadhyay, Sanjoy, E-mail: sanjoy@chem.iitkgp.ernet.in

The noncovalent interaction between protein and DNA is responsible for regulating the genetic activities in living organisms. The most critical issue in this problem is to understand the underlying driving force for the formation and stability of the complex. To address this issue, we have performed atomistic molecular dynamics simulations of two DNA binding K homology (KH) domains (KH3 and KH4) of the far upstream element binding protein (FBP) complexed with two single-stranded DNA (ss-DNA) oligomers in aqueous media. Attempts have been made to calculate the individual components of the net entropy change for the complexation process by adopting suitablemore » statistical mechanical approaches. Our calculations reveal that translational, rotational, and configurational entropy changes of the protein and the DNA components have unfavourable contributions for this protein-DNA association process and such entropy lost is compensated by the entropy gained due to the release of hydration layer water molecules. The free energy change corresponding to the association process has also been calculated using the Free Energy Perturbation (FEP) method. The free energy gain associated with the KH4–DNA complex formation has been found to be noticeably higher than that involving the formation of the KH3–DNA complex.« less
Molecular interplay of pro-inflammatory transcription factors and non-coding RNAs in esophageal squamous cell carcinoma.

PubMed

Sundaram, Gopinath M; Veera Bramhachari, Pallaval

2017-06-01

Esophageal squamous cell carcinoma is the sixth most common cancer in the developing world. The aggressive nature of esophageal squamous cell carcinoma, its tendency for relapse, and the poor survival prospects of patients diagnosed at advanced stages, represent a pressing need for the development of new therapies for this disease. Chronic inflammation is known to have a causal link to cancer pre-disposition. Nuclear factor kappa B and signal transducer and activator of transcription 3 are transcription factors which regulate immunity and inflammation and are emerging as key regulators of tumor initiation, progression, and metastasis. Although these pro-inflammatory factors in esophageal squamous cell carcinoma have been well-characterized with reference to protein-coding targets, their functional interactions with non-coding RNAs have only recently been gaining attention. Non-coding RNAs, especially microRNAs and long non-coding RNAs demonstrate potential as biomarkers and alternative therapeutic targets. In this review, we summarize the recent literature and concepts on non-coding RNAs that are regulated by/regulate nuclear factor kappa B and signal transducer and activator of transcription 3 in esophageal cancer progression. We also discuss how these recent discoveries can pave way for future therapeutic options to treat esophageal squamous cell carcinoma.
A regional approach to plant DNA barcoding provides high species resolution of sedges (Carex and Kobresia, Cyperaceae) in the Canadian Arctic Archipelago.

PubMed

Clerc-Blain, Jessica L E; Starr, Julian R; Bull, Roger D; Saarela, Jeffery M

2010-01-01

Previous research on barcoding sedges (Carex) suggested that basic searches within a global barcoding database would probably not resolve more than 60% of the world's some 2000 species. In this study, we take an alternative approach and explore the performance of plant DNA barcoding in the Carex lineage from an explicitly regional perspective. We characterize the utility of a subset of the proposed protein-coding and noncoding plastid barcoding regions (matK, rpoB, rpoC1, rbcL, atpF-atpH, psbK-psbI) for distinguishing species of Carex and Kobresia in the Canadian Arctic Archipelago, a clearly defined eco-geographical region representing 1% of the Earth's landmass. Our results show that matK resolves the greatest number of species of any single-locus (95%), and when combined in a two-locus barcode, it provides 100% species resolution in all but one combination (matK + atpFH) during unweighted pair-group method with arithmetic mean averages (UPGMA) analyses. Noncoding regions were equally or more variable than matK, but as single markers they resolve substantially fewer taxa than matK alone. When difficulties with sequencing and alignment due to microstructural variation in noncoding regions are also considered, our results support other studies in suggesting that protein-coding regions are more practical as barcoding markers. Plastid DNA barcodes are an effective identification tool for species of Carex and Kobresia in the Canadian Arctic Archipelago, a region where the number of co-existing closely related species is limited. We suggest that if a regional approach to plant DNA barcoding was applied on a global scale, it could provide a solution to the generally poor species resolution seen in previous barcoding studies. © 2009 Blackwell Publishing Ltd.
Probability of coding of a DNA sequence: an algorithm to predict translated reading frames from their thermodynamic characteristics.

PubMed Central

Tramontano, A; Macchiato, M F

1986-01-01

An algorithm to determine the probability that a reading frame codifies for a protein is presented. It is based on the results of our previous studies on the thermodynamic characteristics of a translated reading frame. We also develop a prediction procedure to distinguish between coding and non-coding reading frames. The procedure is based on the characteristics of the putative product of the DNA sequence and not on periodicity characteristics of the sequence, so the prediction is not biased by the presence of overlapping translated reading frames or by the presence of translated reading frames on the complementary DNA strand. PMID:3753761
Dynamic interaction of Y RNAs with chromatin and initiation proteins during human DNA replication

PubMed Central

Zhang, Alice Tianbu; Langley, Alexander R.; Christov, Christo P.; Kheir, Eyemen; Shafee, Thomas; Gardiner, Timothy J.; Krude, Torsten

2011-01-01

Non-coding Y RNAs are required for the initiation of chromosomal DNA replication in mammalian cells. It is unknown how they perform this function or if they associate with a nuclear structure during DNA replication. Here, we investigate the association of Y RNAs with chromatin and their interaction with replication proteins during DNA replication in a human cell-free system. Our results show that fluorescently labelled Y RNAs associate with unreplicated euchromatin in late G1 phase cell nuclei before the initiation of DNA replication. Following initiation, Y RNAs are displaced locally from nascent and replicated DNA present in replication foci. In intact human cells, a substantial fraction of endogenous Y RNAs are associated with G1 phase nuclei, but not with G2 phase nuclei. Y RNAs interact and colocalise with the origin recognition complex (ORC), the pre-replication complex (pre-RC) protein Cdt1, and other proteins implicated in the initiation of DNA replication. These data support a molecular ‘catch and release’ mechanism for Y RNA function during the initiation of chromosomal DNA replication, which is consistent with Y RNAs acting as replication licensing factors. PMID:21610089
Rotifer rDNA-specific R9 retrotransposable elements generate an exceptionally long target site duplication upon insertion.

PubMed

Gladyshev, Eugene A; Arkhipova, Irina R

2009-12-15

Ribosomal DNA genes in many eukaryotes contain insertions of non-LTR retrotransposable elements belonging to the R2 clade. These elements persist in the host genomes by inserting site-specifically into multicopy target sites, thereby avoiding random disruption of single-copy host genes. Here we describe R9 retrotransposons from the R2 clade in the 28S RNA genes of bdelloid rotifers, small freshwater invertebrate animals best known for their long-term asexuality and for their ability to survive repeated cycles of desiccation and rehydration. While the structural organization of R9 elements is highly similar to that of other members of the R2 clade, they are characterized by two distinct features: site-specific insertion into a previously unreported target sequence within the 28S gene, and an unusually long target site duplication of 126 bp. We discuss the implications of these findings in the context of bdelloid genome organization and the mechanisms of target-primed reverse transcription.
Phylogeography and conservation genetics of Hygrophila pogonocalyx (Acanthaceae) based on atpB-rbcL noncoding spacer cpDNA.

PubMed

Huang, Jao-Ching; Wang, Wei-Kuang; Peng, Ching-I; Chiang, Tzen-Yuh

2005-02-01

Genetic variation in the atpB-rbcL intergenic spacer region of chloroplast DNA (cpDNA) was investigated in Hygrophila pogonocalyx Hayata (Acanthaceae), an endangered and endemic species in Taiwan. In this aquatic species, seed dispersal from capsules via elasticity is constrained by gravity and is thereby confined within populations, resulting in limited gene flow between populations. In this study, a total of 849 bp of the cpDNA atpB-rbcL spacer were sequenced from eight populations of H. pogonocalyx. Nucleotide diversity in the cpDNA is low (theta = 0.00343+/-0.00041). The distribution of genetic variation among populations agrees with an "isolation-by-distance" model. Two geographically correlated groups, the western and eastern regions, were identified in a neighbor-joining tree and a minimum-spanning network. Phylogeographical analyses based on the cpDNA network suggest that the present-day differentiation between western and eastern groups of H. pogonocalyx resulted from past fragmentation. The differentiation between eastern and western populations may be ascribed to isolation since the formation of the Central Mountain Range about 5 million years ago, which is consistent with the rate estimates based on a molecular clock of cpDNA.

Construction of Infectious cDNA Clone of a Chrysanthemum stunt viroid Korean Isolate

PubMed Central

Yoon, Ju-Yeon; Cho, In-Sook; Choi, Gug-Seoun; Choi, Seung-Kook

2014-01-01

Chrysanthemum stunt viroid (CSVd), a noncoding infectious RNA molecule, causes seriously economic losses of chrysanthemum for 3 or 4 years after its first infection. Monomeric cDNA clones of CSVd isolate SK1 (CSVd-SK1) were constructed in the plasmids pGEM-T easy vector and pUC19 vector. Linear positive-sense transcripts synthesized in vitro from the full-length monomeric cDNA clones of CSVd-SK1 could infect systemically tomato seedlings and chrysanthemum plants, suggesting that the linear CSVd RNA transcribed from the cDNA clones could be replicated as efficiently as circular CSVd in host species. However, direct inoculation of plasmid cDNA clones containing full-length monomeric cDNA of CSVd-SK1 failed to infect tomato and chrysanthemum and linear negative-sense transcripts from the plasmid DNAs were not infectious in the two plant species. The cDNA sequences of progeny viroid in systemically infected tomato and chrysanthemum showed a few substitutions at a specific nucleotide position, but there were no deletions and insertions in the sequences of the CSVd progeny from tomato and chrysanthemum plants. PMID:25288987
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome

PubMed Central

Ferlaino, Michael; Rogers, Mark F.; Shihab, Hashem A.; Mort, Matthew; Cooper, David N.; Gaunt, Tom R.; Campbell, Colin

2018-01-01

Background Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. Results We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. Conclusions FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome. PMID:28985712
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

PubMed

Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

2017-10-06

Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.
Variation of DNA Methylome of Zebrafish Cells under Cold Pressure

PubMed Central

Xu, Qiongqiong; Luo, Juntao; Shi, Yingdi; Li, Xiaoxia; Yan, Xiaonan; Zhang, Junfang

2016-01-01

DNA methylation is an essential epigenetic mechanism involved in multiple biological processes. However, the relationship between DNA methylation and cold acclimation remains poorly understood. In this study, Methylated DNA Immunoprecipitation Sequencing (MeDIP-seq) was performed to reveal a genome-wide methylation profile of zebrafish (Danio rerio) embryonic fibroblast cells (ZF4) and its variation under cold pressure. MeDIP-seq assay was conducted with ZF4 cells cultured at appropriate temperature of 28°C and at low temperature of 18°C for 5 (short-term) and 30 (long-term) days, respectively. Our data showed that DNA methylation level of whole genome increased after a short-term cold exposure and decreased after a long-term cold exposure. It is interesting that metabolism of folate pathway is significantly hypomethylated after short-term cold exposure, which is consistent with the increased DNA methylation level. 21% of methylation peaks were significantly altered after cold treatment. About 8% of altered DNA methylation peaks are located in promoter regions, while the majority of them are located in non-coding regions. Methylation of genes involved in multiple cold responsive biological processes were significantly affected, such as anti-oxidant system, apoptosis, development, chromatin modifying and immune system suggesting that those processes are responsive to cold stress through regulation of DNA methylation. Our data indicate the involvement of DNA methylation in cellular response to cold pressure, and put a new insight into the genome-wide epigenetic regulation under cold pressure. PMID:27494266
HDA6 directly interacts with DNA methyltransferase MET1 and maintains transposable element silencing in Arabidopsis.

PubMed

Liu, Xuncheng; Yu, Chun-Wei; Duan, Jun; Luo, Ming; Wang, Koching; Tian, Gang; Cui, Yuhai; Wu, Keqiang

2012-01-01

The molecular mechanism of how the histone deacetylase HDA6 participates in maintaining transposable element (TE) silencing in Arabidopsis (Arabidopsis thaliana) is not yet defined. In this study, we show that a subset of TEs was transcriptionally reactivated and that TE reactivation was associated with elevated histone H3 and H4 acetylation as well as increased H3K4Me3 and H3K4Me2 in hda6 mutants. Decreased DNA methylation of the TEs was also detected in hda6 mutants, suggesting that HDA6 silences the TEs by regulating histone acetylation and methylation as well as the DNA methylation status of the TEs. Similarly, transcripts of some of these TEs were also increased in the methyltransferase1 (met1) mutant, with decreased DNA methylation. Furthermore, H4 acetylation, H3K4Me3, H3K4Me2, and H3K36Me2 were enriched at the coregulated TEs in the met1 and hda6 met1 mutants. Protein-protein interaction analysis indicated that HDA6 physically interacts with MET1 in vitro and in vivo, and further deletion analysis demonstrated that the carboxyl-terminal region of HDA6 and the bromo-adjacent homology domain of MET1 were responsible for the interaction. These results suggested that HDA6 and MET1 interact directly and act together to silence TEs by modulating DNA methylation, histone acetylation, and histone methylation status.
Maternally expressed gene 3, an imprinted noncoding RNA gene, is associated with meningioma pathogenesis and progression.

PubMed

Zhang, Xun; Gejman, Roger; Mahta, Ali; Zhong, Ying; Rice, Kimberley A; Zhou, Yunli; Cheunsuchon, Pornsuk; Louis, David N; Klibanski, Anne

2010-03-15

Meningiomas are common tumors, representing 15% to 25% of all central nervous system tumors. NF2 gene inactivation on chromosome 22 has been shown as an early event in tumorigenesis; however, few factors underlying tumor growth and progression have been identified. The chromosomal abnormalities of 14q32 are often associated with meningioma pathogenesis and progression; therefore, it has been proposed that an as yet unidentified tumor suppressor is present at this locus. Maternally expressed gene 3 (MEG3) is an imprinted gene located at 14q32 which encodes a noncoding RNA with an antiproliferative function. We found that MEG3 mRNA is highly expressed in normal arachnoidal cells. However, MEG3 is not expressed in the majority of human meningiomas or the human meningioma cell lines IOMM-Lee and CH157-MN. There is a strong association between loss of MEG3 expression and tumor grade. Allelic loss at the MEG3 locus is also observed in meningiomas, with increasing prevalence in higher grade tumors. In addition, there is an increase in CpG methylation within the promoter and the imprinting control region of MEG3 gene in meningiomas. Functionally, MEG3 suppresses DNA synthesis in both IOMM-Lee and CH157-MN cells by approximately 60% in bromodeoxyuridine incorporation assays. Colony-forming efficiency assays show that MEG3 inhibits colony formation in CH157-MN cells by approximately 80%. Furthermore, MEG3 stimulates p53-mediated transactivation in these cell lines. Therefore, these data are consistent with the hypothesis that MEG3, which encodes a noncoding RNA, may be a tumor suppressor gene at chromosome 14q32 involved in meningioma progression via a novel mechanism.
MicroRNAs and other non-coding RNAs as targets for anticancer drug development

PubMed Central

Ling, Hui; Fabbri, Muller; Calin, George A.

2015-01-01

With the first cancer-targeted microRNA drug, MRX34, a liposome-based miR-34 mimic, entering phase I clinical trial in patients with advanced hepatocellular carcinoma in April 2013, miRNA therapeutics are attracting special attention from both academia and biotechnology companies. Although to date the most studied non-coding RNAs (ncRNAs) are miRNAs, the importance of long non-coding RNAs (lncRNAs) is increasingly being recognized. Here we summarize the roles of miRNAs and lncRNAs in cancer, with a focus on the recently identified novel mechanisms of action, and discuss the current strategies in designing ncRNA-targeting therapeutics, as well as the associated challenges. PMID:24172333
MicroRNAs and non-coding RNAs in virus-infected cells

PubMed Central

Ouellet, Dominique L.; Provost, Patrick

2010-01-01

Within the past few years, microRNAs (miRNAs) and other non-coding RNAs (ncRNAs) have emerged as elements with critically high importance in post-transcriptional control of cellular and, more recently, viral processes. Endogenously produced by a component of the miRNA-guided RNA silencing machinery known as Dicer, miRNAs are known to control messenger RNA (mRNA) translation through recognition of specific binding sites usually located in their 3′ untranslated region. Recent evidences indicate that the host miRNA pathway may represent an adapted antiviral defense mechanism that can act either by direct miRNA-mediated modulation of viral gene expression or through recognition and inactivation of structured viral RNA species by the protein components of the RNA silencing machinery, such as Dicer. This latter process, however, is a double-edge sword, as it may yield viral miRNAs exerting gene regulatory properties on both host and viral mRNAs. Our knowledge of the interaction between viruses and host RNA silencing machineries, and how this influences the course of infection, is becoming increasingly complex. This review article aims to summarize our current knowledge about viral miRNAs/ncRNAs and their targets, as well as cellular miRNAs that are modulated by viruses upon infection. PMID:20217543
Long Non-coding RNAs in the X-inactivation Center

PubMed Central

Kalantry, Sundeep

2014-01-01

The X-inactivation center is a hotbed of functional long non-coding RNAs in eutherian mammals. These RNAs are thought to help orchestrate the epigenetic transcriptional states of the two X-chromosomes in females as well as of the single X-chromosome in males. To balance X-linked gene expression between the sexes, females undergo transcriptional silencing of most genes on one of the two X-chromosomes in a process termed X-chromosome inactivation. While one X-chromosome is inactivated, the other X-chromosome remains active. Moreover, with a few notable exceptions, the originally established epigenetic transcriptional profiles of the two is maintained as such through many rounds of cell division, essentially for the life of the organism. The stable divergent transcriptional fates of the two X-chromosomes, despite residing in a shared nucleoplasm, make X-inactivation a paradigm of epigenetic transcriptional regulation. Originally proposed in 1961 by Mary Lyon, the X-inactivation hypothesis has been validated through much experimentation over the last fifty years. In the last 25 years, the discovery and functional characterization has firmly established X-linked long non-coding RNAs as key players in choreographing X-chromosome inactivation. PMID:24297756
In vitro selection of single-stranded DNA molecular recognition elements against S. aureus alpha toxin and sensitive detection in human serum.

PubMed

Hong, Ka L; Battistella, Luisa; Salva, Alysia D; Williams, Ryan M; Sooter, Letha J

2015-01-27

Alpha toxin is one of the major virulence factors secreted by Staphylococcus aureus, a bacterium that is responsible for a wide variety of infections in both community and hospital settings. Due to the prevalence of S. aureus related infections and the emergence of methicillin-resistant S. aureus, rapid and accurate diagnosis of S. aureus infections is crucial in benefiting patient health outcomes. In this study, a rigorous Systematic Evolution of Ligands by Exponential Enrichment (SELEX) variant previously developed by our laboratory was utilized to select a single-stranded DNA molecular recognition element (MRE) targeting alpha toxin with high affinity and specificity. At the end of the 12-round selection, the selected MRE had an equilibrium dissociation constant (Kd) of 93.7 ± 7.0 nM. Additionally, a modified sandwich enzyme-linked immunosorbent assay (ELISA) was developed by using the selected ssDNA MRE as the toxin-capturing element and a sensitive detection of 200 nM alpha toxin in undiluted human serum samples was achieved.
NCAD, a database integrating the intrinsic conformational preferences of non-coded amino acids

PubMed Central

Revilla-López, Guillem; Torras, Juan; Curcó, David; Casanovas, Jordi; Calaza, M. Isabel; Zanuy, David; Jiménez, Ana I.; Cativiela, Carlos; Nussinov, Ruth; Grodzinski, Piotr; Alemán, Carlos

2010-01-01

Peptides and proteins find an ever-increasing number of applications in the biomedical and materials engineering fields. The use of non-proteinogenic amino acids endowed with diverse physicochemical and structural features opens the possibility to design proteins and peptides with novel properties and functions. Moreover, non-proteinogenic residues are particularly useful to control the three-dimensional arrangement of peptidic chains, which is a crucial issue for most applications. However, information regarding such amino acids –also called non-coded, non-canonical or non-standard– is usually scattered among publications specialized in quite diverse fields as well as in patents. Making all these data useful to the scientific community requires new tools and a framework for their assembly and coherent organization. We have successfully compiled, organized and built a database (NCAD, Non-Coded Amino acids Database) containing information about the intrinsic conformational preferences of non-proteinogenic residues determined by quantum mechanical calculations, as well as bibliographic information about their synthesis, physical and spectroscopic characterization, conformational propensities established experimentally, and applications. The architecture of the database is presented in this work together with the first family of non-coded residues included, namely, α-tetrasubstituted α-amino acids. Furthermore, the NCAD usefulness is demonstrated through a test-case application example. PMID:20455555
Adipocyte Long-Noncoding RNA Transcriptome Analysis of Obese Mice Identified Lnc-Leptin, Which Regulates Leptin.

PubMed

Lo, Kinyui Alice; Huang, Shiqi; Walet, Arcinas Camille Esther; Zhang, Zhi-Chun; Leow, Melvin Khee-Shing; Liu, Meihui; Sun, Lei

2018-06-01

Obesity induces profound transcriptome changes in adipocytes, and recent evidence suggests that long-noncoding RNAs (lncRNAs) play key roles in this process. We performed a comprehensive transcriptome study by RNA sequencing in adipocytes isolated from interscapular brown, inguinal, and epididymal white adipose tissue in diet-induced obese mice. The analysis revealed a set of obesity-dysregulated lncRNAs, many of which exhibit dynamic changes in the fed versus fasted state, potentially serving as novel molecular markers of adipose energy status. Among the most prominent lncRNAs is Lnc-leptin , which is transcribed from an enhancer region upstream of leptin ( Lep ). Expression of Lnc-leptin is sensitive to insulin and closely correlates to Lep expression across diverse pathophysiological conditions. Functionally, induction of Lnc-leptin is essential for adipogenesis, and its presence is required for the maintenance of Lep expression in vitro and in vivo. Direct interaction was detected between DNA loci of Lnc-leptin and Lep in mature adipocytes, which diminished upon Lnc-leptin knockdown. Our study establishes Lnc-leptin as a new regulator of Lep . © 2018 by the American Diabetes Association.
Evidence that noncoding RNA dutA is a multicopy suppressor of Dictyostelium discoideum STAT protein Dd-STATa.

PubMed

Shimada, Nao; Kawata, Takefumi

2007-06-01

Dd-STATa, a Dictyostelium discoideum homologue of metazoan STAT transcription factors, is necessary for culmination. We created a mutant strain with partial Dd-STATa activity and used it to screen for unlinked suppressor genes. We screened approximately 450,000 clones from a slug-stage cDNA library for their ability to rescue the culmination defect when overexpressed. There were 12 multicopy suppressors of Dd-STATa, of which 4 encoded segments of a known noncoding RNA, dutA. Expression of dutA is specific to the pstA zone, the region where Dd-STATa is activated. In suppressed strains the expression patterns of several putative Dd-STATa target genes become similar to the wild-type strain. In addition, the amount of the tyrosine-phosphorylated form of Dd-STATa is significantly increased in the suppressed strain. These results indicate that partial copies of dutA may act upstream of Dd-STATa to regulate tyrosine phosphorylation by an unknown mechanism.
Evidence that Noncoding RNA dutA Is a Multicopy Suppressor of Dictyostelium discoideum STAT Protein Dd-STATa▿

PubMed Central

Shimada, Nao; Kawata, Takefumi

2007-01-01

Dd-STATa, a Dictyostelium discoideum homologue of metazoan STAT transcription factors, is necessary for culmination. We created a mutant strain with partial Dd-STATa activity and used it to screen for unlinked suppressor genes. We screened approximately 450,000 clones from a slug-stage cDNA library for their ability to rescue the culmination defect when overexpressed. There were 12 multicopy suppressors of Dd-STATa, of which 4 encoded segments of a known noncoding RNA, dutA. Expression of dutA is specific to the pstA zone, the region where Dd-STATa is activated. In suppressed strains the expression patterns of several putative Dd-STATa target genes become similar to the wild-type strain. In addition, the amount of the tyrosine-phosphorylated form of Dd-STATa is significantly increased in the suppressed strain. These results indicate that partial copies of dutA may act upstream of Dd-STATa to regulate tyrosine phosphorylation by an unknown mechanism. PMID:17435008
Transcription factor trapping by RNA in gene regulatory elements.

PubMed

Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A

2015-11-20

Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.
The regulatory network analysis of long noncoding RNAs in human colorectal cancer.

PubMed

Zhang, Yuwei; Tao, Yang; Li, Yang; Zhao, Jinshun; Zhang, Lina; Zhang, Xiaohong; Dong, Changzheng; Xie, Yangyang; Dai, Xiaoyu; Zhang, Xinjun; Liao, Qi

2018-05-01

Colorectal cancer (CRC) is among one of the most prevalent and lethiferous diseases worldwide. Long noncoding RNAs (lncRNAs) are commonly accepted to function as a key regulatory factor in human cancer, but the potential regulatory mechanisms of CRC-associated lncRNA are largely obscure. Here, we integrated several expression profiles to obtain 55 differentially expressed (DE) lncRNAs. We first detected lncRNA interactions with transcription factors, microRNAs, mRNAs, and RNA-binding proteins to construct a regulatory network and then create functional enrichment analyses for them using bioinformatics approaches. We found the upregulated genes in the regulatory network are enriched in cell cycle and DNA damage response, while the downregulated genes are enriched in cell differentiation, cellular response, and cell signaling. We then employed module-based methods to mine several intriguing modules from the overall network, which helps to classify the functions of genes more specifically. Next, we confirmed the validity of our network by comparisons with a randomized network using computational method. Finally, we attempted to annotate lncRNA functions based on the regulatory network, which indicated its potential application. Our study of the lncRNA regulatory network provided significant clues to unveil lncRNAs potential regulatory mechanisms in CRC and laid a foundation for further experimental investigation.
Regulatory variation: an emerging vantage point for cancer biology.

PubMed

Li, Luolan; Lorzadeh, Alireza; Hirst, Martin

2014-01-01

Transcriptional regulation involves complex and interdependent interactions of noncoding and coding regions of the genome with proteins that interact and modify them. Genetic variation/mutation in coding and noncoding regions of the genome can drive aberrant transcription and disease. In spite of accounting for nearly 98% of the genome comparatively little is known about the contribution of noncoding DNA elements to disease. Genome-wide association studies of complex human diseases including cancer have revealed enrichment for variants in the noncoding genome. A striking finding of recent cancer genome re-sequencing efforts has been the previously underappreciated frequency of mutations in epigenetic modifiers across a wide range of cancer types. Taken together these results point to the importance of dysregulation in transcriptional regulatory control in genesis of cancer. Powered by recent technological advancements in functional genomic profiling, exploration of normal and transformed regulatory networks will provide novel insight into the initiation and progression of cancer and open new windows to future prognostic and diagnostic tools. © 2013 Wiley Periodicals, Inc.
Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

PubMed Central

Maumus, Florian; Quesneville, Hadi

2014-01-01

Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859
Understanding the Role of Non-Coding RNAs in Bladder Cancer: From Dark Matter to Valuable Therapeutic Targets

PubMed Central

Pop-Bica, Cecilia; Gulei, Diana; Cojocneanu-Petric, Roxana; Braicu, Cornelia; Petrut, Bogdan; Berindan-Neagoe, Ioana

2017-01-01

The mortality and morbidity that characterize bladder cancer compel this malignancy into the category of hot topics in terms of biomolecular research. Therefore, a better knowledge of the specific molecular mechanisms that underlie the development and progression of bladder cancer is demanded. Tumor heterogeneity among patients with similar diagnosis, as well as intratumor heterogeneity, generates difficulties in terms of targeted therapy. Furthermore, late diagnosis represents an ongoing issue, significantly reducing the response to therapy and, inevitably, the overall survival. The role of non-coding RNAs in bladder cancer emerged in the last decade, revealing that microRNAs (miRNAs) may act as tumor suppressor genes, respectively oncogenes, but also as biomarkers for early diagnosis. Regarding other types of non-coding RNAs, especially long non-coding RNAs (lncRNAs) which are extensively reviewed in this article, their exact roles in tumorigenesis are—for the time being—not as evident as in the case of miRNAs, but, still, clearly suggested. Therefore, this review covers the non-coding RNA expression profile of bladder cancer patients and their validated target genes in bladder cancer cell lines, with repercussions on processes such as proliferation, invasiveness, apoptosis, cell cycle arrest, and other molecular pathways which are specific for the malignant transformation of cells. PMID:28703782
Understanding the Role of Non-Coding RNAs in Bladder Cancer: From Dark Matter to Valuable Therapeutic Targets.

PubMed

Pop-Bica, Cecilia; Gulei, Diana; Cojocneanu-Petric, Roxana; Braicu, Cornelia; Petrut, Bogdan; Berindan-Neagoe, Ioana

2017-07-13

The mortality and morbidity that characterize bladder cancer compel this malignancy into the category of hot topics in terms of biomolecular research. Therefore, a better knowledge of the specific molecular mechanisms that underlie the development and progression of bladder cancer is demanded. Tumor heterogeneity among patients with similar diagnosis, as well as intratumor heterogeneity, generates difficulties in terms of targeted therapy. Furthermore, late diagnosis represents an ongoing issue, significantly reducing the response to therapy and, inevitably, the overall survival. The role of non-coding RNAs in bladder cancer emerged in the last decade, revealing that microRNAs (miRNAs) may act as tumor suppressor genes, respectively oncogenes, but also as biomarkers for early diagnosis. Regarding other types of non-coding RNAs, especially long non-coding RNAs (lncRNAs) which are extensively reviewed in this article, their exact roles in tumorigenesis are-for the time being-not as evident as in the case of miRNAs, but, still, clearly suggested. Therefore, this review covers the non-coding RNA expression profile of bladder cancer patients and their validated target genes in bladder cancer cell lines, with repercussions on processes such as proliferation, invasiveness, apoptosis, cell cycle arrest, and other molecular pathways which are specific for the malignant transformation of cells.

Predicted stem-loop structures and variation in nucleotide sequence of 3' noncoding regions among animal calicivirus genomes.

PubMed

Seal, B S; Neill, J D; Ridpath, J F

1994-07-01

Caliciviruses are nonenveloped with a polyadenylated genome of approximately 7.6 kb and a single capsid protein. The "RNA Fold" computer program was used to analyze 3'-terminal noncoding sequences of five feline calicivirus (FCV), rabbit hemorrhagic disease virus (RHDV), and two San Miguel sea lion virus (SMSV) isolates. The FCV 3'-terminal sequences are 40-46 nucleotides in length and 72-91% similar. The FCV sequences were predicted to contain two possible duplex structures and one stem-loop structure with free energies of -2.1 to -18.2 kcal/mole. The RHDV genomic 3'-terminal RNA sequences are 54 nucleotides in length and share 49% sequence similarity to homologous regions of the FCV genome. The RHDV sequence was predicted to form two duplex structures in the 3'-terminal noncoding region with a single stem-loop structure, resembling that of FCV. In contrast, the SMSV 1 and 4 genomic 3'-terminal noncoding sequences were 185 and 182 nucleotides in length, respectively. Ten possible duplex structures were predicted with an average structural free energy of -35 kcal/mole. Sequence similarity between the two SMSV isolates was 75%. Furthermore, extensive cloverleaflike structures are predicted in the 3' noncoding region of the SMSV genome, in contrast to the predicted single stem-loop structures of FCV or RHDV.
Sequence variations in RepMP2/3 and RepMP4 elements reveal intragenomic homologous DNA recombination events in Mycoplasma pneumoniae.

PubMed

Spuesens, Emiel B M; Oduber, Minoushka; Hoogenboezem, Theo; Sluijter, Marcel; Hartwig, Nico G; van Rossum, Annemarie M C; Vink, Cornelis

2009-07-01

The gene encoding major adhesin protein P1 of Mycoplasma pneumoniae, MPN141, contains two DNA sequence stretches, designated RepMP2/3 and RepMP4, which display variation among strains. This variation allows strains to be differentiated into two major P1 genotypes (1 and 2) and several variants. Interestingly, multiple versions of the RepMP2/3 and RepMP4 elements exist at other sites within the bacterial genome. Because these versions are closely related in sequence, but not identical, it has been hypothesized that they have the capacity to recombine with their counterparts within MPN141, and thereby serve as a source of sequence variation of the P1 protein. In order to determine the variation within the RepMP2/3 and RepMP4 elements, both within the bacterial genome and among strains, we analysed the DNA sequences of all RepMP2/3 and RepMP4 elements within the genomes of 23 M. pneumoniae strains. Our data demonstrate that: (i) recombination is likely to have occurred between two RepMP2/3 elements in four of the strains, and (ii) all previously described P1 genotypes can be explained by inter-RepMP recombination events. Moreover, the difference between the two major P1 genotypes was reflected in all RepMP elements, such that subtype 1 and 2 strains can be differentiated on the basis of sequence variation in each RepMP element. This implies that subtype 1 and subtype 2 strains represent evolutionarily diverged strain lineages. Finally, a classification scheme is proposed in which the P1 genotype of M. pneumoniae isolates can be described in a sequence-based, universal fashion.
Cross-species amplification of mitochondrial DNA sequence-tagged-site markers in conifers: the nature of polymorphism and variation within and among species in Picea.

PubMed

Jaramillo-Correa, J P; Bousquet, J; Beaulieu, J; Isabel, N; Perron, M; Bouillé, M

2003-05-01

Primers previously developed to amplify specific non-coding regions of the mitochondrial genome in Angiosperms, and new primers for additional non-coding mtDNA regions, were tested for their ability to direct DNA amplification in 12 conifer taxa and to detect sequence-tagged-site (STS) polymorphisms within and among eight species in Picea. Out of 12 primer pairs, nine were successful at amplifying mtDNA in most of the taxa surveyed. In conifers, indels and substitutions were observed for several loci, allowing them to distinguish between families, genera and, in some cases, between species within genera. In Picea, interspecific polymorphism was detected for four loci, while intraspecific variation was observed for three of the mtDNA regions studied. One of these (SSU rRNA V1 region) exhibited indel polymorphisms, and the two others ( nad1 intron b/c and nad5 intron1) revealed restriction differences after digestion with Sau3AI (PCR-RFLP). A fourth locus, the nad4L- orf25 intergenic region, showed a multibanding pattern for most of the spruce species, suggesting a possible gene duplication. Maternal inheritance, expected for mtDNA in conifers, was observed for all polymorphic markers except the intergenic region nad4L- orf25. Pooling of the variation observed with the remaining three markers resulted in two to six different mtDNA haplotypes within the different species of Picea. Evidence for intra-genomic recombination was observed in at least two taxa. Thus, these mitotypes are likely to be more informative than single-locus haplotypes. They should be particularly useful for the study of biogeography and the dynamics of hybrid zones.
Insertion of a self-splicing intron into the mtDNA of atriploblastic animal

DOE Office of Scientific and Technical Information (OSTI.GOV)

Valles, Y.; Halanych, K.; Boore, J.L.

2006-04-14

Nephtys longosetosa is a carnivorous polychaete worm that lives in the intertidal and subtidal zones with worldwide distribution (pleijel&rouse2001). Its mitochondrial genome has the characteristics typical of most metazoans: 37 genes; circular molecule; almost no intergenic sequence; and no significant gene rearrangements when compared to other annelid mtDNAs (booremoritz19981995). Ubiquitous features as small intergenic regions and lack of introns suggested that metazoan mtDNAs are under strong selective pressures to reduce their genome size allowing for faster replication requirements (booremoritz19981995Lynch2005). Yet, in 1996 two type I introns were found in the mtDNA of the basal metazoan Metridium senile (FigureX). Breaking amore » long-standing rule (absence of introns in metazoan mtDNA), this finding was later supported by the further presence of group I introns in other cnidarians. Interestingly, only the class Anthozoa within cnidarians seems to harbor such introns. Although several hundreds of triploblastic metazoan mtDNAs have been sequenced, this study is the first evidence of mitochondrial introns in triploblastic metazoans. The cox1 gene of N. longosetosa has an intron of almost 2 kbs in length. This finding represents as well the first instance of a group II intron (anthozoans harbor group I introns) in all metazoan lineages. Opposite trends are observed within plants, fungi and protist mtDNAs, where introns (both group I and II) and other non-coding sequences are widespread. Plant, fungal and protist mtDNA structure and organization differ enormously from that of metazoan mtDNA. Both, plant and fungal mtDNA are dynamic molecules that undergo high rates of recombination, contain long intergenic spacer regions and harbor both group I and group II introns. However, as metazoans they have a conserved gene content. Protists, on the other hand have a striking variation of gene content and introns that account for the genome size variation. In
CRISPR/Cas9-mediated noncoding RNA editing in human cancers.

PubMed

Yang, Jie; Meng, Xiaodan; Pan, Jinchang; Jiang, Nan; Zhou, Chengwei; Wu, Zhenhua; Gong, Zhaohui

2018-01-02

Cancer is characterized by multiple genetic and epigenetic alterations, including a higher prevalence of mutations of oncogenes and/or tumor suppressors. Mounting evidences have shown that noncoding RNAs (ncRNAs) are involved in the epigenetic regulation of cancer genes and their associated pathways. The clustered regularly interspaced short palindromic repeats (CRISPR)-associated nuclease 9 (CRISPR/Cas9) system, a revolutionary genome-editing technology, has shed light on ncRNA-based cancer therapy. Here, we briefly introduce the classifications and mechanisms of CRISPR/Cas9 system. Importantly, we mainly focused on the applications of CRISPR/Cas9 system as a molecular tool for ncRNA (microRNA, long noncoding RNA and circular RNA, etc.) editing in human cancers, and the novel techniques that are based on CRISPR/Cas9 system. Additionally, the off-target effects and the corresponding solutions as well as the challenges toward CRISPR/Cas9 were also evaluated and discussed. Long- and short-ncRNAs have been employed as targets in precision oncology, and CRISPR/Cas9-mediated ncRNA editing may provide an excellent way to cure cancer.
DNA transposon-based gene vehicles - scenes from an evolutionary drive

PubMed Central

2013-01-01

DNA transposons are primitive genetic elements which have colonized living organisms from plants to bacteria and mammals. Through evolution such parasitic elements have shaped their host genomes by replicating and relocating between chromosomal loci in processes catalyzed by the transposase proteins encoded by the elements themselves. DNA transposable elements are constantly adapting to life in the genome, and self-suppressive regulation as well as defensive host mechanisms may assist in buffering ‘cut-and-paste’ DNA mobilization until accumulating mutations will eventually restrict events of transposition. With the reconstructed Sleeping Beauty DNA transposon as a powerful engine, a growing list of transposable elements with activity in human cells have moved into biomedical experimentation and preclinical therapy as versatile vehicles for delivery and genomic insertion of transgenes. In this review, we aim to link the mechanisms that drive transposon evolution with the realities and potential challenges we are facing when adapting DNA transposons for gene transfer. We argue that DNA transposon-derived vectors may carry inherent, and potentially limiting, traits of their mother elements. By understanding in detail the evolutionary journey of transposons, from host colonization to element multiplication and inactivation, we may better exploit the potential of distinct transposable elements. Hence, parallel efforts to investigate and develop distinct, but potent, transposon-based vector systems will benefit the broad applications of gene transfer. Insight and clever optimization have shaped new DNA transposon vectors, which recently debuted in the first DNA transposon-based clinical trial. Learning from an evolutionary drive may help us create gene vehicles that are safer, more efficient, and less prone for suppression and inactivation. PMID:24320156
TATA Binding Protein Discriminates between Different Lesions on DNA, Resulting in a Transcription Decrease

PubMed Central

Coin, Frédéric; Frit, Philippe; Viollet, Benoit; Salles, Bernard; Egly, Jean-Marc

1998-01-01

DNA damage recognition by basal transcription factors follows different mechanisms. Using transcription-competition, nitrocellulose filter binding, and DNase I footprinting assays, we show that, although the general transcription factor TFIIH is able to target any kind of lesion which can be repaired by the nucleotide excision repair pathway, TATA binding protein (TBP)-TFIID is more selective in damage recognition. Only genotoxic agents which are able to induce kinked DNA structures similar to the one for the TATA box in its TBP complex are recognized. Indeed, DNase I footprinting patterns reveal that TBP protects equally 4 nucleotides upstream and 6 nucleotides downstream from the A-T (at position −29 of the noncoding strand) of the adenovirus major late promoter and from the G-G of a cisplatin-induced 1,2-d(GpG) cross-link. Together, our results may partially explain differences in transcription inhibition rates following DNA damage. PMID:9632775
Living Organisms Author Their Read-Write Genomes in Evolution.

PubMed

Shapiro, James A

2017-12-06

Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Deletion of the Noncoding GNAS Antisense Transcript Causes Pseudohypoparathyroidism Type Ib and Biparental Defects of GNAS Methylation in cis

PubMed Central

Chillambhi, Smitha; Turan, Serap; Hwang, Daw-Yang; Chen, Hung-Chun; Jüppner, Harald; Bastepe, Murat

2010-01-01

Context: GNAS encodes the α-subunit of the stimulatory G protein as well as additional imprinted transcripts including the maternally expressed NESP55 and the paternally expressed XLαs, antisense, and A/B transcripts. Most patients with pseudohypoparathyroidism type Ib (PHP-Ib) exhibit imprinting defects affecting the maternal GNAS allele, which are thought to reduce/abolish Gsα expression in renal proximal tubules and thereby cause resistance to PTH. Objective: Our objective was to define the genetic defect in a previously unreported family with autosomal dominant PHP-Ib. Design and Setting: Analyses of serum and urine chemistries and of genomic DNA and lymphoblastoid-derived RNA were conducted at a tertiary hospital and research laboratory. Patients: Affected individuals presented with muscle weakness and/or paresthesia and showed hypocalcemia, hyperphosphatemia, and elevated serum PTH. Obligate carriers were healthy and revealed no obvious abnormality in mineral ion homeostasis. Results: A novel 4.2-kb microdeletion was discovered in the affected individuals and the obligate carriers, ablating two noncoding GNAS antisense exons while preserving the NESP55 exon. On maternal transmission, the deletion causes loss of all maternal GNAS imprints, partial gain of NESP55 methylation, and PTH resistance. Paternal transmission of the mutation leads to epigenetic alterations in cis, including a partial loss of NESP55 methylation and a partial gain of A/B methylation. Conclusions: The identified deletion points to a unique cis-acting element located telomeric of the NESP55 exon that is critical for imprinting both GNAS alleles. These findings provide novel insights into the molecular mechanisms underlying PHP and GNAS imprinting. PMID:20444925
Dynamics of water around the complex structures formed between the KH domains of far upstream element binding protein and single-stranded DNA molecules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chakraborty, Kaushik; Bandyopadhyay, Sanjoy, E-mail: sanjoy@chem.iitkgp.ernet.in

2015-07-28

Single-stranded DNA (ss-DNA) binding proteins specifically bind to the single-stranded regions of the DNA and protect it from premature annealing, thereby stabilizing the DNA structure. We have carried out atomistic molecular dynamics simulations of the aqueous solutions of two DNA binding K homology (KH) domains (KH3 and KH4) of the far upstream element binding protein complexed with two short ss-DNA segments. Attempts have been made to explore the influence of the formation of such complex structures on the microscopic dynamics and hydrogen bond properties of the interfacial water molecules. It is found that the water molecules involved in bridging themore » ss-DNA segments and the protein domains form a highly constrained thin layer with extremely retarded mobility. These water molecules play important roles in freezing the conformational oscillations of the ss-DNA oligomers and thereby forming rigid complex structures. Further, it is demonstrated that the effect of complexation on the slow long-time relaxations of hydrogen bonds at the interface is correlated with hindered motions of the surrounding water molecules. Importantly, it is observed that the highly restricted motions of the water molecules bridging the protein and the DNA components in the complexed forms originate from more frequent hydrogen bond reformations.« less
Non-coding landscapes of colorectal cancer

PubMed Central

Ragusa, Marco; Barbagallo, Cristina; Statello, Luisa; Condorelli, Angelo Giuseppe; Battaglia, Rosalia; Tamburello, Lucia; Barbagallo, Davide; Di Pietro, Cinzia; Purrello, Michele

2015-01-01

For two decades Vogelstein’s model has been the paradigm for describing the sequence of molecular changes within protein-coding genes that would lead to overt colorectal cancer (CRC). This model is now too simplistic in the light of recent studies, which have shown that our genome is pervasively transcribed in RNAs other than mRNAs, denominated non-coding RNAs (ncRNAs). The discovery that mutations in genes encoding these RNAs [i.e., microRNAs (miRNAs), long non-coding RNAs, and circular RNAs] are causally involved in cancer phenotypes has profoundly modified our vision of tumour molecular genetics and pathobiology. By exploiting a wide range of different mechanisms, ncRNAs control fundamental cellular processes, such as proliferation, differentiation, migration, angiogenesis and apoptosis: these data have also confirmed their role as oncogenes or tumor suppressors in cancer development and progression. The existence of a sophisticated RNA-based regulatory system, which dictates the correct functioning of protein-coding networks, has relevant biological and biomedical consequences. Different miRNAs involved in neoplastic and degenerative diseases exhibit potential predictive and prognostic properties. Furthermore, the key roles of ncRNAs make them very attractive targets for innovative therapeutic approaches. Several recent reports have shown that ncRNAs can be secreted by cells into the extracellular environment (i.e., blood and other body fluids): this suggests the existence of extracellular signalling mechanisms, which may be exploited by cells in physiology and pathology. In this review, we will summarize the most relevant issues on the involvement of cellular and extracellular ncRNAs in disease. We will then specifically describe their involvement in CRC pathobiology and their translational applications to CRC diagnosis, prognosis and therapy. PMID:26556998
Braveheart, a long non-coding RNA required for cardiovascular lineage commitment

PubMed Central

Klattenhoff, Carla; Scheuermann, Johanna C.; Surface, Lauren E.; Bradley, Robert K.; Fields, Paul A.; Steinhauser, Matthew L.; Ding, Huiming; Butty, Vincent L.; Torrey, Lillian; Haas, Simon; Abo, Ryan; Tabebordbar, Mohammadsharif; Lee, Richard T.; Burge, Christopher B.; Boyer, Laurie A.

2013-01-01

Summary Long noncoding RNAs (lncRNAs) are often expressed in a development-specific manner, yet little is known about their roles in lineage commitment. Here, we identified Braveheart (Bvht), a heart-associated lncRNA in mouse. Using multiple embryonic stem cell (ESC) differentiation strategies, we show that Bvht is required for progression of nascent mesoderm towards a cardiac fate. We find that Bvht is necessary for activation of a core cardiovascular gene network and functions upstream of MesP1 (mesoderm posterior 1), a master regulator of a common multipotent cardiovascular progenitor. We also show that Bvht interacts with SUZ12, a component of Polycomb Repressive Complex 2 (PRC2), during cardiomyocyte differentiation suggesting that Bvht mediates epigenetic regulation of cardiac commitment. Finally, we demonstrate a role for Bvht in maintaining cardiac fate in neonatal cardiomyocytes. Together, our work provides evidence for a long noncoding RNA with critical roles in the establishment of the cardiovascular lineage during mammalian development. PMID:23352431
Comprehensive discovery of noncoding RNAs in acute myeloid leukemia cell transcriptomes.

PubMed

Zhang, Jin; Griffith, Malachi; Miller, Christopher A; Griffith, Obi L; Spencer, David H; Walker, Jason R; Magrini, Vincent; McGrath, Sean D; Ly, Amy; Helton, Nichole M; Trissal, Maria; Link, Daniel C; Dang, Ha X; Larson, David E; Kulkarni, Shashikant; Cordes, Matthew G; Fronick, Catrina C; Fulton, Robert S; Klco, Jeffery M; Mardis, Elaine R; Ley, Timothy J; Wilson, Richard K; Maher, Christopher A

2017-11-01

To detect diverse and novel RNA species comprehensively, we compared deep small RNA and RNA sequencing (RNA-seq) methods applied to a primary acute myeloid leukemia (AML) sample. We were able to discover previously unannotated small RNAs using deep sequencing of a library method using broader insert size selection. We analyzed the long noncoding RNA (lncRNA) landscape in AML by comparing deep sequencing from multiple RNA-seq library construction methods for the sample that we studied and then integrating RNA-seq data from 179 AML cases. This identified lncRNAs that are completely novel, differentially expressed, and associated with specific AML subtypes. Our study revealed the complexity of the noncoding RNA transcriptome through a combined strategy of strand-specific small RNA and total RNA-seq. This dataset will serve as an invaluable resource for future RNA-based analyses. Copyright © 2017 ISEH – Society for Hematology and Stem Cells. Published by Elsevier Inc. All rights reserved.
Identification and role of regulatory non-coding RNAs in Listeria monocytogenes.

PubMed

Izar, Benjamin; Mraheil, Mobarak Abu; Hain, Torsten

2011-01-01

Bacterial regulatory non-coding RNAs control numerous mRNA targets that direct a plethora of biological processes, such as the adaption to environmental changes, growth and virulence. Recently developed high-throughput techniques, such as genomic tiling arrays and RNA-Seq have allowed investigating prokaryotic cis- and trans-acting regulatory RNAs, including sRNAs, asRNAs, untranslated regions (UTR) and riboswitches. As a result, we obtained a more comprehensive view on the complexity and plasticity of the prokaryotic genome biology. Listeria monocytogenes was utilized as a model system for intracellular pathogenic bacteria in several studies, which revealed the presence of about 180 regulatory RNAs in the listerial genome. A regulatory role of non-coding RNAs in survival, virulence and adaptation mechanisms of L. monocytogenes was confirmed in subsequent experiments, thus, providing insight into a multifaceted modulatory function of RNA/mRNA interference. In this review, we discuss the identification of regulatory RNAs by high-throughput techniques and in their functional role in L. monocytogenes.
Long non-coding RNAs in hepatocellular carcinoma: Potential roles and clinical implications

PubMed Central

Niu, Zhao-Shan; Niu, Xiao-Jun; Wang, Wen-Hong

2017-01-01

Long non-coding RNAs (lncRNAs) are a subgroup of non-coding RNA transcripts greater than 200 nucleotides in length with little or no protein-coding potential. Emerging evidence indicates that lncRNAs may play important regulatory roles in the pathogenesis and progression of human cancers, including hepatocellular carcinoma (HCC). Certain lncRNAs may be used as diagnostic or prognostic markers for HCC, a serious malignancy with increasing morbidity and high mortality rates worldwide. Therefore, elucidating the functional roles of lncRNAs in tumors can contribute to a better understanding of the molecular mechanisms of HCC and may help in developing novel therapeutic targets. In this review, we summarize the recent progress regarding the functional roles of lncRNAs in HCC and explore their clinical implications as diagnostic or prognostic biomarkers and molecular therapeutic targets for HCC. PMID:28932078
Long noncoding RNA FTX inhibits hepatocellular carcinoma proliferation and metastasis by binding MCM2 and miR-374a.

PubMed

Liu, F; Yuan, J-H; Huang, J-F; Yang, F; Wang, T-T; Ma, J-Z; Zhang, L; Zhou, C-C; Wang, F; Yu, J; Zhou, W-P; Sun, S-H

2016-10-13

It has long been known that males are more susceptible than females to hepatocellular carcinoma (HCC), but the reason remains elusive. In this study, we investigated the expression and function of the long noncoding RNA FTX (lnc-FTX), an X-inactive-specific transcript (XIST) regulator transcribed from the X chromosome inactivation center, in both HCC and HCC gender disparity. lnc-FTX is expressed at higher levels in female livers than in male livers and is significantly downregulated in HCC tissues compared with normal liver tissues. Patients with higher lnc-FTX expression exhibited longer survival, suggesting that lnc-FTX is a useful prognostic factor for HCC patients. lnc-FTX inhibits HCC cell growth and metastasis both in vitro and in vivo. Mechanistically, lnc-FTX represses Wnt/β-catenin signaling activity by competitively sponging miR-374a and inhibits HCC cell epithelial-mesenchymal transition and invasion. In addition, lnc-FTX binds to the DNA replication licensing factor MCM2, thereby impeding DNA replication and inhibiting proliferation in HCC cells. In conclusion, these findings suggest that lnc-FTX may act as a tumor suppressor in HCC through physically binding miR-374a and MCM2. It may also be one of the reasons for HCC gender disparity and may potentially contribute to HCC treatment.
The human haptoglobin gene promoter: interleukin-6-responsive elements interact with a DNA-binding protein induced by interleukin-6.

PubMed Central

Oliviero, S; Cortese, R

1989-01-01

Transcription of the human haptoglobin (Hp) gene is induced by interleukin-6 (IL-6) in the human hepatoma cell line Hep3B. Cis-acting elements responsible for this response are localized within the first 186 bp of the 5'-flanking region. Site-specific mutants of the Hp promoter fused to the chloramphenicol acetyl transferase (CAT) gene were analysed by transient transfection into uninduced and IL-6-treated Hep3B cells. We identified three regions, A, B and C, defined by mutation, which are important for the IL-6 response. Band shift experiments using nuclear extracts from untreated or IL-6-treated cells revealed the presence of IL-6-inducible DNA binding activities when DNA fragments containing the A or the C sequences were used. Competition experiments showed that both sequences bind to the same nuclear factors. Polymers of oligonucleotides containing either the A or the C regions confer IL-6 responsiveness to a truncated SV40 promoter. The B region forms several complexes with specific DNA-binding proteins different from those which bind to the A and C region. The B region complexes are identical in nuclear extracts from IL-6-treated and untreated cells. While important for IL-6 induction in the context of the haptoglobin promoter, the B site does not confer IL-6 inducibility to the SV40 promoter. Our results indicate that the IL-6 response of the haptoglobin promoter is dependent on the presence of multiple, partly redundant, cis-acting elements. Images PMID:2787245
Allele-specific DNA methylation and its interplay with repressive histone marks at promoter-mutant TERT genes

PubMed Central

Stern, Josh Lewis; Paucek, Richard D.; Huang, Franklin W.; Ghandi, Mahmoud; Nwumeh, Ronald; Costello, James C.; Cech, Thomas R.

2017-01-01

SUMMARY A mutation in the promoter of the Telomerase Reverse Transcriptase (TERT) gene is the most frequent noncoding mutation in cancer. The mutation drives unusual monoallelic expression of TERT, allowing immortalization. Here we find that DNA methylation of the TERT CpG Island (CGI) is also allele-specific in multiple cancers. The expressed allele is hypomethylated, which is opposite to cancers without TERT promoter mutations. The continued presence of Polycomb repressive complex 2 (PRC2) on the inactive allele suggests that histone marks of repressed chromatin may be causally linked to high DNA methylation. Consistent with this hypothesis, TERT promoter DNA containing 5-methyl-CpG has much increased affinity for PRC2 in vitro. Thus, CpG methylation and histone marks appear to collaborate to maintain the two TERT alleles in different epigenetic states in TERT promoter-mutant cancers. Finally, in several cancers DNA methylation levels at the TERT CGI correlate with altered patient survival. PMID:29281820
Targeting Non-Coding RNAs in Plants with the CRISPR-Cas Technology is a Challenge yet Worth Accepting.

PubMed

Basak, Jolly; Nithin, Chandran

2015-01-01

Non-coding RNAs (ncRNAs) have emerged as versatile master regulator of biological functions in recent years. MicroRNAs (miRNAs) are small endogenous ncRNAs of 18-24 nucleotides in length that originates from long self-complementary precursors. Besides their direct involvement in developmental processes, plant miRNAs play key roles in gene regulatory networks and varied biological processes. Alternatively, long ncRNAs (lncRNAs) are a large and diverse class of transcribed ncRNAs whose length exceed that of 200 nucleotides. Plant lncRNAs are transcribed by different RNA polymerases, showing diverse structural features. Plant lncRNAs also are important regulators of gene expression in diverse biological processes. There has been a breakthrough in the technology of genome editing, the CRISPR-Cas9 (clustered regulatory interspaced short palindromic repeats/CRISPR-associated protein 9) technology, in the last decade. CRISPR loci are transcribed into ncRNA and eventually form a functional complex with Cas9 and further guide the complex to cleave complementary invading DNA. The CRISPR-Cas technology has been successfully applied in model plants such as Arabidopsis and tobacco and important crops like wheat, maize, and rice. However, all these studies are focused on protein coding genes. Information about targeting non-coding genes is scarce. Hitherto, the CRISPR-Cas technology has been exclusively used in vertebrate systems to engineer miRNA/lncRNAs, but it is still relatively unexplored in plants. While briefing miRNAs, lncRNAs and applications of the CRISPR-Cas technology in human and animals, this review essentially elaborates several strategies to overcome the challenges of applying the CRISPR-Cas technology in editing ncRNAs in plants and the future perspective of this field.
Maternally Expressed Gene 3, an imprinted non-coding RNA gene, is associated with meningioma pathogenesis and progression

PubMed Central

Zhang, Xun; Gejman, Roger; Mahta, Ali; Zhong, Ying; Rice, Kimberley A.; Zhou, Yunli; Cheunsuchon, Pornsuk; Louis, David N.; Klibanski, Anne

2010-01-01

Meningiomas are common tumors, representing 15-25% of all central nervous system tumors. NF2 gene inactivation on chromosome 22 has been shown as an early event in tumorigenesis; however, few factors underlying tumor growth and progression have been identified. Chromosomal abnormalities of 14q32 are often associated with meningioma pathogenesis and progression; therefore it has been proposed that an as yet unidentified tumor suppressor is present at this locus. MEG3 is an imprinted gene located at 14q32 that encodes a non-coding RNA with an anti-proliferative function. We found that MEG3 mRNA is highly expressed in normal arachnoidal cells. However, MEG3 is not expressed in the majority of human meningiomas or the human meningioma cell lines IOMM-Lee and CH157-MN. There is a strong association between loss of MEG3 expression and tumor grade. Allelic loss at the MEG3 locus is also observed in meningiomas, with increasing prevalence in higher grade tumors. In addition, there is an increase in CpG methylation within the promoter and the imprinting control region of MEG3 gene in meningiomas. Functionally, MEG3 suppresses DNA synthesis in both IOMM-Lee and CH157-MN cells by approximately 60% in BrdU incorporation assays. Colony-forming efficiency assays show that MEG3 inhibits colony formation in CH157-MN cells by approximately 80%. Furthermore, MEG3 stimulates p53-mediated transactivation in these cell lines. Therefore, these data are consistent with the hypothesis that MEG3, which encodes a non-coding RNA, may be a tumor suppressor gene at chromosome 14q32 involved in meningioma progression via a novel mechanism. PMID:20179190

The Long Non-Coding RNA Transcriptome Landscape in CHO Cells Under Batch and Fed-Batch Conditions.

PubMed

Vito, Davide; Smales, C Mark

2018-05-21

The role of non-coding RNAs in determining growth, productivity and recombinant product quality attributes in Chinese hamster ovary (CHO) cells has received much attention in recent years, exemplified by studies into microRNAs in particular. However, other classes of non-coding RNAs have received less attention. One such class are the non-coding RNAs known collectively as long non-coding RNAs (lncRNAs). We have undertaken the first landscape analysis of the lncRNA transcriptome in CHO using a mouse based microarray that also provided for the surveillance of the coding transcriptome. We report on those lncRNAs present in a model host CHO cell line under batch and fed-batch conditions on two different days and relate the expression of different lncRNAs to each other. We demonstrate that the mouse microarray was suitable for the detection and analysis of thousands of CHO lncRNAs and validated a number of these by qRT-PCR. We then further analysed the data to identify those lncRNAs whose expression changed the most between growth and stationary phases of culture or between batch and fed-batch culture to identify potential lncRNA targets for further functional studies with regard to their role in controlling growth of CHO cells. We discuss the implications for the publication of this rich dataset and how this may be used by the community. This article is protected by copyright. All rights reserved.
Refined mapping of autoimmune disease associated genetic variants with gene expression suggests an important role for non-coding RNAs.

PubMed

Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod

2016-04-01

Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Structural insights into the stabilization of MALAT1 noncoding RNA by a bipartite triple helix

PubMed Central

Brown, Jessica A.; Bulkley, David; Wang, Jimin; Valenstein, Max L.; Yario, Therese A.; Steitz, Thomas A.; Steitz, Joan A.

2014-01-01

Metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) is a highly-abundant nuclear long noncoding RNA that promotes malignancy. A 3′-stem-loop structure is predicted to confer stability by engaging a downstream A-rich tract in a triple helix, similar to the expression and nuclear retention element (ENE) from the KSHV polyadenylated nuclear RNA. The 3.1-Å resolution crystal structure of the human MALAT1 ENE and A-rich tract reveals a bipartite triple helix containing stacks of five and four U•A-U triples separated by a C+•G-C triplet and C-G doublet, extended by two A-minor interactions. In vivo decay assays indicate that this blunt-ended triple helix, with the 3′ nucleotide in a U•A-U triple, inhibits rapid nuclear RNA decay. Interruption of the triple helix by the C-G doublet induces a “helical reset” that explains why triple-helical stacks longer than six do not occur in nature. PMID:24952594
Advances in esophageal cancer: A new perspective on pathogenesis associated with long non-coding RNAs.

PubMed

Huang, Xiaomei; Zhou, Xi; Hu, Qing; Sun, Binyu; Deng, Mingming; Qi, Xiaolong; Lü, Muhan

2018-01-28

Esophageal cancer is a malignant digestive tract cancer with high mortality. Although studies have found that esophageal cancer is involved in a complex and important gene regulation network, the pathogenesis remains unclear. The recently described long non-coding RNAs (lncRNAs) are one effective part of the gene regulation network. However, in past decades, lncRNAs were thought to be "transcript noise" or "pseudogenes" and were thus ignored. Early studies indicated that lncRNAs play pivotal roles during evolution. However, in recent years, increasing research has revealed that many lncRNAs are associated with tumorigenesis. In particular, lncRNAs may act as important elements for epigenetic regulation, transcription, post-transcriptional regulation and post-translational modification of proteins. Additionally, they may be novel biomarkers for tumors and therapeutic targets in cancer. Here, we summarize the functions of lncRNAs in esophageal cancer, with an emphasis on lncRNA-mediated regulatory mechanisms that affect the biological characteristics of esophageal cancer. Copyright © 2017 Elsevier B.V. All rights reserved.
Long Non-Coding RNA CASC2 Improves Diabetic Nephropathy by Inhibiting JNK Pathway.

PubMed

Yang, Huihui; Kan, Quan E; Su, Yong; Man, Hua

2018-06-11

It's known that long non-coding RNA CASC2 overexpression inhibit the JNK pathway in some disease models, while JNK pathway activation exacerbates diabetic nephropathy. Therefore we speculate that long non-coding RNA CASC2 can improve diabetic nephropathy by inhibiting JNK pathway. Thus, our study was carried out to investigate the involvement of CASC2 in diabetic nephropathy. We found that serum level of CASC2 was significantly lower in diabetic nephropathy patients than in normal people, and serum level of CASC2 showed no significant correlations with age, gender, alcohol consumption and smoking habits, but was correlated with course of disease. ROC curve analysis showed that serum level of CASC2 could be used to accurately predict diabetic nephropathy. Diabetes mellitus has many complications. This study also included a series of complications of diabetes, such as diabetic retinopathy, diabetic ketoacidosis, diabetic foot infections and diabetic cardiopathy, while serum level of CASC2 was specifically reduced in diabetic nephropathy. CASC2 expression level decreased, while JNK1 phosphorylation level increased in mouse podocyte cells treated with high glucose. CASC2 overexpression inhibited apoptosis of podocyte cells and reduced phosphorylation level of JNK1. We conclude that long non-coding RNA CASC2 may improve diabetic nephropathy by inhibiting JNK pathway. © Georg Thieme Verlag KG Stuttgart · New York.
Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi

We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less
Functional importance of cardiac enhancer-associated noncoding RNAs in heart development and disease

DOE PAGES

Ounzain, Samir; Pezzuto, Iole; Micheletti, Rudi; ...

2014-08-19

We report here that the key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Throughmore » a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.« less
GATA: A graphic alignment tool for comparative sequenceanalysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nix, David A.; Eisen, Michael B.

2005-01-01

Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dotplot analysis is often used to estimate non-coding sequence relatedness. Yet dotmore » plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.« less
Cloning and molecular characterization of scorpion Buthus martensi venom hyaluronidases: a novel full-length and diversiform noncoding isoforms.

PubMed

Xia, Xichao; Liu, Rongzhi; Li, Yi; Xue, Shipeng; Liu, Qingchun; Jiang, Xiao; Zhang, Wenjuan; Ding, Ke

2014-09-01

Hyaluronidase is a common component of scorpion venom and has been considered as "spreading factor" that promotes a fast penetration of the venom in the anaphylactic reaction. In the current study, a novel full-length of hyaluronidase BmHYI and three noncoding isoforms of BmHYII, BmHYIII and BmHYIV were cloned by using a combined strategy based on peptide sequencing and Rapid Amplification of cDNA Ends (RACE). BmHYI has 410 amino acid residues containing the catalytic, positional and five potential N-glycosylation sites. The deduced protein sequence of BmHYI shares significant identity with venom hyaluronidases from bees and snakes. The phylogenetic analysis showed early divergence and independent evolution of BmHYI from other hyaluronidases. An extraordinarily high level of sequence similarity was detected among four sequences. But, BmHYII, BmHYIII and BmHYIV were short of stop-codon in the open reading frame and poly(A) signal in the 3' end. Copyright © 2014 Elsevier B.V. All rights reserved.
Peptides Used in the Delivery of Small Noncoding RNA

PubMed Central

2015-01-01

RNA interference (RNAi) is an endogenous process in which small noncoding RNAs, including small interfering RNAs (siRNAs) and microRNAs (miRNAs), post-transcriptionally regulate gene expressions. In general, siRNA and miRNA/miRNA mimics are similar in nature and activity except their origin and specificity. Although both siRNAs and miRNAs have been extensively studied as novel therapeutics for a wide range of diseases, the large molecular weight, anionic surface charges, instability in blood circulation, and intracellular trafficking to the RISC after cellular uptake have hindered the translation of these RNAs from bench to clinic. As a result, a great variety of delivery systems have been investigated for safe and effective delivery of small noncoding RNAs. Among these systems, peptides, especially cationic peptides, have emerged as a promising type of carrier due to their inherent ability to condense negatively charged RNAs, ease of synthesis, controllable size, and tunable structure. In this review, we will focus on three major types of cationic peptides, including poly(l-lysine) (PLL), protamine, and cell penetrating peptides (CPP), as well as peptide targeting ligands that have been extensively used in RNA delivery. The delivery strategies, applications, and limitations of these cationic peptides in siRNA/miRNA delivery will be discussed. PMID:25157701
Present Scenario of Long Non-Coding RNAs in Plants

PubMed Central

Bhatia, Garima; Goyal, Neetu; Sharma, Shailesh; Upadhyay, Santosh Kumar; Singh, Kashmir

2017-01-01

Small non-coding RNAs have been extensively studied in plants over the last decade. In contrast, genome-wide identification of plant long non-coding RNAs (lncRNAs) has recently gained momentum. LncRNAs are now being recognized as important players in gene regulation, and their potent regulatory roles are being studied comprehensively in eukaryotes. LncRNAs were first reported in humans in 1992. Since then, research in animals, particularly in humans, has rapidly progressed, and a vast amount of data has been generated, collected, and organized using computational approaches. Additionally, numerous studies have been conducted to understand the roles of these long RNA species in several diseases. However, the status of lncRNA investigation in plants lags behind that in animals (especially humans). Efforts are being made in this direction using computational tools and high-throughput sequencing technologies, such as the lncRNA microarray technique, RNA-sequencing (RNA-seq), RNA capture sequencing, (RNA CaptureSeq), etc. Given the current scenario, significant amounts of data have been produced regarding plant lncRNAs, and this amount is likely to increase in the subsequent years. In this review we have documented brief information about lncRNAs and their status of research in plants, along with the plant-specific resources/databases for information retrieval on lncRNAs. PMID:29657289
Evolution of coding and non-coding genes in HOX clusters of a marsupial.

PubMed

Yu, Hongshi; Lindsay, James; Feng, Zhi-Ping; Frankenberg, Stephen; Hu, Yanqiu; Carone, Dawn; Shaw, Geoff; Pask, Andrew J; O'Neill, Rachel; Papenfuss, Anthony T; Renfree, Marilyn B

2012-06-18

The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial.
Evolution of coding and non-coding genes in HOX clusters of a marsupial

PubMed Central

2012-01-01

Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672
A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.

PubMed

Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor

2017-08-30

Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.
Non-coding, mRNA-like RNAs database Y2K.

PubMed

Erdmann, V A; Szymanski, M; Hochberg, A; Groot, N; Barciszewski, J

2000-01-01

In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www. man.poznan.pl/5SData/ncRNA/index.html
Identification of Developmentally Regulated PCP-Responsive Non-Coding RNA, prt6, in the Rat Thalamus

PubMed Central

Umino, Asami; Nishikawa, Toru

2014-01-01

Schizophrenia and similar psychoses induced by NMDA-type glutamate receptor antagonists, such as phencyclidine (PCP) and ketamine, usually develop after adolescence. Moreover, adult-type behavioral disturbance following NMDA receptor antagonist application in rodents is observed after a critical period at around 3 postnatal weeks. These observations suggest that the schizophrenic symptoms caused by and psychotomimetic effects of NMDA antagonists require the maturation of certain brain neuron circuits and molecular networks, which differentially respond to NMDA receptor antagonists across adolescence and the critical period. From this viewpoint, we have identified a novel developmentally regulated phencyclidine-responsive transcript from the rat thalamus, designated as prt6, as a candidate molecule involved in the above schizophrenia-related systems using a DNA microarray technique. The transcript is a non-coding RNA that includes sequences of at least two microRNAs, miR132 and miR212, and is expressed strongly in the brain and testis, with trace or non-detectable levels in the spleen, heart, liver, kidney, lung and skeletal muscle, as revealed by Northern blot analysis. The systemic administration of PCP (7.5 mg/kg, subcutaneously (s.c.)) significantly elevated the expression of prt6 mRNA in the thalamus at postnatal days (PD) 32 and 50, but not at PD 8, 13, 20, or 24 as compared to saline-treated controls. At PD 50, another NMDA receptor antagonist, dizocilpine (0.5 mg/kg, s.c.), and a schizophrenomimetic dopamine agonist, methamphetamine (4.8 mg/kg, s.c.), mimicked a significant increase in the levels of thalamic prt6 mRNAs, while a D2 dopmamine receptor antagonist, haloperidol, partly inhibited the increasing influence of PCP on thalamic prt6 expression without its own effects. These data indicate that prt6 may be involved in the pathophysiology of the onset of drug-induced schizophrenia-like symptoms and schizophrenia through the possible dysregulation of
The twilight zone of cis element alignments.

PubMed

Sebastian, Alvaro; Contreras-Moreira, Bruno

2013-02-01

Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein-DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein-DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.
The ATRX cDNA is prone to bacterial IS10 element insertions that alter its structure.

PubMed

Valle-García, David; Griffiths, Lyra M; Dyer, Michael A; Bernstein, Emily; Recillas-Targa, Félix

2014-01-01

The SWI/SNF-like chromatin-remodeling protein ATRX has emerged as a key factor in the regulation of α-globin gene expression, incorporation of histone variants into the chromatin template and, more recently, as a frequently mutated gene across a wide spectrum of cancers. Therefore, the availability of a functional ATRX cDNA for expression studies is a valuable tool for the scientific community. We have identified two independent transposon insertions of a bacterial IS10 element into exon 8 of ATRX isoform 2 coding sequence in two different plasmids derived from a single source. We demonstrate that these insertion events are common and there is an insertion hotspot within the ATRX cDNA. Such IS10 insertions produce a truncated form of ATRX, which significantly compromises its nuclear localization. In turn, we describe ways to prevent IS10 insertion during propagation and cloning of ATRX-containing vectors, including optimal growth conditions, bacterial strains, and suggested sequencing strategies. Finally, we have generated an insertion-free plasmid that is available to the community for expression studies of ATRX.
Promoter of lncRNA Gene PVT1 Is a Tumor-Suppressor DNA Boundary Element. | Office of Cancer Genomics

Cancer.gov

Noncoding mutations in cancer genomes are frequent but challenging to interpret. PVT1 encodes an oncogenic lncRNA, but recurrent translocations and deletions in human cancers suggest alternative mechanisms. Here, we show that the PVT1 promoter has a tumor-suppressor function that is independent of PVT1 lncRNA. CRISPR interference of PVT1 promoter enhances breast cancer cell competition and growth in vivo.
The high mobility group protein 1 enhances binding of the estrogen receptor DNA binding domain to the estrogen response element.

PubMed

Romine, L E; Wood, J R; Lamia, L A; Prendergast, P; Edwards, D P; Nardulli, A M

1998-05-01

We have examined the ability of the high-mobility group protein 1 (HMG1) to alter binding of the estrogen receptor DNA-binding domain (DBD) to the estrogen response element (ERE). HMG1 dramatically enhanced binding of purified, bacterially expressed DBD to the consensus vitellogenin A2 ERE in a dose-dependent manner. The ability of HMG1 to stabilize the DBD-ERE complex resulted in part from a decrease in the dissociation rate of the DBD from the ERE. Antibody supershift experiments demonstrated that HMG1 was also capable of forming a ternary complex with the ERE-bound DBD in the presence of HMG1-specific antibody. HMG1 did not substantially affect DBD-ERE contacts as assessed by methylation interference assays, nor did it alter the ability of the DBD to induce distortion in ERE-containing DNA fragments. Because HMG1 dramatically enhanced estrogen receptor DBD binding to the ERE, and the DBD is the most highly conserved region among the nuclear receptor superfamily members, HMG1 may function to enhance binding of other nuclear receptors to their respective response elements and act in concert with coactivator proteins to regulate expression of hormone-responsive genes.

Identification of a Conserved Non-Protein-Coding Genomic Element that Plays an Essential Role in Alphabaculovirus Pathogenesis

PubMed Central

Kikhno, Irina

2014-01-01

Highly homologous sequences 154–157 bp in length grouped under the name of “conserved non-protein-coding element” (CNE) were revealed in all of the sequenced genomes of baculoviruses belonging to the genus Alphabaculovirus. A CNE alignment led to the detection of a set of highly conserved nucleotide clusters that occupy strictly conserved positions in the CNE sequence. The significant length of the CNE and conservation of both its length and cluster architecture were identified as a combination of characteristics that make this CNE different from known viral non-coding functional sequences. The essential role of the CNE in the Alphabaculovirus life cycle was demonstrated through the use of a CNE-knockout Autographa californica multiple nucleopolyhedrovirus (AcMNPV) bacmid. It was shown that the essential function of the CNE was not mediated by the presumed expression activities of the protein- and non-protein-coding genes that overlap the AcMNPV CNE. On the basis of the presented data, the AcMNPV CNE was categorized as a complex-structured, polyfunctional genomic element involved in an essential DNA transaction that is associated with an undefined function of the baculovirus genome. PMID:24740153
Disease-Causing 7.4 kb Cis-Regulatory Deletion Disrupting Conserved Non-Coding Sequences and Their Interaction with the FOXL2 Promotor: Implications for Mutation Screening

PubMed Central

Dostie, Josée; Lemire, Edmond; Bouchard, Philippe; Field, Michael; Jones, Kristie; Lorenz, Birgit; Menten, Björn; Buysse, Karen; Pattyn, Filip; Friedli, Marc; Ucla, Catherine; Rossier, Colette; Wyss, Carine; Speleman, Frank; De Paepe, Anne; Dekker, Job; Antonarakis, Stylianos E.; De Baere, Elfride

2009-01-01

To date, the contribution of disrupted potentially cis-regulatory conserved non-coding sequences (CNCs) to human disease is most likely underestimated, as no systematic screens for putative deleterious variations in CNCs have been conducted. As a model for monogenic disease we studied the involvement of genetic changes of CNCs in the cis-regulatory domain of FOXL2 in blepharophimosis syndrome (BPES). Fifty-seven molecularly unsolved BPES patients underwent high-resolution copy number screening and targeted sequencing of CNCs. Apart from three larger distant deletions, a de novo deletion as small as 7.4 kb was found at 283 kb 5′ to FOXL2. The deletion appeared to be triggered by an H-DNA-induced double-stranded break (DSB). In addition, it disrupts a novel long non-coding RNA (ncRNA) PISRT1 and 8 CNCs. The regulatory potential of the deleted CNCs was substantiated by in vitro luciferase assays. Interestingly, Chromosome Conformation Capture (3C) of a 625 kb region surrounding FOXL2 in expressing cellular systems revealed physical interactions of three upstream fragments and the FOXL2 core promoter. Importantly, one of these contains the 7.4 kb deleted fragment. Overall, this study revealed the smallest distant deletion causing monogenic disease and impacts upon the concept of mutation screening in human disease and developmental disorders in particular. PMID:19543368
HOTAIR: An Oncogenic Long Non-Coding RNA in Human Cancer.

PubMed

Tang, Qing; Hann, Swei Sunny

2018-05-24

Long non-coding RNAs (LncRNAs) represent a novel class of noncoding RNAs that are longer than 200 nucleotides without protein-coding potential and function as novel master regulators in various human diseases, including cancer. Accumulating evidence shows that lncRNAs are dysregulated and implicated in various aspects of cellular homeostasis, such as proliferation, apoptosis, mobility, invasion, metastasis, chromatin remodeling, gene transcription, and post-transcriptional processing. However, the mechanisms by which lncRNAs regulate various biological functions in human diseases have yet to be determined. HOX antisense intergenic RNA (HOTAIR) is a recently discovered lncRNA and plays a critical role in various areas of cancer, such as proliferation, survival, migration, drug resistance, and genomic stability. In this review, we briefly introduce the concept, identification, and biological functions of HOTAIR. We then describe the involvement of HOTAIR that has been associated with tumorigenesis, growth, invasion, cancer stem cell differentiation, metastasis, and drug resistance in cancer. We also discuss emerging insights into the role of HOTAIR as potential biomarkers and therapeutic targets for novel treatment paradigms in cancer. © 2018 The Author(s). Published by S. Karger AG, Basel.
Expression of long non-coding RNA-HOTAIR in oral squamous cell carcinoma Tca8113 cells and its associated biological behavior

PubMed Central

Liu, Huawei; Li, Zhiyong; Wang, Chao; Feng, Lin; Huang, Haitao; Liu, Changkui; Li, Fengxia

2016-01-01

As a long noncoding RNA, HOX transcript antisense intergenic RNA (HOTAIR) is highly expressed in many types of tumors. However, its expression and function in oral squamous cell carcinoma (OSCC) cells and tissues remains largely unknown. We herein studied the biological functions of HOTAIR in OSCC Tca8113 cells. Real-time quantitative PCR showed that HOTAIR, p21 and p53 mRNA expressions in doxorubicin (DOX)-treated or γ-ray-irradiated Tca8113 cells were up-regulated. Knockdown of p53 expression inhibited DOX-induced HOTAIR up-regulation, suggesting that DNA damage-induced HOTAIR expression may be associated with p53. Transfection and CCK-8 assays showed that compared with the control group, overexpression of HOTAIR promoted the proliferation of Tca8113 cells, while interfering with its expression played an opposite role. Flow cytometry exhibited that HOTAIR overexpression decreased the rate of DOX-induced apoptosis. When HOTAIR expression was inhibited by siRNA, the proportions of cells in G2/M and S phases increased and decreased respectively. Meanwhile, the rate of DOX-induced apoptosis rose. DNA damage-induced HOTAIR expression facilitated the proliferation of Tca8113 cells and decreased their apoptosis. However, whether the up-regulation depends on p53 still needs in-depth studies. PMID:27904675
Non-coding RNA networks in cancer.

PubMed

Anastasiadou, Eleni; Jacob, Leni S; Slack, Frank J

2018-01-01

Thousands of unique non-coding RNA (ncRNA) sequences exist within cells. Work from the past decade has altered our perception of ncRNAs from 'junk' transcriptional products to functional regulatory molecules that mediate cellular processes including chromatin remodelling, transcription, post-transcriptional modifications and signal transduction. The networks in which ncRNAs engage can influence numerous molecular targets to drive specific cell biological responses and fates. Consequently, ncRNAs act as key regulators of physiological programmes in developmental and disease contexts. Particularly relevant in cancer, ncRNAs have been identified as oncogenic drivers and tumour suppressors in every major cancer type. Thus, a deeper understanding of the complex networks of interactions that ncRNAs coordinate would provide a unique opportunity to design better therapeutic interventions.
Regulated Formation of lncRNA-DNA Hybrids Enables Faster Transcriptional Induction and Environmental Adaptation.

PubMed

Cloutier, Sara C; Wang, Siwen; Ma, Wai Kit; Al Husini, Nadra; Dhoondia, Zuzer; Ansari, Athar; Pascuzzi, Pete E; Tran, Elizabeth J

2016-02-04

Long non-coding (lnc)RNAs, once thought to merely represent noise from imprecise transcription initiation, have now emerged as major regulatory entities in all eukaryotes. In contrast to the rapidly expanding identification of individual lncRNAs, mechanistic characterization has lagged behind. Here we provide evidence that the GAL lncRNAs in the budding yeast S. cerevisiae promote transcriptional induction in trans by formation of lncRNA-DNA hybrids or R-loops. The evolutionarily conserved RNA helicase Dbp2 regulates formation of these R-loops as genomic deletion or nuclear depletion results in accumulation of these structures across the GAL cluster gene promoters and coding regions. Enhanced transcriptional induction is manifested by lncRNA-dependent displacement of the Cyc8 co-repressor and subsequent gene looping, suggesting that these lncRNAs promote induction by altering chromatin architecture. Moreover, the GAL lncRNAs confer a competitive fitness advantage to yeast cells because expression of these non-coding molecules correlates with faster adaptation in response to an environmental switch. Copyright © 2016 Elsevier Inc. All rights reserved.
DNA Fingerprinting of Lactobacillus crispatus Strain CTV-05 by Repetitive Element Sequence-Based PCR Analysis in a Pilot Study of Vaginal Colonization

PubMed Central

Antonio, May A. D.; Hillier, Sharon L.

2003-01-01

Lactobacillus crispatus is one of the predominant hydrogen peroxide (H2O2)-producing species found in the vagina and is under development as a probiotic for the treatment of bacterial vaginosis. In this study, we assessed whether DNA fingerprinting by repetitive element sequence-based PCR (rep-PCR) can be used to distinguish the capsule strain of L. crispatus (CTV-05) from other endogenous strains as well as other species of vaginal lactobacilli. Vaginal and rectal lactobacilli were identified to the species level by using whole-chromosome probe DNA hybridization. The DNAs from L. crispatus, L. jensenii, L. gasseri, and an as-yet-unnamed H2O2-negative Lactobacillus species designated 1086V were subjected to rep-PCR. The results of gel electrophoresis and ethidium bromide staining of the DNA fingerprints obtained were compared. L. crispatus CTV-05 had a unique DNA fingerprint compared to all other lactobacilli. DNA fingerprints for 27 production lots of L. crispatus sampled from 1994 through 2001 were identical to that of the original strain isolated in 1993, suggesting strain stability. In a pilot study of nine women, this DNA fingerprinting method distinguished CTV-05 from other endogenous vaginal lactobacilli prior to and after vaginal capsule use. rep-PCR DNA fingerprinting is useful for strain typing and for evaluating longitudinal loss or acquisition of vaginal lactobacilli used as probiotics. PMID:12734221
Paternal leakage of mitochondrial DNA in experimental crosses of populations of the potato cyst nematode Globodera pallida.

PubMed

Hoolahan, Angelique H; Blok, Vivian C; Gibson, Tracey; Dowton, Mark

2011-12-01

Animal mtDNA is typically assumed to be maternally inherited. Paternal mtDNA has been shown to be excluded from entering the egg or eliminated post-fertilization in several animals. However, in the contact zones of hybridizing species and populations, the reproductive barriers between hybridizing organisms may not be as efficient at preventing paternal mtDNA inheritance, resulting in paternal leakage. We assessed paternal mtDNA leakage in experimental crosses of populations of a cyst-forming nematode, Globodera pallida. A UK population, Lindley, was crossed with two South American populations, P5A and P4A. Hybridization of these populations was supported by evidence of nuclear DNA from both the maternal and paternal populations in the progeny. To assess paternal mtDNA leakage, a ~3.4 kb non-coding mtDNA region was analyzed in the parental populations and in the progeny. Paternal mtDNA was evident in the progeny of both crosses involving populations P5A and P4A. Further, paternal mtDNA replaced the maternal mtDNA in 22 and 40 % of the hybrid cysts from these crosses, respectively. These results indicate that under appropriate conditions, paternal leakage occurs in the mtDNA of parasitic nematodes, and supports the hypothesis that hybrid zones facilitate paternal leakage. Thus, assumptions of strictly maternal mtDNA inheritance may be frequently violated, particularly when divergent populations interbreed.
Solving Mendelian Mysteries: The Non-coding Genome May Hold the Key.

PubMed

Valente, Enza Maria; Bhatia, Kailash P

2018-02-22

Despite revolutionary advances in sequencing approaches, many mendelian disorders have remained unexplained. In this issue of Cell, Aneichyk et al. combine genomic and cell-type-specific transcriptomic data to causally link a non-coding mutation in the ubiquitous TAF1 gene to X-linked dystonia-parkinsonism. Copyright © 2018 Elsevier Inc. All rights reserved.
Non-coding, mRNA-like RNAs database Y2K

PubMed Central

Erdmann, Volker A.; Szymanski, Maciej; Hochberg, Abraham; Groot, Nathan de; Barciszewski, Jan

2000-01-01

In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www.man.poznan.pl/5SData/ncRNA/index.html PMID:10592224
Noncoding RNA Shows Context-Dependent Function | Center for Cancer Research

Cancer.gov

In addition to well-studied protein coding sequences, it is known that the genomes of higher organisms produce numerous noncoding RNAs (ncRNAs). Important roles for some ncRNAs in cell function have been demonstrated, though usually on a case-by-case basis, leading some scientists to argue that the majority of ncRNA production is just “noise” that results from the imperfect
Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

PubMed

Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

1997-12-01

A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.
Genomic Identification and Analysis of Shared Cis-regulator Elements in a Developmentally Critical homeobox Cluster

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chris Amemiya

2003-04-01

The goals of this project were to isolate, characterize, and sequence the Dlx3/Dlx7 bigene cluster from twelve different species of mammals. The Dlx3 and Dlx7 genes are known to encode homeobox transcription factors involved in patterning of structures in the vertebrate jaw as well as vertebrate limbs. Genomic sequences from the respective taxa will subsequently be compared in order to identify conserved non-coding sequences that are potential cis-regulatory elements. Based on the comparisons they will fashion transgenic mouse experiments to functionally test the strength of the potential cis-regulatory elements. A goal of the project is to attempt to identify thosemore » elements that may function in coordinately regulating both Dlx3 and Dlx7 functions.« less
Decoding the function of nuclear long non-coding RNAs.

PubMed

Chen, Ling-Ling; Carmichael, Gordon G

2010-06-01

Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.
Non-canonical ribosomal DNA segments in the human genome, and nucleoli functioning.

PubMed

Kupriyanova, Natalia S; Netchvolodov, Kirill K; Sadova, Anastasia A; Cherepanova, Marina D; Ryskov, Alexei P

2015-11-10

Ribosomal DNA (rDNA) in the human genome is represented by tandem repeats of 43 kb nucleotide sequences that form nucleoli organizers (NORs) on each of five pairs of acrocentric chromosomes. RDNA-similar segments of different lengths are also present on (NOR)(-) chromosomes. Many of these segments contain nucleotide substitutions, supplementary microsatellite clusters, and extended deletions. Recently, it was shown that, in addition to ribosome biogenesis, nucleoli exhibit additional functions, such as cell-cycle regulation and response to stresses. In particular, several stress-inducible loci located in the ribosomal intergenic spacer (rIGS) produce stimuli-specific noncoding nucleolus RNAs. By mapping the 5'/3' ends of the rIGS segments scattered throughout (NOR)(-) chromosomes, we discovered that the bonds in the rIGS that were most often susceptible to disruption in the rIGS were adjacent to, or overlapped with stimuli-specific inducible loci. This suggests the interconnection of the two phenomena - nucleoli functioning and the scattering of rDNA-like sequences on (NOR)(-) chromosomes. Copyright © 2015 Elsevier B.V. All rights reserved.
Dampening DNA binding: a common mechanism of transcriptional repression for both ncRNAs and protein domains.

PubMed

Goodrich, James A; Kugel, Jennifer F

2010-01-01

With eukaryotic non-coding RNAs (ncRNAs) now established as critical regulators of cellular transcription, the true diversity with which they can elicit biological effects is beginning to be appreciated. Two ncRNAs, mouse B2 RNA and human Alu RNA, have been found to repress mRNA transcription in response to heat shock. They do so by binding directly to RNA polymerase II, assembling into complexes on promoter DNA, and disrupting contacts between the polymerase and the DNA. Such a mechanism of repression had not previously been observed for a eukaryotic ncRNA; however, there are examples of eukaryotic protein domains that repress transcription by blocking essential protein-DNA interactions. Comparing the mechanism of transcriptional repression utilized by these protein domains to that used by B2 and Alu RNAs raises intriguing questions regarding transcriptional control, and how B2 and Alu RNAs might themselves be regulated.
Differential DNA methylation at conserved non-genic elements and evidence for transgenerational inheritance following developmental exposure to mono(2-ethylhexyl) phthalate and 5-azacytidine in zebrafish.

PubMed

Kamstra, Jorke H; Sales, Liana Bastos; Aleström, Peter; Legler, Juliette

2017-01-01

Exposure to environmental stressors during development may lead to latent and transgenerational adverse health effects. To understand the role of DNA methylation in these effects, we used zebrafish as a vertebrate model to investigate heritable changes in DNA methylation following chemical-induced stress during early development. We exposed zebrafish embryos to non-embryotoxic concentrations of the biologically active phthalate metabolite mono(2-ethylhexyl) phthalate (MEHP, 30 µM) and the DNA methyltransferase 1 inhibitor 5-azacytidine (5AC, 10 µM). Direct, latent and transgenerational effects on DNA methylation were assessed using global, genome-wide and locus-specific DNA methylation analyses. Following direct exposure in zebrafish embryos from 0 to 6 days post-fertilization, genome-wide analysis revealed a multitude of differentially methylated regions, strongly enriched at conserved non-genic elements for both compounds. Pathways involved in adipogenesis were enriched with the putative obesogenic compound MEHP. Exposure to 5AC resulted in enrichment of pathways involved in embryonic development and transgenerational effects on larval body length. Locus-specific methylation analysis of 10 differentially methylated sites revealed six of these loci differentially methylated in sperm sampled from adult zebrafish exposed during development to 5AC, and in first and second generation larvae. With MEHP, consistent changes were found at 2 specific loci in first and second generation larvae. Our results suggest a functional role for DNA methylation on cis-regulatory conserved elements following developmental exposure to compounds. Effects on these regions are potentially transferred to subsequent generations.
Quantification of non-coding RNA target localization diversity and its application in cancers.

PubMed

Cheng, Lixin; Leung, Kwong-Sak

2018-04-01

Subcellular localization is pivotal for RNAs and proteins to implement biological functions. The localization diversity of protein interactions has been studied as a crucial feature of proteins, considering that the protein-protein interactions take place in various subcellular locations. Nevertheless, the localization diversity of non-coding RNA (ncRNA) target proteins has not been systematically studied, especially its characteristics in cancers. In this study, we provide a new algorithm, non-coding RNA target localization coefficient (ncTALENT), to quantify the target localization diversity of ncRNAs based on the ncRNA-protein interaction and protein subcellular localization data. ncTALENT can be used to calculate the target localization coefficient of ncRNAs and measure how diversely their targets are distributed among the subcellular locations in various scenarios. We focus our study on long non-coding RNAs (lncRNAs), and our observations reveal that the target localization diversity is a primary characteristic of lncRNAs in different biotypes. Moreover, we found that lncRNAs in multiple cancers, differentially expressed cancer lncRNAs, and lncRNAs with multiple cancer target proteins are prone to have high target localization diversity. Furthermore, the analysis of gastric cancer helps us to obtain a better understanding that the target localization diversity of lncRNAs is an important feature closely related to clinical prognosis. Overall, we systematically studied the target localization diversity of the lncRNAs and uncovered its association with cancer.
Transcriptome interrogation of human myometrium identifies differentially expressed sense-antisense pairs of protein-coding and long non-coding RNA genes in spontaneous labor at term

PubMed Central

Romero, Roberto; Tarca, Adi; Chaemsaithong, Piya; Miranda, Jezid; Chaiworapongsa, Tinnakorn; Jia, Hui; Hassan, Sonia S.; Kalita, Cynthia A.; Cai, Juan; Yeo, Lami; Lipovich, Leonard

2014-01-01

Objective The mechanisms responsible for normal and abnormal parturition are poorly understood. Myometrial activation leading to regular uterine contractions is a key component of labor. Dysfunctional labor (arrest of dilatation and/or descent) is a leading indication for cesarean delivery. Compelling evidence suggests that most of these disorders are functional in nature, and not the result of cephalopelvic disproportion. The methodology and the datasets afforded by the post-genomic era provide novel opportunities to understand and target gene functions in these disorders. In 2012, the ENCODE Consortium elucidated the extraordinary abundance and functional complexity of long non-coding RNA genes in the human genome. The purpose of the study was to identify differentially expressed long non-coding RNA genes in human myometrium in women in spontaneous labor at term. Materials and Methods Myometrium was obtained from women undergoing cesarean deliveries who were not in labor (n=19) and women in spontaneous labor at term (n=20). RNA was extracted and profiled using an Illumina® microarray platform. The analysis of the protein coding genes from this study has been previously reported. Here, we have used computational approaches to bound the extent of long non-coding RNA representation on this platform, and to identify co-differentially expressed and correlated pairs of long non-coding RNA genes and protein-coding genes sharing the same genomic loci. Results Upon considering more than 18,498 distinct lncRNA genes compiled nonredundantly from public experimental data sources, and interrogating 2,634 that matched Illumina microarray probes, we identified co-differential expression and correlation at two genomic loci that contain coding-lncRNA gene pairs: SOCS2-AK054607 and LMCD1-NR_024065 in women in spontaneous labor at term. This co-differential expression and correlation was validated by qRT-PCR, an independent experimental method. Intriguingly, one of the two lnc
CRISPR Display: A modular method for locus-specific targeting of long noncoding RNAs and synthetic RNA devices in vivo

PubMed Central

Shechner, David M.; Hacisüleyman, Ezgi; Younger, Scott T.; Rinn, John L.

2016-01-01

Noncoding RNAs (ncRNAs) comprise an important class of regulatory molecules that mediate a vast array of biological processes. This broad functional capacity has also facilitated the design of artificial ncRNAs with novel functions. To further investigate and harness these capabilities, we developed CRISPR-Display (“CRISP-Disp”), a targeted localization method that uses Sp. Cas9 to deploy large RNA cargos to DNA loci. We demonstrate that exogenous RNA domains can be functionally appended onto the CRISPR scaffold at multiple insertion points, allowing the construction of Cas9 complexes with protein-binding cassettes, artificial aptamers, pools of random sequences, and RNAs up to 4.8 kilobases in length, including natural lncRNAs. Unlike most existing CRISPR methods, CRISP-Disp allows simultaneous multiplexing of distinct functions at multiple targets, limited only by the number of available functional RNA motifs. We anticipate that this technology will provide a powerful method with which to ectopically localize functional RNAs and ribonucleoprotein (RNP) complexes at specified genomic loci. PMID:26030444

Some links on this page may take you to non-federal websites. Their policies may differ from this site.