Kanhayuwa, Lakkhana; Coutts, Robert H. A.
2016-01-01
Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4–14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140–493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3’-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50–65% and 60–75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259–343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity. PMID:27736869
Kanhayuwa, Lakkhana; Coutts, Robert H A
2016-01-01
Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.
Basu, Abhijit; Jain, Niyati; Tolbert, Blanton S.; Komar, Anton A.
2017-01-01
Abstract RNA–protein interactions with physiological outcomes usually rely on conserved sequences within the RNA element. By contrast, activity of the diverse gamma-interferon-activated inhibitor of translation (GAIT)-elements relies on the conserved RNA folding motifs rather than the conserved sequence motifs. These elements drive the translational silencing of a group of chemokine (CC/CXC) and chemokine receptor (CCR) mRNAs, thereby helping to resolve physiological inflammation. Despite sequence dissimilarity, these RNA elements adopt common secondary structures (as revealed by 2D-1H NMR spectroscopy), providing a basis for their interaction with the RNA-binding GAIT complex. However, many of these elements (e.g. those derived from CCL22, CXCL13, CCR4 and ceruloplasmin (Cp) mRNAs) have substantially different affinities for GAIT complex binding. Toeprinting analysis shows that different positions within the overall conserved GAIT element structure contribute to differential affinities of the GAIT protein complex towards the elements. Thus, heterogeneity of GAIT elements may provide hierarchical fine-tuning of the resolution of inflammation. PMID:29069516
Insights into Structural and Mechanistic Features of Viral IRES Elements
Martinez-Salas, Encarnacion; Francisco-Velilla, Rosario; Fernandez-Chamorro, Javier; Embarek, Azman M.
2018-01-01
Internal ribosome entry site (IRES) elements are cis-acting RNA regions that promote internal initiation of protein synthesis using cap-independent mechanisms. However, distinct types of IRES elements present in the genome of various RNA viruses perform the same function despite lacking conservation of sequence and secondary RNA structure. Likewise, IRES elements differ in host factor requirement to recruit the ribosomal subunits. In spite of this diversity, evolutionarily conserved motifs in each family of RNA viruses preserve sequences impacting on RNA structure and RNA–protein interactions important for IRES activity. Indeed, IRES elements adopting remarkable different structural organizations contain RNA structural motifs that play an essential role in recruiting ribosomes, initiation factors and/or RNA-binding proteins using different mechanisms. Therefore, given that a universal IRES motif remains elusive, it is critical to understand how diverse structural motifs deliver functions relevant for IRES activity. This will be useful for understanding the molecular mechanisms beyond cap-independent translation, as well as the evolutionary history of these regulatory elements. Moreover, it could improve the accuracy to predict IRES-like motifs hidden in genome sequences. This review summarizes recent advances on the diversity and biological relevance of RNA structural motifs for viral IRES elements. PMID:29354113
White, Eleanor; Kamieniarz-Gdula, Kinga; Dye, Michael J.; Proudfoot, Nick J.
2013-01-01
RNA Polymerase II (Pol II) termination is dependent on RNA processing signals as well as specific terminator elements located downstream of the poly(A) site. One of the two major terminator classes described so far is the Co-Transcriptional Cleavage (CoTC) element. We show that homopolymer A/T tracts within the human β-globin CoTC-mediated terminator element play a critical role in Pol II termination. These short A/T tracts, dispersed within seemingly random sequences, are strong terminator elements, and bioinformatics analysis confirms the presence of such sequences in 70% of the putative terminator regions (PTRs) genome-wide. PMID:23258704
Searching for nuclear export elements in hepatitis D virus RNA.
Freitas, Natália; Cunha, Celso
2013-08-12
To search for the presence of cis elements in hepatitis D virus (HDV) genomic and antigenomic RNA capable of promoting nuclear export. We made use of a well characterized chloramphenicol acetyl-transferase reporter system based on plasmid pDM138. Twenty cDNA fragments corresponding to different HDV genomic and antigenomic RNA sequences were inserted in plasmid pDM138, and used in transfection experiments in Huh7 cells. The relative amounts of HDV RNA in nuclear and cytoplasmic fractions were then determined by real-time polymerase chain reaction and Northern blotting. The secondary structure of the RNA sequences that displayed nuclear export ability was further predicted using a web interface. Finally, the sensitivity to leptomycin B was assessed in order to investigate possible cellular pathways involved in HDV RNA nuclear export. Analysis of genomic RNA sequences did not allow identifying an unequivocal nuclear export element. However, two regions were found to promote the export of reporter mRNAs with efficiency higher than the negative controls albeit lower than the positive control. These regions correspond to nucleotides 266-489 and 584-920, respectively. In addition, when analyzing antigenomic RNA sequences a nuclear export element was found in positions 214-417. Export mediated by the nuclear export element of HDV antigenomic RNA is sensitive to leptomycin B suggesting a possible role of CRM1 in this transport pathway. A cis-acting nuclear export element is present in nucleotides 214-417 of HDV antigenomic RNA.
Organization and transient expression of the gene for human U11 snRNA
Clemens, Suter-Crazzolara; Walter, Keller
1991-01-01
The nucleotide sequence of U11 small nuclear RNA, a minor U RNA from HeLa cells, was determined. Computer analysis of the sequence (135 residues) predicts two strong hairpin loops which are separated by seventeen nucleotides containing an Sm binding site (AAUUUUUUGG). A synthetic gene was constructed in which the coding region of U11 RNA is under the control of a T7 promoter. This vector can be used to produce U11 RNA in vitro. Southern hybridization and PCR analysis of HeLa genomic DNA suggest that U11 RNA is encoded by a single copy gene, and that at least three genomic regions could be U11 RNA pseudogenes. A HeLa genomic copy of a U11 gene was isolated by inverted PCR. This gene contains the U11 RNA coding sequence and several sequence elements unique for the U RNA genes. These include a Distal Sequence Element (DSE, ATTTGCATA) present between positions −215 and −223 relative to the start of transcription; a Proximal Sequence Element (PSE, TTCACCTTTACCAAAAATG) located between positions −43 and −63 ; and a 3′box (GTTAGGCGAAATATTA) between positions +150 and +166. Transfection of HeLa cells with this gene revealed that it is functioning in vivo and can produce U11 RNA. PMID:1820214
Influence of gag and RRE Sequences on HIV-1 RNA Packaging Signal Structure and Function.
Kharytonchyk, Siarhei; Brown, Joshua D; Stilger, Krista; Yasin, Saif; Iyer, Aishwarya S; Collins, John; Summers, Michael F; Telesnitsky, Alice
2018-07-06
The packaging signal (Ψ) and Rev-responsive element (RRE) enable unspliced HIV-1 RNAs' export from the nucleus and packaging into virions. For some retroviruses, engrafting Ψ onto a heterologous RNA is sufficient to direct encapsidation. In contrast, HIV-1 RNA packaging requires 5' leader Ψ elements plus poorly defined additional features. We previously defined minimal 5' leader sequences competitive with intact Ψ for HIV-1 packaging, and here examined the potential roles of additional downstream elements. The findings confirmed that together, HIV-1 5' leader Ψ sequences plus a nuclear export element are sufficient to specify packaging. However, RNAs trafficked using a heterologous export element did not compete well with RNAs using HIV-1's RRE. Furthermore, some RNA additions to well-packaged minimal vectors rendered them packaging-defective. These defects were rescued by extending gag sequences in their native context. To understand these packaging defects' causes, in vitro dimerization properties of RNAs containing minimal packaging elements were compared to RNAs with sequence extensions that were or were not compatible with packaging. In vitro dimerization was found to correlate with packaging phenotypes, suggesting that HIV-1 evolved to prevent 5' leader residues' base pairing with downstream residues and misfolding of the packaging signal. Our findings explain why gag sequences have been implicated in packaging and show that RRE's packaging contributions appear more specific than nuclear export alone. Paired with recent work showing that sequences upstream of Ψ can dictate RNA folds, the current work explains how genetic context of minimal packaging elements contributes to HIV-1 RNA fate determination. Copyright © 2018 Elsevier Ltd. All rights reserved.
Kim, K H; Hemenway, C
1997-05-26
The putative subgenomic RNA (sgRNA) promoter regions upstream of the potato virus X (PVX) triple block and coat protein (CP) genes contain sequences common to other potexviruses. The importance of these sequences to PVX sgRNA accumulation was determined by inoculation of Nicotiana tabacum NT1 cell suspension protoplasts with transcripts derived from wild-type and modified PVX cDNA clones. Analyses of RNA accumulation by S1 nuclease digestion and primer extension indicated that a conserved octanucleotide sequence element and the spacing between this element and the start-site for sgRNA synthesis are critical for accumulation of the two major sgRNA species. The impact of mutations on CP sgRNA levels was also reflected in the accumulation of CP. In contrast, genomic minus- and plus-strand RNA accumulation were not significantly affected by mutations in these regions. Studies involving inoculation of tobacco plants with the modified transcripts suggested that the conserved octanucleotide element functions in sgRNA accumulation and some other aspect of the infection process.
Williams, Kelly P.
2003-01-01
A partial screen for genetic elements integrated into completely sequenced bacterial genomes shows more significant bias in specificity for the tmRNA gene (ssrA) than for any type of tRNA gene. Horizontal gene transfer, a major avenue of bacterial evolution, was assessed by focusing on elements using this single attachment locus. Diverse elements use ssrA; among enterobacteria alone, at least four different integrase subfamilies have independently evolved specificity for ssrA, and almost every strain analyzed presents a unique set of integrated elements. Even elements using essentially the same integrase can be very diverse, as is a group with an ssrA-specific integrase of the P4 subfamily. This same integrase appears to promote damage routinely at attachment sites, which may be adaptive. Elements in arrays can recombine; one such event mediated by invertible DNA segments within neighboring elements likely explains the monophasic nature of Salmonella enterica serovar Typhi. One of a limited set of conserved sequences occurs at the attachment site of each enterobacterial element, apparently serving as a transcriptional terminator for ssrA. Elements were usually found integrated into tRNA-like sequence at the 3′ end of ssrA, at subsites corresponding to those used in tRNA genes; an exception was found at the non-tRNA-like 3′ end produced by ssrA gene permutation in cyanobacteria, suggesting that, during the evolution of new site specificity by integrases, tropism toward a conserved 3′ end of an RNA gene may be as strong as toward a tRNA-like sequence. The proximity of ssrA and smpB, which act in concert, was also surveyed. PMID:12533482
The nonamer UUAUUUAUU is the key AU-rich sequence motif that mediates mRNA degradation.
Zubiaga, A M; Belasco, J G; Greenberg, M E
1995-01-01
Labile mRNAs that encode cytokine and immediate-early gene products often contain AU-rich sequences within their 3' untranslated region (UTR). These AU-rich sequences appear to be key determinants of the short half-lives of these mRNAs, although the sequence features of these elements and the mechanism by which they target mRNAs for rapid decay have not been fully defined. We have examined the features of AU-rich elements (AREs) that are crucial for their function as determinants of mRNA instability in mammalian cells by testing the ability of various mutant c-fos AREs and synthetic AREs to direct rapid mRNA deadenylation and decay when inserted within the 3' UTR of the normally stable beta-globin mRNA. Evidence is presented that the pentamer AUUUA, which previously was suggested to be the minimal determinant of instability present in mammalian AREs, cannot direct rapid mRNA deadenylation and decay. Instead, the nonomer UUAUUUAUU is the elemental AU-rich sequence motif that destabilizes mRNA. Removal of one uridine residue from either end of the nonamer (UUAUUUAU or UAUUUAUU) results in a decrease of potency of the element, while removal of a uridine residue from both ends of the nonamer (UAUUUAU) eliminates detectable destabilizing activity. The inclusion of an additional uridine residue at both ends of the nonamer (UUUAUUUAUUU) does not further increase the efficacy of the element. Taken together, these findings suggest that the nonamer UUAUUUAUU is the minimal AU-rich motif that effectively destabilizes mRNA. Additional ARE potency is achieved by combining multiple copies of this nonamer in a single mRNA 3' UTR. Furthermore, analysis of poly(A) shortening rates for ARE-containing mRNAs reveals that the UUAUUUAUU sequence also accelerates mRNA deadenylation and suggests that the UUAUUUAUU motif targets mRNA for rapid deadenylation as an early step in the mRNA decay process. PMID:7891716
An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures
Zandi, Kasra; Butler, Gregory; Kharma, Nawwaf
2016-01-01
Computational design of RNA sequences that fold into targeted secondary structures has many applications in biomedicine, nanotechnology and synthetic biology. An RNA molecule is made of different types of secondary structure elements and an important RNA element named pseudoknot plays a key role in stabilizing the functional form of the molecule. However, due to the computational complexities associated with characterizing pseudoknotted RNA structures, most of the existing RNA sequence designer algorithms generally ignore this important structural element and therefore limit their applications. In this paper we present a new algorithm to design RNA sequences for pseudoknotted secondary structures. We use NUPACK as the folding algorithm to compute the equilibrium characteristics of the pseudoknotted RNAs, and describe a new adaptive defect weighted sampling algorithm named Enzymer to design low ensemble defect RNA sequences for targeted secondary structures including pseudoknots. We used a biological data set of 201 pseudoknotted structures from the Pseudobase library to benchmark the performance of our algorithm. We compared the quality characteristics of the RNA sequences we designed by Enzymer with the results obtained from the state of the art MODENA and antaRNA. Our results show our method succeeds more frequently than MODENA and antaRNA do, and generates sequences that have lower ensemble defect, lower probability defect and higher thermostability. Finally by using Enzymer and by constraining the design to a naturally occurring and highly conserved Hammerhead motif, we designed 8 sequences for a pseudoknotted cis-acting Hammerhead ribozyme. Enzymer is available for download at https://bitbucket.org/casraz/enzymer. PMID:27499762
Fauzi, Hamid; Agyeman, Akwasi; Hines, Jennifer V.
2008-01-01
Many bacteria utilize riboswitch transcription regulation to monitor and appropriately respond to cellular levels of important metabolites or effector molecules. The T box transcription antitermination riboswitch responds to cognate uncharged tRNA by specifically stabilizing an antiterminator element in the 5′-untranslated mRNA leader region and precluding formation of a thermodynamically more stable terminator element. Stabilization occurs when the tRNA acceptor end base pairs with the first four nucleotides in the seven nucleotide bulge of the highly conserved antiterminator element. The significance of the conservation of the antiterminator bulge nucleotides that do not base pair with the tRNA is unknown, but they are required for optimal function. In vitro selection was used to determine if the isolated antiterminator bulge context alone dictates the mode in which the tRNA acceptor end binds the bulge nucleotides. No sequence conservation beyond complementarity was observed and the location was not constrained to the first four bases of the bulge. The results indicate that formation of a structure that recognizes the tRNA acceptor end in isolation is not the determinant driving force for the high phylogenetic sequence conservation observed within the antiterminator bulge. Additional factors or T box leader features more likely influenced the phylogenetic sequence conservation. PMID:19152843
Christensen, Shawn M; Ye, Junqiang; Eickbush, Thomas H
2006-11-21
Non-LTR retrotransposons insert into eukaryotic genomes by target-primed reverse transcription (TPRT), a process in which cleaved DNA targets are used to prime reverse transcription of the element's RNA transcript. Many of the steps in the integration pathway of these elements can be characterized in vitro for the R2 element because of the rigid sequence specificity of R2 for both its DNA target and its RNA template. R2 retrotransposition involves identical subunits of the R2 protein bound to different DNA sequences upstream and downstream of the insertion site. The key determinant regulating which DNA-binding conformation the protein adopts was found to be a 320-nt RNA sequence from near the 5' end of the R2 element. In the absence of this 5' RNA the R2 protein binds DNA sequences upstream of the insertion site, cleaves the first DNA strand, and conducts TPRT when RNA containing the 3' untranslated region of the R2 transcript is present. In the presence of the 320-nt 5' RNA, the R2 protein binds DNA sequences downstream of the insertion site. Cleavage of the second DNA strand by the downstream subunit does not appear to occur until after the 5' RNA is removed from this subunit. We postulate that the removal of the 5' RNA normally occurs during reverse transcription, and thus provides a critical temporal link to first- and second-strand DNA cleavage in the R2 retrotransposition reaction.
Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.
2015-01-01
We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064
Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W
2008-06-05
We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.
Whisson, Stephen C; Avrova, Anna O; Lavrova, Olga; Pritchard, Leighton
2005-04-01
The first known families of tRNA-related short interspersed elements (SINEs) in the oomycetes were identified by exploiting the genomic DNA sequence resources for the potato late blight pathogen, Phytophthora infestans. Fifteen families of tRNA-related SINEs, as well as predicted tRNAs, and other possible RNA polymerase III-transcribed sequences were identified. The size of individual elements ranges from 101 to 392 bp, representing sequences present from low (1) to highly abundant (over 2000) copy number in the P. infestans genome, based on quantitative PCR analysis. Putative short direct repeat sequences (6-14 bp) flanking the elements were also identified for eight of the SINEs. Predicted SINEs were named in a series prefixed infSINE (for infestans-SINE). Two SINEs were apparently present as multimers of tRNA-related units; four copies of a related unit for infSINEr, and two unrelated units for infSINEz. Two SINEs, infSINEh and infSINEi, were typically located within 400 bp of each other. These were also the only two elements identified as being actively transcribed in the mycelial stage of P. infestans by RT-PCR. It is possible that infSINEh and infSINEi represent active retrotransposons in P. infestans. Based on the quantitative PCR estimates of copy number for all of the elements identified, tRNA-related SINEs were estimated to comprise 0.3% of the 250 Mb P. infestans genome. InfSINE-related sequences were found to occur in species throughout the genus Phytophthora. However, seven elements were shown to be exclusive to P. infestans.
Cartault, François; Munier, Patrick; Benko, Edgar; Desguerre, Isabelle; Hanein, Sylvain; Boddaert, Nathalie; Bandiera, Simonetta; Vellayoudom, Jeanine; Krejbich-Trotot, Pascale; Bintner, Marc; Hoarau, Jean-Jacques; Girard, Muriel; Génin, Emmanuelle; de Lonlay, Pascale; Fourmaintraux, Alain; Naville, Magali; Rodriguez, Diana; Feingold, Josué; Renouil, Michel; Munnich, Arnold; Westhof, Eric; Fähling, Michael; Lyonnet, Stanislas; Henrion-Caude, Alexandra
2012-01-01
The human genome is densely populated with transposons and transposon-like repetitive elements. Although the impact of these transposons and elements on human genome evolution is recognized, the significance of subtle variations in their sequence remains mostly unexplored. Here we report homozygosity mapping of an infantile neurodegenerative disease locus in a genetic isolate. Complete DNA sequencing of the 400-kb linkage locus revealed a point mutation in a primate-specific retrotransposon that was transcribed as part of a unique noncoding RNA, which was expressed in the brain. In vitro knockdown of this RNA increased neuronal apoptosis, consistent with the inappropriate dosage of this RNA in vivo and with the phenotype. Moreover, structural analysis of the sequence revealed a small RNA-like hairpin that was consistent with the putative gain of a functional site when mutated. We show here that a mutation in a unique transposable element-containing RNA is associated with lethal encephalopathy, and we suggest that RNAs that harbor evolutionarily recent repetitive elements may play important roles in human brain development. PMID:22411793
Park, Mi-Ri; Kwon, Sun-Jung; Choi, Hong-Soo; Hemenway, Cynthia L; Kim, Kook-Hyung
2008-08-15
The repeated ACCA or AC-rich sequence and structural (SL1) elements in the 5' non-translated region (NTR) of the Potato virus X (PVX) RNA play vital roles in the PVX life cycle by controlling translation, RNA replication, movement, and assembly. It has already been shown that the repeated ACCA or AC-rich sequence affect both gRNA and sgRNA accumulation, while not affecting minus-strand RNA accumulation, and are also required for host protein binding. The functional significance of the repeated ACCA sequence elements in the 5' NTR region was investigated by analyzing the effects of deletion and site-directed mutations on PVX replication in Nicotiana benthamiana plants and NT1 protoplasts. Substitution (ACCA into AAAA or UUUU) mutations introduced in the first (nt 10-13) element in the 5' NTR of the PVX RNA significantly affected viral replication, while mutations introduced in the second (nt 17-20) and third (nt 20-23) elements did not. The fourth (nt 29-32) ACCA element weakly affected virus replication, whereas mutations in the fifth (nt 38-41) significantly reduced virus replication due to the structure disruption of SL1 by AAAA and/or UUUU substitutions. Further characterization of the first ACCA element indicated that duplication of ACCA at nt 10-13 (nt 10-17, ACCAACCA) caused severe symptom development as compared to that of wild type, while deletion of the single element (nt 10-13), DeltaACCA) or tripling of this element caused reduced symptom development. Single- and double-nucleotide substitutions introduced into the first ACCA element revealed the importance of CC located at nt positions 11 and 12. Altogether, these results indicate that the first ACCA element is important for PVX replication.
Identification of a Recently Active Mammalian SINE Derived from Ribosomal RNA
Longo, Mark S.; Brown, Judy D.; Zhang, Chu; O’Neill, Michael J.; O’Neill, Rachel J.
2015-01-01
Complex eukaryotic genomes are riddled with repeated sequences whose derivation does not coincide with phylogenetic history and thus is often unknown. Among such sequences, the capacity for transcriptional activity coupled with the adaptive use of reverse transcription can lead to a diverse group of genomic elements across taxa, otherwise known as selfish elements or mobile elements. Short interspersed nuclear elements (SINEs) are nonautonomous mobile elements found in eukaryotic genomes, typically derived from cellular RNAs such as tRNAs, 7SL or 5S rRNA. Here, we identify and characterize a previously unknown SINE derived from the 3′-end of the large ribosomal subunit (LSU or 28S rDNA) and transcribed via RNA polymerase III. This new element, SINE28, is represented in low-copy numbers in the human reference genome assembly, wherein we have identified 27 discrete loci. Phylogenetic analysis indicates these elements have been transpositionally active within primate lineages as recently as 6 MYA while modern humans still carry transcriptionally active copies. Moreover, we have identified SINE28s in all currently available assembled mammalian genome sequences. Phylogenetic comparisons indicate that these elements are frequently rederived from the highly conserved LSU rRNA sequences in a lineage-specific manner. We propose that this element has not been previously recognized as a SINE given its high identity to the canonical LSU, and that SINE28 likely represents one of possibly many unidentified, active transposable elements within mammalian genomes. PMID:25637222
Sadofsky, M; Connelly, S; Manley, J L; Alwine, J C
1985-01-01
Our previous studies of the 3'-end processing of simian virus 40 late mRNAs indicated the existence of an essential element (or elements) downstream of the AAUAAA signal. We report here the use of transient expression analysis to study a functional element which we located within the sequence AGGUUUUUU, beginning 59 nucleotides downstream of the recognized signal AAUAAA. Deletion of this element resulted in (i) at least a 75% drop in 3'-end processing at the normal site and (ii) appearance of readthrough transcripts with alternate 3' ends. Some flexibility in the downstream position of this element relative to the AAUAAA was noted by deletion analysis. Using computer sequence comparison, we located homologous regions within downstream sequences of other genes, suggesting a generalized sequence element. In addition, specific complementarity is noted between the downstream element and U4 RNA. The possibility that this complementarity could participate in 3'-end site selection is discussed. Images PMID:3016512
Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing
NASA Astrophysics Data System (ADS)
Ferreira, Pedro G.; Oti, Martin; Barann, Matthias; Wieland, Thomas; Ezquina, Suzana; Friedländer, Marc R.; Rivas, Manuel A.; Esteve-Codina, Anna; Estivill, Xavier; Guigó, Roderic; Dermitzakis, Emmanouil; Antonarakis, Stylianos; Meitinger, Thomas; Strom, Tim M.; Palotie, Aarno; François Deleuze, Jean; Sudbrak, Ralf; Lerach, Hans; Gut, Ivo; Syvänen, Ann-Christine; Gyllensten, Ulf; Schreiber, Stefan; Rosenstiel, Philip; Brunner, Han; Veltman, Joris; Hoen, Peter A. C. T.; Jan van Ommen, Gert; Carracedo, Angel; Brazma, Alvis; Flicek, Paul; Cambon-Thomsen, Anne; Mangion, Jonathan; Bentley, David; Hamosh, Ada; Rosenstiel, Philip; Strom, Tim M.; Lappalainen, Tuuli; Guigó, Roderic; Sammeth, Michael
2016-09-01
Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing—alternative splice sites, introns, and cleavage sites—which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.
Continuous in vitro evolution of bacteriophage RNA polymerase promoters
NASA Technical Reports Server (NTRS)
Breaker, R. R.; Banerji, A.; Joyce, G. F.
1994-01-01
Rapid in vitro evolution of bacteriophage T7, T3, and SP6 RNA polymerase promoters was achieved by a method that allows continuous enrichment of DNAs that contain functional promoter elements. This method exploits the ability of a special class of nucleic acid molecules to replicate continuously in the presence of both a reverse transcriptase and a DNA-dependent RNA polymerase. Replication involves the synthesis of both RNA and cDNA intermediates. The cDNA strand contains an embedded promoter sequence, which becomes converted to a functional double-stranded promoter element, leading to the production of RNA transcripts. Synthetic cDNAs, including those that contain randomized promoter sequences, can be used to initiate the amplification cycle. However, only those cDNAs that contain functional promoter sequences are able to produce RNA transcripts. Furthermore, each RNA transcript encodes the RNA polymerase promoter sequence that was responsible for initiation of its own transcription. Thus, the population of amplifying molecules quickly becomes enriched for those templates that encode functional promoters. Optimal promoter sequences for phage T7, T3, and SP6 RNA polymerase were identified after a 2-h amplification reaction, initiated in each case with a pool of synthetic cDNAs encoding greater than 10(10) promoter sequence variants.
Lim, Chun Shen; Brown, Chris M
2017-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community.
Lim, Chun Shen; Brown, Chris M.
2018-01-01
Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101
Robinett, C C; O'Connor, A; Dunaway, M
1997-01-01
We have identified a novel activity for the region of the intergenic spacer of the Xenopus laevis rRNA genes that contains the 35- and 100-bp repeats. We devised a new assay for this region by constructing DNA plasmids containing a tandem repeat of rRNA reporter genes that were separated by the 35- and 100-bp repeat region and a rRNA gene enhancer. When the 35- and 100-bp repeat region is present in its normal position and orientation at the 3' end of the rRNA reporter genes, the enhancer activates the adjacent downstream promoter but not the upstream rRNA promoter on the same plasmid. Because this element can restrict the range of an enhancer's activity in the context of tandem genes, we have named it the repeat organizer (RO). The ability to restrict enhancer action is a feature of insulator elements, but unlike previously described insulator elements the RO does not block enhancer action in a simple enhancer-blocking assay. Instead, the activity of the RO requires that it be in its normal position and orientation with respect to the other sequence elements of the rRNA genes. The enhancer-binding transcription factor xUBF also binds to the repetitive sequences of the RO in vitro, but these sequences do not activate transcription in vivo. We propose that the RO is a specialized insulator element that organizes the tandem array of rRNA genes into single-gene expression units by promoting activation of a promoter by its proximal enhancers. PMID:9111359
Definition of RNA Polymerase II CoTC Terminator Elements in the Human Genome
Nojima, Takayuki; Dienstbier, Martin; Murphy, Shona; Proudfoot, Nicholas J.; Dye, Michael J.
2013-01-01
Summary Mammalian RNA polymerase II (Pol II) transcription termination is an essential step in protein-coding gene expression that is mediated by pre-mRNA processing activities and DNA-encoded terminator elements. Although much is known about the role of pre-mRNA processing in termination, our understanding of the characteristics and generality of terminator elements is limited. Whereas promoter databases list up to 40,000 known and potential Pol II promoter sequences, fewer than ten Pol II terminator sequences have been described. Using our knowledge of the human β-globin terminator mechanism, we have developed a selection strategy for mapping mammalian Pol II terminator elements. We report the identification of 78 cotranscriptional cleavage (CoTC)-type terminator elements at endogenous gene loci. The results of this analysis pave the way for the full understanding of Pol II termination pathways and their roles in gene expression. PMID:23562152
How Messenger RNA and Nascent Chain Sequences Regulate Translation Elongation.
Choi, Junhong; Grosely, Rosslyn; Prabhakar, Arjun; Lapointe, Christopher P; Wang, Jinfan; Puglisi, Joseph D
2018-06-20
Translation elongation is a highly coordinated, multistep, multifactor process that ensures accurate and efficient addition of amino acids to a growing nascent-peptide chain encoded in the sequence of translated messenger RNA (mRNA). Although translation elongation is heavily regulated by external factors, there is clear evidence that mRNA and nascent-peptide sequences control elongation dynamics, determining both the sequence and structure of synthesized proteins. Advances in methods have driven experiments that revealed the basic mechanisms of elongation as well as the mechanisms of regulation by mRNA and nascent-peptide sequences. In this review, we highlight how mRNA and nascent-peptide elements manipulate the translation machinery to alter the dynamics and pathway of elongation.
Transcription factor trapping by RNA in gene regulatory elements.
Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A
2015-11-20
Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.
Probing Xist RNA Structure in Cells Using Targeted Structure-Seq
Rutenberg-Schoenberg, Michael; Simon, Matthew D.
2015-01-01
The long non-coding RNA (lncRNA) Xist is a master regulator of X-chromosome inactivation in mammalian cells. Models for how Xist and other lncRNAs function depend on thermodynamically stable secondary and higher-order structures that RNAs can form in the context of a cell. Probing accessible RNA bases can provide data to build models of RNA conformation that provide insight into RNA function, molecular evolution, and modularity. To study the structure of Xist in cells, we built upon recent advances in RNA secondary structure mapping and modeling to develop Targeted Structure-Seq, which combines chemical probing of RNA structure in cells with target-specific massively parallel sequencing. By enriching for signals from the RNA of interest, Targeted Structure-Seq achieves high coverage of the target RNA with relatively few sequencing reads, thus providing a targeted and scalable approach to analyze RNA conformation in cells. We use this approach to probe the full-length Xist lncRNA to develop new models for functional elements within Xist, including the repeat A element in the 5’-end of Xist. This analysis also identified new structural elements in Xist that are evolutionarily conserved, including a new element proximal to the C repeats that is important for Xist function. PMID:26646615
Mathews, D H; Banerjee, A R; Luan, D D; Eickbush, T H; Turner, D H
1997-01-01
RNA transcripts corresponding to the 250-nt 3' untranslated region of the R2 non-LTR retrotransposable element are recognized by the R2 reverse transcriptase and are sufficient to serve as templates in the target DNA-primed reverse transcription (TPRT) reaction. The R2 protein encoded by the Bombyx mori R2 can recognize this region from both the B. mori and Drosophila melanogaster R2 elements even though these regions show little nucleotide sequence identity. A model for the RNA secondary structure of the 3' untranslated region of the D. melanogaster R2 retrotransposon was developed by sequence comparison of 10 species aided by free energy minimization. Chemical modification experiments are consistent with this prediction. A secondary structure model for the 3' untranslated region of R2 RNA from the R2 element from B. mori was obtained by a combination of chemical modification data and free energy minimization. These two secondary structure models, found independently, share several common sites. This study shows the utility of combining free energy minimization, sequence comparison, and chemical modification to model an RNA secondary structure. PMID:8990394
Human Fip1 is a subunit of CPSF that binds to U-rich RNA elements and stimulates poly(A) polymerase.
Kaufmann, Isabelle; Martin, Georges; Friedlein, Arno; Langen, Hanno; Keller, Walter
2004-02-11
In mammals, polyadenylation of mRNA precursors (pre-mRNAs) by poly(A) polymerase (PAP) depends on cleavage and polyadenylation specificity factor (CPSF). CPSF is a multisubunit complex that binds to the canonical AAUAAA hexamer and to U-rich upstream sequence elements on the pre-mRNA, thereby stimulating the otherwise weakly active and nonspecific polymerase to elongate efficiently RNAs containing a poly(A) signal. Based on sequence similarity to the Saccharomyces cerevisiae polyadenylation factor Fip1p, we have identified human Fip1 (hFip1) and found that the protein is an integral subunit of CPSF. hFip1 interacts with PAP and has an arginine-rich RNA-binding motif that preferentially binds to U-rich sequence elements on the pre-mRNA. Recombinant hFip1 is sufficient to stimulate the in vitro polyadenylation activity of PAP in a U-rich element-dependent manner. hFip1, CPSF160 and PAP form a ternary complex in vitro, suggesting that hFip1 and CPSF160 act together in poly(A) site recognition and in cooperative recruitment of PAP to the RNA. These results show that hFip1 significantly contributes to CPSF-mediated stimulation of PAP activity.
Human Fip1 is a subunit of CPSF that binds to U-rich RNA elements and stimulates poly(A) polymerase
Kaufmann, Isabelle; Martin, Georges; Friedlein, Arno; Langen, Hanno; Keller, Walter
2004-01-01
In mammals, polyadenylation of mRNA precursors (pre-mRNAs) by poly(A) polymerase (PAP) depends on cleavage and polyadenylation specificity factor (CPSF). CPSF is a multisubunit complex that binds to the canonical AAUAAA hexamer and to U-rich upstream sequence elements on the pre-mRNA, thereby stimulating the otherwise weakly active and nonspecific polymerase to elongate efficiently RNAs containing a poly(A) signal. Based on sequence similarity to the Saccharomyces cerevisiae polyadenylation factor Fip1p, we have identified human Fip1 (hFip1) and found that the protein is an integral subunit of CPSF. hFip1 interacts with PAP and has an arginine-rich RNA-binding motif that preferentially binds to U-rich sequence elements on the pre-mRNA. Recombinant hFip1 is sufficient to stimulate the in vitro polyadenylation activity of PAP in a U-rich element-dependent manner. hFip1, CPSF160 and PAP form a ternary complex in vitro, suggesting that hFip1 and CPSF160 act together in poly(A) site recognition and in cooperative recruitment of PAP to the RNA. These results show that hFip1 significantly contributes to CPSF-mediated stimulation of PAP activity. PMID:14749727
Transterm: a database to aid the analysis of regulatory sequences in mRNAs
Jacobs, Grant H.; Chen, Augustine; Stevens, Stewart G.; Stockwell, Peter A.; Black, Michael A.; Tate, Warren P.; Brown, Chris M.
2009-01-01
Messenger RNAs, in addition to coding for proteins, may contain regulatory elements that affect how the protein is translated. These include protein and microRNA-binding sites. Transterm (http://mRNA.otago.ac.nz/Transterm.html) is a database of regions and elements that affect translation with two major unique components. The first is integrated results of analysis of general features that affect translation (initiation, elongation, termination) for species or strains in Genbank, processed through a standard pipeline. The second is curated descriptions of experimentally determined regulatory elements that function as translational control elements in mRNAs. Transterm focuses on protein binding sites, particularly those in 3′-untranslated regions (3′-UTR). For this release the interface has been extensively updated based on user feedback. The data is now accessible by strain rather than species, for example there are 10 Escherichia coli strains (genomes) analysed separately. In addition to providing a repository of data, the database also provides tools for users to query their own mRNA sequences. Users can search sequences for Transterm or user defined regulatory elements, including protein or miRNA targets. Transterm also provides a central core of links to related resources for complementary analyses. PMID:18984623
Mandl, C W; Holzmann, H; Kunz, C; Heinz, F X
1993-05-01
The complete nucleotide sequence of the positive-stranded RNA genome of the tick-borne flavivirus Powassan (10,839 nucleotides) was elucidated and the amino acid sequence of all viral proteins was derived. Based on this sequence as well as serological data, Powassan virus represents the most divergent member of the tick-borne serocomplex within the genus flaviviruses, family Flaviviridae. The primary nucleotide sequence and potential RNA secondary structures of the Powassan virus genome as well as the protein sequences and the reactivities of the virion with a panel of monoclonal antibodies were compared to other tick-borne and mosquito-borne flaviviruses. These analyses corroborated significant differences between tick-borne and mosquito-borne flaviviruses, but also emphasized structural elements that are conserved among both vector groups. The comparisons among tick-borne flaviviruses revealed conserved sequence elements that might represent important determinants of the tick-borne flavivirus phenotype.
Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De; Qin, Cheng-Feng
2013-06-01
cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5' cyclization sequence (5'CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5'CS, and the presence of DCS-PK facilitates the formation of 5'-3' RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution.
Liu, Zhong-Yu; Li, Xiao-Feng; Jiang, Tao; Deng, Yong-Qiang; Zhao, Hui; Wang, Hong-Jiang; Ye, Qing; Zhu, Shun-Ya; Qiu, Yang; Zhou, Xi; Qin, E-De
2013-01-01
cis-Acting elements in the viral genome RNA (vRNA) are essential for the translation, replication, and/or encapsidation of RNA viruses. In this study, a novel conserved cis-acting element was identified in the capsid-coding region of mosquito-borne flavivirus. The downstream of 5′ cyclization sequence (5′CS) pseudoknot (DCS-PK) element has a three-stem pseudoknot structure, as demonstrated by structure prediction and biochemical analysis. Using dengue virus as a model, we show that DCS-PK enhances vRNA replication and that its function depends on its secondary structure and specific primary sequence. Mutagenesis revealed that the highly conserved stem 1 and loop 2, which are involved in potential loop-helix interactions, are crucial for DCS-PK function. A predicted loop 1-stem 3 base triple interaction is important for the structural stability and function of DCS-PK. Moreover, the function of DCS-PK depends on its position relative to the 5′CS, and the presence of DCS-PK facilitates the formation of 5′-3′ RNA complexes. Taken together, our results reveal that the cis-acting element DCS-PK enhances vRNA replication by regulating genome cyclization, and DCS-PK might interplay with other cis-acting elements to form a functional vRNA cyclization domain, thus playing critical roles during the flavivirus life cycle and evolution. PMID:23576500
2018-01-01
FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. PMID:29668722
Zebrafish U6 small nuclear RNA gene promoters contain a SPH element in an unusual location.
Halbig, Kari M; Lekven, Arne C; Kunkel, Gary R
2008-09-15
Promoters for vertebrate small nuclear RNA (snRNA) genes contain a relatively simple array of transcriptional control elements, divided into proximal and distal regions. Most of these genes are transcribed by RNA polymerase II (e.g., U1, U2), whereas the U6 gene is transcribed by RNA polymerase III. Previously identified vertebrate U6 snRNA gene promoters consist of a proximal sequence element (PSE) and TATA element in the proximal region, plus a distal region with octamer (OCT) and SphI postoctamer homology (SPH) elements. We have found that zebrafish U6 snRNA promoters contain the SPH element in a novel proximal position immediately upstream of the TATA element. The zebrafish SPH element is recognized by SPH-binding factor/selenocysteine tRNA gene transcription activating factor/zinc finger protein 143 (SBF/Staf/ZNF143) in vitro. Furthermore, a zebrafish U6 promoter with a defective SPH element is inefficiently transcribed when injected into embryos.
Circular RNA expression in basal cell carcinoma.
Sand, Michael; Bechara, Falk G; Sand, Daniel; Gambichler, Thilo; Hahn, Stephan A; Bromba, Michael; Stockfleth, Eggert; Hessam, Schapoor
2016-05-01
Circular RNAs (circRNAs), are nonprotein coding RNAs consisting of a circular loop with multiple miRNA, binding sites called miRNA response elements (MREs), functioning as miRNA sponges. This study was performed to identify differentially expressed circRNAs and their MREs in basal cell carcinoma (BCC). Microarray circRNA expression profiles were acquired from BCC and control followed by qRT-PCR validation. Bioinformatical target prediction revealed multiple MREs. Sequence analysis was performed concerning MRE interaction potential with the BCC miRNome. We identified 23 upregulated and 48 downregulated circRNAs with 354 miRNA response elements capable of sequestering miRNA target sequences of the BCC miRNome. The present study describes a variety of circRNAs that are potentially involved in the molecular pathogenesis of BCC.
Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R
2006-01-01
As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639
RNA motif search with data-driven element ordering.
Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa
2016-05-18
In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .
CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs.
Gilbert, N; Labuda, D
1999-03-16
A 65-bp "core" sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3' ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome.
CORE-SINEs: Eukaryotic short interspersed retroposing elements with common sequence motifs
Gilbert, Nicolas; Labuda, Damian
1999-01-01
A 65-bp “core” sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3′ ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome. PMID:10077603
Definition of RNA polymerase II CoTC terminator elements in the human genome.
Nojima, Takayuki; Dienstbier, Martin; Murphy, Shona; Proudfoot, Nicholas J; Dye, Michael J
2013-04-25
Mammalian RNA polymerase II (Pol II) transcription termination is an essential step in protein-coding gene expression that is mediated by pre-mRNA processing activities and DNA-encoded terminator elements. Although much is known about the role of pre-mRNA processing in termination, our understanding of the characteristics and generality of terminator elements is limited. Whereas promoter databases list up to 40,000 known and potential Pol II promoter sequences, fewer than ten Pol II terminator sequences have been described. Using our knowledge of the human β-globin terminator mechanism, we have developed a selection strategy for mapping mammalian Pol II terminator elements. We report the identification of 78 cotranscriptional cleavage (CoTC)-type terminator elements at endogenous gene loci. The results of this analysis pave the way for the full understanding of Pol II termination pathways and their roles in gene expression. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Yeh, Po-Yuan; Wu, Hung-Yi
2014-07-30
It has been demonstrated that, in addition to genomic RNA, sgmRNA is able to serve as a template for the synthesis of the negative-strand [(-)-strand] complement. However, the cis-acting elements on the positive-strand [(+)-strand] sgmRNA required for (-)-strand sgmRNA synthesis have not yet been systematically identified. In this study, we employed real-time quantitative reverse transcription polymerase chain reaction to analyze the cis-acting elements on bovine coronavirus (BCoV) sgmRNA 7 required for the synthesis of its (-)-strand counterpart by deletion mutagenesis. The major findings are as follows. (1) Deletion of the 5'-terminal leader sequence on sgmRNA 7 decreased the synthesis of the (-)-strand sgmRNA complement. (2) Deletions of the 3' untranslated region (UTR) bulged stem-loop showed no effect on (-)-strand sgmRNA synthesis; however, deletion of the 3' UTR pseudoknot decreased the yield of (-)-strand sgmRNA. (3) Nucleotides positioned from -15 to -34 of the sgmRNA 7 3'-terminal region are required for efficient (-)-strand sgmRNA synthesis. (4) Nucleotide species at the 3'-most position (-1) of sgmRNA 7 is correlated to the efficiency of (-)-strand sgmRNA synthesis. These results together suggest, in principle, that the 5'- and 3'-terminal sequences on sgmRNA 7 harbor cis-acting elements are critical for efficient (-)-strand sgmRNA synthesis in BCoV.
2009-01-01
Background Tardigrades represent an animal phylum with extraordinary resistance to environmental stress. Results To gain insights into their stress-specific adaptation potential, major clusters of related and similar proteins are identified, as well as specific functional clusters delineated comparing all tardigrades and individual species (Milnesium tardigradum, Hypsibius dujardini, Echiniscus testudo, Tulinus stephaniae, Richtersius coronifer) and functional elements in tardigrade mRNAs are analysed. We find that 39.3% of the total sequences clustered in 58 clusters of more than 20 proteins. Among these are ten tardigrade specific as well as a number of stress-specific protein clusters. Tardigrade-specific functional adaptations include strong protein, DNA- and redox protection, maintenance and protein recycling. Specific regulatory elements regulate tardigrade mRNA stability such as lox P DICE elements whereas 14 other RNA elements of higher eukaryotes are not found. Further features of tardigrade specific adaption are rapidly identified by sequence and/or pattern search on the web-tool tardigrade analyzer http://waterbear.bioapps.biozentrum.uni-wuerzburg.de. The work-bench offers nucleotide pattern analysis for promotor and regulatory element detection (tardigrade specific; nrdb) as well as rapid COG search for function assignments including species-specific repositories of all analysed data. Conclusion Different protein clusters and regulatory elements implicated in tardigrade stress adaptations are analysed including unpublished tardigrade sequences. PMID:19821996
Förster, Frank; Liang, Chunguang; Shkumatov, Alexander; Beisser, Daniela; Engelmann, Julia C; Schnölzer, Martina; Frohme, Marcus; Müller, Tobias; Schill, Ralph O; Dandekar, Thomas
2009-10-12
Tardigrades represent an animal phylum with extraordinary resistance to environmental stress. To gain insights into their stress-specific adaptation potential, major clusters of related and similar proteins are identified, as well as specific functional clusters delineated comparing all tardigrades and individual species (Milnesium tardigradum, Hypsibius dujardini, Echiniscus testudo, Tulinus stephaniae, Richtersius coronifer) and functional elements in tardigrade mRNAs are analysed. We find that 39.3% of the total sequences clustered in 58 clusters of more than 20 proteins. Among these are ten tardigrade specific as well as a number of stress-specific protein clusters. Tardigrade-specific functional adaptations include strong protein, DNA- and redox protection, maintenance and protein recycling. Specific regulatory elements regulate tardigrade mRNA stability such as lox P DICE elements whereas 14 other RNA elements of higher eukaryotes are not found. Further features of tardigrade specific adaption are rapidly identified by sequence and/or pattern search on the web-tool tardigrade analyzer http://waterbear.bioapps.biozentrum.uni-wuerzburg.de. The work-bench offers nucleotide pattern analysis for promotor and regulatory element detection (tardigrade specific; nrdb) as well as rapid COG search for function assignments including species-specific repositories of all analysed data. Different protein clusters and regulatory elements implicated in tardigrade stress adaptations are analysed including unpublished tardigrade sequences.
Algama, Manjula; Tasker, Edward; Williams, Caitlin; Parslow, Adam C; Bryson-Richardson, Robert J; Keith, Jonathan M
2017-03-27
Computational identification of non-coding RNAs (ncRNAs) is a challenging problem. We describe a genome-wide analysis using Bayesian segmentation to identify intronic elements highly conserved between three evolutionarily distant vertebrate species: human, mouse and zebrafish. We investigate the extent to which these elements include ncRNAs (or conserved domains of ncRNAs) and regulatory sequences. We identified 655 deeply conserved intronic sequences in a genome-wide analysis. We also performed a pathway-focussed analysis on genes involved in muscle development, detecting 27 intronic elements, of which 22 were not detected in the genome-wide analysis. At least 87% of the genome-wide and 70% of the pathway-focussed elements have existing annotations indicative of conserved RNA secondary structure. The expression of 26 of the pathway-focused elements was examined using RT-PCR, providing confirmation that they include expressed ncRNAs. Consistent with previous studies, these elements are significantly over-represented in the introns of transcription factors. This study demonstrates a novel, highly effective, Bayesian approach to identifying conserved non-coding sequences. Our results complement previous findings that these sequences are enriched in transcription factors. However, in contrast to previous studies which suggest the majority of conserved sequences are regulatory factor binding sites, the majority of conserved sequences identified using our approach contain evidence of conserved RNA secondary structures, and our laboratory results suggest most are expressed. Functional roles at DNA and RNA levels are not mutually exclusive, and many of our elements possess evidence of both. Moreover, ncRNAs play roles in transcriptional and post-transcriptional regulation, and this may contribute to the over-representation of these elements in introns of transcription factors. We attribute the higher sensitivity of the pathway-focussed analysis compared to the genome-wide analysis to improved alignment quality, suggesting that enhanced genomic alignments may reveal many more conserved intronic sequences.
Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko
2010-05-01
The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.
Marciniak, R A; Garcia-Blanco, M A; Sharp, P A
1990-01-01
Human immunodeficiency virus type 1 RNAs contain a sequence, trans-activation-response (TAR) element, which is required for tat protein-mediated trans-activation of viral gene expression. We have identified a nuclear protein from extracts of HeLa cells that binds to the TAR element RNA in a sequence-specific manner. The binding of this 68-kDa polypeptide was detected by UV cross-linking proteins to TAR element RNA transcribed in vitro. Competition experiments were performed by using a partially purified preparation of the protein to quantify the relative binding affinities of TAR element RNA mutants. The binding affinity of the TAR mutants paralleled the reported ability of those mutants to support tat trans-activation in vivo. We propose that this cellular protein moderates TAR activity in vivo. Images PMID:2333305
Pickard, Mark R.; Williams, Gwyn T.
2016-01-01
Growth arrest-specific 5 (GAS5) lncRNA promotes apoptosis, and its expression is down-regulated in breast cancer. GAS5 lncRNA is a decoy of glucocorticoid/related receptors; a stem-loop sequence constitutes the GAS5 hormone response element mimic (HREM), which is essential for the regulation of breast cancer cell apoptosis. This preclinical study aimed to determine if the GAS5 HREM sequence alone promotes the apoptosis of breast cancer cells. Nucleofection of hormone-sensitive and –insensitive breast cancer cell lines with a GAS5 HREM DNA oligonucleotide increased both basal and ultraviolet-C-induced apoptosis, and decreased culture viability and clonogenic growth, similar to GAS5 lncRNA. The HREM oligonucleotide demonstrated similar sequence specificity to the native HREM for its functional activity and had no effect on endogenous GAS5 lncRNA levels. Certain chemically modified HREM oligonucleotides, notably DNA and RNA phosphorothioates, retained pro-apoptotic. activity. Crucially the HREM oligonucleotide could overcome apoptosis resistance secondary to deficient endogenous GAS5 lncRNA levels. Thus, the GAS5 lncRNA HREM sequence alone is sufficient to induce apoptosis in breast cancer cells, including triple-negative breast cancer cells. These findings further suggest that emerging knowledge of structure/function relationships in the field of lncRNA biology can be exploited for the development of entirely novel, oligonucleotide mimic-based, cancer therapies. PMID:26862727
Computer-Aided Design of RNA Origami Structures.
Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S
2017-01-01
RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.
The VP35 and VP40 proteins of filoviruses. Homology between Marburg and Ebola viruses.
Bukreyev, A A; Volchkov, V E; Blinov, V M; Netesov, S V
1993-05-03
The fragments of genomic RNA sequences of Marburg (MBG) and Ebola (EBO) viruses are reported. These fragments were found to encode the VP35 and VP40 proteins. The canonic sequences were revealed before and after each open reading frame. It is suggested that these sequences are mRNA extremities and at the same time the regulatory elements for mRNA transcription. Homology between the MBG and EBO proteins was discovered.
Gioio, Anthony E.
2017-01-01
Abstract Tyrosine hydroxylase (TH) is the enzyme that catalyzes the rate-limiting step in the biosynthesis of the catecholamine neurotransmitters. In a previous communication, evidence was provided that TH mRNA is trafficked to the axon, where it is locally translated. In addition, a 50-bp sequence element in the 3′untranslated region (3’UTR) of TH mRNA was identified that directs TH mRNA to distal axons (i.e., zip-code). In the present study, the hypothesis was tested that local translation of TH plays an important role in the biosynthesis of the catecholamine neurotransmitters in the axon and/or presynaptic nerve terminal. Toward this end, a targeted deletion of the axonal transport sequence element was developed, using the lentiviral delivery of the CRISPR/Cas9 system, and two guide RNA (gRNA) sequences flanking the 50-bp cis-acting regulatory element in rat superior cervical ganglion (SCG) neurons. Deletion of the axonal transport element reduced TH mRNA levels in the distal axons and reduced the axonal protein levels of TH and TH activity as measured by phosphorylation of SER40 in SCG neurons. Moreover, deletion of the zip-code diminished the axonal levels of dopamine (DA) and norepinephrine (NE). Conversely, the local translation of exogenous TH mRNA in the distal axon enhanced TH levels and activity, and elevated axonal NE levels. Taken together, these results provide direct evidence to support the hypothesis that TH mRNA trafficking and local synthesis of TH play an important role in the synthesis of catecholamines in the axon and presynaptic terminal. PMID:28630892
Aschrafi, Armaz; Gioio, Anthony E; Dong, Lijin; Kaplan, Barry B
2017-01-01
Tyrosine hydroxylase (TH) is the enzyme that catalyzes the rate-limiting step in the biosynthesis of the catecholamine neurotransmitters. In a previous communication, evidence was provided that TH mRNA is trafficked to the axon, where it is locally translated. In addition, a 50-bp sequence element in the 3'untranslated region (3'UTR) of TH mRNA was identified that directs TH mRNA to distal axons (i.e., zip-code). In the present study, the hypothesis was tested that local translation of TH plays an important role in the biosynthesis of the catecholamine neurotransmitters in the axon and/or presynaptic nerve terminal. Toward this end, a targeted deletion of the axonal transport sequence element was developed, using the lentiviral delivery of the CRISPR/Cas9 system, and two guide RNA (gRNA) sequences flanking the 50-bp cis- acting regulatory element in rat superior cervical ganglion (SCG) neurons. Deletion of the axonal transport element reduced TH mRNA levels in the distal axons and reduced the axonal protein levels of TH and TH activity as measured by phosphorylation of SER40 in SCG neurons. Moreover, deletion of the zip-code diminished the axonal levels of dopamine (DA) and norepinephrine (NE). Conversely, the local translation of exogenous TH mRNA in the distal axon enhanced TH levels and activity, and elevated axonal NE levels. Taken together, these results provide direct evidence to support the hypothesis that TH mRNA trafficking and local synthesis of TH play an important role in the synthesis of catecholamines in the axon and presynaptic terminal.
Li, Yang Eric; Xiao, Mu; Shi, Binbin; Yang, Yu-Cheng T; Wang, Dong; Wang, Fei; Marcia, Marco; Lu, Zhi John
2017-09-08
Crosslinking immunoprecipitation sequencing (CLIP-seq) technologies have enabled researchers to characterize transcriptome-wide binding sites of RNA-binding protein (RBP) with high resolution. We apply a soft-clustering method, RBPgroup, to various CLIP-seq datasets to group together RBPs that specifically bind the same RNA sites. Such combinatorial clustering of RBPs helps interpret CLIP-seq data and suggests functional RNA regulatory elements. Furthermore, we validate two RBP-RBP interactions in cell lines. Our approach links proteins and RNA motifs known to possess similar biochemical and cellular properties and can, when used in conjunction with additional experimental data, identify high-confidence RBP groups and their associated RNA regulatory elements.
Genome Cyclization as Strategy for Flavivirus RNA Replication
Villordo, Sergio M.; Gamarnik, Andrea V.
2017-01-01
Long-range and local RNA-RNA contacts in viral RNA genomes result in tertiary structures that modulate the function of enhancers, promoters, and silencers during translation, RNA replication, and encapsidation. In the case of flaviviruses, the presence of inverted complementary sequences at the 5′ and 3′ ends of the genome mediate long-range RNA interactions and RNA cyclization. The circular conformation of flavivirus genomes was demonstrated to be essential for RNA amplification. New ideas about the mechanisms by which circular genomes participate in flavivirus replication have emerged in the last few years. Here, we will describe the latest information about cis-acting elements involved in flavivirus genome cyclization, RNA promoter elements required for viral polymerase recognition, and how these elements together coordinate viral RNA synthesis. PMID:18703097
Torrent, C; Gabus, C; Darlix, J L
1994-02-01
Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer.
Wilson, G M; Vasa, M Z; Deeley, R G
1998-05-01
The mRNA encoding the human low density lipoprotein (LDL) receptor is transiently stabilized after phorbol ester treatment of HepG2 cells and has been shown to associate with components of the cytoskeleton in this cell line (G. M. Wilson, E. A. Roberts, and R. G. Deeley, J. Lipid Res. 1997. 38: 437-446). Using an episomal expression system, fragments of the 3' untranslated region (3'UTR) of LDL receptor mRNA were transcribed in fusion with the coding region of beta-globin mRNA in HepG2 cells. Analyses of the decay kinetics of these beta-globin-LDL receptor fusion mRNA deletion mutants showed that sequences in the proximal 3'UTR of LDL receptor mRNA including several AU-rich elements (AREs) were sufficient to confer short constitutive mRNA half-life in the heterologous system. Stabilization of LDL receptor mRNA in the presence of PMA required sequences in the distal 3'UTR, at or near three Alu-like repetitive elements. Furthermore, the 3'UTR of LDL receptor mRNA conferred cytoskeletal association on the otherwise unassociated beta-globin mRNA, by a mechanism involving at least two distinct RNA elements. Comparisons of decay kinetics and subcellular localization of endogenous LDL receptor mRNA and beta-globin-LDL receptor mRNA fusions in HepG2 cells have demonstrated that several cis-acting elements in the receptor 3'UTR contribute to post-transcriptional regulation of receptor expression, and provide further support for involvement of the cytoskeleton in the regulation of LDL receptor mRNA turnover.
Sheikh, Faruk G; Mukhopadhyay, Sudit S; Gupta, Prabhakar
2002-02-01
The PstI family of elements are short, highly repetitive DNA sequences interspersed throughout the genome of the Bovidae. We have cloned and sequenced some members of the PstI family from cattle, goat, and buffalo. These elements are approximately 500 bp, have a copy number of 2 x 10(5) - 4 x 10(5), and comprise about 4% of the haploid genome. Studies of nucleotide sequence homology indicate that the buffalo and goat PstI repeats (type II) are similar types of short interspersed nucleotide element (SINE) sequences, but the cattle PstI repeat (type I) is considerably more divergent. Additionally, the goat PstI sequence showed significant sequence homology with bovine serine tRNA, and is therefore likely derived from serine tRNA. Interestingly, Southern hybridization suggests that both types of SINEs (I and II) are present in all the species of Bovidae. Dendrogram analysis indicates that cattle PstI SINE is similar to bovine Alu-like SINEs. Goat and buffalo SINEs formed a separate cluster, suggesting that these two types of SINEs evolved separately in the genome of the Bovidae.
Gruber, Andreas R
2014-07-10
RNA Polymerase III is a highly specialized enzyme complex responsible for the transcription of a very distinct set of housekeeping noncoding RNAs including tRNAs, 7SK snRNA, Y RNAs, U6 snRNA, and the RNA components of RNaseP and RNaseMRP. In this work we have utilized the conserved promoter structure of known RNA Polymerase III transcripts consisting of characteristic sequence elements termed proximal sequence elements (PSE) A and B and a TATA-box to uncover a novel RNA Polymerase III-transcribed, noncoding RNA family found to be conserved in Caenorhabditis as well as other clade V nematode species. Homology search in combination with detailed sequence and secondary structure analysis revealed that members of this novel ncRNA family evolve rapidly, and only maintain a potentially functional small stem structure that links the 5' end to the very 3' end of the transcript and a small hairpin structure at the 3' end. This is most likely required for efficient transcription termination. In addition, our study revealed evidence that canonical C/D box snoRNAs are also transcribed from a PSE A-PSE B-TATA-box promoter in Caenorhabditis elegans. Copyright © 2014 Elsevier B.V. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Butyrate is a nutritional element with strong epigenetic regulatory activity as an inhibitor of histone deacetylases (HDACs). Based on the analysis of differentially expressed genes induced by butyrate in the bovine epithelial cell using deep RNA-sequencing technology (RNA-seq), a set of unique gen...
Chillón, Isabel; Pyle, Anna M.
2016-01-01
LincRNA-p21 is a long intergenic non-coding RNA (lincRNA) involved in the p53-mediated stress response. We sequenced the human lincRNA-p21 (hLincRNA-p21) and found that it has a single exon that includes inverted repeat Alu elements (IRAlus). Sense and antisense Alu elements fold independently of one another into a secondary structure that is conserved in lincRNA-p21 among primates. Moreover, the structures formed by IRAlus are involved in the localization of hLincRNA-p21 in the nucleus, where hLincRNA-p21 colocalizes with paraspeckles. Our results underscore the importance of IRAlus structures for the function of hLincRNA-p21 during the stress response. PMID:27378782
Darlix, J L; Gabus, C; Nugeyre, M T; Clavel, F; Barré-Sinoussi, F
1990-12-05
The retroviral genome consists of two identical RNA molecules joined at their 5' ends by the Dimer Linkage Structure (DLS). To study the mechanism of dimerization and the DLS of HIV-1 RNA, large amounts of bona fide HIV-1 RNA and of mutants have been synthesized in vitro. We report that HIV-1 RNA forms dimeric molecules and that viral nucleocapsid (NC) protein NCp15 greatly activates dimerization. Deletion mutagenesis in the RNA 5' 1333 nucleotides indicated that a small domain of 100 nucleotides, located between positions 311 to 415 from the 5' end, is necessary and sufficient to promote HIV-1 RNA dimerization. This dimerization domain encompasses an encapsidation element located between the 5' splice donor site and initiator AUG of gag and shows little sequence variations in different strains of HIV-1. Furthermore, cross-linking analysis of the interactions between NC and HIV-1 RNA (311 to 415) locates a major contact site in the encapsidation element of HIV-1 RNA. The genomic RNA dimer is tightly associated with nucleocapsid protein molecules in avian and murine retroviruses, and this ribonucleoprotein structure is believed to be the template for reverse transcription. Genomic RNA-protein interactions have been analyzed in human immunodeficiency virus (HIV) virions and results showed that NC protein molecules are tightly bound to the genomic RNA dimer. Since retroviral RNA dimerization and packaging appear to be under the control of the same cis element, the encapsidation sequences, and trans-acting factor, the NC protein, they are probably related events in the course of virion assembly.
NASA Technical Reports Server (NTRS)
Breault, D. T.; Lichtler, A. C.; Rowe, D. W.
1997-01-01
Collagen reporter gene constructs have be used to identify cell-specific sequences needed for transcriptional activation. The elements required for endogenous levels of COL1A1 expression, however, have not been elucidated. The human COL1A1 minigene is expressed at high levels and likely harbors sequence elements required for endogenous levels of activity. Using stably transfected osteoblastic Py1a cells, we studied a series of constructs (pOBColCAT) designed to characterize further the elements required for high level of expression. pOBColCAT, which contains the COL1A1 first intron, was expressed at 50-100-fold higher levels than ColCAT 3.6, which lacks the first intron. This difference is best explained by improved mRNA processing rather than a transcriptional effect. Furthermore, variation in activity observed with the intron deletion constructs is best explained by altered mRNA splicing. Two major regions of the human COL1A1 minigene, the 3'-flanking sequences and the minigene body, were introduced into pOBColCAT to assess both transcriptional enhancing activity and the effect on mRNA stability. Analysis of the minigene body, which includes the first five exons and introns fused with the terminal six introns and exons, revealed an orientation-independent 5-fold increase in CAT activity. In contrast the 3'-flanking sequences gave rise to a modest 61% increase in CAT activity. Neither region increased the mRNA half-life of the parent construct, suggesting that CAT-specific mRNA instability elements may serve as dominant negative regulators of stability. This study suggests that other sites within the body of the COL1A1 minigene are important for high expression, e.g. during periods of rapid extracellular matrix production.
Jaffrey, S R; Haile, D J; Klausner, R D; Harford, J B
1993-09-25
To assess the influence of RNA sequence/structure on the interaction RNAs with the iron-responsive element binding protein (IRE-BP), twenty eight altered RNAs were tested as competitors for an RNA corresponding to the ferritin H chain IRE. All changes in the loop of the predicted IRE hairpin and in the unpaired cytosine residue characteristically found in IRE stems significantly decreased the apparent affinity of the RNA for the IRE-BP. Similarly, alteration in the spacing and/or orientation of the loop and the unpaired cytosine of the stem by either increasing or decreasing the number of base pairs separating them significantly reduced efficacy as a competitor. It is inferred that the IRE-BP forms multiple contacts with its cognate RNA, and that these contacts, acting in concert, provide the basis for the high affinity of this interaction.
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export.
Karijolich, John; Zhao, Yang; Alla, Ravi; Glaunsinger, Britt
2017-06-02
Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA-RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genome-wide mapping of infection-induced SINE RNAs reveals a role in selective mRNA export
Zhao, Yang; Alla, Ravi
2017-01-01
Abstract Short interspersed nuclear elements (SINEs) are retrotransposons evolutionarily derived from endogenous RNA Polymerase III RNAs. Though SINE elements have undergone exaptation into gene regulatory elements, how transcribed SINE RNA impacts transcriptional and post-transcriptional regulation is largely unknown. This is partly due to a lack of information regarding which of the loci have transcriptional potential. Here, we present an approach (short interspersed nuclear element sequencing, SINE-seq), which selectively profiles RNA Polymerase III-derived SINE RNA, thereby identifying transcriptionally active SINE loci. Applying SINE-seq to monitor murine B2 SINE expression during a gammaherpesvirus infection revealed transcription from 28 270 SINE loci, with ∼50% of active SINE elements residing within annotated RNA Polymerase II loci. Furthermore, B2 RNA can form intermolecular RNA–RNA interactions with complementary mRNAs, leading to nuclear retention of the targeted mRNA via a mechanism involving p54nrb. These findings illuminate a pathway for the selective regulation of mRNA export during stress via retrotransposon activation. PMID:28334904
Torrent, C; Gabus, C; Darlix, J L
1994-01-01
Retroviral genomes consist of two identical RNA molecules associated at their 5' ends by the dimer linkage structure located in the packaging element (Psi or E) necessary for RNA dimerization in vitro and packaging in vivo. In murine leukemia virus (MLV)-derived vectors designed for gene transfer, the Psi + sequence of 600 nucleotides directs the packaging of recombinant RNAs into MLV virions produced by helper cells. By using in vitro RNA dimerization as a screening system, a sequence of rat VL30 RNA located next to the 5' end of the Harvey mouse sarcoma virus genome and as small as 67 nucleotides was found to form stable dimeric RNA. In addition, a purine-rich sequence located at the 5' end of this VL30 RNA seems to be critical for RNA dimerization. When this VL30 element was extended by 107 nucleotides at its 3' end and inserted into an MLV-derived vector lacking MLV Psi +, it directed the efficient encapsidation of recombinant RNAs into MLV virions. Because this VL30 packaging signal is smaller and more efficient in packaging recombinant RNAs than the MLV Psi + and does not contain gag or glyco-gag coding sequences, its use in MLV-derived vectors should render even more unlikely recombinations which could generate replication-competent viruses. Therefore, utilization of the rat VL30 packaging sequence should improve the biological safety of MLV vectors for human gene transfer. Images PMID:8289369
Interaction of influenza virus polymerase with viral RNA in the 'corkscrew' conformation.
Flick, R; Hobom, G
1999-10-01
The influenza virus RNA (vRNA) promoter structure is known to consist of the 5'- and 3'-terminal sequences of the RNA, within very narrow boundaries of 16 and 15 nucleotides, respectively. A complete set of single nucleotide substitutions led to the previously proposed model of a binary hooked or 'corkscrew' conformation for the vRNA promoter when it interacts with the viral polymerase. This functional structure is confirmed here with a complete set of complementary double substitutions, of both the regular A:U and G:C type and also the G:U type of base-pair exchanges. The proposed structure consists of a six base-pair RNA rod in the distal element in conjunction with two stem-loop structures of two short-range base-pairs (positions 2-9; 3-8). These support an exposed tetranucleotide loop within each branch of the proximal element, in an overall oblique organization due to a central unpaired A residue at position 10 in the 5' sequence. Long-range base-pairing between the entire 5' and 3' branches, as required for an unmodified 'panhandle' model, has been excluded for the proximal element, while it is known to represent the mode of interaction within the distal element. A large number of short-range base-pair exchanges in the proximal element constitute promoter-up mutations, which show activities several times above that of the wild-type in reporter gene assays. The unique overall conformation and rather few invariant nucleotides appear to be the core elements in vRNA recognition by polymerase and also in viral ribonucleoprotein packaging, to allow discrimination against the background of other RNA molecules in the cell.
ExportAid: database of RNA elements regulating nuclear RNA export in mammals.
Giulietti, Matteo; Milantoni, Sara Armida; Armeni, Tatiana; Principato, Giovanni; Piva, Francesco
2015-01-15
Regulation of nuclear mRNA export or retention is carried out by RNA elements but the mechanism is not yet well understood. To understand the mRNA export process, it is important to collect all the involved RNA elements and their trans-acting factors. By hand-curated literature screening we collected, in ExportAid database, experimentally assessed data about RNA elements regulating nuclear export or retention of endogenous, heterologous or artificial RNAs in mammalian cells. This database could help to understand the RNA export language and to study the possible export efficiency alterations owing to mutations or polymorphisms. Currently, ExportAid stores 235 and 96 RNA elements, respectively, increasing and decreasing export efficiency, and 98 neutral assessed sequences. Freely accessible without registration at http://www.introni.it/ExportAid/ExportAid.html. Database and web interface are implemented in Perl, MySQL, Apache and JavaScript with all major browsers supported. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
The 5′ RNA Terminus of Spleen Necrosis Virus Stimulates Translation of Nonviral mRNA
Roberts, Tiffiney M.; Boris-Lawrie, Kathleen
2000-01-01
The RU5 region at the 5′ RNA terminus of spleen necrosis virus (SNV) has been shown to facilitate expression of human immunodeficiency virus type 1 (HIV) unspliced RNA independently of the Rev-responsive element (RRE) and Rev. The SNV sequences act as a distinct posttranscriptional control element to stimulate gag RNA nuclear export and association with polyribosomes. Here we sought to determine whether RU5 functions to neutralize the cis-acting inhibitory sequences (INSs) in HIV RNA that confer RRE/Rev dependence or functions as an independent stimulatory sequence. Experiments with HIV gag reporter plasmids that contain inactivated INS-1 indicated that neutralization of INSs does not account for RU5 function. Results with luciferase reporter gene (luc) plasmids further indicated that RU5 stimulates expression of a nonretroviral RNA that lacks INSs. Northern blot and RT-PCR analyses indicated that RU5 does not increase the steady-state levels or nuclear export of the luc transcript but rather that the U5 region facilitates efficient polyribosomal association of the mRNA. RU5 does not function as an internal ribosome entry site in bicistronic reporter plasmids, and it requires the 5′-proximal position for efficient function. Our results indicate that RU5 contains stimulatory sequences that function in a 5′-proximal position to enhance initiation of translation of a nonretroviral reporter gene RNA. We speculate that RU5 evolved to overcome the translation-inhibitory effect of the highly structured encapsidation signal and other replication motifs in the 5′ untranslated region of the retroviral RNA. PMID:10933721
Probing the Structures of Viral RNA Regulatory Elements with SHAPE and Related Methodologies
Rausch, Jason W.; Sztuba-Solinska, Joanna; Le Grice, Stuart F. J.
2018-01-01
Viral RNAs were selected by evolution to possess maximum functionality in a minimal sequence. Depending on the classification of the virus and the type of RNA in question, viral RNAs must alternately be replicated, spliced, transcribed, transported from the nucleus into the cytoplasm, translated and/or packaged into nascent virions, and in most cases, provide the sequence and structural determinants to facilitate these processes. One consequence of this compact multifunctionality is that viral RNA structures can be exquisitely complex, often involving intermolecular interactions with RNA or protein, intramolecular interactions between sequence segments separated by several thousands of nucleotides, or specialized motifs such as pseudoknots or kissing loops. The fluidity of viral RNA structure can also present a challenge when attempting to characterize it, as genomic RNAs especially are likely to sample numerous conformations at various stages of the virus life cycle. Here we review advances in chemoenzymatic structure probing that have made it possible to address such challenges with respect to cis-acting elements, full-length viral genomes and long non-coding RNAs that play a major role in regulating viral gene expression. PMID:29375504
Gherghe, Cristina; Lombo, Tania; Leonard, Christopher W.; Datta, Siddhartha A. K.; Bess, Julian W.; Gorelick, Robert J.; Rein, Alan; Weeks, Kevin M.
2010-01-01
All retroviral genomic RNAs contain a cis-acting packaging signal by which dimeric genomes are selectively packaged into nascent virions. However, it is not understood how Gag (the viral structural protein) interacts with these signals to package the genome with high selectivity. We probed the structure of murine leukemia virus RNA inside virus particles using SHAPE, a high-throughput RNA structure analysis technology. These experiments showed that NC (the nucleic acid binding domain derived from Gag) binds within the virus to the sequence UCUG-UR-UCUG. Recombinant Gag and NC proteins bound to this same RNA sequence in dimeric RNA in vitro; in all cases, interactions were strongest with the first U and final G in each UCUG element. The RNA structural context is critical: High-affinity binding requires base-paired regions flanking this motif, and two UCUG-UR-UCUG motifs are specifically exposed in the viral RNA dimer. Mutating the guanosine residues in these two motifs—only four nucleotides per genomic RNA—reduced packaging 100-fold, comparable to the level of nonspecific packaging. These results thus explain the selective packaging of dimeric RNA. This paradigm has implications for RNA recognition in general, illustrating how local context and RNA structure can create information-rich recognition signals from simple single-stranded sequence elements in large RNAs. PMID:20974908
Detection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.
Toffano-Nioche, Claire; Luo, Yufei; Kuchly, Claire; Wallon, Claire; Steinbach, Delphine; Zytnicki, Matthias; Jacq, Annick; Gautheret, Daniel
2013-09-01
RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNAs, independent small RNA genes (sRNAs) and transcripts produced from the antisense strand of genes (asRNA). Here we present a computational pipeline (DETR'PROK: detection of ncRNAs in prokaryotes) based on the Galaxy framework that takes as input a mapping of deep sequencing reads and performs successive steps of clustering, comparison with existing annotation and identification of transcribed non-coding fragments classified into putative 5' UTRs, sRNAs and asRNAs. We provide a step-by-step description of the protocol using real-life example data sets from Vibrio splendidus and Escherichia coli. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Highly multiplexed subcellular RNA sequencing in situ
Lee, Je Hyuk; Daugharthy, Evan R.; Scheiman, Jonathan; Kalhor, Reza; Ferrante, Thomas C.; Yang, Joyce L.; Terry, Richard; Jeanty, Sauveur S. F.; Li, Chao; Amamoto, Ryoji; Peters, Derek T.; Turczyk, Brian M.; Marblestone, Adam H.; Inverso, Samuel A.; Bernard, Amy; Mali, Prashant; Rios, Xavier; Aach, John; Church, George M.
2014-01-01
Understanding the spatial organization of gene expression with single nucleotide resolution requires localizing the sequences of expressed RNA transcripts within a cell in situ. Here we describe fluorescent in situ RNA sequencing (FISSEQ), in which stably cross-linked cDNA amplicons are sequenced within a biological sample. Using 30-base reads from 8,742 genes in situ, we examined RNA expression and localization in human primary fibroblasts using a simulated wound healing assay. FISSEQ is compatible with tissue sections and whole mount embryos, and reduces the limitations of optical resolution and noisy signals on single molecule detection. Our platform enables massively parallel detection of genetic elements, including gene transcripts and molecular barcodes, and can be used to investigate cellular phenotype, gene regulation, and environment in situ. PMID:24578530
Das, G; Henning, D; Wright, D; Reddy, R
1988-01-01
Whereas the genes coding for trimethyl guanosine-capped snRNAs are transcribed by RNA polymerase II, the U6 RNA genes are transcribed by RNA polymerase III. In this study, we have analyzed the cis-regulatory elements involved in the transcription of a mouse U6 snRNA gene in vitro and in frog oocytes. Transcriptional analysis of mutant U6 gene constructs showed that, unlike most known cases of polymerase III transcription, intragenic sequences except the initiation nucleotide are dispensable for efficient and accurate transcription of U6 gene in vitro. Transcription of 5' deletion mutants in vitro and in frog oocytes showed that the upstream region, within 79 bp from the initiation nucleotide, contains elements necessary for U6 gene transcription. Transcription studies were carried out in frog oocytes with U6 genes containing 5' distal sequence; these studies revealed that the distal element acts as an orientation-dependent enhancer when present upstream to the gene, while it is orientation-independent but distance-dependent enhancer when placed down-stream to the U6 gene. Analysis of 3' deletion mutants showed that the transcription termination of U6 RNA is dependent on a T cluster present on the 3' end of the gene, thus providing further support to other lines of evidence that U6 genes are transcribed by RNA polymerase III. These observations suggest the involvement of a composite of components of RNA polymerase II and III transcription machineries in the transcription of U6 genes by RNA polymerase III. Images PMID:3366121
Unit-length line-1 transcripts in human teratocarcinoma cells.
Skowronski, J; Fanning, T G; Singer, M F
1988-01-01
We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons. Images PMID:2454389
Georgiev, O; Birnstiel, M L
1985-01-01
Analysis of cDNA sequences obtained from the small nuclear RNA U7 has previously suggested specific contacts, by base pairing, between the conserved stem-loop structure and CAAGAAAGA sequence of the histone pre-mRNA and the 5'-terminal sequence of the U7 RNA during RNA processing. In order to test some aspects of the model we have created a series of linker scan, deletion and insertion mutants of the 3' terminus of a sea urchin H3 histone gene and have injected mutant DNAs or in vitro synthesized precursors into frog oocyte nuclei for interpretation. We find that, in addition to the stem-loop structure of the mRNA, the CAAGAAAGA spacer transcript within the histone pre-mRNA is required absolutely for RNA processing, as predicted from our model. Spacer sequences immediately downstream of the CAAGAAAGA motif are not complementary to U7 RNA. Nevertheless, they are necessary for obtaining a maximal rate of RNA processing, as is the ACCA sequence coding for the 3' terminus of the mature mRNA. An increase of distance between the mRNA palindrome and the CAAGAAAGA by as little as six nucleotides abolishes all processing. It may, therefore, be useful to regard both these sequence motifs as part of one and the same RNA processing signal with narrowly defined topologies. Interestingly, U7 RNA-dependent 3' processing of histone pre-mRNA can occur in RNA injection experiments only when the in vitro synthesized pre-mRNA contains sequence extensions well beyond the region of sequence complementarities to the U7 RNA. In addition to directing 3' processing the terminal mRNA sequences may have a role in histone mRNA stabilization in the cytoplasmic compartment. Images Fig. 3. Fig. 4. Fig. 5. Fig. 6. Fig. 7. PMID:2410259
Properties of a U1 RNA enhancer-like sequence.
Ciliberto, G; Palla, F; Tebb, G; Mattaj, I W; Philipson, L
1987-01-01
The properties of a X.laevis U1B snRNA gene enhancer have been studied by microinjection in Xenopus oocytes. The enhancer-like sequence, defined as a short DNA stretch that is able to activate transcription in an orientation independent manner, is interchangeable between different U snRNA genes. The enhancer sequence alone does not, however, efficiently activate transcription from an SV40 pol II promoter but regains its activity when combined with the U-gene specific proximal sequence element. DNase I protection experiments show that the X.laevis U1B enhancer can interact specifically with a nuclear factor present in mammalian cells. Images PMID:3031597
Increased complexity of circRNA expression during species evolution.
Dong, Rui; Ma, Xu-Kai; Chen, Ling-Ling; Yang, Li
2017-08-03
Circular RNAs (circRNAs) are broadly identified from precursor mRNA (pre-mRNA) back-splicing across various species. Recent studies have suggested a cell-/tissue- specific manner of circRNA expression. However, the distinct expression pattern of circRNAs among species and its underlying mechanism still remain to be explored. Here, we systematically compared circRNA expression from human and mouse, and found that only a small portion of human circRNAs could be determined in parallel mouse samples. The conserved circRNA expression between human and mouse is correlated with the existence of orientation-opposite complementary sequences in introns that flank back-spliced exons in both species, but not the circRNA sequences themselves. Quantification of RNA pairing capacity of orientation-opposite complementary sequences across circRNA-flanking introns by Complementary Sequence Index (CSI) identifies that among all types of complementary sequences, SINEs, especially Alu elements in human, contribute the most for circRNA formation and that their diverse distribution across species leads to the increased complexity of circRNA expression during species evolution. Together, our integrated and comparative reference catalog of circRNAs in different species reveals a species-specific pattern of circRNA expression and suggests a previously under-appreciated impact of fast-evolved SINEs on the regulation of (circRNA) gene expression.
Cis-acting RNA elements in the Hepatitis C virus RNA genome
Sagan, Selena M.; Chahal, Jasmin; Sarnow, Peter
2017-01-01
Hepatitis C virus (HCV) infection is a rapidly increasing global health problem with an estimated 170 million people infected worldwide. HCV is a hepatotropic, positive-sense RNA virus of the family Flaviviridae. As a positive-sense RNA virus, the HCV genome itself must serve as a template for translation, replication and packaging. The viral RNA must therefore be a dynamic structure that is able to readily accommodate structural changes to expose different regions of the genome to viral and cellular proteins to carry out the HCV life cycle. The ∼9600 nucleotide viral genome contains a single long open reading frame flanked by 5′ and 3′ non-coding regions that contain cis-acting RNA elements important for viral translation, replication and stability. Additional cis-acting RNA elements have also been identified in the coding sequences as well as in the 3′ end of the negative-strand replicative intermediate. Herein, we provide an overview of the importance of these cis-acting RNA elements in the HCV life cycle. PMID:25576644
de Borba, Luana; Villordo, Sergio M; Iglesias, Nestor G; Filomatori, Claudia V; Gebhard, Leopoldo G; Gamarnik, Andrea V
2015-03-01
The dengue virus genome is a dynamic molecule that adopts different conformations in the infected cell. Here, using RNA folding predictions, chemical probing analysis, RNA binding assays, and functional studies, we identified new cis-acting elements present in the capsid coding sequence that facilitate cyclization of the viral RNA by hybridization with a sequence involved in a local dumbbell structure at the viral 3' untranslated region (UTR). The identified interaction differentially enhances viral replication in mosquito and mammalian cells. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
An RNA motif advances transcription by preventing Rho-dependent termination
Sevostyanova, Anastasia; Groisman, Eduardo A.
2015-01-01
The transcription termination factor Rho associates with most nascent bacterial RNAs as they emerge from RNA polymerase. However, pharmacological inhibition of Rho derepresses only a small fraction of these transcripts. What, then, determines the specificity of Rho-dependent transcription termination? We now report the identification of a Rho-antagonizing RNA element (RARE) that hinders Rho-dependent transcription termination. We establish that RARE traps Rho in an inactive complex but does not prevent Rho binding to its recruitment sites. Although translating ribosomes normally block Rho access to an mRNA, inefficient translation of an open reading frame in the leader region of the Salmonella mgtCBR operon actually enables transcription of its associated coding region by favoring an RNA conformation that sequesters RARE. The discovery of an RNA element that inactivates Rho signifies that the specificity of nucleic-acid binding proteins is defined not only by the sequences that recruit these proteins but also by sequences that antagonize their activity. PMID:26630006
Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy
Matkovich, Scot J.; Dorn, Gerald W.
2018-01-01
Summary MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicates purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses. PMID:25836573
Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy.
Matkovich, Scot J; Dorn, Gerald W
2015-01-01
MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicate purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses.
Yoo, Soonmoon; Kim, Hak Hee; Kim, Paul; Donnelly, Christopher J.; Kalinski, Ashley L.; Vuppalanchi, Deepika; Park, Michael; Lee, Seung Joon; Merianda, Tanuja T.; Perrone-Bizzozero, Nora I.; Twiss, Jeffery L.
2013-01-01
Localized translation of axonal mRNAs contributes to developmental and regenerative axon growth. Although untranslated regions (UTRs) of many different axonal mRNAs appear to drive their localization, there has been no consensus RNA structure responsible for this localization. We recently showed that limited expression of ZBP1 protein restricts axonal localization of both β-actin and GAP-43 mRNAs. β-actin 3′UTR has a defined element for interaction with ZBP1, but GAP-43 mRNA shows no homology to this RNA sequence. Here, we show that an AU-rich element (ARE) in GAP-43’s 3′UTR is necessary and sufficient for its axonal localization. Axonal GAP-43 mRNA levels increase after in vivo injury, and GAP-43 mRNA shows an increased half-life in regenerating axons. GAP-43 mRNA interacts with both HuD and ZBP1, and HuD and ZBP1 coimmunoprecipitate in an RNA-dependent fashion. Reporter mRNA with the GAP-43 ARE competes with endogenous β-actin mRNA for axonal localization and decreases axon length and branching similar to the β-actin 3′UTR competing with endogenous GAP-43 mRNA. Conversely, overexpressing GAP-43 coding sequence with it’s 3′UTR ARE increases axonal elongation and this effect is lost when just the ARE is deleted from GAP-43’s 3′UTR. PMID:23586486
The Structure and Function of the Rous Sarcoma virus RNA Stability Element
Withers, Johanna B.; Beemon, Karen L.
2013-01-01
For simple retroviruses, such as the Rous sarcoma virus (RSV), post-transcriptional control elements regulate viral RNA splicing, export, stability, and packaging into virions. These RNA sequences interact with cellular host proteins to regulate and facilitate productive viral infections. One such element, known as the RSV stability element (RSE), is required for maintaining stability of the full-length unspliced RNA. This viral RNA serves as the mRNA for the Gag and Pol proteins and also as the genome packaged in progeny virions. When the RSE is deleted from the viral RNA, the unspliced RNA becomes unstable and is degraded in a Upf1-dependent manner. Current evidence suggests that the RSE inhibits recognition of the viral gag termination codon by the nonsense-mediated mRNA decay (NMD) pathway. We believe that the RSE acts as an insulator to NMD, thereby preventing at least one of the required functional steps that target an mRNA for degradation. Here, we discuss the history of the RSE and the current model of how the RSE is interacting with cellular NMD factors. PMID:21769913
Kelly, Shannan; Yamamoto, Hideki
2008-01-01
Purpose We previously reported the differential expression and translation of mRNA and protein in dark- and light-adapted octopus retinas, which may result from cytoplasmic polyadenylation element (CPE)–dependent mRNA masking and unmasking. Here we investigate the presence of CPEs in α-tubulin and S-crystallin mRNA and report the identification of cytoplasmic polyadenylation element binding protein (CPEB) in light- and dark-adapted octopus retinas. Methods 3’-RACE and sequencing were used to isolate and analyze the 3’-UTRs of α-tubulin and S-crystallin mRNA. Total retinal protein isolated from light- and dark-adapted octopus retinas was subjected to western blot analysis followed by CPEB antibody detection, PEP-171 inhibition of CPEB, and dephosphorylation of CPEB. Results The following CPE-like sequence was detected in the 3’-UTR of isolated long S-crystallin mRNA variants: UUUAACA. No CPE or CPE-like sequences were detected in the 3’-UTRs of α-tubulin mRNA or of the short S-crystallin mRNA variants. Western blot analysis detected CPEB as two putative bands migrating between 60-80 kDa, while a third band migrated below 30 kDa in dark- and light-adapted retinas. Conclusions The detection of CPEB and the identification of the putative CPE-like sequences in the S-crystallin 3’-UTR suggest that CPEB may be involved in the activation of masked S-crystallin mRNA, but not in the regulation of α-tubulin mRNA, resulting in increased S-crystallin protein synthesis in dark-adapted octopus retinas. PMID:18682811
Means, A L; Farnham, P J
1990-02-01
We have identified a sequence element that specifies the position of transcription initiation for the dihydrofolate reductase gene. Unlike the functionally analogous TATA box that directs RNA polymerase II to initiate transcription 30 nucleotides downstream, the positioning element of the dihydrofolate reductase promoter is located directly at the site of transcription initiation. By using DNase I footprint analysis, we have shown that a protein binds to this initiator element. Transcription initiated at the dihydrofolate reductase initiator element when 28 nucleotides were inserted between it and all other upstream sequences, or when it was placed on either side of the DNA helix, suggesting that there is no strict spatial requirement between the initiator and an upstream element. Although neither a single Sp1-binding site nor a single initiator element was sufficient for transcriptional activity, the combination of one Sp1-binding site and the dihydrofolate reductase initiator element cloned into a plasmid vector resulted in transcription starting at the initiator element. We have also shown that the simian virus 40 late major initiation site has striking sequence homology to the dihydrofolate reductase initiation site and that the same, or a similar, protein binds to both sites. Examination of the sequences at other RNA polymerase II initiation sites suggests that we have identified an element that is important in the transcription of other housekeeping genes. We have thus named the protein that binds to the initiator element HIP1 (Housekeeping Initiator Protein 1).
Cell cycle-dependent transcription factors control the expression of yeast telomerase RNA.
Dionne, Isabelle; Larose, Stéphanie; Dandjinou, Alain T; Abou Elela, Sherif; Wellinger, Raymund J
2013-07-01
Telomerase is a specialized ribonucleoprotein that adds repeated DNA sequences to the ends of eukaryotic chromosomes to preserve genome integrity. Some secondary structure features of the telomerase RNA are very well conserved, and it serves as a central scaffold for the binding of associated proteins. The Saccharomyces cerevisiae telomerase RNA, TLC1, is found in very low copy number in the cell and is the limiting component of the known telomerase holoenzyme constituents. The reasons for this low abundance are unclear, but given that the RNA is very stable, transcriptional control mechanisms must be extremely important. Here we define the sequences forming the TLC1 promoter and identify the elements required for its low expression level, including enhancer and repressor elements. Within an enhancer element, we found consensus sites for Mbp1/Swi4 association, and chromatin immunoprecipitation (ChIP) assays confirmed the binding of Mbp1 and Swi4 to these sites of the TLC1 promoter. Furthermore, the enhancer element conferred cell cycle-dependent regulation to a reporter gene, and mutations in the Mbp1/Swi4 binding sites affected the levels of telomerase RNA and telomere length. Finally, ChIP experiments using a TLC1 RNA-binding protein as target showed cell cycle-dependent transcription of the TLC1 gene. These results indicate that the budding yeast TLC1 RNA is transcribed in a cell cycle-dependent fashion late in G1 and may be part of the S phase-regulated group of genes involved in DNA replication.
Bousios, Alexandros; Diez, Concepcion M; Takuno, Shohei; Bystry, Vojtech; Darzentas, Nikos; Gaut, Brandon S
2016-02-01
Transposable elements (TEs) proliferate within the genome of their host, which responds by silencing them epigenetically. Much is known about the mechanisms of silencing in plants, particularly the role of siRNAs in guiding DNA methylation. In contrast, little is known about siRNA targeting patterns along the length of TEs, yet this information may provide crucial insights into the dynamics between hosts and TEs. By focusing on 6456 carefully annotated, full-length Sirevirus LTR retrotransposons in maize, we show that their silencing associates with underlying characteristics of the TE sequence and also uncover three features of the host-TE interaction. First, siRNA mapping varies among families and among elements, but particularly along the length of elements. Within the cis-regulatory portion of the LTRs, a complex palindrome-rich region acts as a hotspot of both siRNA matching and sequence evolution. These patterns are consistent across leaf, tassel, and immature ear libraries, but particularly emphasized for floral tissues and 21- to 22-nt siRNAs. Second, this region has the ability to form hairpins, making it a potential template for the production of miRNA-like, hairpin-derived small RNAs. Third, Sireviruses are targeted by siRNAs as a decreasing function of their age, but the oldest elements remain highly targeted, partially by siRNAs that cross-map to the youngest elements. We show that the targeting of older Sireviruses reflects their conserved palindromes. Altogether, we hypothesize that the palindromes aid the silencing of active elements and influence transposition potential, siRNA targeting levels, and ultimately the fate of an element within the genome. © 2016 Bousios et al.; Published by Cold Spring Harbor Laboratory Press.
Lan, Susan; Kamel, Wael; Punga, Tanel; Akusjärvi, Göran
2017-02-28
The adenovirus L4-22K protein both activates and suppresses transcription from the adenovirus major late promoter (MLP) by binding to DNA elements located downstream of the MLP transcriptional start site: the so-called DE element (positive) and the R1 region (negative). Here we show that L4-22K preferentially binds to the RNA form of the R1 region, both to the double-stranded RNA and the single-stranded RNA of the same polarity as the nascent MLP transcript. Further, L4-22K binds to a 5΄-CAAA-3΄ motif in the single-stranded RNA, which is identical to the sequence motif characterized for L4-22K DNA binding. L4-22K binding to single-stranded RNA results in an enhancement of U1 snRNA recruitment to the major late first leader 5΄ splice site. This increase in U1 snRNA binding results in a suppression of MLP transcription and a concurrent stimulation of major late first intron splicing. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Repetitive element transcripts are elevated in the brain of C9orf72 ALS/FTLD patients.
Prudencio, Mercedes; Gonzales, Patrick K; Cook, Casey N; Gendron, Tania F; Daughrity, Lillian M; Song, Yuping; Ebbert, Mark T W; van Blitterswijk, Marka; Zhang, Yong-Jie; Jansen-West, Karen; Baker, Matthew C; DeTure, Michael; Rademakers, Rosa; Boylan, Kevin B; Dickson, Dennis W; Petrucelli, Leonard; Link, Christopher D
2017-09-01
Significant transcriptome alterations are detected in the brain of patients with amyotrophic lateral sclerosis (ALS), including carriers of the C9orf72 repeat expansion and C9orf72-negative sporadic cases. Recently, the expression of repetitive element transcripts has been associated with toxicity and, while increased repetitive element expression has been observed in several neurodegenerative diseases, little is known about their contribution to ALS. To assess whether aberrant expression of repetitive element sequences are observed in ALS, we analysed RNA sequencing data from C9orf72-positive and sporadic ALS cases, as well as healthy controls. Transcripts from multiple classes and subclasses of repetitive elements (LINEs, endogenous retroviruses, DNA transposons, simple repeats, etc.) were significantly increased in the frontal cortex of C9orf72 ALS patients. A large collection of patient samples, representing both C9orf72 positive and negative ALS, ALS/FTLD, and FTLD cases, was used to validate the levels of several repetitive element transcripts. These analyses confirmed that repetitive element expression was significantly increased in C9orf72-positive compared to C9orf72-negative or control cases. While previous studies suggest an important link between TDP-43 and repetitive element biology, our data indicate that TDP-43 pathology alone is insufficient to account for the observed changes in repetitive elements in ALS/FTLD. Instead, we found that repetitive element expression positively correlated with RNA polymerase II activity in postmortem brain, and pharmacologic modulation of RNA polymerase II activity altered repetitive element expression in vitro. We conclude that increased RNA polymerase II activity in ALS/FTLD may lead to increased repetitive element transcript expression, a novel pathological feature of ALS/FTLD. © The Author 2017. Published by Oxford University Press.
A purified transcription factor (TIF-IB) binds to essential sequences of the mouse rDNA promoter.
Clos, J; Buttgereit, D; Grummt, I
1986-01-01
A transcription factor that is specific for mouse rDNA has been partially purified from Ehrlich ascites cells. This factor [designated transcription initiation factor (TIF)-IB] is required for accurate in vitro synthesis of mouse rRNA in addition to RNA polymerase I and another regulatory factor, TIF-IA. TIF-IB activity is present in extracts both from growing and nongrowing cells in comparable amounts. Prebinding competition experiments with wild-type and mutant templates suggest that TIF-IB interacts with the core control element of the rDNA promoter, which is located immediately upstream of the initiation site. The specific binding of TIF-IB to the RNA polymerase I promoter is demonstrated by exonuclease III protection experiments. The 3' border of the sequences protected by TIF-IB is shown to be on the coding strand at position -21 and on the noncoding strand at position -7. The results suggest that direct binding of TIF-IB to sequences in the core promoter element is the mechanism by which this factor imparts promoter selectivity to RNA polymerase I. Images PMID:3456157
Pavelitz, Thomas; Bailey, Arnold D.; Elco, Christopher P.; Weiner, Alan M.
2008-01-01
In mammals, small multigene families generate spliceosomal U snRNAs that are nearly as abundant as rRNA. Using the tandemly repeated human U2 genes as a model, we show by footprinting with DNase I and permanganate that nearly all sequences between the enhancer-like distal sequence element and the initiation site are protected during interphase whereas the upstream half of the U2 snRNA coding region is exposed. We also show by chromatin immunoprecipitation that the SNAPc complex, which binds the TATA-like proximal sequence element, is removed at metaphase but remains bound under conditions that induce locus-specific metaphase fragility of the U2 genes, such as loss of CSB, BRCA1, or BRCA2 function, treatment with actinomycin D, or overexpression of the tetrameric p53 C terminus. We propose that the U2 snRNA promoter establishes a persistently open state to facilitate rapid reinitiation and perhaps also to bypass TFIIH-dependent promoter melting; this open state would then be disassembled to allow metaphase chromatin condensation. PMID:18378697
Regulation of cytoplasmic mRNA decay
Schoenberg, Daniel R.; Maquat, Lynne E.
2012-01-01
Discoveries made over the past 20 years highlight the importance of mRNA decay as a means to modulate gene expression and thereby protein production. Up until recently, studies focused largely on identifying cis-acting sequences that serve as mRNA stability or instability elements, the proteins that bind these elements, how the process of translation influences mRNA decay, and the ribonucleases that catalyze decay. Now, current studies have begun to elucidate how the decay process is regulated. This review examines our current understanding of how mammalian-cell mRNA decay is controlled by different signaling pathways and lays out a framework for future research. PMID:22392217
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting
Piazza, Carol Lyn; Smith, Dorie
2018-01-01
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149
Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting.
Qu, Guosheng; Piazza, Carol Lyn; Smith, Dorie; Belfort, Marlene
2018-06-15
Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis , inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. © 2018, Qu et al.
Complete Genomic Structure of the Bloom-forming Toxic Cyanobacterium Microcystis aeruginosa NIES-843
Kaneko, Takakazu; Nakajima, Nobuyoshi; Okamoto, Shinobu; Suzuki, Iwane; Tanabe, Yuuhiko; Tamaoki, Masanori; Nakamura, Yasukazu; Kasai, Fumie; Watanabe, Akiko; Kawashima, Kumiko; Kishida, Yoshie; Ono, Akiko; Shimizu, Yoshimi; Takahashi, Chika; Minami, Chiharu; Fujishiro, Tsunakazu; Kohara, Mitsuyo; Katoh, Midori; Nakazaki, Naomi; Nakayama, Shinobu; Yamada, Manabu; Tabata, Satoshi; Watanabe, Makoto M.
2007-01-01
Abstract The nucleotide sequence of the complete genome of a cyanobacterium, Microcystis aeruginosa NIES-843, was determined. The genome of M. aeruginosa is a single, circular chromosome of 5 842 795 base pairs (bp) in length, with an average GC content of 42.3%. The chromosome comprises 6312 putative protein-encoding genes, two sets of rRNA genes, 42 tRNA genes representing 41 tRNA species, and genes for tmRNA, the B subunit of RNase P, SRP RNA, and 6Sa RNA. Forty-five percent of the putative protein-encoding sequences showed sequence similarity to genes of known function, 32% were similar to hypothetical genes, and the remaining 23% had no apparent similarity to reported genes. A total of 688 kb of the genome, equivalent to 11.8% of the entire genome, were composed of both insertion sequences and miniature inverted-repeat transposable elements. This is indicative of a plasticity of the M. aeruginosa genome, through a mechanism that involves homologous recombination mediated by repetitive DNA elements. In addition to known gene clusters related to the synthesis of microcystin and cyanopeptolin, novel gene clusters that may be involved in the synthesis and modification of toxic small polypeptides were identified. Compared with other cyanobacteria, a relatively small number of genes for two component systems and a large number of genes for restriction-modification systems were notable characteristics of the M. aeruginosa genome. PMID:18192279
Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.
2013-01-01
Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
piRNA pathway targets active LINE1 elements to establish the repressive H3K9me3 mark in germ cells
Pezic, Dubravka; Manakov, Sergei A.; Sachidanandam, Ravi; Aravin, Alexei A.
2014-01-01
Transposable elements (TEs) occupy a large fraction of metazoan genomes and pose a constant threat to genomic integrity. This threat is particularly critical in germ cells, as changes in the genome that are induced by TEs will be transmitted to the next generation. Small noncoding piwi-interacting RNAs (piRNAs) recognize and silence a diverse set of TEs in germ cells. In mice, piRNA-guided transposon repression correlates with establishment of CpG DNA methylation on their sequences, yet the mechanism and the spectrum of genomic targets of piRNA silencing are unknown. Here we show that in addition to DNA methylation, the piRNA pathway is required to maintain a high level of the repressive H3K9me3 histone modification on long interspersed nuclear elements (LINEs) in germ cells. piRNA-dependent chromatin repression targets exclusively full-length elements of actively transposing LINE families, demonstrating the remarkable ability of the piRNA pathway to recognize active elements among the large number of genomic transposon fragments. PMID:24939875
In and out of the rRNA genes: characterization of Pokey elements in the sequenced Daphnia genome
2013-01-01
Background Only a few transposable elements are known to exhibit site-specific insertion patterns, including the well-studied R-element retrotransposons that insert into specific sites within the multigene rDNA. The only known rDNA-specific DNA transposon, Pokey (superfamily: piggyBac) is found in the freshwater microcrustacean, Daphnia pulex. Here, we present a genome-wide analysis of Pokey based on the recently completed whole genome sequencing project for D. pulex. Results Phylogenetic analysis of Pokey elements recovered from the genome sequence revealed the presence of four lineages corresponding to two divergent autonomous families and two related lineages of non-autonomous miniature inverted repeat transposable elements (MITEs). The MITEs are also found at the same 28S rRNA gene insertion site as the Pokey elements, and appear to have arisen as deletion derivatives of autonomous elements. Several copies of the full-length Pokey elements may be capable of producing an active transposase. Surprisingly, both families of Pokey possess a series of 200 bp repeats upstream of the transposase that is derived from the rDNA intergenic spacer (IGS). The IGS sequences within the Pokey elements appear to be evolving in concert with the rDNA units. Finally, analysis of the insertion sites of Pokey elements outside of rDNA showed a target preference for sites similar to the specific sequence that is targeted within rDNA. Conclusions Based on the target site preference of Pokey elements and the concerted evolution of a segment of the element with the rDNA unit, we propose an evolutionary path by which the ancestors of Pokey elements have invaded the rDNA niche. We discuss how specificity for the rDNA unit may have evolved and how this specificity has played a role in the long-term survival of these elements in the subgenus Daphnia. PMID:24059783
Ahmed, Ikhlak; Sarazin, Alexis; Bowler, Chris; Colot, Vincent; Quesneville, Hadi
2011-09-01
Transposable elements (TEs) and their relics play major roles in genome evolution. However, mobilization of TEs is usually deleterious and strongly repressed. In plants and mammals, this repression is typically associated with DNA methylation, but the relationship between this epigenetic mark and TE sequences has not been investigated systematically. Here, we present an improved annotation of TE sequences and use it to analyze genome-wide DNA methylation maps obtained at single-nucleotide resolution in Arabidopsis. We show that although the majority of TE sequences are methylated, ∼26% are not. Moreover, a significant fraction of TE sequences densely methylated at CG, CHG and CHH sites (where H = A, T or C) have no or few matching small interfering RNA (siRNAs) and are therefore unlikely to be targeted by the RNA-directed DNA methylation (RdDM) machinery. We provide evidence that these TE sequences acquire DNA methylation through spreading from adjacent siRNA-targeted regions. Further, we show that although both methylated and unmethylated TE sequences located in euchromatin tend to be more abundant closer to genes, this trend is least pronounced for methylated, siRNA-targeted TE sequences located 5' to genes. Based on these and other findings, we propose that spreading of DNA methylation through promoter regions explains at least in part the negative impact of siRNA-targeted TE sequences on neighboring gene expression.
Ou-Yang, Fangqian; Luo, Qing-Jun; Zhang, Yue; Richardson, Casey R.; Jiang, Yingwen; Rock, Christopher D.
2013-01-01
microRNAs (miRNAs) are a class of small RNAs (sRNAs) of ~21 nucleotides (nt) in length processed from foldback hairpins by DICER-LIKE1 (DCL1) or DCL4. They regulate the expression of target mRNAs by base pairing through RNA-Induced Silencing Complex (RISC). In the RISC, ARGONAUTE1 (AGO1) is the key protein that cleaves miRNA targets at position ten of a miRNA:target duplex. The authenticity of many annotated rice miRNA hairpins is under debate because of their homology to repeat sequences. Some of them, like miR1884b, have been removed from the current release of miRBase based on incomplete information. In this study, we investigated the association of transposable element (TE)-derived miRNAs with typical miRNA pathways (DCL1/4- and AGO1-dependent) using publicly available deep sequencing datasets. Seven miRNA hairpins with 13 unique sRNAs were specifically enriched in AGO1 immunoprecipitation samples and relatively reduced in DCL1/4 knockdown genotypes. Interestingly, these species are ~21-nt long, instead of 24-nt as annotated in miRBase and the literature. Their expression profiles meet current criteria for functional annotation of miRNAs. In addition, diagnostic cleavage tags were found in degradome datasets for predicted target mRNAs. Most of these miRNA hairpins share significant homology with miniature inverted-repeat transposable elements (MITEs), one type of abundant DNA transposons in rice. Finally, the root-specific production of a 24 nt miRNA-like sRNA was confirmed by RNA blot for a novel EST that maps to the 3'-UTR of a candidate pseudogene showing extensive sequence homology to miR1884b hairpin. Our data are consistent with the hypothesis that TEs can serve as a driving force for the evolution of some MIRNAs, where co-opting of DICER-LIKE1/4 processing and integration into AGO1 could exapt transcribed TE-associated hairpins into typical miRNA pathways. PMID:23420033
DOE Office of Scientific and Technical Information (OSTI.GOV)
Daniels, C.J.
1993-06-01
We have established that a 100 bp DNA fragment from the Haloferax volcanii tRNALys gene directs transcription in vivo. This element served as the starting point for a detailed analysis of the requirements for in vivo transcription. Among several gene tentatively identified as reporter elements, we selected a eukaryotic intron-containing tRNAPro gene for when it is driven by the H. volcanii tRNALys promoter fragment, produces a single small transcript. Transcript analysis, by Sl mapping and primer extension, showed that this RNA initiated at the expected tRNALys BoxB sequence and terminated in the tRNAPro RNA Pol III termination element present onmore » the DNA fragment. In initial studies we determined that the 3 inches proximal region of this tRNALys promoter element was sufficient for transcription initiation in vivo. This 40 bp region contains only the BoxA and BoxB regions and short purine rich regions 5 inches to the BoxA and BoxB sequence. Using the tRNAPro gene as the reporter and this minimal promoter, we performed a comprehensive analysis of the BoxA region. Each position of the BoxA region was converted to an four possible nucleotides and the transcription of 36 mutants was quantitated. Among the sites analyzed, only five of the positions showed high levels of discrimination; the preferred BoxA element was 5 inches-TT({sub T}/A)({sup A}/T) ANNNN-3 inches. Mutational analysis demonstrated that a transition from T-rich to A-rich sequences in the BoxA element is essential and that there is some flexibility in the location of the ``TA`` sequence. Additionally the TA sequence appears to determine the location of the transcription start site. The BoxA element defined in this study is similar to those observed for Sulfolobus and the methanogen promoters, and supports the hypothesis that a similar core promoter element is used by all archaeal RNA polymerases.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pelletier, J.; Kaplan, G.; Racaniello, V.R.
1988-03-01
Poliovirus polysomal RNA is naturally uncapped, and as such, its translation must bypass any 5' cap-dependent ribosome recognition event. To elucidate the manner by which poliovirus mRNA is translated, the authors determined the translational efficiencies of a series of deletion mutants within the 5' noncoding region of the mRNA. They found striking differences in translatability among the altered mRNAs when assayed in mock-infected and poliovirus-infected HeLa cell extracts. The results identify a functional cis-acting element within the 5' noncoding region of the poliovirus mRNA which enables it to translate in a cap-independent fashion. The major determinant of this element mapsmore » between nucleotides 320 and 631 of the 5' end of the poliovirus mRNA. They also show that this region (320 to 631), when fused to a heterologous mRNA, can function in cis to render the mRNA cap independent in translation.« less
Hüttenhofer, Alexander; Kiefmann, Martin; Meier-Ewert, Sebastian; O’Brien, John; Lehrach, Hans; Bachellerie, Jean-Pierre; Brosius, Jürgen
2001-01-01
In mouse brain cDNA libraries generated from small RNA molecules we have identified a total of 201 different expressed RNA sequences potentially encoding novel small non-messenger RNA species (snmRNAs). Based on sequence and structural motifs, 113 of these RNAs can be assigned to the C/D box or H/ACA box subclass of small nucleolar RNAs (snoRNAs), known as guide RNAs for rRNA. While 30 RNAs represent mouse homologues of previously identified human C/D or H/ACA snoRNAs, 83 correspond to entirely novel snoRNAs. Among these, for the first time, we identified four C/D box snoRNAs and four H/ACA box snoRNAs predicted to direct modifications within U2, U4 or U6 small nuclear RNAs (snRNAs). Furthermore, 25 snoRNAs from either class lacked antisense elements for rRNAs or snRNAs. Therefore, additional snoRNA targets have to be considered. Surprisingly, six C/D box snoRNAs and one H/ACA box snoRNA were expressed exclusively in brain. Of the 88 RNAs not belonging to either snoRNA subclass, at least 26 are probably derived from truncated heterogeneous nuclear RNAs (hnRNAs) or mRNAs. Short interspersed repetitive elements (SINEs) are located on five RNA sequences and may represent rare examples of transcribed SINEs. The remaining RNA species could not as yet be assigned either to any snmRNA class or to a part of a larger hnRNA/mRNA. It is likely that at least some of the latter will represent novel, unclassified snmRNAs. PMID:11387227
Frolov, I; McBride, M S; Rice, C M
1998-01-01
Pestiviruses, such as bovine viral diarrhea virus (BVDV), share many similarities with hepatitis C virus (HCV) yet are more amenable to virologic and genetic analysis. For both BVDV and HCV, translation is initiated via an internal ribosome entry site (IRES). Besides IRES function, the viral 5' nontranslated regions (NTRs) may also contain cis-acting RNA elements important for viral replication. A series of chimeric RNAs were used to examine the function of the BVDV 5' NTR. Our results show that: (1) the HCV and the encephalomyocarditis virus (EMCV) IRES element can functionally replace that of BVDV; (2) two 5' terminal hairpins in BVDV genomic RNA are important for efficient replication; (3) replacement of the entire BVDV 5' NTR with those of HCV or EMCV leads to severely impaired replication; (4) such replacement chimeras are unstable and efficiently replicating pseudorevertants arise; (5) pseudorevertant mutations involve deletion of 5' sequences and/or acquisition of novel 5' sequences such that the 5' terminal 3-4 bases of BVDV genome RNA are restored. Besides providing new insight into functional elements in the BVDV 5' NTR, these chimeras may prove useful as pestivirus vaccines and for screening and evaluation of anti-HCV IRES antivirals. PMID:9814762
Distinct families of cis-acting RNA replication elements epsilon from hepatitis B viruses
Chen, Augustine; Brown, Chris
2012-01-01
The hepadnavirus encapsidation signal, epsilon (ε), is an RNA structure located at the 5′ end of the viral pregenomic RNA. It is essential for viral replication and functions in polymerase protein binding and priming. This structure could also have potential regulatory roles in controlling the expression of viral replicative proteins. In addition to its structure, the primary sequence of this RNA element has crucial functional roles in the viral lifecycle. Although the ε elements in hepadnaviruses share common critical functions, there are some significant differences in mammalian and avian hepadnaviruses, which include both sequence and structural variations. Here we present several covariance models for ε elements from the Hepadnaviridae. The model building included experimentally determined data from previous studies using chemical probing and NMR analysis. These models have sufficient similarity to comprise a clan. The clan has in common a highly conserved overall structure consisting of a lower-stem, bulge, upper-stem and apical-loop. The models differ in functionally critical regions—notably the two types of avian ε elements have a tetra-loop (UGUU) including a non-canonical UU base pair, while the hepatitis B virus (HBV) epsilon has a tri-loop (UGU). The avian epsilon elements have a less stable dynamic structure in the upper stem. Comparisons between these models and all other Rfam models, and searches of genomes, showed these structures are specific to the Hepadnaviridae. Two family models and the clan are available from the Rfam database. PMID:22418844
Jakubczak, J. L.; Zenni, M. K.; Woodruff, R. C.; Eickbush, T. H.
1992-01-01
R1 and R2 are distantly related non-long terminal repeat retrotransposable elements each of which inserts into a specific site in the 28S rRNA genes of most insects. We have analyzed aspects of R1 and R2 abundance and sequence variation in 27 geographical isolates of Drosophila melanogaster. The fraction of 28S rRNA genes containing these elements varied greatly between strains, 17-67% for R1 elements and 2-28% for R2 elements. The total percentage of the rDNA repeats inserted ranged from 32 to 77%. The fraction of the rDNA repeats that contained both of these elements suggested that R1 and R2 exhibit neither an inhibition of nor preference for insertion into a 28S gene already containing the other type of element. Based on the conservation of restriction sites in the elements of all strains, and sequence analysis of individual elements from three strains, nucleotide divergence is very low for R1 and R2 elements within or between strains (<0.6%). This sequence uniformity is the expected result of the forces of concerted evolution (unequal crossovers and gene conversion) which act on the rRNA genes themselves. Evidence for the role of retrotransposition in the turnover of R1 and R2 was obtained by using naturally occurring 5' length polymorphisms of the elements as markers for independent transposition events. The pattern of these different length 5' truncations of R1 and R2 was found to be diverse and unique to most strains analyzed. Because recombination can only, with time, amplify or eliminate those length variants already present, the diversity found in each strain suggests that retrotransposition has played a critical role in maintaining these elements in the rDNA repeats of D. melanogaster. PMID:1317313
Arimbasseri, Aneeshkumar G.; Maraia, Richard J.
2015-01-01
SUMMARY Understanding the mechanism of transcription termination by a eukaryotic RNA polymerase (RNAP) has been limited by lack of a characterizable intermediate that reflects transition from an elongation complex to a true termination event. While other multisubunit RNAPs require multipartite cis-signals and/or ancillary factors to mediate pausing and release of the nascent transcript from the clutches of these enzymes, RNAP III does so with precision and efficiency on a simple oligo(dT) tract, independent of other cis-elements or trans-factors. We report a RNAP III pre-termination complex that reveals termination mechanisms controlled by sequence-specific elements in the non-template strand. Furthermore, the TFIIF-like, RNAP III subunit, C37 is required for this function of the non-template strand signal. The results reveal the RNAP III terminator as an information-rich control element. While the template strand promotes destabilization via a weak oligo(rU:dA) hybrid, the non-template strand provides distinct sequence-specific destabilizing information through interactions with the C37 subunit. PMID:25959395
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.
Eriani, G; Dirheimer, G; Gangloff, J
1989-01-01
The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
L1-mediated retrotransposition of murine B1 and B2 SINEs recapitulated in cultured cells.
Dewannieux, Marie; Heidmann, Thierry
2005-06-03
SINEs are short interspersed nucleotide elements with transpositional activity, present at a high copy number (up to a million) in mammalian genomes. They are 80-400 bp long, non-coding sequences which derive either from the 7SL RNA (e.g. human Alus, murine B1s) or tRNA (e.g. murine B2s) polymerase III-driven genes. We have previously demonstrated that Alus very efficiently divert the enzymatic machinery of the autonomous L1 LINE (long interspersed nucleotide element) retrotransposons to transpose at a high rate. Here we show, using an ex vivo assay for transposition, that both B1 and B2 SINEs can be mobilized by murine LINEs, with the hallmarks of a bona fide retrotransposition process, including target site duplications of varying lengths and integrations into A-rich sequences. Despite different phylogenetic origins, transposition of the tRNA-derived B2 sequences is as efficient as that of the human Alus, whereas that of B1s is 20-100-fold lower despite a similar high copy number of these elements in the mouse genome. We provide evidence, via an appropriate nucleotide substitution within the B1 sequence in a domain essential for its intracellular targeting, that the current B1 SINEs are not optimal for transposition, a feature most probably selected for the host sake in the course of evolution.
Functions of the 3′ and 5′ genome RNA regions of members of the genus Flavivirus
Brinton, Margo A.; Basu, Mausumi
2015-01-01
The positive sense genomes of members of the genus Flavivirus in the family Flaviviridae are ~11 kb nts in length and have a 5′ type I cap but no 3′ poly A. The 5′ and 3′ terminal regions contain short conserved sequences that are proposed to be repeated remnants of an ancient sequence. However, the functions of most of these conserved sequences have not yet been determined. The terminal regions of the genome also contain multiple conserved RNA structures. Functional data for many of these structures has been obtained. Three sets of complementary 3′ and 5′ terminal region sequences, some of which are located in conserved RNA structures, interact to form a panhandle structure that is required for initiation of minus strand RNA synthesis with the 5′ terminal structure functioning as the promoter. How the switch from the terminal RNA structure base pairing to the long distance RNA-RNA interaction is triggered and regulated is not well understood but evidence suggests involvement of a cell protein binding to three sites on the 3′ terminal RNA structures and a cis-acting metastable 3′ RNA element in the 3′ terminal structure. Cell proteins may also be involved in facilitating exponential replication of nascent genomic RNA within replication vesicles at later times of infection cycle. Other conserved RNA structures and/or sequences in the 5′ and 3′ terminal regions have been proposed to regulate genome translation. Additional functions of the 5′ and 3′ terminal sequences have also been reported. PMID:25683510
Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.
Li, Sanshu; Breaker, Ronald R
2017-10-13
With the development of rapid and inexpensive DNA sequencing, the genome sequences of more than 100 fungal species have been made available. This dataset provides an excellent resource for comparative genomics analyses, which can be used to discover genetic elements, including noncoding RNAs (ncRNAs). Bioinformatics tools similar to those used to uncover novel ncRNAs in bacteria, likewise, should be useful for searching fungal genomic sequences, and the relative ease of genetic experiments with some model fungal species could facilitate experimental validation studies. We have adapted a bioinformatics pipeline for discovering bacterial ncRNAs to systematically analyze many fungal genomes. This comparative genomics pipeline integrates information on conserved RNA sequence and structural features with alternative splicing information to reveal fungal RNA motifs that are candidate regulatory domains, or that might have other possible functions. A total of 15 prominent classes of structured ncRNA candidates were identified, including variant HDV self-cleaving ribozyme representatives, atypical snoRNA candidates, and possible structured antisense RNA motifs. Candidate regulatory motifs were also found associated with genes for ribosomal proteins, S-adenosylmethionine decarboxylase (SDC), amidase, and HexA protein involved in Woronin body formation. We experimentally confirm that the variant HDV ribozymes undergo rapid self-cleavage, and we demonstrate that the SDC RNA motif reduces the expression of SAM decarboxylase by translational repression. Furthermore, we provide evidence that several other motifs discovered in this study are likely to be functional ncRNA elements. Systematic screening of fungal genomes using a computational discovery pipeline has revealed the existence of a variety of novel structured ncRNAs. Genome contexts and similarities to known ncRNA motifs provide strong evidence for the biological and biochemical functions of some newly found ncRNA motifs. Although initial examinations of several motifs provide evidence for their likely functions, other motifs will require more in-depth analysis to reveal their functions.
Monarez, Roberto R.; Macdonald, Clinton C.; Dass, Brinda
2006-01-01
CstF-64 (cleavage stimulation factor-64), a major regulatory protein of polyadenylation, is absent during male meiosis. Therefore a paralogous variant, τCstF-64 is expressed in male germ cells to maintain normal spermatogenesis. Based on sequence differences between τCstF-64 and CstF-64, and on the high incidence of alternative polyadenylation in testes, we hypothesized that the RBDs (RNA-binding domains) of τCstF-64 and CstF-64 have different affinities for RNA elements. We quantified Kd values of CstF-64 and τCstF-64 RBDs for various ribopolymers using an RNA cross-linking assay. The two RBDs had similar affinities for poly(G)18, poly(A)18 or poly(C)18, with affinity for poly(C)18 being the lowest. However, CstF-64 had a higher affinity for poly(U)18 than τCstF-64, whereas it had a lower affinity for poly(GU)9. Changing Pro-41 to a serine residue in the CstF-64 RBD did not affect its affinity for poly(U)18, but changes in amino acids downstream of the C-terminal α-helical region decreased affinity towards poly(U)18. Thus we show that the two CstF-64 paralogues differ in their affinities for specific RNA sequences, and that the region C-terminal to the RBD is important in RNA sequence recognition. This supports the hypothesis that τCstF-64 promotes germ-cell-specific patterns of polyadenylation by binding to different downstream sequence elements. PMID:17029590
Eickbush, D. G.; Eickbush, T. H.
1995-01-01
R1 and R2 are non-long-terminal repeat retrotransposable elements that insert into specific sequences of insect 28S ribosomal RNA genes. These elements have been extensively described in Drosophila melanogaster. To determine whether these elements have been horizontally or vertically transmitted, we characterized R1 and R2 elements from the seven other members of the melanogaster species subgroup by genomic blotting and nucleotide sequencing. Each species was found to have homogeneous families of R1 and R2 elements with the exception of erecta and orena, which have no R2 elements. The DNA sequences of multiple R1 and R2 copies from each species indicated nucleotide divergence within each species averaged only 0.48% for R1 and 0.35% for R2, well below the level of divergence among the species. Most copies of R1 and R2 (40 of 47) sequenced from the seven species were potentially functional, as indicated by the absence of premature termination codons or translational frameshifts that would destroy the open reading frame of the element. The sequence relationships of both the R1 and R2 elements from the various members of the melanogaster subgroup closely followed that of the species phylogeny, suggesting that R1 and R2 have been stably maintained by vertical transmission since the origin of this species subgroup 17-20 million years ago. The remarkable stability of R1 and R2, compared to what has been suggested for transposable elements that insert at multiple locations in these same species, may be due to their unique specificity for sites in the rRNA gene locus. Under low copy number conditions, when it is essential for any mobile element to transpose, the insertion specificities of R1 and R2 ensure uniform developmentally regulated target sites that can be occupied with little or no detrimental effect on the host. PMID:7713424
RNA editing of non-coding RNA and its role in gene regulation.
Daniel, Chammiran; Lagergren, Jens; Öhman, Marie
2015-10-01
It has for a long time been known that repetitive elements, particularly Alu sequences in human, are edited by the adenosine deaminases acting on RNA, ADAR, family. The functional interpretation of these events has been even more difficult than that of editing events in coding sequences, but today there is an emerging understanding of their downstream effects. A surprisingly large fraction of the human transcriptome contains inverted Alu repeats, often forming long double stranded structures in RNA transcripts, typically occurring in introns and UTRs of protein coding genes. Alu repeats are also common in other primates, and similar inverted repeats can frequently be found in non-primates, although the latter are less prone to duplex formation. In human, as many as 700,000 Alu elements have been identified as substrates for RNA editing, of which many are edited at several sites. In fact, recent advancements in transcriptome sequencing techniques and bioinformatics have revealed that the human editome comprises at least a hundred million adenosine to inosine (A-to-I) editing sites in Alu sequences. Although substantial additional efforts are required in order to map the editome, already present knowledge provides an excellent starting point for studying cis-regulation of editing. In this review, we will focus on editing of long stem loop structures in the human transcriptome and how it can effect gene expression. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.
Yang, Qin; Gilmartin, Gregory M.; Doublié, Sylvie
2010-01-01
Human Cleavage Factor Im (CFIm) is an essential component of the pre-mRNA 3′ processing complex that functions in the regulation of poly(A) site selection through the recognition of UGUA sequences upstream of the poly(A) site. Although the highly conserved 25 kDa subunit (CFIm25) of the CFIm complex possesses a characteristic α/β/α Nudix fold, CFIm25 has no detectable hydrolase activity. Here we report the crystal structures of the human CFIm25 homodimer in complex with UGUAAA and UUGUAU RNA sequences. CFIm25 is the first Nudix protein to be reported to bind RNA in a sequence-specific manner. The UGUA sequence contributes to binding specificity through an intramolecular G:A Watson–Crick/sugar-edge base interaction, an unusual pairing previously found to be involved in the binding specificity of the SAM-III riboswitch. The structures, together with mutational data, suggest a novel mechanism for the simultaneous sequence-specific recognition of two UGUA elements within the pre-mRNA. Furthermore, the mutually exclusive binding of RNA and the signaling molecule Ap4A (diadenosine tetraphosphate) by CFIm25 suggests a potential role for small molecules in the regulation of mRNA 3′ processing. PMID:20479262
Yang, Qin; Gilmartin, Gregory M; Doublié, Sylvie
2010-06-01
Human Cleavage Factor Im (CFI(m)) is an essential component of the pre-mRNA 3' processing complex that functions in the regulation of poly(A) site selection through the recognition of UGUA sequences upstream of the poly(A) site. Although the highly conserved 25 kDa subunit (CFI(m)25) of the CFI(m) complex possesses a characteristic alpha/beta/alpha Nudix fold, CFI(m)25 has no detectable hydrolase activity. Here we report the crystal structures of the human CFI(m)25 homodimer in complex with UGUAAA and UUGUAU RNA sequences. CFI(m)25 is the first Nudix protein to be reported to bind RNA in a sequence-specific manner. The UGUA sequence contributes to binding specificity through an intramolecular G:A Watson-Crick/sugar-edge base interaction, an unusual pairing previously found to be involved in the binding specificity of the SAM-III riboswitch. The structures, together with mutational data, suggest a novel mechanism for the simultaneous sequence-specific recognition of two UGUA elements within the pre-mRNA. Furthermore, the mutually exclusive binding of RNA and the signaling molecule Ap(4)A (diadenosine tetraphosphate) by CFI(m)25 suggests a potential role for small molecules in the regulation of mRNA 3' processing.
Splicing predictions reliably classify different types of alternative splicing
Busch, Anke; Hertel, Klemens J.
2015-01-01
Alternative splicing is a key player in the creation of complex mammalian transcriptomes and its misregulation is associated with many human diseases. Multiple mRNA isoforms are generated from most human genes, a process mediated by the interplay of various RNA signature elements and trans-acting factors that guide spliceosomal assembly and intron removal. Here, we introduce a splicing predictor that evaluates hundreds of RNA features simultaneously to successfully differentiate between exons that are constitutively spliced, exons that undergo alternative 5′ or 3′ splice-site selection, and alternative cassette-type exons. Surprisingly, the splicing predictor did not feature strong discriminatory contributions from binding sites for known splicing regulators. Rather, the ability of an exon to be involved in one or multiple types of alternative splicing is dictated by its immediate sequence context, mainly driven by the identity of the exon's splice sites, the conservation around them, and its exon/intron architecture. Thus, the splicing behavior of human exons can be reliably predicted based on basic RNA sequence elements. PMID:25805853
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-11-11
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF.
Yang, Q; Radebaugh, C A; Kubaska, W; Geiss, G K; Paule, M R
1995-01-01
The intergenic spacer (IGS) of Acanthamoeba castellanii rRNA genes contains repeated elements which are weak enhancers for transcription by RNA polymerase I. A protein, EBF, was identified and partially purified which binds to the enhancers and to several other sequences within the IGS, but not to other DNA fragments, including the rRNA core promoter. No consensus binding sequence could be discerned in these fragments and bound factor is in rapid equilibrium with unbound. EBF has functional characteristics similar to vertebrate upstream binding factors (UBF). Not only does it bind to the enhancer and other IGS elements, but it also stimulates binding of TIF-IB, the fundamental transcription initiation factor, to the core promoter and stimulates transcription from the promoter. Attempts to identify polypeptides with epitopes similar to rat or Xenopus laevis UBF suggest that structurally the protein from A.castellanii is not closely related to vertebrate UBF. Images PMID:7501455
Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A
1997-07-01
Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
RAG-3D: A search tool for RNA 3D substructures
Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; ...
2015-08-24
In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less
RAG-3D: a search tool for RNA 3D substructures
Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar
2015-01-01
To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547
RAG-3D: A search tool for RNA 3D substructures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef
In this study, to address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally describedmore » in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding.« less
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-01-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA. PMID:8943327
Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S
1996-12-01
Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.
DDM1 represses noncoding RNA expression and RNA-directed DNA methylation in heterochromatin.
Tan, Feng; Lu, Yue; Jiang, Wei; Zhao, Yu; Wu, Tian; Zhang, Ruoyu; Zhou, Dao-Xiu
2018-05-24
Cytosine methylation of DNA, which occurs at CG, CHG, and CHH (H=A, C, or T) sequences in plants, is a hallmark for epigenetic repression of repetitive sequences. The chromatin remodeling factor DECREASE IN DNA METHYLATION1 (DDM1) is essential for DNA methylation, especially at CG and CHG sequences. However, its potential role in RNA-directed DNA methylation (RdDM) and in chromatin function is not completely understood in rice (Oryza sativa). In this work, we used high-throughput approaches to study the function of rice DDM1 (OsDDM1) in RdDM and the expression of non-coding RNA (ncRNA). We show that loss of function of OsDDM1 results in ectopic CHH methylation of transposable elements and repeats. The ectopic CHH methylation was dependent on rice DOMAINS REARRANGED METHYLTRANSFERASE2 (OsDRM2), a DNA methyltransferase involved in RdDM. Mutations in OsDDM1 lead to decreases of histone H3K9me2 and increases in the levels of heterochromatic small RNA (sRNA) and long noncoding RNA (lncRNA). In particular, OsDDM1 was found to be essential to repress transcription of the two repetitive sequences, Centromeric Retrotransposons of Rice1 (CRR1) and the dominant centromeric CentO repeats. These results suggest that OsDDM1 antagonizes RdDM at heterochromatin and represses tissue-specific expression of ncRNA from repetitive sequences in the rice genome. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
ElemeNT: a computational tool for detecting core promoter elements.
Sloutskin, Anna; Danino, Yehuda M; Orenstein, Yaron; Zehavi, Yonathan; Doniger, Tirza; Shamir, Ron; Juven-Gershon, Tamar
2015-01-01
Core promoter elements play a pivotal role in the transcriptional output, yet they are often detected manually within sequences of interest. Here, we present 2 contributions to the detection and curation of core promoter elements within given sequences. First, the Elements Navigation Tool (ElemeNT) is a user-friendly web-based, interactive tool for prediction and display of putative core promoter elements and their biologically-relevant combinations. Second, the CORE database summarizes ElemeNT-predicted core promoter elements near CAGE and RNA-seq-defined Drosophila melanogaster transcription start sites (TSSs). ElemeNT's predictions are based on biologically-functional core promoter elements, and can be used to infer core promoter compositions. ElemeNT does not assume prior knowledge of the actual TSS position, and can therefore assist in annotation of any given sequence. These resources, freely accessible at http://lifefaculty.biu.ac.il/gershon-tamar/index.php/resources, facilitate the identification of core promoter elements as active contributors to gene expression.
Potential Links between Hepadnavirus and Bornavirus Sequences in the Host Genome and Cancer.
Honda, Tomoyuki
2017-01-01
Various viruses leave their sequences in the host genomes during infection. Such events occur mainly in retrovirus infection but also sometimes in DNA and non-retroviral RNA virus infections. If viral sequences are integrated into the genomes of germ line cells, the sequences can become inherited as endogenous viral elements (EVEs). The integration events of viral sequences may have oncogenic potential. Because proviral integrations of some retroviruses and/or reactivation of endogenous retroviruses are closely linked to cancers, viral insertions related to non-retroviral viruses also possibly contribute to cancer development. This article focuses on genomic viral sequences derived from two non-retroviral viruses, whose endogenization is already reported, and discusses their possible contributions to cancer. Viral insertions of hepatitis B virus play roles in the development of hepatocellular carcinoma. Endogenous bornavirus-like elements, the only non-retroviral RNA virus-related EVEs found in the human genome, may also be involved in cancer formation. In addition, the possible contribution of the interactions between viruses and retrotransposons, which seem to be a major driving force for generating EVEs related to non-retroviral RNA viruses, to cancers will be discussed. Future studies regarding the possible links described here may open a new avenue for the development of novel therapeutics for tumor virus-related cancers and/or provide novel insights into EVE functions.
Schnare, Murray N.; Collings, James C.; Spencer, David F.; Gray, Michael W.
2000-01-01
In Crithidia fasciculata, the ribosomal RNA (rRNA) gene repeats range in size from ∼11 to 12 kb. This length heterogeneity is localized to a region of the intergenic spacer (IGS) that contains tandemly repeated copies of a 19mer sequence. The IGS also contains four copies of an ∼55 nt repeat that has an internal inverted repeat and is also present in the IGS of Leishmania species. We have mapped the C.fasciculata transcription initiation site as well as two other reverse transcriptase stop sites that may be analogous to the A0 and A′ pre-rRNA processing sites within the 5′ external transcribed spacer (ETS) of other eukaryotes. Features that could influence processing at these sites include two stretches of conserved primary sequence and three secondary structure elements present in the 5′ ETS. We also characterized the C.fasciculata U3 snoRNA, which has the potential for base-pairing with pre-rRNA sequences. Finally, we demonstrate that biosynthesis of large subunit rRNA in both C.fasciculata and Trypanosoma brucei involves 3′-terminal addition of three A residues that are not present in the corresponding DNA sequences. PMID:10982863
Schnapp, A; Clos, J; Hädelt, W; Schreck, R; Cvekl, A; Grummt, I
1990-03-25
The murine ribosomal gene promoter contains two cis-acting control elements which operate in concert to promote efficient and accurate transcription initiation by RNA polymerase I. The start site proximal core element which is indispensable for promoter recognition by RNA polymerase I (pol I) encompasses sequences from position -39 to -1. An upstream control element (UCE) which is located between nucleotides -142 and -112 stimulates the efficiency of transcription initiation both in vivo and in vitro. Here we report the isolation and functional characterization of a specific rDNA binding protein, the transcription initiation factor TIF-IB, which specifically interacts with the core region of the mouse ribosomal RNA gene promoter. Highly purified TIF-IB complements transcriptional activity in the presence of two other essential initiation factors TIF-IA and TIF-IC. We demonstrate that the binding efficiency of purified TIF-IB to the core promoter is strongly enhanced by the presence in cis of the UCE. This positive effect of upstream sequences on TIF-IB binding is observed throughout the purification procedure suggesting that the synergistic action of the two distant promoter elements is not mediated by a protein different from TIF-IB. Increasing the distance between both control elements still facilitates stable factor binding but eliminates transcriptional activation. The results demonstrate that TIF-IB binding to the rDNA promoter is an essential early step in the assembly of a functional transcription initiation complex. The subsequent interaction of TIF-IB with other auxiliary transcription initiation factors, however, requires the correct spacing between the UCE and the core promoter element.
Delimiting regulatory sequences of the Drosophila melanogaster Ddc gene.
Hirsh, J; Morgan, B A; Scholnick, S B
1986-01-01
We delimited sequences necessary for in vivo expression of the Drosophila melanogaster dopa decarboxylase gene Ddc. The expression of in vitro-altered genes was assayed following germ line integration via P-element vectors. Sequences between -209 and -24 were necessary for normally regulated expression, although genes lacking these sequences could be expressed at 10 to 50% of wild-type levels at specific developmental times. These genes showed components of normal developmental expression, which suggests that they retain some regulatory elements. All Ddc genes lacking the normal immediate 5'-flanking sequences were grossly deficient in larval central nervous system expression. Thus, this upstream region must contain at least one element necessary for this expression. A mutated Ddc gene without a normal TATA boxlike sequence used the normal RNA start points, indicating that this sequences is not required for start point specificity. Images PMID:3099170
Liu, Xiaochuan; Freitas, Jaime; Zheng, Dinghai; Oliveira, Marta S; Hoque, Mainul; Martins, Torcato; Henriques, Telmo; Tian, Bin; Moreira, Alexandra
2017-12-01
Alternative polyadenylation (APA) is a mechanism that generates multiple mRNA isoforms with different 3'UTRs and/or coding sequences from a single gene. Here, using 3' region extraction and deep sequencing (3'READS), we have systematically mapped cleavage and polyadenylation sites (PASs) in Drosophila melanogaster , expanding the total repertoire of PASs previously identified for the species, especially those located in A-rich genomic sequences. Cis -element analysis revealed distinct sequence motifs around fly PASs when compared to mammalian ones, including the greater enrichment of upstream UAUA elements and the less prominent presence of downstream UGUG elements. We found that over 75% of mRNA genes in Drosophila melanogaster undergo APA. The head tissue tends to use distal PASs when compared to the body, leading to preferential expression of APA isoforms with long 3'UTRs as well as with distal terminal exons. The distance between the APA sites and intron location of PAS are important parameters for APA difference between body and head, suggesting distinct PAS selection contexts. APA analysis of the RpII215 C4 mutant strain, which harbors a mutant RNA polymerase II (RNAPII) with a slower elongation rate, revealed that a 50% decrease in transcriptional elongation rate leads to a mild trend of more usage of proximal, weaker PASs, both in 3'UTRs and in introns, consistent with the "first come, first served" model of APA regulation. However, this trend was not observed in the head, suggesting a different regulatory context in neuronal cells. Together, our data expand the PAS collection for Drosophila melanogaster and reveal a tissue-specific effect of APA regulation by RNAPII elongation rate. © 2017 Liu et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Osman, Toba A M; Olsthoorn, René C L; Livieratos, Ioannis C
2014-09-22
Pepino mosaic virus (PepMV) is a mechanically-transmitted positive-strand RNA potexvirus, with a 6410 nt long single-stranded (ss) RNA genome flanked by a 5'-methylguanosine cap and a 3' poly-A tail. Computer-assisted folding of the 64 nt long PepMV 3'-untranslated region (UTR) resulted in the prediction of three stem-loop structures (hp1, hp2, and hp3 in the 3'-5' direction). The importance of these structures and/or sequences for promotion of negative-strand RNA synthesis and binding to the RNA dependent RNA polymerase (RdRp) was tested in vitro using a specific RdRp assay. Hp1, which is highly variable among different PepMV isolates, appeared dispensable for negative-strand synthesis. Hp2, which is characterized by a large U-rich loop, tolerated base-pair changes in its stem as long as they maintained the stem integrity but was very sensitive to changes in the U-rich loop. Hp3, which harbours the conserved potexvirus ACUUAA hexamer motif, was essential for template activity. Template-RNA polymerase binding competition experiments showed that the ACUUAA sequence represents a high-affinity RdRp binding element. Copyright © 2014 Elsevier B.V. All rights reserved.
Retrotransposons as regulators of gene expression
Elbarbary, Reyad A.; Lucas, Bronwyn A.; Maquat, Lynne E.
2016-01-01
Transposable elements (TEs) are both a boon and a bane to eukaryotic organisms, depending on where they integrate into the genome and how their sequences function once integrated. We focus on two types of TEs: long interspersed elements (LINEs) and short interspersed elements (SINEs). LINEs and SINEs are retrotransposons; that is, they transpose via an RNA intermediate. We discuss how LINEs and SINEs have expanded in eukaryotic genomes and contribute to genome evolution. An emerging body of evidence indicates that LINEs and SINEs function to regulate gene expression by affecting chromatin structure, gene transcription, pre-mRNA processing, or aspects of mRNA metabolism. We also describe how adenosine-to-inosine editing influences SINE function and how ongoing retrotransposition is countered by the body’s defense mechanisms. PMID:26912865
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity
2013-01-01
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution. PMID:23984183
CLIP-related methodologies and their application to retrovirology.
Bieniasz, Paul D; Kutluay, Sebla B
2018-05-02
Virtually every step of HIV-1 replication and numerous cellular antiviral defense mechanisms are regulated by the binding of a viral or cellular RNA-binding protein (RBP) to distinct sequence or structural elements on HIV-1 RNAs. Until recently, these protein-RNA interactions were studied largely by in vitro binding assays complemented with genetics approaches. However, these methods are highly limited in the identification of the relevant targets of RBPs in physiologically relevant settings. Development of crosslinking-immunoprecipitation sequencing (CLIP) methodology has revolutionized the analysis of protein-nucleic acid complexes. CLIP combines immunoprecipitation of covalently crosslinked protein-RNA complexes with high-throughput sequencing, providing a global account of RNA sequences bound by a RBP of interest in cells (or virions) at near-nucleotide resolution. Numerous variants of the CLIP protocol have recently been developed, some with major improvements over the original. Herein, we briefly review these methodologies and give examples of how CLIP has been successfully applied to retrovirology research.
Samsa, Marcelo M.; Mondotte, Juan A.; Caramelo, Julio J.
2012-01-01
Little is known about the mechanism of flavivirus genome encapsidation. Here, functional elements of the dengue virus (DENV) capsid (C) protein were investigated. Study of the N-terminal region of DENV C has been limited by the presence of overlapping cis-acting RNA elements within the protein-coding region. To dissociate these two functions, we used a recombinant DENV RNA with a duplication of essential RNA structures outside the C coding sequence. By the use of this system, the highly conserved amino acids FNML, which are encoded in the RNA cyclization sequence 5′CS, were found to be dispensable for C function. In contrast, deletion of the N-terminal 18 amino acids of C impaired DENV particle formation. Two clusters of basic residues (R5-K6-K7-R9 and K17-R18-R20-R22) were identified as important. A systematic mutational analysis indicated that a high density of positive charges, rather than particular residues at specific positions, was necessary. Furthermore, a differential requirement of N-terminal sequences of C for viral particle assembly was observed in mosquito and human cells. While no viral particles were observed in human cells with a virus lacking the first 18 residues of C, DENV propagation was detected in mosquito cells, although to a level about 50-fold less than that observed for a wild-type (WT) virus. We conclude that basic residues at the N terminus of C are necessary for efficient particle formation in mosquito cells but that they are crucial for propagation in human cells. This is the first report demonstrating that the N terminus of C plays a role in DENV particle formation. In addition, our results suggest that this function of C is differentially modulated in different host cells. PMID:22072762
Christel, Stephan; Fridlund, Jimmy; Buetti-Dinh, Antoine; Buck, Moritz; Watkin, Elizabeth L; Dopson, Mark
2016-04-01
Acidithiobacillus ferrivorans is an acidophile implicated in low-temperature biomining for the recovery of metals from sulfide minerals. Acidithiobacillus ferrivorans obtains its energy from the oxidation of inorganic sulfur compounds, and genes encoding several alternative pathways have been identified. Next-generation sequencing of At. ferrivorans RNA transcripts identified the genes coding for metabolic and electron transport proteins for energy conservation from tetrathionate as electron donor. RNA transcripts suggested that tetrathionate was hydrolyzed by the tetH1 gene product to form thiosulfate, elemental sulfur and sulfate. Despite two of the genes being truncated, RNA transcripts for the SoxXYZAB complex had higher levels than for thiosulfate quinone oxidoreductase (doxDAgenes). However, a lack of heme-binding sites in soxX suggested that DoxDA was responsible for thiosulfate metabolism. Higher RNA transcript counts also suggested that elemental sulfur was metabolized by heterodisulfide reductase (hdrgenes) rather than sulfur oxygenase reductase (sor). The sulfite produced as a product of heterodisulfide reductase was suggested to be oxidized by a pathway involving the sat gene product or abiotically react with elemental sulfur to form thiosulfate. Finally, several electron transport complexes were involved in energy conservation. This study has elucidated the previously unknown At. ferrivorans tetrathionate metabolic pathway that is important in biomining. © FEMS 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Functional RNA elements in the dengue virus genome.
Gebhard, Leopoldo G; Filomatori, Claudia V; Gamarnik, Andrea V
2011-09-01
Dengue virus (DENV) genome amplification is a process that involves the viral RNA, cellular and viral proteins, and a complex architecture of cellular membranes. The viral RNA is not a passive template during this process; it plays an active role providing RNA signals that act as promoters, enhancers and/or silencers of the replication process. RNA elements that modulate RNA replication were found at the 5' and 3' UTRs and within the viral coding sequence. The promoter for DENV RNA synthesis is a large stem loop structure located at the 5' end of the genome. This structure specifically interacts with the viral polymerase NS5 and promotes RNA synthesis at the 3' end of a circularized genome. The circular conformation of the viral genome is mediated by long range RNA-RNA interactions that span thousands of nucleotides. Recent studies have provided new information about the requirement of alternative, mutually exclusive, structures in the viral RNA, highlighting the idea that the viral genome is flexible and exists in different conformations. In this article, we describe elements in the promoter SLA and other RNA signals involved in NS5 polymerase binding and activity, and provide new ideas of how dynamic secondary and tertiary structures of the viral RNA participate in the viral life cycle.
Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency.
Dang, Ying; Jia, Gengxiang; Choi, Jennie; Ma, Hongming; Anaya, Edgar; Ye, Chunting; Shankar, Premlata; Wu, Haoquan
2015-12-15
Single-guide RNA (sgRNA) is one of the two key components of the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 genome-editing system. The current commonly used sgRNA structure has a shortened duplex compared with the native bacterial CRISPR RNA (crRNA)-transactivating crRNA (tracrRNA) duplex and contains a continuous sequence of thymines, which is the pause signal for RNA polymerase III and thus could potentially reduce transcription efficiency. Here, we systematically investigate the effect of these two elements on knockout efficiency and showed that modifying the sgRNA structure by extending the duplex length and mutating the fourth thymine of the continuous sequence of thymines to cytosine or guanine significantly, and sometimes dramatically, improves knockout efficiency in cells. In addition, the optimized sgRNA structure also significantly increases the efficiency of more challenging genome-editing procedures, such as gene deletion, which is important for inducing a loss of function in non-coding genes. By a systematic investigation of sgRNA structure we find that extending the duplex by approximately 5 bp combined with mutating the continuous sequence of thymines at position 4 to cytosine or guanine significantly increases gene knockout efficiency in CRISPR-Cas9-based genome editing experiments.
T box riboswitches in Actinobacteria: Translational regulation via novel tRNA interactions
Sherwood, Anna V.; Grundy, Frank J.; Henkin, Tina M.
2015-01-01
The T box riboswitch regulates many amino acid-related genes in Gram-positive bacteria. T box riboswitch-mediated gene regulation was shown previously to occur at the level of transcription attenuation via structural rearrangements in the 5′ untranslated (leader) region of the mRNA in response to binding of a specific uncharged tRNA. In this study, a novel group of isoleucyl-tRNA synthetase gene (ileS) T box leader sequences found in organisms of the phylum Actinobacteria was investigated. The Stem I domains of these RNAs lack several highly conserved elements that are essential for interaction with the tRNA ligand in other T box RNAs. Many of these RNAs were predicted to regulate gene expression at the level of translation initiation through tRNA-dependent stabilization of a helix that sequesters a sequence complementary to the Shine–Dalgarno (SD) sequence, thus freeing the SD sequence for ribosome binding and translation initiation. We demonstrated specific binding to the cognate tRNAIle and tRNAIle-dependent structural rearrangements consistent with regulation at the level of translation initiation, providing the first biochemical demonstration, to our knowledge, of translational regulation in a T box riboswitch. PMID:25583497
Krisinger, J; Jeung, E B; Simmen, R C; Leung, P C
1995-01-01
The expression of Calbindin-D9k (CaBP-9k) in the pig uterus and placenta was measured by Northern blot analysis and reverse transcription polymerase chain reaction (PCR), respectively. Progesterone (P4) administration to ovariectomized pigs decreased CaBP-9k mRNA levels. Expression of endometrial CaBP-9k mRNA was high on pregnancy Days 10-12 and below the detection limit on Days 15 and 18. On Day 60, expression could be detected at low levels. In myometrium and placenta, CaBP-9k mRNA expression was not detectable by Northern analysis using total RNA. Reverse-transcribed RNA from both tissues demonstrated the presence of CaBP-9k transcripts by means of PCR. The partial CaBP-9k gene was amplified by PCR and cloned to determine the sequence of intron A. In contrast to the rat CaBP-9k gene, the pig gene does not contain a functional estrogen response element (ERE) within this region. A similar ERE-like sequence located at the identical location was examined by gel retardation analysis and failed to bind the estradiol receptor. A similar disruption of this ERE-like sequence has been described in the human CaBP-9k gene, which is not expressed at any level in placenta, myometrium, or endometrium. It is concluded that the pig CaBP-9k gene is regulated in these reproductive tissues in a manner distinct from that in rat and human tissues. The regulation is probably due to a regulatory region outside of intron A, which in the rat gene contains the key cis element for uterine expression of the CaBP-9k gene.
cis-Acting elements important for retroviral RNA packaging specificity.
Beasley, Benjamin E; Hu, Wei-Shau
2002-05-01
Spleen necrosis virus (SNV) proteins can package RNA from distantly related murine leukemia virus (MLV), whereas MLV proteins cannot package SNV RNA efficiently. We used this nonreciprocal recognition to investigate regions of packaging signals that influence viral RNA encapsidation specificity. Although the MLV and SNV packaging signals (Psi and E, respectively) do not contain significant sequence homology, they both contain a pair of hairpins. This hairpin pair was previously proposed to be the core element in MLV Psi. In the present study, MLV-based vectors were generated to contain chimeric SNV/MLV packaging signals in which the hairpins were replaced with the heterologous counterpart. The interactions between these chimeras and MLV or SNV proteins were examined by virus replication and RNA analyses. SNV proteins recognized all of the chimeras, indicating that these chimeras were functional. We found that replacing the hairpin pair did not drastically alter the ability of MLV proteins to package these chimeras. These results indicate that, despite the important role of the hairpin pair in RNA packaging, it is not the major motif responsible for the ability of MLV proteins to discriminate between the MLV and SNV packaging signals. To determine the role of sequences flanking the hairpins in RNA packaging specificity, vectors with swapped flanking regions were generated and evaluated. SNV proteins packaged all of these chimeras efficiently. In contrast, MLV proteins strongly favored chimeras with the MLV 5'-flanking regions. These data indicated that MLV Gag recognizes multiple elements in the viral packaging signal, including the hairpin structure and flanking regions.
Brewer-Jensen, Paul; Wilson, Carrie B.; Abernethy, John; Mollison, Lonna; Card, Samantha
2016-01-01
Although RNA polymerase II (Pol II) productively transcribes very long genes in vivo, transcription through extragenic sequences often terminates in the promoter-proximal region and the nascent RNA is degraded. Mechanisms that induce early termination and RNA degradation are not well understood in multicellular organisms. Here, we present evidence that the suppressor of sable [su(s)] regulatory pathway of Drosophila melanogaster plays a role in this process. We previously showed that Su(s) promotes exosome-mediated degradation of transcripts from endogenous repeated elements at an Hsp70 locus (Hsp70-αβ elements). In this report, we identify Wdr82 as a component of this process and show that it works with Su(s) to inhibit Pol II elongation through Hsp70-αβ elements. Furthermore, we show that the unstable transcripts produced during this process are polyadenylated at heterogeneous sites that lack canonical polyadenylation signals. We define two distinct regions that mediate this regulation. These results indicate that the Su(s) pathway promotes RNA degradation and transcription termination through a novel mechanism. PMID:26577379
Belak, Zachery R; Ovsenek, Nicholas; Eskiw, Christopher H
2018-05-23
Yin-Yang 1 (YY1) is a highly conserved transcription factor possessing RNA-binding activity. A putative YY1 homologue was previously identified in the developmental model organism Strongylocentrotus purpuratus (the purple sea urchin) by genomic sequencing. We identified a high degree of sequence similarity with YY1 homologues of vertebrate origin which shared 100% protein sequence identity over the DNA- and RNA-binding zinc-finger region with high similarity in the N-terminal transcriptional activation domain. SpYY1 demonstrated identical DNA- and RNA-binding characteristics between Xenopus laevis and S. purpuratus indicating that it maintains similar functional and biochemical properties across widely divergent deuterostome species. SpYY1 binds to the consensus YY1 DNA element, and also to U-rich RNA sequences. Although we detected SpYY1 RNA-binding activity in ova lysates and observed cytoplasmic localization, SpYY1 was not associated with maternal mRNA in ova. SpYY1 expressed in Xenopus oocytes was excluded from the nucleus and associated with maternally expressed cytoplasmic mRNA molecules. These data demonstrate the existence of an YY1 homologue in S. purpuratus with similar structural and biochemical features to those of the well-studied vertebrate YY1; however, the data reveal major differences in the biological role of YY1 in the regulation of maternally expressed mRNA in the two species.
cis elements and trans-acting factors involved in dimer formation of murine leukemia virus RNA.
Prats, A C; Roy, C; Wang, P A; Erard, M; Housset, V; Gabus, C; Paoletti, C; Darlix, J L
1990-02-01
The genetic material of all retroviruses examined so far consists of two identical RNA molecules joined at their 5' ends by the dimer linkage structure (DLS). Since the precise location of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analyzed the dimerization process of Moloney murine leukemia virus (MoMuLV) genomic RNA. For this purpose we derived an in vitro model for RNA dimerization. By using this model, murine leukemia virus RNA was shown to form dimeric molecules. Deletion mutagenesis in the 620-nucleotide leader of MoMuLV RNA showed that the dimer promoting sequences are located within the encapsidation element Psi between positions 215 and 420. Furthermore, hybridization assays in which DNA oligomers were used to probe monomer and dimer forms of MoMuLV RNA indicated that the DLS probably maps between positions 280 and 330 from the RNA 5' end. Also, retroviral nucleocapsid protein was shown to catalyze dimerization of MoMuLV RNA and to be tightly bound to genomic dimer RNA in virions. These results suggest that MoMuLV RNA dimerization and encapsidation are probably controlled by the same cis element, Psi, and trans-acting factor, nucleocapsid protein, and thus might be linked during virion formation.
cis elements and trans-acting factors involved in dimer formation of murine leukemia virus RNA.
Prats, A C; Roy, C; Wang, P A; Erard, M; Housset, V; Gabus, C; Paoletti, C; Darlix, J L
1990-01-01
The genetic material of all retroviruses examined so far consists of two identical RNA molecules joined at their 5' ends by the dimer linkage structure (DLS). Since the precise location of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analyzed the dimerization process of Moloney murine leukemia virus (MoMuLV) genomic RNA. For this purpose we derived an in vitro model for RNA dimerization. By using this model, murine leukemia virus RNA was shown to form dimeric molecules. Deletion mutagenesis in the 620-nucleotide leader of MoMuLV RNA showed that the dimer promoting sequences are located within the encapsidation element Psi between positions 215 and 420. Furthermore, hybridization assays in which DNA oligomers were used to probe monomer and dimer forms of MoMuLV RNA indicated that the DLS probably maps between positions 280 and 330 from the RNA 5' end. Also, retroviral nucleocapsid protein was shown to catalyze dimerization of MoMuLV RNA and to be tightly bound to genomic dimer RNA in virions. These results suggest that MoMuLV RNA dimerization and encapsidation are probably controlled by the same cis element, Psi, and trans-acting factor, nucleocapsid protein, and thus might be linked during virion formation. Images PMID:2153242
A model for genesis of transcription systems.
Burton, Zachary F; Opron, Kristopher; Wei, Guowei; Geiger, James H
2016-01-01
Repeating sequences generated from RNA gene fusions/ligations dominate ancient life, indicating central importance of building structural complexity in evolving biological systems. A simple and coherent story of life on earth is told from tracking repeating motifs that generate α/β proteins, 2-double-Ψ-β-barrel (DPBB) type RNA polymerases (RNAPs), general transcription factors (GTFs), and promoters. A general rule that emerges is that biological complexity that arises through generation of repeats is often bounded by solubility and closure (i.e., to form a pseudo-dimer or a barrel). Because the first DNA genomes were replicated by DNA template-dependent RNA synthesis followed by RNA template-dependent DNA synthesis via reverse transcriptase, the first DNA replication origins were initially 2-DPBB type RNAP promoters. A simplifying model for evolution of promoters/replication origins via repetition of core promoter elements is proposed. The model can explain why Pribnow boxes in bacterial transcription (i.e., (-12)TATAATG(-6)) so closely resemble TATA boxes (i.e., (-31)TATAAAAG(-24)) in archaeal/eukaryotic transcription. The evolution of anchor DNA sequences in bacterial (i.e., (-35)TTGACA(-30)) and archaeal (BRE(up); BRE for TFB recognition element) promoters is potentially explained. The evolution of BRE(down) elements of archaeal promoters is potentially explained.
Characterization of short interspersed elements (SINEs) in a red alga, Porphyra yezoensis.
Zhang, Wenbo; Lin, Xiaofei; Peddigari, Suresh; Takechi, Katsuaki; Takano, Hiroyoshi; Takio, Susumu
2007-02-01
Short interspersed element (SINE)-like sequences referred to as PySN1 and PySN2 were identified in a red alga, Porphyra yezoensis. Both elements contained an internal promoter with motifs (A box and B box) recognized by RNA polymerase III, and target site duplications at both ends. Genomic Southern blot analysis revealed that both elements were widely and abundantly distributed on the genome. 3' and 5' RACE suggested that PySN1 was expressed as a chimera transcript with flanking SINE-unrelated sequences and possessed the poly-A tail at the same position near the 3' end of PySN1.
A promoter recognition mechanism common to yeast mitochondrial and phage t7 RNA polymerases.
Nayak, Dhananjaya; Guo, Qing; Sousa, Rui
2009-05-15
Yeast mitochondrial (YMt) and phage T7 RNA polymerases (RNAPs) are two divergent representatives of a large family of single subunit RNAPs that are also found in the mitochondria and chloroplasts of higher eukaryotes, mammalian nuclei, and many other bacteriophage. YMt and phage T7 promoters differ greatly in sequence and length, and the YMt RNAP uses an accessory factor for initiation, whereas T7 RNAP does not. We obtain evidence here that, despite these apparent differences, both the YMt and T7 RNAPs utilize a similar promoter recognition loop to bind their respective promoters. Mutations in this element in YMt RNAP specifically disrupt mitochondrial promoter utilization, and experiments with site-specifically tethered chemical nucleases indicate that this element binds the mitochondrial promoter almost identically to how the promoter recognition loop from the phage RNAP binds its promoter. Sequence comparisons reveal that the other members of the single subunit RNAP family display loops of variable sequence and size at a position corresponding to the YMt and T7 RNAP promoter recognition loops. We speculate that these elements may be involved in promoter recognition in most or all of these enzymes and that this element's structure allows it to accommodate significant sequence and length variation to provide a mechanism for rapid evolution of new promoter specificities in this RNAP family.
The identification and functional annotation of RNA structures conserved in vertebrates
Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan
2017-01-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
Lerat, Emmanuelle; Fablet, Marie; Modolo, Laurent; Lopez-Maestre, Hélène
2017-01-01
Abstract Over recent decades, substantial efforts have been made to understand the interactions between host genomes and transposable elements (TEs). The impact of TEs on the regulation of host genes is well known, with TEs acting as platforms of regulatory sequences. Nevertheless, due to their repetitive nature it is considerably hard to integrate TE analysis into genome-wide studies. Here, we developed a specific tool for the analysis of TE expression: TEtools. This tool takes into account the TE sequence diversity of the genome, it can be applied to unannotated or unassembled genomes and is freely available under the GPL3 (https://github.com/l-modolo/TEtools). TEtools performs the mapping of RNA-seq data obtained from classical mRNAs or small RNAs onto a list of TE sequences and performs differential expression analyses with statistical relevance. Using this tool, we analyzed TE expression from five Drosophila wild-type strains. Our data show for the first time that the activity of TEs is strictly linked to the activity of the genes implicated in the piwi-interacting RNA biogenesis and therefore fits an arms race scenario between TE sequences and host control genes. PMID:28204592
Colored petri net modeling of small interfering RNA-mediated messenger RNA degradation.
Nickaeen, Niloofar; Moein, Shiva; Heidary, Zarifeh; Ghaisari, Jafar
2016-01-01
Mathematical modeling of biological systems is an attractive way for studying complex biological systems and their behaviors. Petri Nets, due to their ability to model systems with various levels of qualitative information, have been wildly used in modeling biological systems in which enough qualitative data may not be at disposal. These nets have been used to answer questions regarding the dynamics of different cell behaviors including the translation process. In one stage of the translation process, the RNA sequence may be degraded. In the process of degradation of RNA sequence, small-noncoding RNA molecules known as small interfering RNA (siRNA) match the target RNA sequence. As a result of this matching, the target RNA sequence is destroyed. In this context, the process of matching and destruction is modeled using Colored Petri Nets (CPNs). The model is constructed using CPNs which allow tokens to have a value or type on them. Thus, CPN is a suitable tool to model string structures in which each element of the string has a different type. Using CPNs, long RNA, and siRNA strings are modeled with a finite set of colors. The model is simulated via CPN Tools. A CPN model of the matching between RNA and siRNA strings is constructed in CPN Tools environment. In previous studies, a network of stoichiometric equations was modeled. However, in this particular study, we modeled the mechanism behind the silencing process. Modeling this kind of mechanisms provides us with a tool to examine the effects of different factors such as mutation or drugs on the process.
Ryner, L C; Takagaki, Y; Manley, J L
1989-01-01
To investigate the role of sequences lying downstream of the conserved AAUAAA hexanucleotide in pre-mRNA cleavage and polyadenylation, deletions or substitutions were constructed in polyadenylation signals from simian virus 40 and adenovirus, and their effects were assayed in both crude and fractionated HeLa cell nuclear extracts. As expected, these sequences influenced the efficiency of both cleavage and polyadenylation as well as the accuracy of the cleavage reaction. Sequences near or upstream of the actual site of poly(A) addition appeared to specify a unique cleavage site, since their deletion resulted, in some cases, in heterogeneous cleavage. Furthermore, the sequences that allowed the simian virus 40 late pre-RNA to be cleaved preferentially by partially purified cleavage activity were also those at the cleavage site itself. Interestingly, sequences downstream of the cleavage site interacted with factors not directly involved in catalyzing cleavage and polyadenylation, since the effects of deletions were substantially diminished when partially purified components were used in assays. In addition, these sequences contained elements that could affect 3'-end formation both positively and negatively. Images PMID:2566911
Sokol, Martin; Jessen, Karen Margrethe; Pedersen, Finn Skou
2016-01-01
Several studies have shown that human endogenous retroviruses and endogenous retrovirus-like repeats (here collectively HERVs) impose direct regulation on human genes through enhancer and promoter motifs present in their long terminal repeats (LTRs). Although chimeric transcription in which novel gene isoforms containing retroviral and human sequence are transcribed from viral promoters are commonly associated with disease, regulation by HERVs is beneficial in other settings; for example, in human testis chimeric isoforms of TP63 induced by an ERV9 LTR protect the male germ line upon DNA damage by inducing apoptosis, whereas in the human globin locus the γ- and β-globin switch during normal hematopoiesis is mediated by complex interactions of an ERV9 LTR and surrounding human sequence. The advent of deep sequencing or next-generation sequencing (NGS) has revolutionized the way researchers solve important scientific questions and develop novel hypotheses in relation to human genome regulation. We recently applied next-generation paired-end RNA-sequencing (RNA-seq) together with chromatin immunoprecipitation with sequencing (ChIP-seq) to examine ERV9 chimeric transcription in human reference cell lines from Encyclopedia of DNA Elements (ENCODE). This led to the discovery of advanced regulation mechanisms by ERV9s and other HERVs across numerous human loci including transcription of large gene-unannotated genomic regions, as well as cooperative regulation by multiple HERVs and non-LTR repeats such as Alu elements. In this article, well-established examples of human gene regulation by HERVs are reviewed followed by a description of paired-end RNA-seq, and its application in identifying chimeric transcription genome-widely. Based on integrative analyses of RNA-seq and ChIP-seq, data we then present novel examples of regulation by ERV9s of tumor suppressor genes CADM2 and SEMA3A, as well as transcription of an unannotated region. Taken together, this article highlights the high suitability of contemporary sequencing methods in future analyses of human biology in relation to evolutionary acquired retroviruses in the human genome. © 2016 APMIS. Published by John Wiley & Sons Ltd.
Structural Analysis of Single-Point Mutations Given an RNA Sequence: A Case Study with RNAMute
NASA Astrophysics Data System (ADS)
Churkin, Alexander; Barash, Danny
2006-12-01
We introduce here for the first time the RNAMute package, a pattern-recognition-based utility to perform mutational analysis and detect vulnerable spots within an RNA sequence that affect structure. Mutations in these spots may lead to a structural change that directly relates to a change in functionality. Previously, the concept was tried on RNA genetic control elements called "riboswitches" and other known RNA switches, without an organized utility that analyzes all single-point mutations and can be further expanded. The RNAMute package allows a comprehensive categorization, given an RNA sequence that has functional relevance, by exploring the patterns of all single-point mutants. For illustration, we apply the RNAMute package on an RNA transcript for which individual point mutations were shown experimentally to inactivate spectinomycin resistance in Escherichia coli. Functional analysis of mutations on this case study was performed experimentally by creating a library of point mutations using PCR and screening to locate those mutations. With the availability of RNAMute, preanalysis can be performed computationally before conducting an experiment.
Widespread promoter-mediated coordination of transcription and mRNA degradation
2012-01-01
Background Previous work showed that mRNA degradation is coordinated with transcription in yeast, and in several genes the control of mRNA degradation was linked to promoter elements through two different mechanisms. Here we show at the genomic scale that the coordination of transcription and mRNA degradation is promoter-dependent in yeast and is also observed in humans. Results We first demonstrate that swapping upstream cis-regulatory sequences between two yeast species affects both transcription and mRNA degradation and suggest that while some cis-regulatory elements control either transcription or degradation, multiple other elements enhance both processes. Second, we show that adjacent yeast genes that share a promoter (through divergent orientation) have increased similarity in their patterns of mRNA degradation, providing independent evidence for the promoter-mediated coupling of transcription to mRNA degradation. Finally, analysis of the differences in mRNA degradation rates between mammalian cell types or mammalian species suggests a similar coordination between transcription and mRNA degradation in humans. Conclusions Our results extend previous studies and suggest a pervasive promoter-mediated coordination between transcription and mRNA degradation in yeast. The diverse genes and regulatory elements associated with this coordination suggest that it is generated by a global mechanism of gene regulation and modulated by gene-specific mechanisms. The observation of a similar coupling in mammals raises the possibility that coupling of transcription and mRNA degradation may reflect an evolutionarily conserved phenomenon in gene regulation. PMID:23237624
Barrera-Figueroa, Blanca E; Gao, Lei; Wu, Zhigang; Zhou, Xuefeng; Zhu, Jianhua; Jin, Hailing; Liu, Renyi; Zhu, Jian-Kang
2012-08-03
MicroRNAs (miRNAs) are small RNA molecules that play important regulatory roles in plant development and stress responses. Identification of stress-regulated miRNAs is crucial for understanding how plants respond to environmental stimuli. Abiotic stresses are one of the major factors that limit crop growth and yield. Whereas abiotic stress-regulated miRNAs have been identified in vegetative tissues in several plants, they are not well studied in reproductive tissues such as inflorescences. We used Illumina deep sequencing technology to sequence four small RNA libraries that were constructed from the inflorescences of rice plants that were grown under control condition and drought, cold, or salt stress. We identified 227 miRNAs that belong to 127 families, including 70 miRNAs that are not present in the miRBase. We validated 62 miRNAs (including 10 novel miRNAs) using published small RNA expression data in DCL1, DCL3, and RDR2 RNAi lines and confirmed 210 targets from 86 miRNAs using published degradome data. By comparing the expression levels of miRNAs, we identified 18, 15, and 10 miRNAs that were regulated by drought, cold and salt stress conditions, respectively. In addition, we identified 80 candidate miRNAs that originated from transposable elements or repeats, especially miniature inverted-repeat elements (MITEs). We discovered novel miRNAs and stress-regulated miRNAs that may play critical roles in stress response in rice inflorescences. Transposable elements or repeats, especially MITEs, are rich sources for miRNA origination.
Miras, Manuel; Sempere, Raquel N.; Kraft, Jelena J.; Miller, W. Allen; Aranda, Miguel A.; Truniger, Veronica
2015-01-01
Summary Many plant viruses depend on functional RNA elements, called 3′-UTR cap-independent translation enhancers (3′-CITEs), for translation of their RNAs. In this manuscript we provide direct proof for the existing hypothesis that 3′-CITEs are modular and transferable by recombination in nature, and that this is associated with an advantage for the created virus. By characterizing a newly identified Melon necrotic spot virus (MNSV; Tombusviridae) isolate, which is able to overcome eukaryotic translation initiation factor 4E (eIF4E)-mediated resistance, we found that it contains a 55 nucleotide insertion in its 3′-UTR. We provide strong evidence that this insertion was acquired by interfamilial recombination with the 3′-UTR of an Asiatic Cucurbit aphid-borne yellows virus (CABYV; Luteoviridae). By constructing chimeric viruses, we showed that this recombined sequence is responsible for resistance breaking. Analysis of the translational efficiency of reporter constructs showed that this sequence functions as a novel 3′-CITE in both resistant and susceptible plants, being essential for translation control in resistant plants. In conclusion, we showed that a recombination event between two clearly identified viruses from different families led to the transfer of exactly the sequence corresponding to a functional RNA element, giving rise to a new isolate with the capacity to infect an otherwise non-susceptible host. PMID:24372390
Miras, Manuel; Sempere, Raquel N; Kraft, Jelena J; Miller, W Allen; Aranda, Miguel A; Truniger, Veronica
2014-04-01
Many plant viruses depend on functional RNA elements, called 3'-UTR cap-independent translation enhancers (3'-CITEs), for translation of their RNAs. In this manuscript we provide direct proof for the existing hypothesis that 3'-CITEs are modular and transferable by recombination in nature, and that this is associated with an advantage for the created virus. By characterizing a newly identified Melon necrotic spot virus (MNSV; Tombusviridae) isolate, which is able to overcome eukaryotic translation initiation factor 4E (eIF4E)-mediated resistance, we found that it contains a 55 nucleotide insertion in its 3'-UTR. We provide strong evidence that this insertion was acquired by interfamilial recombination with the 3'-UTR of an Asiatic Cucurbit aphid-borne yellows virus (CABYV; Luteoviridae). By constructing chimeric viruses, we showed that this recombined sequence is responsible for resistance breaking. Analysis of the translational efficiency of reporter constructs showed that this sequence functions as a novel 3'-CITE in both resistant and susceptible plants, being essential for translation control in resistant plants. In conclusion, we showed that a recombination event between two clearly identified viruses from different families led to the transfer of exactly the sequence corresponding to a functional RNA element, giving rise to a new isolate with the capacity to infect an otherwise nonsusceptible host. © 2013 The Authors. New Phytologist © 2013 New Phytologist Trust.
Retrotransposons as regulators of gene expression.
Elbarbary, Reyad A; Lucas, Bronwyn A; Maquat, Lynne E
2016-02-12
Transposable elements (TEs) are both a boon and a bane to eukaryotic organisms, depending on where they integrate into the genome and how their sequences function once integrated. We focus on two types of TEs: long interspersed elements (LINEs) and short interspersed elements (SINEs). LINEs and SINEs are retrotransposons; that is, they transpose via an RNA intermediate. We discuss how LINEs and SINEs have expanded in eukaryotic genomes and contribute to genome evolution. An emerging body of evidence indicates that LINEs and SINEs function to regulate gene expression by affecting chromatin structure, gene transcription, pre-mRNA processing, or aspects of mRNA metabolism. We also describe how adenosine-to-inosine editing influences SINE function and how ongoing retrotransposition is countered by the body's defense mechanisms. Copyright © 2016, American Association for the Advancement of Science.
Biswas, Ambarish; Brown, Chris M
2014-06-08
Gene expression in vertebrate cells may be controlled post-transcriptionally through regulatory elements in mRNAs. These are usually located in the untranslated regions (UTRs) of mRNA sequences, particularly the 3'UTRs. Scan for Motifs (SFM) simplifies the process of identifying a wide range of regulatory elements on alignments of vertebrate 3'UTRs. SFM includes identification of both RNA Binding Protein (RBP) sites and targets of miRNAs. In addition to searching pre-computed alignments, the tool provides users the flexibility to search their own sequences or alignments. The regulatory elements may be filtered by expected value cutoffs and are cross-referenced back to their respective sources and literature. The output is an interactive graphical representation, highlighting potential regulatory elements and overlaps between them. The output also provides simple statistics and links to related resources for complementary analyses. The overall process is intuitive and fast. As SFM is a free web-application, the user does not need to install any software or databases. Visualisation of the binding sites of different classes of effectors that bind to 3'UTRs will facilitate the study of regulatory elements in 3' UTRs.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons
Pagano, Johanna F.B.; Ensink, Wim A.; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P.; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J.; Dekker, Rob J.
2017-01-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. PMID:28003516
Recognition of Double Stranded RNA by Guanidine-Modified Peptide Nucleic Acids (GPNA)
Gupta, Pankaj; Muse, Oluwatoyosi; Rozners, Eriks
2011-01-01
Double helical RNA has become an attractive target for molecular recognition because many non-coding RNAs play important roles in control of gene expression. Recently, we discovered that short peptide nucleic acids (PNA) bind strongly and sequence selectively to a homopurine tract of double helical RNA via triple helix formation. Herein we tested if the molecular recognition of RNA can be enhanced by α-guanidine modification of PNA. Our study was motivated by the discovery of Ly and co-workers that the guanidine modification greatly enhances the cellular delivery of PNA. Isothermal titration calorimetry showed that the guanidine-modified PNA (GPNA) had reduced affinity and sequence selectivity for triple helical recognition of RNA. The data suggested that in contrast to unmodified PNA, which formed a 1:1 PNA-RNA triple helix, GPNA preferred a 2:1 GPNA-RNA triplex-invasion complex. Nevertheless, promising results were obtained for recognition of biologically relevant double helical RNA. Consistent with enhanced strand invasion ability, GPNA derived from D-arginine recognized the transactivation response element (TAR) of HIV-1 with high affinity and sequence selectivity, presumably via Watson-Crick duplex formation. On the other hand, strong and sequence selective triple helices were formed by unmodified and nucelobase-modified PNAs and the purine rich strand of bacterial A-site. These results suggest that appropriate chemical modifications of PNA may enhance molecular recognition of complex non-coding RNAs. PMID:22146072
Xu, Weijia; Ozer, Stuart; Gutell, Robin R
2009-01-01
With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.
Xu, Weijia; Ozer, Stuart; Gutell, Robin R.
2010-01-01
With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534
Visootsat, Akasit; Payungporn, Sunchai; T-Thienprasert, Nattanan P
2015-12-01
Hepatitis B virus (HBV) infection is a primary cause of hepatocellular carcinoma and liver cirrhosis worldwide. To develop novel antiviral drugs, a better understanding of HBV gene expression regulation is vital. One important aspect is to understand how HBV hijacks the cellular machinery to export unspliced RNA from the nucleus. The HBV post-transcriptional regulatory element (HBV PRE) has been proposed to be the HBV RNA nuclear export element. However, the function remains controversial, and the core element is unclear. This study, therefore, aimed to identify functional regulatory elements within the HBV PRE and investigate their functions. Using bioinformatics programs based on sequence conservation and conserved RNA secondary structures, three regulatory elements were predicted, namely PRE 1151-1410, PRE 1520-1620 and PRE 1650-1684. PRE 1151-1410 significantly increased intronless and unspliced luciferase activity in both HepG2 and COS-7 cells. Likewise, PRE 1151-1410 significantly elevated intronless and unspliced HBV surface transcripts in liver cancer cells. Moreover, motif analysis predicted that PRE 1151-1410 contains several regulatory motifs. This study reported the roles of PRE 1151-1410 in intronless transcript nuclear export and the splicing mechanism. Additionally, these results provide knowledge in the field of HBV RNA regulation. Moreover, PRE 1151-1410 may be used to enhance the expression of other mRNAs in intronless reporter plasmids.
Context-dependent control of alternative splicing by RNA-binding proteins
Fu, Xiang-Dong; Ares, Manuel
2015-01-01
Sequence-specific RNA-binding proteins (RBPs) bind to pre-mRNA to control alternative splicing, but it is not yet possible to read the ‘splicing code’ that dictates splicing regulation on the basis of genome sequence. Each alternative splicing event is controlled by multiple RBPs, the combined action of which creates a distribution of alternatively spliced products in a given cell type. As each cell type expresses a distinct array of RBPs, the interpretation of regulatory information on a given RNA target is exceedingly dependent on the cell type. RBPs also control each other’s functions at many levels, including by mutual modulation of their binding activities on specific regulatory RNA elements. In this Review, we describe some of the emerging rules that govern the highly context-dependent and combinatorial nature of alternative splicing regulation. PMID:25112293
Stable CoT-1 repeat RNA is abundant and associated with euchromatic interphase chromosomes
Hall, Lisa L.; Carone, Dawn M.; Gomez, Alvin; Kolpa, Heather J.; Byron, Meg; Mehta, Nitish; Fackelmayer, Frank O.; Lawrence, Jeanne B.
2014-01-01
SUMMARY Recent studies recognize a vast diversity of non-coding RNAs with largely unknown functions, but few have examined interspersed repeat sequences, which constitute almost half our genome. RNA hybridization in situ using CoT-1 (highly repeated) DNA probes detects surprisingly abundant euchromatin-associated RNA comprised predominantly of repeat sequences (“CoT-1 RNA”), including LINE-1. CoT-1-hybridizing RNA strictly localizes to the interphase chromosome territory in cis, and remains stably associated with the chromosome territory following prolonged transcriptional inhibition. The CoT-1 RNA territory resists mechanical disruption and fractionates with the non-chromatin scaffold, but can be experimentally released. Loss of repeat-rich, stable nuclear RNAs from euchromatin corresponds to aberrant chromatin distribution and condensation. CoT-1 RNA has several properties similar to XIST chromosomal RNA, but is excluded from chromatin condensed by XIST. These findings impact two “black boxes” of genome science: the poorly understood diversity of non-coding RNA and the unexplained abundance of repetitive elements. PMID:24581492
Secondary structural entropy in RNA switch (Riboswitch) identification.
Manzourolajdad, Amirhossein; Arnold, Jonathan
2015-04-28
RNA regulatory elements play a significant role in gene regulation. Riboswitches, a widespread group of regulatory RNAs, are vital components of many bacterial genomes. These regulatory elements generally function by forming a ligand-induced alternative fold that controls access to ribosome binding sites or other regulatory sites in RNA. Riboswitch-mediated mechanisms are ubiquitous across bacterial genomes. A typical class of riboswitch has its own unique structural and biological complexity, making de novo riboswitch identification a formidable task. Traditionally, riboswitches have been identified through comparative genomics based on sequence and structural homology. The limitations of structural-homology-based approaches, coupled with the assumption that there is a great diversity of undiscovered riboswitches, suggests the need for alternative methods for riboswitch identification, possibly based on features intrinsic to their structure. As of yet, no such reliable method has been proposed. We used structural entropy of riboswitch sequences as a measure of their secondary structural dynamics. Entropy values of a diverse set of riboswitches were compared to that of their mutants, their dinucleotide shuffles, and their reverse complement sequences under different stochastic context-free grammar folding models. Significance of our results was evaluated by comparison to other approaches, such as the base-pairing entropy and energy landscapes dynamics. Classifiers based on structural entropy optimized via sequence and structural features were devised as riboswitch identifiers and tested on Bacillus subtilis, Escherichia coli, and Synechococcus elongatus as an exploration of structural entropy based approaches. The unusually long untranslated region of the cotH in Bacillus subtilis, as well as upstream regions of certain genes, such as the sucC genes were associated with significant structural entropy values in genome-wide examinations. Various tests show that there is in fact a relationship between higher structural entropy and the potential for the RNA sequence to have alternative structures, within the limitations of our methodology. This relationship, though modest, is consistent across various tests. Understanding the behavior of structural entropy as a fairly new feature for RNA conformational dynamics, however, may require extensive exploratory investigation both across RNA sequences and folding models.
mRNA deep sequencing reveals 75 new genes and a complex transcriptional landscape in Mimivirus.
Legendre, Matthieu; Audic, Stéphane; Poirot, Olivier; Hingamp, Pascal; Seltzer, Virginie; Byrne, Deborah; Lartigue, Audrey; Lescot, Magali; Bernadac, Alain; Poulain, Julie; Abergel, Chantal; Claverie, Jean-Michel
2010-05-01
Mimivirus, a virus infecting Acanthamoeba, is the prototype of the Mimiviridae, the latest addition to the nucleocytoplasmic large DNA viruses. The Mimivirus genome encodes close to 1000 proteins, many of them never before encountered in a virus, such as four amino-acyl tRNA synthetases. To explore the physiology of this exceptional virus and identify the genes involved in the building of its characteristic intracytoplasmic "virion factory," we coupled electron microscopy observations with the massively parallel pyrosequencing of the polyadenylated RNA fractions of Acanthamoeba castellanii cells at various time post-infection. We generated 633,346 reads, of which 322,904 correspond to Mimivirus transcripts. This first application of deep mRNA sequencing (454 Life Sciences [Roche] FLX) to a large DNA virus allowed the precise delineation of the 5' and 3' extremities of Mimivirus mRNAs and revealed 75 new transcripts including several noncoding RNAs. Mimivirus genes are expressed across a wide dynamic range, in a finely regulated manner broadly described by three main temporal classes: early, intermediate, and late. This RNA-seq study confirmed the AAAATTGA sequence as an early promoter element, as well as the presence of palindromes at most of the polyadenylation sites. It also revealed a new promoter element correlating with late gene expression, which is also prominent in Sputnik, the recently described Mimivirus "virophage." These results-validated genome-wide by the hybridization of total RNA extracted from infected Acanthamoeba cells on a tiling array (Agilent)--will constitute the foundation on which to build subsequent functional studies of the Mimivirus/Acanthamoeba system.
Hogan, Daniel J; Riordan, Daniel P; Gerber, André P; Herschlag, Daniel; Brown, Patrick O
2008-10-28
RNA-binding proteins (RBPs) have roles in the regulation of many post-transcriptional steps in gene expression, but relatively few RBPs have been systematically studied. We searched for the RNA targets of 40 proteins in the yeast Saccharomyces cerevisiae: a selective sample of the approximately 600 annotated and predicted RBPs, as well as several proteins not annotated as RBPs. At least 33 of these 40 proteins, including three of the four proteins that were not previously known or predicted to be RBPs, were reproducibly associated with specific sets of a few to several hundred RNAs. Remarkably, many of the RBPs we studied bound mRNAs whose protein products share identifiable functional or cytotopic features. We identified specific sequences or predicted structures significantly enriched in target mRNAs of 16 RBPs. These potential RNA-recognition elements were diverse in sequence, structure, and location: some were found predominantly in 3'-untranslated regions, others in 5'-untranslated regions, some in coding sequences, and many in two or more of these features. Although this study only examined a small fraction of the universe of yeast RBPs, 70% of the mRNA transcriptome had significant associations with at least one of these RBPs, and on average, each distinct yeast mRNA interacted with three of the RBPs, suggesting the potential for a rich, multidimensional network of regulation. These results strongly suggest that combinatorial binding of RBPs to specific recognition elements in mRNAs is a pervasive mechanism for multi-dimensional regulation of their post-transcriptional fate.
Bieth, E; Gabus, C; Darlix, J L
1990-01-11
The genetic material of all retroviruses examined so far is an RNA dimer where two identical RNA subunits are joined at their 5' ends by a structure named dimer linkage structure (DLS). Since the precise location and structure of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analysed the dimerization process of Rous sarcoma virus (RSV) RNA. For this purpose we set up an in vitro model for RSV RNA dimerization. Using this model RSV RNA was shown to form dimeric molecules and this dimerization process was greatly activated by nucleocapsid protein (NCp12) of RSV. Furthermore, RSV RNA dimerization was performed in the presence of complementary 5'32P-DNA oligomers in order to probe the monomer and dimer forms of RSV RNA. Data indicated that the DLS of RSV RNA probably maps between positions 544-564 from the 5' end. In an attempt to define sequences needed for the dimerization of RSV RNA, deletion mutageneses were generated in the 5' 600 nt. The results showed that the dimer promoting sequences probably are located within positions 208-270 and 400-600 from the 5' end and hence possibly encompassing the cis-acting elements needed for the specific encapsidation of RSV genomic RNA. Also it is reported that synthesis of the polyprotein precursor Pr76gag is inhibited upon dimerization of RSV RNA. These results suggest that dimerization and encapsidation of genome length RSV RNA might be linked in the course of virion formation since they appear to be under the control of the same cis elements, E and DLS, and the trans-acting factor nucleocapsid protein NCp12.
Bieth, E; Gabus, C; Darlix, J L
1990-01-01
The genetic material of all retroviruses examined so far is an RNA dimer where two identical RNA subunits are joined at their 5' ends by a structure named dimer linkage structure (DLS). Since the precise location and structure of the DLS as well as the mechanism and role(s) of RNA dimerization remain unclear, we analysed the dimerization process of Rous sarcoma virus (RSV) RNA. For this purpose we set up an in vitro model for RSV RNA dimerization. Using this model RSV RNA was shown to form dimeric molecules and this dimerization process was greatly activated by nucleocapsid protein (NCp12) of RSV. Furthermore, RSV RNA dimerization was performed in the presence of complementary 5'32P-DNA oligomers in order to probe the monomer and dimer forms of RSV RNA. Data indicated that the DLS of RSV RNA probably maps between positions 544-564 from the 5' end. In an attempt to define sequences needed for the dimerization of RSV RNA, deletion mutageneses were generated in the 5' 600 nt. The results showed that the dimer promoting sequences probably are located within positions 208-270 and 400-600 from the 5' end and hence possibly encompassing the cis-acting elements needed for the specific encapsidation of RSV genomic RNA. Also it is reported that synthesis of the polyprotein precursor Pr76gag is inhibited upon dimerization of RSV RNA. These results suggest that dimerization and encapsidation of genome length RSV RNA might be linked in the course of virion formation since they appear to be under the control of the same cis elements, E and DLS, and the trans-acting factor nucleocapsid protein NCp12. Images PMID:2155394
Sequence-specific inhibition of Dicer measured with a force-based microarray for RNA ligands.
Limmer, Katja; Aschenbrenner, Daniela; Gaub, Hermann E
2013-04-01
Malfunction of protein translation causes many severe diseases, and suitable correction strategies may become the basis of effective therapies. One major regulatory element of protein translation is the nuclease Dicer that cuts double-stranded RNA independently of the sequence into pieces of 19-22 base pairs starting the RNA interference pathway and activating miRNAs. Inhibiting Dicer is not desirable owing to its multifunctional influence on the cell's gene regulation. Blocking specific RNA sequences by small-molecule binding, however, is a promising approach to affect the cell's condition in a controlled manner. A label-free assay for the screening of site-specific interference of small molecules with Dicer activity is thus needed. We used the Molecular Force Assay (MFA), recently developed in our lab, to measure the activity of Dicer. As a model system, we used an RNA sequence that forms an aptamer-binding site for paromomycin, a 615-dalton aminoglycoside. We show that Dicer activity is modulated as a function of concentration and incubation time: the addition of paromomycin leads to a decrease of Dicer activity according to the amount of ligand. The measured dissociation constant of paromomycin to its aptamer was found to agree well with literature values. The parallel format of the MFA allows a large-scale search and analysis for ligands for any RNA sequence.
iPARTS2: an improved tool for pairwise alignment of RNA tertiary structures, version 2.
Yang, Chung-Han; Shih, Cheng-Ting; Chen, Kun-Tze; Lee, Po-Han; Tsai, Ping-Han; Lin, Jian-Cheng; Yen, Ching-Yu; Lin, Tiao-Yin; Lu, Chin Lung
2016-07-08
Since its first release in 2010, iPARTS has become a valuable tool for globally or locally aligning two RNA 3D structures. It was implemented by a structural alphabet (SA)-based approach, which uses an SA of 23 letters to reduce RNA 3D structures into 1D sequences of SA letters and applies traditional sequence alignment to these SA-encoded sequences for determining their global or local similarity. In this version, we have re-implemented iPARTS into a new web server iPARTS2 by constructing a totally new SA, which consists of 92 elements with each carrying both information of base and backbone geometry for a representative nucleotide. This SA is significantly different from the one used in iPARTS, because the latter consists of only 23 elements with each carrying only the backbone geometry information of a representative nucleotide. Our experimental results have shown that iPARTS2 outperforms its previous version iPARTS and also achieves better accuracy than other popular tools, such as SARA, SETTER and RASS, in RNA alignment quality and function prediction. iPARTS2 takes as input two RNA 3D structures in the PDB format and outputs their global or local alignments with graphical display. iPARTS2 is now available online at http://genome.cs.nthu.edu.tw/iPARTS2/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Lee, Sooncheol; Kang, Changwon
2011-05-06
The RNA oligo(U) sequence, along with an immediately preceding RNA hairpin structure, is an essential cis-acting element for bacterial class I intrinsic termination. This sequence not only causes a pause in transcription during the beginning of the termination process but also facilitates transcript release at the end of the process. In this study, the oligo(U) sequence of the bacteriophage T7 intrinsic terminator Tφ, rather than the hairpin structure, induced pauses of phage T7 RNA polymerase not only at the termination site, triggering a termination process, but also 3 bp upstream, exerting an antitermination effect. The upstream pause presumably allowed RNA to form a thermodynamically more stable secondary structure rather than a terminator hairpin and to persist because the 5'-half of the terminator hairpin-forming sequence could be sequestered by a farther upstream sequence via sequence-specific hybridization, prohibiting formation of the terminator hairpin and termination. The putative antiterminator RNA structure lacked several base pairs essential for termination when probed using RNases A, T1, and V1. When the antiterminator was destabilized by incorporation of IMP into nascent RNA at G residue positions, antitermination was abolished. Furthermore, antitermination strength increased with more stable antiterminator secondary structures and longer pauses. Thus, the oligo(U)-mediated pause prior to the termination site can exert a cis-acting antitermination activity on intrinsic terminator Tφ, and the termination efficiency depends primarily on the termination-interfering pause that precedes the termination-facilitating pause at the termination site.
Massive programmed translational jumping in mitochondria
Lang, B. Franz; Jakubkova, Michaela; Hegedusova, Eva; Daoud, Rachid; Forget, Lise; Brejova, Brona; Vinar, Tomas; Kosa, Peter; Fricova, Dominika; Nebohacova, Martina; Griac, Peter; Tomaska, Lubomir; Burger, Gertraud; Nosek, Jozef
2014-01-01
Programmed translational bypassing is a process whereby ribosomes “ignore” a substantial interval of mRNA sequence. Although discovered 25 y ago, the only experimentally confirmed example of this puzzling phenomenon is expression of the bacteriophage T4 gene 60. Bypassing requires translational blockage at a “takeoff codon” immediately upstream of a stop codon followed by a hairpin, which causes peptidyl-tRNA dissociation and reassociation with a matching “landing triplet” 50 nt downstream, where translation resumes. Here, we report 81 translational bypassing elements (byps) in mitochondria of the yeast Magnusiomyces capitatus and demonstrate in three cases, by transcript analysis and proteomics, that byps are retained in mitochondrial mRNAs but not translated. Although mitochondrial byps resemble the bypass sequence in the T4 gene 60, they utilize unused codons instead of stops for translational blockage and have relaxed matching rules for takeoff/landing sites. We detected byp-like sequences also in mtDNAs of several Saccharomycetales, indicating that byps are mobile genetic elements. These byp-like sequences lack bypassing activity and are tolerated when inserted in-frame in variable protein regions. We hypothesize that byp-like elements have the potential to contribute to evolutionary diversification of proteins by adding new domains that allow exploration of new structures and functions. PMID:24711422
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kiss, Daniel L.; Hou, Dezhi; Gross, Robert H.
Highlights: Black-Right-Pointing-Pointer Successful use of a novel RNA-specific bioinformatic tool, RNA SCOPE. Black-Right-Pointing-Pointer Identified novel 3 Prime UTR cis-acting element that destabilizes a reporter mRNA. Black-Right-Pointing-Pointer Show exosome subunits are required for cis-acting element-mediated mRNA instability. Black-Right-Pointing-Pointer Define precise sequence requirements of novel cis-acting element. Black-Right-Pointing-Pointer Show that microarray-defined exosome subunit-regulated mRNAs have novel element. -- Abstract: Eukaryotic RNA turnover is regulated in part by the exosome, a nuclear and cytoplasmic complex of ribonucleases (RNases) and RNA-binding proteins. The major RNase of the complex is thought to be Dis3, a multi-functional 3 Prime -5 Prime exoribonuclease and endoribonuclease. Although itmore » is known that Dis3 and core exosome subunits are recruited to transcriptionally active genes and to messenger RNA (mRNA) substrates, this recruitment is thought to occur indirectly. We sought to discover cis-acting elements that recruit Dis3 or other exosome subunits. Using a bioinformatic tool called RNA SCOPE to screen the 3 Prime untranslated regions of up-regulated transcripts from our published Dis3 depletion-derived transcriptomic data set, we identified several motifs as candidate instability elements. Secondary screening using a luciferase reporter system revealed that one cassette-harboring four elements-destabilized the reporter transcript. RNAi-based depletion of Dis3, Rrp6, Rrp4, Rrp40, or Rrp46 diminished the efficacy of cassette-mediated destabilization. Truncation analysis of the cassette showed that two exosome subunit-sensitive elements (ESSEs) destabilized the reporter. Point-directed mutagenesis of ESSE abrogated the destabilization effect. An examination of the transcriptomic data from exosome subunit depletion-based microarrays revealed that mRNAs with ESSEs are found in every up-regulated mRNA data set but are underrepresented or missing from the down-regulated data sets. Taken together, our findings imply a potentially novel mechanism of mRNA turnover that involves direct Dis3 and other exosome subunit recruitment to and/or regulation on mRNA substrates.« less
Fonfara, Ines; Le Rhun, Anaïs; Chylinski, Krzysztof; Makarova, Kira S.; Lécrivain, Anne-Laure; Bzdrenga, Janek; Koonin, Eugene V.; Charpentier, Emmanuelle
2014-01-01
The CRISPR-Cas-derived RNA-guided Cas9 endonuclease is the key element of an emerging promising technology for genome engineering in a broad range of cells and organisms. The DNA-targeting mechanism of the type II CRISPR-Cas system involves maturation of tracrRNA:crRNA duplex (dual-RNA), which directs Cas9 to cleave invading DNA in a sequence-specific manner, dependent on the presence of a Protospacer Adjacent Motif (PAM) on the target. We show that evolution of dual-RNA and Cas9 in bacteria produced remarkable sequence diversity. We selected eight representatives of phylogenetically defined type II CRISPR-Cas groups to analyze possible coevolution of Cas9 and dual-RNA. We demonstrate that these two components are interchangeable only between closely related type II systems when the PAM sequence is adjusted to the investigated Cas9 protein. Comparison of the taxonomy of bacterial species that harbor type II CRISPR-Cas systems with the Cas9 phylogeny corroborates horizontal transfer of the CRISPR-Cas loci. The reported collection of dual-RNA:Cas9 with associated PAMs expands the possibilities for multiplex genome editing and could provide means to improve the specificity of the RNA-programmable Cas9 tool. PMID:24270795
Global Organization of a Positive-strand RNA Virus Genome
Wu, Baodong; Grigull, Jörg; Ore, Moriam O.; Morin, Sylvie; White, K. Andrew
2013-01-01
The genomes of plus-strand RNA viruses contain many regulatory sequences and structures that direct different viral processes. The traditional view of these RNA elements are as local structures present in non-coding regions. However, this view is changing due to the discovery of regulatory elements in coding regions and functional long-range intra-genomic base pairing interactions. The ∼4.8 kb long RNA genome of the tombusvirus tomato bushy stunt virus (TBSV) contains these types of structural features, including six different functional long-distance interactions. We hypothesized that to achieve these multiple interactions this viral genome must utilize a large-scale organizational strategy and, accordingly, we sought to assess the global conformation of the entire TBSV genome. Atomic force micrographs of the genome indicated a mostly condensed structure composed of interconnected protrusions extending from a central hub. This configuration was consistent with the genomic secondary structure model generated using high-throughput selective 2′-hydroxyl acylation analysed by primer extension (i.e. SHAPE), which predicted different sized RNA domains originating from a central region. Known RNA elements were identified in both domain and inter-domain regions, and novel structural features were predicted and functionally confirmed. Interestingly, only two of the six long-range interactions known to form were present in the structural model. However, for those interactions that did not form, complementary partner sequences were positioned relatively close to each other in the structure, suggesting that the secondary structure level of viral genome structure could provide a basic scaffold for the formation of different long-range interactions. The higher-order structural model for the TBSV RNA genome provides a snapshot of the complex framework that allows multiple functional components to operate in concert within a confined context. PMID:23717202
Identifying mRNA sequence elements for target recognition by human Argonaute proteins
Li, Jingjing; Kim, TaeHyung; Nutiu, Razvan; Ray, Debashish; Hughes, Timothy R.; Zhang, Zhaolei
2014-01-01
It is commonly known that mammalian microRNAs (miRNAs) guide the RNA-induced silencing complex (RISC) to target mRNAs through the seed-pairing rule. However, recent experiments that coimmunoprecipitate the Argonaute proteins (AGOs), the central catalytic component of RISC, have consistently revealed extensive AGO-associated mRNAs that lack seed complementarity with miRNAs. We herein test the hypothesis that AGO has its own binding preference within target mRNAs, independent of guide miRNAs. By systematically analyzing the data from in vivo cross-linking experiments with human AGOs, we have identified a structurally accessible and evolutionarily conserved region (∼10 nucleotides in length) that alone can accurately predict AGO–mRNA associations, independent of the presence of miRNA binding sites. Within this region, we further identified an enriched motif that was replicable on independent AGO-immunoprecipitation data sets. We used RNAcompete to enumerate the RNA-binding preference of human AGO2 to all possible 7-mer RNA sequences and validated the AGO motif in vitro. These findings reveal a novel function of AGOs as sequence-specific RNA-binding proteins, which may aid miRNAs in recognizing their targets with high specificity. PMID:24663241
Miras, Manuel; Rodríguez-Hernández, Ana M; Romero-López, Cristina; Berzal-Herranz, Alfredo; Colchero, Jaime; Aranda, Miguel A; Truniger, Verónica
2018-01-01
In eukaryotes, the formation of a 5'-cap and 3'-poly(A) dependent protein-protein bridge is required for translation of its mRNAs. In contrast, several plant virus RNA genomes lack both of these mRNA features, but instead have a 3'-CITE (for cap-independent translation enhancer), a RNA element present in their 3'-untranslated region that recruits translation initiation factors and is able to control its cap-independent translation. For several 3'-CITEs, direct RNA-RNA long-distance interactions based on sequence complementarity between the 5'- and 3'-ends are required for efficient translation, as they bring the translation initiation factors bound to the 3'-CITE to the 5'-end. For the carmovirus melon necrotic spot virus (MNSV), a 3'-CITE has been identified, and the presence of its 5'-end in cis has been shown to be required for its activity. Here, we analyze the secondary structure of the 5'-end of the MNSV RNA genome and identify two highly conserved nucleotide sequence stretches that are complementary to the apical loop of its 3'-CITE. In in vivo cap-independent translation assays with mutant constructs, by disrupting and restoring sequence complementarity, we show that the interaction between the 3'-CITE and at least one complementary sequence in the 5'-end is essential for virus RNA translation, although efficient virus translation and multiplication requires both connections. The complementary sequence stretches are invariant in all MNSV isolates, suggesting that the dual 5'-3' RNA:RNA interactions are required for optimal MNSV cap-independent translation and multiplication.
Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M
2017-04-01
5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Soifer, Harris S; Zaragoza, Adriana; Peyvan, Maany; Behlke, Mark A; Rossi, John J
2005-01-01
Long interspersed nuclear elements (LINE-1 or L1) comprise 17% of the human genome, although only 80-100 L1s are considered retrotransposition-competent (RC-L1). Despite their small number, RC-L1s are still potential hazards to genome integrity through insertional mutagenesis, unequal recombination and chromosome rearrangements. In this study, we provide several lines of evidence that the LINE-1 retrotransposon is susceptible to RNA interference (RNAi). First, double-stranded RNA (dsRNA) generated in vitro from an L1 template is converted into functional short interfering RNA (siRNA) by DICER, the RNase III enzyme that initiates RNAi in human cells. Second, pooled siRNA from in vitro cleavage of L1 dsRNA, as well as synthetic L1 siRNA, targeting the 5'-UTR leads to sequence-specific mRNA degradation of an L1 fusion transcript. Finally, both synthetic and pooled siRNA suppressed retrotransposition from a highly active RC-L1 clone in cell culture assay. Our report is the first to demonstrate that a human transposable element is subjected to RNAi.
McGrew, L L; Richter, J D
1990-11-01
The expression of certain maternal mRNAs during oocyte maturation is regulated by cytoplasmic polyadenylation. To understand this process, we have focused on a maternal mRNA from Xenopus termed G10. This mRNA is stored in the cytoplasm of stage 6 oocytes until maturation when the process of poly(A) elongation stimulates its translation. Deletion analysis of the 3' untranslated region of G10 RNA has revealed that two sequence elements, UUUUUUAU and AAUAAA were both necessary and sufficient for polyadenylation and polysomal recruitment. In this communication, we have defined the U-rich region that is optimal for polyadenylation as UUUUUUAUAAAG, henceforth referred to as the cytoplasmic polyadenylation element (CPE). We have also identified unique sequence requirements in the 3' terminus of the RNA that can modulate polyadenylation even in the presence of wild-type cis elements. A time course of cytoplasmic polyadenylation in vivo shows that it is an early event of maturation and that it requires protein synthesis within the first 15 min of exposure to progesterone. MPF and cyclin can both induce polyadenylation but, at least with respect to MPF, cannot obviate the requirement for protein synthesis. To identify factors that may be responsible for maturation-specific polyadenylation, we employed extracts from oocytes and unfertilized eggs, the latter of which correctly polyadenylates exogenously added RNA. UV crosslinking demonstrated that an 82 kd protein binds to the U-rich CPE in egg, but not oocyte, extracts. The data suggest that progesterone, either in addition to or through MPF/cyclin, induces the synthesis of a factor during very early maturation that stimulates polyadenylation.(ABSTRACT TRUNCATED AT 250 WORDS)
Rodríguez-Martín, Carlos; Cidre, Florencia; Fernández-Teijeiro, Ana; Gómez-Mariano, Gema; de la Vega, Leticia; Ramos, Patricia; Zaballos, Ángel; Monzón, Sara; Alonso, Javier
2016-05-01
Retinoblastoma (RB, MIM 180200) is the paradigm of hereditary cancer. Individuals harboring a constitutional mutation in one allele of the RB1 gene have a high predisposition to develop RB. Here, we present the first case of familial RB caused by a de novo insertion of a full-length long interspersed element-1 (LINE-1) into intron 14 of the RB1 gene that caused a highly heterogeneous splicing pattern of RB1 mRNA. LINE-1 insertion was inferred by mRNA studies and full-length sequenced by massive parallel sequencing. Some of the aberrant mRNAs were produced by noncanonical acceptor splice sites, a new finding that up to date has not been described to occur upon LINE-1 retrotransposition. Our results clearly show that RNA-based strategies have the potential to detect disease-causing transposon insertions. It also confirms that the incorporation of new genetic approaches, such as massive parallel sequencing, contributes to characterize at the sequence level these unique and exceptional genetic alterations.
Nguyen, Thong T; Suryamohan, Kushal; Kuriakose, Boney; Janakiraman, Vasantharajan; Reichelt, Mike; Chaudhuri, Subhra; Guillory, Joseph; Divakaran, Neethu; Rabins, P E; Goel, Ridhi; Deka, Bhabesh; Sarkar, Suman; Ekka, Preety; Tsai, Yu-Chih; Vargas, Derek; Santhosh, Sam; Mohan, Sangeetha; Chin, Chen-Shan; Korlach, Jonas; Thomas, George; Babu, Azariah; Seshagiri, Somasekar
2018-06-12
We sequenced the Hyposidra talaca NPV (HytaNPV) double stranded circular DNA genome using PacBio single molecule sequencing technology. We found that the HytaNPV genome is 139,089 bp long with a GC content of 39.6%. It encodes 141 open reading frames (ORFs) including the 37 baculovirus core genes, 25 genes conserved among lepidopteran baculoviruses, 72 genes known in baculovirus, and 7 genes unique to the HytaNPV genome. It is a group II alphabaculovirus that codes for the F protein and lacks the gp64 gene found in group I alphabaculovirus viruses. Using RNA-seq, we confirmed the expression of the ORFs identified in the HytaNPV genome. Phylogenetic analysis showed HytaNPV to be closest to BusuNPV, SujuNPV and EcobNPV that infect other tea pests, Buzura suppressaria, Sucra jujuba, and Ectropis oblique, respectively. We identified repeat elements and a conserved non-coding baculovirus element in the genome. Analysis of the putative promoter sequences identified motif consistent with the temporal expression of the genes observed in the RNA-seq data.
Moustafa, Ibrahim M.; Shen, Hujun; Morton, Brandon; Colina, Coray M.; Cameron, Craig E.
2011-01-01
The viral RNA-dependent RNA polymerase (RdRp) is essential for multiplication of all RNA viruses. The sequence diversity of an RNA virus population contributes to its ability to infect the host. This diversity emanates from errors made by the RdRp during RNA synthesis. The physical basis for RdRp fidelity is unclear but is linked to conformational changes occurring during the nucleotide-addition cycle. To understand RdRp dynamics that might influence RdRp function, we have analyzed all-atom molecular dynamics (MD) simulations on the nanosecond timescale of four RdRps from the picornavirus family that exhibit 30–74% sequence identity. Principal component analysis showed that the major motions observed during the simulations derived from conserved structural motifs and regions of known function. Dynamics of residues participating in the same biochemical property, for example RNA binding, nucleotide binding or catalysis, were correlated even when spatially distant on the RdRp structure. The conserved and correlated dynamics of functional, structural elements suggest co-evolution of dynamics with structure and function of the RdRp. Crystal structures of all picornavirus RdRps exhibit a template-nascent RNA duplex channel too small to fully accommodate duplex RNA. Simulations revealed opening and closing motions of the RNA and NTP channels, which might be relevant to NTP entry, PPi exit and translocation. A role for nanosecond timescale dynamics in RdRp fidelity is supported by altered dynamics of the high-fidelity G64S derivative of PV RdRp relative to wild-type enzyme. PMID:21575642
Herrero, Noemi
2017-04-01
A new double-stranded RNA (dsRNA) mycovirus has been identified in the isolate NB IFR-19 of the entomopathogenic fungus Isaria javanica. Isaria javanica chrysovirus-1 (IjCV-1) constitutes a new member of the Chrysoviridae family, and its genome is made up of four dsRNA elements designated dsRNA1, 2, 3 and 4 from largest to smallest. dsRNA1 and dsRNA2 encode an RNA-dependent RNA polymerase (RdRp) and a coat protein (CP), respectively. dsRNA3 and 4 encode hypothetical proteins of unknown function. IjCV-1 constitutes the first report of a chrysovirus infecting the entomopathogenic fungus Isaria javanica.
RNA secondary structure prediction using soft computing.
Ray, Shubhra Sankar; Pal, Sankar K
2013-01-01
Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.
Small gene family encoding an eggshell (chorion) protein of the human parasite Schistosoma mansoni
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bobek, L.A.; Rekosh, D.M.; Lo Verde, P.T.
1988-08-01
The authors isolated six independent genomic clones encoding schistosome chorion or eggshell proteins from a Schistosoma mansoni genomic library. A linkage map of five of the clones spanning 35 kilobase pairs (kbp) of the S. mansoni genome was constructed. The region contained two eggshell protein genes closely linked, separated by 7.5 kbp of intergenic DNA. The two genes of the cluster were arranged in the same orientation, that is, they were transcribed from the same strand. The sixth clone probably represents a third copy of the eggshell gene that is not contained within the 35-kbp region. The 5- end ofmore » the mRNA transcribed from these genes was defined by primer extension directly off the RNA. The ATCAT cap site sequence was homologous to a silkmoth chorion PuTCATT cap site sequence, where Pu indicates any purine. DNA sequence analysis showed that there were no introns in these genes. The DNA sequences of the three genes were very homologous to each other and to a cDNA clone, pSMf61-46, differing only in three or four nucleotices. A multiple TATA box was located at positions -23 to -31, and a CAAAT sequence was located at -52 upstream of the eggshell transcription unit. Comparison of sequences in regions further upstream with silkmoth and Drosophila sequences revealed very short elements that were shared. One such element, TCACGT, recently shown to be an essential cis-regulatory element for silkmoth chorion gene promoter function, was found at a similar position in all three organisms.« less
Zhang, Li-Feng; Li, Wan-Feng; Han, Su-Ying; Yang, Wen-Hua; Qi, Li-Wang
2013-10-15
A full-length cDNA and genomic sequences of a translationally controlled tumor protein (TCTP) gene were isolated from Japanese larch (Larix leptolepis) and designated LaTCTP. The length of the cDNA was 1, 043 bp and contained a 504 bp open reading frame that encodes a predicted protein of 167 amino acids, characterized by two signature sequences of the TCTP protein family. Analysis of the LaTCTP gene structure indicated four introns and five exons, and it is the largest of all currently known TCTP genes in plants. The 5'-flanking promoter region of LaTCTP was cloned using an improved TAIL-PCR technique. In this region we identified many important potential cis-acting elements, such as a Box-W1 (fungal elicitor responsive element), a CAT-box (cis-acting regulatory element related to meristem expression), a CGTCA-motif (cis-acting regulatory element involved in MeJA-responsiveness), a GT1-motif (light responsive element), a Skn-1-motif (cis-acting regulatory element required for endosperm expression) and a TGA-element (auxin-responsive element), suggesting that expression of LaTCTP is highly regulated. Expression analysis demonstrated ubiquitous localization of LaTCTP mRNA in the roots, stems and needles, high mRNA levels in the embryonal-suspensor mass (ESM), browning embryogenic cultures and mature somatic embryos, and low levels of mRNA at day five during somatic embryogenesis. We suggest that LaTCTP might participate in the regulation of somatic embryo development. These results provide a theoretical basis for understanding the molecular regulatory mechanism of LaTCTP and lay the foundation for artificial regulation of somatic embryogenesis. © 2013.
Yoo, Soonmoon; Kim, Hak H; Kim, Paul; Donnelly, Christopher J; Kalinski, Ashley L; Vuppalanchi, Deepika; Park, Michael; Lee, Seung J; Merianda, Tanuja T; Perrone-Bizzozero, Nora I; Twiss, Jeffery L
2013-09-01
Localized translation of axonal mRNAs contributes to developmental and regenerative axon growth. Although untranslated regions (UTRs) of many different axonal mRNAs appear to drive their localization, there has been no consensus RNA structure responsible for this localization. We recently showed that limited expression of ZBP1 protein restricts axonal localization of both β-actin and GAP-43 mRNAs. β-actin 3'UTR has a defined element for interaction with ZBP1, but GAP-43 mRNA shows no homology to this RNA sequence. Here, we show that an AU-rich regulatory element (ARE) in GAP-43's 3'UTR is necessary and sufficient for its axonal localization. Axonal GAP-43 mRNA levels increase after in vivo injury, and GAP-43 mRNA shows an increased half-life in regenerating axons. GAP-43 mRNA interacts with both HuD and ZBP1, and HuD and ZBP1 co-immunoprecipitate in an RNA-dependent fashion. Reporter mRNA with the GAP-43 ARE competes with endogenous β-actin mRNA for axonal localization and decreases axon length and branching similar to the β-actin 3'UTR competing with endogenous GAP-43 mRNA. Conversely, over-expressing GAP-43 coding sequence with its 3'UTR ARE increases axonal elongation and this effect is lost when just the ARE is deleted from GAP-43's 3'UTR. We have recently found that over-expression of GAP-43 using an axonally targeted construct with the 3'UTRs of GAP-43 promoted elongating growth of axons, while restricting the mRNA to the cell body with the 3'UTR of γ-actin had minimal effect on axon length. In this study, we show that the ARE in GAP-43's 3'UTR is responsible for localization of GAP-43 mRNA into axons and is sufficient for GAP-43 protein's role in elongating axonal growth. © 2013 International Society for Neurochemistry.
Hauler, Aron; Jonietz, Christian; Stoll, Birgit; Stoll, Katrin; Braun, Hans-Peter; Binder, Stefan
2013-05-01
The 5' ends of many mitochondrial transcripts are generated post-transcriptionally. Recently, we identified three RNA PROCESSING FACTORs required for 5' end maturation of different mitochondrial mRNAs in Arabidopsis thaliana. All of these factors are pentatricopeptide repeat proteins (PPRPs), highly similar to RESTORERs OF FERTILTY (RF), that rescue male fertility in cytoplasmic male-sterile lines from different species. Therefore, we suggested a general role of these RF-like PPRPs in mitochondrial 5' processing. We now identified RNA PROCESSING FACTOR 5, a PPRP not classified as an RF-like protein, required for the efficient 5' maturation of the nad6 and atp9 mRNAs as well as 26S rRNA. The precursor molecules of these RNAs share conserved sequence elements, approximately ranging from positions -50 to +9 relative to mature 5' mRNA termini, suggesting these sequences to be at least part of the cis elements required for processing. The knockout of RPF5 has only a moderate influence on 5' processing of atp9 mRNA, whereas the generation of the mature nad6 mRNA and 26S rRNA is almost completely abolished in the mutant. The latter leads to a 50% decrease of total 26S rRNA species, resulting in an imbalance between the large rRNA and 18S rRNA. Despite these severe changes in RNA levels and in the proportion between the 26S and 18S rRNAs, mitochondrial protein levels appear to be unaltered in the mutant, whereas seed germination capacity is markedly reduced. © 2013 The Authors The Plant Journal © 2013 John Wiley & Sons Ltd.
Nafissi, Maryam; Chau, Jeannette; Xu, Jimin
2012-01-01
Synthesis of the Fis nucleoid protein rapidly increases in response to nutrient upshifts, and Fis is one of the most abundant DNA binding proteins in Escherichia coli under nutrient-rich growth conditions. Previous work has shown that control of Fis synthesis occurs at transcription initiation of the dusB-fis operon. We show here that while translation of the dihydrouridine synthase gene dusB is low, unusual mechanisms operate to enable robust translation of fis. At least two RNA sequence elements located within the dusB coding region are responsible for high fis translation. The most important is an AU element centered 35 nucleotides (nt) upstream of the fis AUG, which may function as a binding site for ribosomal protein S1. In addition, a 44-nt segment located upstream of the AU element and predicted to form a stem-loop secondary structure plays a prominent role in enhancing fis translation. On the other hand, mutations close to the AUG, including over a potential Shine-Dalgarno sequence, have little effect on Fis protein levels. The AU element and stem-loop regions are phylogenetically conserved within dusB-fis operons of representative enteric bacteria. PMID:22389479
The identification and functional annotation of RNA structures conserved in vertebrates.
Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan
2017-08-01
Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
Fisher, R P; Topper, J N; Clayton, D A
1987-07-17
Selective transcription of human mitochondrial DNA requires a transcription factor (mtTF) in addition to an essentially nonselective RNA polymerase. Partially purified mtTF is able to sequester promoter-containing DNA in preinitiation complexes in the absence of mitochondrial RNA polymerase, suggesting a DNA-binding mechanism for factor activity. Functional domains, required for positive transcriptional regulation by mtTF, are identified within both major promoters of human mtDNA through transcription of mutant promoter templates in a reconstituted in vitro system. These domains are essentially coextensive with DNA sequences protected from nuclease digestion by mtTF-binding. Comparison of the sequences of the two mtTF-responsive elements reveals significant homology only when one sequence is inverted; the binding sites are in opposite orientations with respect to the predominant direction of transcription. Thus mtTF may function bidirectionally, requiring additional protein-DNA interactions to dictate transcriptional polarity. The mtTF-responsive elements are arrayed as direct repeats, separated by approximately 80 bp within the displacement-loop region of human mitochondrial DNA; this arrangement may reflect duplication of an ancestral bidirectional promoter, giving rise to separate, unidirectional promoters for each strand.
Graveley, Brenton R.
2008-01-01
Summary Drosophila Dscam encodes 38,016 distinct axon guidance receptors through the mutually exclusive alternative splicing of 95 variable exons. Importantly, known mechanisms that ensure the mutually exclusive splicing of pairs of exons cannot explain this phenomenon in Dscam. I have identified two classes of conserved elements in the Dscam exon 6 cluster, which contains 48 alternative exons—the docking site, located in the intron downstream of constitutive exon 5, and the selector sequences, which are located upstream of each exon 6 variant. Strikingly, each selector sequence is complementary to a portion of the docking site, and this pairing juxtaposes one, and only one, alternative exon to the upstream constitutive exon. The mutually exclusive nature of the docking site:selector sequence interactions suggests that the formation of these competing RNA structures is a central component of the mechanism guaranteeing that only one exon 6 variant is included in each Dscam mRNA. PMID:16213213
Pea chloroplast tRNA(Lys) (UUU) gene: transcription and analysis of an intron-containing gene.
Boyer, S K; Mullet, J E
1988-07-01
The pea chloroplast trnK gene which encodes tRNA(Lys) (UUU) was sequenced. TrnK is located 210 bp upstream from the promoter of psbA and immediately downstream from the 3'-end of rbcL. The gene is transcribed from the same DNA strand as psbA and rbcL. A 2447 bp intron with class II features is located in the trnK anticodon loop. The intron contains a 506 amino acid open reading frame which could encode an RNA maturase. The primary transcript of trnK is 2.9 kb long; its 5'-end was identified as a site of transcription initiation by in vitro transcription experiments. The 5'-terminus is adjacent to DNA sequences previously identified as transcription promoter elements. The most abundant trnK transcript is 2.5 kb long with termini corresponding to the 5' and 3' ends of the trnK exons. Intron specific RNAs were not detected. This suggests that RNA processing which produces tRNA(Lys) leads to rapid degradation of intron sequences.
Park, Eonyoung; Maquat, Lynne E.
2013-01-01
Staufen1 (STAU1)-mediated mRNA decay (SMD) is an mRNA degradation process in mammalian cells that is mediated by the binding of STAU1 to a STAU1-binding site (SBS) within the 3'-untranslated region (3'UTR) of target mRNAs. During SMD, STAU1, a double-stranded (ds) RNA-binding protein, recognizes dsRNA structures formed either by intramolecular base-pairing of 3'UTR sequences or by intermolecular base-pairing of 3'UTR sequences with a long noncoding RNA (lncRNA) via partially complementary Alu elements. Recently, STAU2, a paralog of STAU1, has also been reported to mediate SMD. Both STAU1 and STAU2 interact directly with the ATP-dependent RNA helicase UPF1, a key SMD factor, enhancing its helicase activity to promote effective SMD. Moreover, STAU1 and STAU2 form homodimeric and heterodimeric interactions via domain-swapping. Since both SMD and the mechanistically related nonsense-mediated mRNA decay (NMD) employ UPF1, SMD and NMD are competitive pathways. Competition contributes to cellular differentiation processes, such as myogenesis and adipogenesis, placing SMD at the heart of various physiologically important mechanisms. PMID:23681777
The impact of CRISPR repeat sequence on structures of a Cas6 protein-RNA complex
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Ruiying; Zheng, Han; Preamplume, Gan
The repeat-associated mysterious proteins (RAMPs) comprise the most abundant family of proteins involved in prokaryotic immunity against invading genetic elements conferred by the clustered regularly interspaced short palindromic repeat (CRISPR) system. Cas6 is one of the first characterized RAMP proteins and is a key enzyme required for CRISPR RNA maturation. Despite a strong structural homology with other RAMP proteins that bind hairpin RNA, Cas6 distinctly recognizes single-stranded RNA. Previous structural and biochemical studies show that Cas6 captures the 5' end while cleaving the 3' end of the CRISPR RNA. Here, we describe three structures and complementary biochemical analysis of amore » noncatalytic Cas6 homolog from Pyrococcus horikoshii bound to CRISPR repeat RNA of different sequences. Our study confirms the specificity of the Cas6 protein for single-stranded RNA and further reveals the importance of the bases at Positions 5-7 in Cas6-RNA interactions. Substitutions of these bases result in structural changes in the protein-RNA complex including its oligomerization state.« less
Rousseau, Beth A; Hou, Zhonggang; Gramelspacher, Max J; Zhang, Yan
2018-03-01
The microbial CRISPR systems enable adaptive defense against mobile elements and also provide formidable tools for genome engineering. The Cas9 proteins are type II CRISPR-associated, RNA-guided DNA endonucleases that identify double-stranded DNA targets by sequence complementarity and protospacer adjacent motif (PAM) recognition. Here we report that the type II-C CRISPR-Cas9 from Neisseria meningitidis (Nme) is capable of programmable, RNA-guided, site-specific cleavage and recognition of single-stranded RNA targets and that this ribonuclease activity is independent of the PAM sequence. We define the mechanistic feature and specificity constraint for RNA cleavage by NmeCas9 and also show that nuclease null dNmeCas9 binds to RNA target complementary to CRISPR RNA. Finally, we demonstrate that NmeCas9-catalyzed RNA cleavage can be blocked by three families of type II-C anti-CRISPR proteins. These results fundamentally expand the targeting capacities of CRISPR-Cas9 and highlight the potential utility of NmeCas9 as a single platform to target both RNA and DNA. Copyright © 2018 Elsevier Inc. All rights reserved.
Compilation of small ribosomal subunit RNA structures.
Neefs, J M; Van de Peer, Y; De Rijk, P; Chapelle, S; De Wachter, R
1993-01-01
The database on small ribosomal subunit RNA structure contained 1804 nucleotide sequences on April 23, 1993. This number comprises 365 eukaryotic, 65 archaeal, 1260 bacterial, 30 plastidial, and 84 mitochondrial sequences. These are stored in the form of an alignment in order to facilitate the use of the database as input for comparative studies on higher-order structure and for reconstruction of phylogenetic trees. The elements of the postulated secondary structure for each molecule are indicated by special symbols. The database is available on-line directly from the authors by ftp and can also be obtained from the EMBL nucleotide sequence library by electronic mail, ftp, and on CD ROM disk. PMID:8332525
Analytical study of avian reticuloendotheliosis virus dimeric RNA generated in vivo and in vitro.
Darlix, J L; Gabus, C; Allain, B
1992-12-01
The retroviral genome consists of two identical RNA molecules associated at their 5' ends by a stable structure called the dimer linkage structure. The dimer linkage structure, while maintaining the dimer state of the retroviral genome, might also be involved in packaging and reverse transcription, as well as recombination during proviral DNA synthesis. To study the dimer structure of the retroviral genome and the mechanism of dimerization, we analyzed features of the dimeric genome of reticuloendotheliosis virus (REV) type A and identified elements required for its dimerization. Here we report that the REV dimeric genome extracted from virions and infected cells, as well as that synthesized in vitro, is more resistant to heat denaturation than avian sarcoma and leukemia virus, murine leukemia virus, or human immunodeficiency virus type 1 dimeric RNA. The minimal domain required to form a stable REV RNA dimer in vitro was found to map between positions 268 and 452 (KpnI and SalI sites), thus corresponding to the E encapsidation sequence (J. E. Embretson and H. M. Temin, J. Virol. 61:2675-2683, 1987). In addition, both the 5' and 3' halves of E are necessary in cis for RNA dimerization and the extent of RNA dimerization is influenced by viral sequences flanking E. Rapid and efficient dimerization of REV RNA containing gag sequences in addition to the E sequences and annealing of replication primer tRNA(Pro) to the primer-binding site necessitate the nucleocapsid protein.
Analytical study of avian reticuloendotheliosis virus dimeric RNA generated in vivo and in vitro.
Darlix, J L; Gabus, C; Allain, B
1992-01-01
The retroviral genome consists of two identical RNA molecules associated at their 5' ends by a stable structure called the dimer linkage structure. The dimer linkage structure, while maintaining the dimer state of the retroviral genome, might also be involved in packaging and reverse transcription, as well as recombination during proviral DNA synthesis. To study the dimer structure of the retroviral genome and the mechanism of dimerization, we analyzed features of the dimeric genome of reticuloendotheliosis virus (REV) type A and identified elements required for its dimerization. Here we report that the REV dimeric genome extracted from virions and infected cells, as well as that synthesized in vitro, is more resistant to heat denaturation than avian sarcoma and leukemia virus, murine leukemia virus, or human immunodeficiency virus type 1 dimeric RNA. The minimal domain required to form a stable REV RNA dimer in vitro was found to map between positions 268 and 452 (KpnI and SalI sites), thus corresponding to the E encapsidation sequence (J. E. Embretson and H. M. Temin, J. Virol. 61:2675-2683, 1987). In addition, both the 5' and 3' halves of E are necessary in cis for RNA dimerization and the extent of RNA dimerization is influenced by viral sequences flanking E. Rapid and efficient dimerization of REV RNA containing gag sequences in addition to the E sequences and annealing of replication primer tRNA(Pro) to the primer-binding site necessitate the nucleocapsid protein. Images PMID:1331519
Defective control of pre–messenger RNA splicing in human disease
Shkreta, Lulzim
2016-01-01
Examples of associations between human disease and defects in pre–messenger RNA splicing/alternative splicing are accumulating. Although many alterations are caused by mutations in splicing signals or regulatory sequence elements, recent studies have noted the disruptive impact of mutated generic spliceosome components and splicing regulatory proteins. This review highlights recent progress in our understanding of how the altered splicing function of RNA-binding proteins contributes to myelodysplastic syndromes, cancer, and neuropathologies. PMID:26728853
Characterization of an endogenous retrovirus class in elephants and their relatives
Greenwood, Alex D; Englbrecht, Claudia C; MacPhee, Ross DE
2004-01-01
Background Endogenous retrovirus-like elements (ERV-Ls, primed with tRNA leucine) are a diverse group of reiterated sequences related to foamy viruses and widely distributed among mammals. As shown in previous investigations, in many primates and rodents this class of elements has remained transpositionally active, as reflected by increased copy number and high sequence diversity within and among taxa. Results Here we examine whether proviral-like sequences may be suitable molecular probes for investigating the phylogeny of groups known to have high element diversity. As a test we characterized ERV-Ls occurring in a sample of extant members of superorder Uranotheria (Asian and African elephants, manatees, and hyraxes). The ERV-L complement in this group is even more diverse than previously suspected, and there is sequence evidence for active expansion, particularly in elephantids. Many of the elements characterized have protein coding potential suggestive of activity. Conclusions In general, the evidence supports the hypothesis that the complement had a single origin within basal Uranotheria. PMID:15476555
Chishima, Takafumi; Iwakiri, Junichi
2018-01-01
It has been recently suggested that transposable elements (TEs) are re-used as functional elements of long non-coding RNAs (lncRNAs). This is supported by some examples such as the human endogenous retrovirus subfamily H (HERVH) elements contained within lncRNAs and expressed specifically in human embryonic stem cells (hESCs), as required to maintain hESC identity. There are at least two unanswered questions about all lncRNAs. How many TEs are re-used within lncRNAs? Are there any other TEs that affect tissue specificity of lncRNA expression? To answer these questions, we comprehensively identify TEs that are significantly related to tissue-specific expression levels of lncRNAs. We downloaded lncRNA expression data corresponding to normal human tissue from the Expression Atlas and transformed the data into tissue specificity estimates. Then, Fisher’s exact tests were performed to verify whether the presence or absence of TE-derived sequences influences the tissue specificity of lncRNA expression. Many TE–tissue pairs associated with tissue-specific expression of lncRNAs were detected, indicating that multiple TE families can be re-used as functional domains or regulatory sequences of lncRNAs. In particular, we found that the antisense promoter region of L1PA2, a LINE-1 subfamily, appears to act as a promoter for lncRNAs with placenta-specific expression. PMID:29315213
Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures
Stark, Alexander; Lin, Michael F.; Kheradpour, Pouya; Pedersen, Jakob S.; Parts, Leopold; Carlson, Joseph W.; Crosby, Madeline A.; Rasmussen, Matthew D.; Roy, Sushmita; Deoras, Ameya N.; Ruby, J. Graham; Brennecke, Julius; Hodges, Emily; Hinrichs, Angie S.; Caspi, Anat; Paten, Benedict; Park, Seung-Won; Han, Mira V.; Maeder, Morgan L.; Polansky, Benjamin J.; Robson, Bryanne E.; Aerts, Stein; van Helden, Jacques; Hassan, Bassem; Gilbert, Donald G.; Eastman, Deborah A.; Rice, Michael; Weir, Michael; Hahn, Matthew W.; Park, Yongkyu; Dewey, Colin N.; Pachter, Lior; Kent, W. James; Haussler, David; Lai, Eric C.; Bartel, David P.; Hannon, Gregory J.; Kaufman, Thomas C.; Eisen, Michael B.; Clark, Andrew G.; Smith, Douglas; Celniker, Susan E.; Gelbart, William M.; Kellis, Manolis
2008-01-01
Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or ‘evolutionary signatures’, dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies. PMID:17994088
Romero-López, Cristina; Barroso-delJesus, Alicia; Berzal-Herranz, Alfredo
2017-02-24
The RNA genome of the hepatitis C virus (HCV) establishes a network of long-distance RNA-RNA interactions that direct the progression of the infective cycle. This work shows that the dimerization of the viral genome, which is initiated at the dimer linkage sequence (DLS) within the 3'UTR, is promoted by the CRE region, while the IRES is a negative regulatory partner. Using differential 2'-acylation probing (SHAPE-dif) and molecular interference (HMX) technologies, the CRE activity was found to mainly lie in the critical 5BSL3.2 domain, while the IRES-mediated effect is dependent upon conserved residues within the essential structural elements JIIIabc, JIIIef and PK2. These findings support the idea that, along with the DLS motif, the IRES and CRE are needed to control HCV genome dimerization. They also provide evidences of a novel function for these elements as chaperone-like partners that fine-tune the architecture of distant RNA domains within the HCV genome.
Romero-López, Cristina; Barroso-delJesus, Alicia; Berzal-Herranz, Alfredo
2017-01-01
The RNA genome of the hepatitis C virus (HCV) establishes a network of long-distance RNA-RNA interactions that direct the progression of the infective cycle. This work shows that the dimerization of the viral genome, which is initiated at the dimer linkage sequence (DLS) within the 3′UTR, is promoted by the CRE region, while the IRES is a negative regulatory partner. Using differential 2′-acylation probing (SHAPE-dif) and molecular interference (HMX) technologies, the CRE activity was found to mainly lie in the critical 5BSL3.2 domain, while the IRES-mediated effect is dependent upon conserved residues within the essential structural elements JIIIabc, JIIIef and PK2. These findings support the idea that, along with the DLS motif, the IRES and CRE are needed to control HCV genome dimerization. They also provide evidences of a novel function for these elements as chaperone-like partners that fine-tune the architecture of distant RNA domains within the HCV genome. PMID:28233845
Short intronic repeat sequences facilitate circular RNA production
Liang, Dongming
2014-01-01
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery “backsplices” and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3′ end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. PMID:25281217
mRNA deep sequencing reveals 75 new genes and a complex transcriptional landscape in Mimivirus
Legendre, Matthieu; Audic, Stéphane; Poirot, Olivier; Hingamp, Pascal; Seltzer, Virginie; Byrne, Deborah; Lartigue, Audrey; Lescot, Magali; Bernadac, Alain; Poulain, Julie; Abergel, Chantal; Claverie, Jean-Michel
2010-01-01
Mimivirus, a virus infecting Acanthamoeba, is the prototype of the Mimiviridae, the latest addition to the nucleocytoplasmic large DNA viruses. The Mimivirus genome encodes close to 1000 proteins, many of them never before encountered in a virus, such as four amino-acyl tRNA synthetases. To explore the physiology of this exceptional virus and identify the genes involved in the building of its characteristic intracytoplasmic “virion factory,” we coupled electron microscopy observations with the massively parallel pyrosequencing of the polyadenylated RNA fractions of Acanthamoeba castellanii cells at various time post-infection. We generated 633,346 reads, of which 322,904 correspond to Mimivirus transcripts. This first application of deep mRNA sequencing (454 Life Sciences [Roche] FLX) to a large DNA virus allowed the precise delineation of the 5′ and 3′ extremities of Mimivirus mRNAs and revealed 75 new transcripts including several noncoding RNAs. Mimivirus genes are expressed across a wide dynamic range, in a finely regulated manner broadly described by three main temporal classes: early, intermediate, and late. This RNA-seq study confirmed the AAAATTGA sequence as an early promoter element, as well as the presence of palindromes at most of the polyadenylation sites. It also revealed a new promoter element correlating with late gene expression, which is also prominent in Sputnik, the recently described Mimivirus “virophage.” These results—validated genome-wide by the hybridization of total RNA extracted from infected Acanthamoeba cells on a tiling array (Agilent)—will constitute the foundation on which to build subsequent functional studies of the Mimivirus/Acanthamoeba system. PMID:20360389
Åsman, Anna K M; Vetukuri, Ramesh R; Jahan, Sultana N; Fogelqvist, Johan; Corcoran, Pádraic; Avrova, Anna O; Whisson, Stephen C; Dixelius, Christina
2014-12-10
The oomycete Phytophthora infestans possesses active RNA silencing pathways, which presumably enable this plant pathogen to control the large numbers of transposable elements present in its 240 Mb genome. Small RNAs (sRNAs), central molecules in RNA silencing, are known to also play key roles in this organism, notably in regulation of critical effector genes needed for infection of its potato host. To identify additional classes of sRNAs in oomycetes, we mapped deep sequencing reads to transfer RNAs (tRNAs) thereby revealing the presence of 19-40 nt tRNA-derived RNA fragments (tRFs). Northern blot analysis identified abundant tRFs corresponding to half tRNA molecules. Some tRFs accumulated differentially during infection, as seen by examining sRNAs sequenced from P. infestans-potato interaction libraries. The putative connection between tRF biogenesis and the canonical RNA silencing pathways was investigated by employing hairpin RNA-mediated RNAi to silence the genes encoding P. infestans Argonaute (PiAgo) and Dicer (PiDcl) endoribonucleases. By sRNA sequencing we show that tRF accumulation is PiDcl1-independent, while Northern hybridizations detected reduced levels of specific tRNA-derived species in the PiAgo1 knockdown line. Our findings extend the sRNA diversity in oomycetes to include fragments derived from non-protein-coding RNA transcripts and identify tRFs with elevated levels during infection of potato by P. infestans.
Bartels, Hanni; Luban, Jeremy
2014-09-12
All retroviruses synthesize essential proteins via alternatively spliced mRNAs. Retrovirus genera, though, exploit different mechanisms to coordinate the synthesis of proteins from alternatively spliced mRNAs. The best studied of these retroviral, post-transcriptional effectors are the trans-acting Rev protein of lentiviruses and the cis-acting constitutive transport element (CTE) of the betaretrovirus Mason-Pfizer monkey virus (MPMV). How members of the gammaretrovirus genus translate protein from unspliced RNA has not been elucidated. The mechanism by which two gammaretroviruses, XMRV and MLV, synthesize the Gag polyprotein (Pr65Gag) from full-length, unspliced mRNA was investigated here. The yield of Pr65Gag from a gag-only expression plasmid was found to be at least 30-fold less than that from an otherwise isogenic gag-pol expression plasmid. A frameshift mutation disrupting the pol open reading frame within the gag-pol expression plasmid did not decrease Pr65Gag production and 398 silent nucleotide changes engineered into gag rendered Pr65Gag synthesis pol-independent. These results are consistent with pol-encoded RNA acting in cis to promote Pr65Gag translation. Two independently-acting pol fragments were identified by screening 17 pol deletion mutations. To determine the mechanism by which pol promoted Pr65Gag synthesis, gag RNA in total and cytoplasmic fractions was quantitated by northern blot and by RT-PCR. The pol sequences caused, maximally, three-fold increase in total or cytoplasmic gag mRNA. Instead, pol sequences increased gag mRNA association with polyribosomes ~100-fold, a magnitude sufficient to explain the increase in Pr65Gag translation efficiency. The MPMV CTE, an NXF1-binding element, substituted for pol in promoting Pr65Gag synthesis. A pol RNA stem-loop resembling the CTE promoted Pr65Gag synthesis. Over-expression of NXF1 and NXT, host factors that bind to the MPMV CTE, synergized with pol to promote gammaretroviral gag RNA loading onto polysomes and to increase Pr65Gag synthesis. Conversely, Gag polyprotein synthesis was decreased by NXF1 knockdown. Finally, overexpression of SRp20, a shuttling protein that binds to NXF1 and promotes NXF1 binding to RNA, also increased gag RNA loading onto polysomes and increased Pr65Gag synthesis. These experiments demonstrate that gammaretroviral pol sequences act in cis to recruit NXF1 and SRp20 to promote polysome loading of gag RNA and, thereby license the synthesis of Pr65Gag from unspliced mRNA.
Control of transcriptional pausing by biased thermal fluctuations on repetitive genomic sequences
Imashimizu, Masahiko; Afek, Ariel; Takahashi, Hiroki; Lubkowska, Lucyna; Lukatsky, David B.
2016-01-01
In the process of transcription elongation, RNA polymerase (RNAP) pauses at highly nonrandom positions across genomic DNA, broadly regulating transcription; however, molecular mechanisms responsible for the recognition of such pausing positions remain poorly understood. Here, using a combination of statistical mechanical modeling and high-throughput sequencing and biochemical data, we evaluate the effect of thermal fluctuations on the regulation of RNAP pausing. We demonstrate that diffusive backtracking of RNAP, which is biased by repetitive DNA sequence elements, causes transcriptional pausing. This effect stems from the increased microscopic heterogeneity of an elongation complex, and thus is entropy-dominated. This report shows a linkage between repetitive sequence elements encoded in the genome and regulation of RNAP pausing driven by thermal fluctuations. PMID:27830653
RNA 3D Modules in Genome-Wide Predictions of RNA 2D Structure
Theis, Corinna; Zirbel, Craig L.; zu Siederdissen, Christian Höner; Anthon, Christian; Hofacker, Ivo L.; Nielsen, Henrik; Gorodkin, Jan
2015-01-01
Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution. These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D module prediction tools and apply them on a 13-way vertebrate sequence-based alignment. We find that RNA 3D modules predicted by metaRNAmodules and JAR3D are significantly enriched in the screened windows compared to their shuffled counterparts. The initially estimated FDR of 47.0% is lowered to below 25% when certain 3D module predictions are present in the window of the 2D prediction. We discuss the implications and prospects for further development of computational strategies for detection of RNA 2D structure in genomic sequence. PMID:26509713
Smyth, Redmond P; Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe; von Kleist, Max; Marquet, Roland
2018-05-18
Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5' region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5' PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production.
Smith, Maureen R; Jousset, Anne-Caroline; Despons, Laurence; Laumond, Géraldine; Decoville, Thomas; Cattenoz, Pierre; Moog, Christiane; Jossinet, Fabrice; Mougel, Marylène; Paillart, Jean-Christophe
2018-01-01
Abstract Non-coding RNA regulatory elements are important for viral replication, making them promising targets for therapeutic intervention. However, regulatory RNA is challenging to detect and characterise using classical structure-function assays. Here, we present in cell Mutational Interference Mapping Experiment (in cell MIME) as a way to define RNA regulatory landscapes at single nucleotide resolution under native conditions. In cell MIME is based on (i) random mutation of an RNA target, (ii) expression of mutated RNA in cells, (iii) physical separation of RNA into functional and non-functional populations, and (iv) high-throughput sequencing to identify mutations affecting function. We used in cell MIME to define RNA elements within the 5′ region of the HIV-1 genomic RNA (gRNA) that are important for viral replication in cells. We identified three distinct RNA motifs controlling intracellular gRNA production, and two distinct motifs required for gRNA packaging into virions. Our analysis reveals the 73AAUAAA78 polyadenylation motif within the 5′ PolyA domain as a dual regulator of gRNA production and gRNA packaging, and demonstrates that a functional polyadenylation signal is required for viral packaging even though it negatively affects gRNA production. PMID:29514260
Hamilton, P T; Reeve, J N
1985-01-01
DNA fragments cloned from the methanogenic archaebacterium Methanobrevibacter smithii which complement mutations in the purE and proC genes of E. coli have been sequenced. Sequence analyses, transposon mutagenesis and expression in E. coli minicells indicate that purE and proC complementations result from the synthesis of M. smithii polypeptides with molecular weights of 36,697 and 27,836 respectively. The encoding genes appear to be located in operons. The M. smithii genome contains 69% A/T basepairs (bp) which is reflected in unusual codon usages and intergenic regions containing approximately 85% A/T bp. An insertion element, designated ISM1, was found within the cloned M. smithii DNA located adjacent to the proC complementing region. ISM1 is 1381 bp in length, has 29 bp terminal inverted repeat sequences and contains one major ORF encoded in 87% of the ISM1 sequence. ISM1 is mobile, present in approximately 10 copies per genome and integration duplicates 8 bp at the site of insertion. The duplicated sequences show homology with sequences within the 29 bp terminal repeat sequence of ISM1. Comparison of our data with sequences from halophilic archaebacteria suggests that 5'GAANTTTCA and 5'TTTTAATATAAA may be consensus promoter sequences for archaebacteria. These sequences closely resemble the consensus sequences which precede Drosophila heat-shock genes (Pelham 1982; Davidson et al. 1983). Methanogens appear to employ the eubacterial system of mRNA: 16SrRNA hybridization to ensure initiation of translation; the consensus ribosome binding sequence is 5'AGGTGA.
Herrero, Noemi
2016-12-01
Purpureocillium lilacinum is a ubiquitous saprophytic fungus commonly isolated from soils and widely known as a biological control agent against phytopathogenic nematodes and pest insects. Mycoviruses infect a wide number of fungal species, but the study of viruses infecting entomopathogenic fungi is still quite recent. In this study, a total of 86 P. lilacinum isolates collected from soil in natural and cultivated habitats throughout the Czech Republic were analyzed; 22 % of the isolates harbored double-stranded RNA (dsRNA) elements with viral characteristics. These results suggest that mycoviruses are common in P. lilacinum. One of the most common dsRNA elements detected in the survey was completely sequenced and corresponded to the 2,864-bp genome of a previously undescribed mycovirus, designated Purpureocillium lilacinum nonsegmented virus 1 (PlNV-1). Phylogenetic analysis of the RNA-dependent RNA polymerase of PlNV-1 indicated that this virus might belong to a new taxon related to the family Partitiviridae.
Stacking interactions in PUF-RNA complexes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yiling Koh, Yvonne; Wang, Yeming; Qiu, Chen
2012-07-02
Stacking interactions between amino acids and bases are common in RNA-protein interactions. Many proteins that regulate mRNAs interact with single-stranded RNA elements in the 3' UTR (3'-untranslated region) of their targets. PUF proteins are exemplary. Here we focus on complexes formed between a Caenorhabditis elegans PUF protein, FBF, and its cognate RNAs. Stacking interactions are particularly prominent and involve every RNA base in the recognition element. To assess the contribution of stacking interactions to formation of the RNA-protein complex, we combine in vivo selection experiments with site-directed mutagenesis, biochemistry, and structural analysis. Our results reveal that the identities of stackingmore » amino acids in FBF affect both the affinity and specificity of the RNA-protein interaction. Substitutions in amino acid side chains can restrict or broaden RNA specificity. We conclude that the identities of stacking residues are important in achieving the natural specificities of PUF proteins. Similarly, in PUF proteins engineered to bind new RNA sequences, the identity of stacking residues may contribute to 'target' versus 'off-target' interactions, and thus be an important consideration in the design of proteins with new specificities.« less
TEcandidates: Prediction of genomic origin of expressed Transposable Elements using RNA-seq data.
Valdebenito-Maturana, Braulio; Riadi, Gonzalo
2018-06-01
In recent years, Transposable Elements (TEs) have been related to gene regulation. However, estimating the origin of expression of TEs through RNA-seq is complicated by multimapping reads coming from their repetitive sequences. Current approaches that address multimapping reads are focused in expression quantification and not in finding the origin of expression. Addressing the genomic origin of expressed TEs could further aid in understanding the role that TEs might have in the cell. We have developed a new pipeline called TEcandidates, based on de novo transcriptome assembly to assess the instances of TEs being expressed, along with their location, to include in downstream DE analysis. TEcandidates takes as input the RNA-seq data, the genome sequence and the TE annotation file, and returns a list of coordinates of candidate TEs being expressed, the TEs that have been removed, and the genome sequence with removed TEs as masked. This masked genome is suited to include TEs in downstream expression analysis, as the ambiguity of reads coming from TEs is significantly reduced in the mapping step of the analysis. The script which runs the pipeline can be downloaded at http://www.mobilomics.org/tecandidates/downloads or http://github.com/TEcandidates/TEcandidates. griadi@utalca.cl. Supplementary data are available at Bioinformatics online.
Maximizing mutagenesis with solubilized CRISPR-Cas9 ribonucleoprotein complexes.
Burger, Alexa; Lindsay, Helen; Felker, Anastasia; Hess, Christopher; Anders, Carolin; Chiavacci, Elena; Zaugg, Jonas; Weber, Lukas M; Catena, Raul; Jinek, Martin; Robinson, Mark D; Mosimann, Christian
2016-06-01
CRISPR-Cas9 enables efficient sequence-specific mutagenesis for creating somatic or germline mutants of model organisms. Key constraints in vivo remain the expression and delivery of active Cas9-sgRNA ribonucleoprotein complexes (RNPs) with minimal toxicity, variable mutagenesis efficiencies depending on targeting sequence, and high mutation mosaicism. Here, we apply in vitro assembled, fluorescent Cas9-sgRNA RNPs in solubilizing salt solution to achieve maximal mutagenesis efficiency in zebrafish embryos. MiSeq-based sequence analysis of targeted loci in individual embryos using CrispRVariants, a customized software tool for mutagenesis quantification and visualization, reveals efficient bi-allelic mutagenesis that reaches saturation at several tested gene loci. Such virtually complete mutagenesis exposes loss-of-function phenotypes for candidate genes in somatic mutant embryos for subsequent generation of stable germline mutants. We further show that targeting of non-coding elements in gene regulatory regions using saturating mutagenesis uncovers functional control elements in transgenic reporters and endogenous genes in injected embryos. Our results establish that optimally solubilized, in vitro assembled fluorescent Cas9-sgRNA RNPs provide a reproducible reagent for direct and scalable loss-of-function studies and applications beyond zebrafish experiments that require maximal DNA cutting efficiency in vivo. © 2016. Published by The Company of Biologists Ltd.
Translational co-regulation of a ligand and inhibitor by a conserved RNA element
Zaucker, Andreas; Nagorska, Agnieszka; Kumari, Pooja; Hecker, Nikolai; Wang, Yin; Huang, Sizhou; Cooper, Ledean; Sivashanmugam, Lavanya; VijayKumar, Shruthi; Brosens, Jan; Gorodkin, Jan
2018-01-01
Abstract In many organisms, transcriptional and post-transcriptional regulation of components of pathways or processes has been reported. However, to date, there are few reports of translational co-regulation of multiple components of a developmental signaling pathway. Here, we show that an RNA element which we previously identified as a dorsal localization element (DLE) in the 3′UTR of zebrafish nodal-related1/squint (ndr1/sqt) ligand mRNA, is shared by the related ligand nodal-related2/cyclops (ndr2/cyc) and the nodal inhibitors, lefty1 (lft1) and lefty2 mRNAs. We investigated the activity of the DLEs through functional assays in live zebrafish embryos. The lft1 DLE localizes fluorescently labeled RNA similarly to the ndr1/sqt DLE. Similar to the ndr1/sqt 3′UTR, the lft1 and lft2 3′UTRs are bound by the RNA-binding protein (RBP) and translational repressor, Y-box binding protein 1 (Ybx1), whereas deletions in the DLE abolish binding to Ybx1. Analysis of zebrafish ybx1 mutants shows that Ybx1 represses lefty1 translation in embryos. CRISPR/Cas9-mediated inactivation of human YBX1 also results in human NODAL translational de-repression, suggesting broader conservation of the DLE RNA element/Ybx1 RBP module in regulation of Nodal signaling. Our findings demonstrate translational co-regulation of components of a signaling pathway by an RNA element conserved in both sequence and structure and an RBP, revealing a ‘translational regulon’. PMID:29059375
Gao, Feng; Simon, Anne E.
2016-01-01
Programmed -1 ribosomal frameshifting (-1 PRF) is used by many positive-strand RNA viruses for translation of required products. Despite extensive studies, it remains unresolved how cis-elements just downstream of the recoding site promote a precise level of frameshifting. The Umbravirus Pea enation mosaic virus RNA2 expresses its RNA polymerase by -1 PRF of the 5′-proximal ORF (p33). Three hairpins located in the vicinity of the recoding site are phylogenetically conserved among Umbraviruses. The central Recoding Stimulatory Element (RSE), located downstream of the p33 termination codon, is a large hairpin with two asymmetric internal loops. Mutational analyses revealed that sequences throughout the RSE and the RSE lower stem (LS) structure are important for frameshifting. SHAPE probing of mutants indicated the presence of higher order structure, and sequences in the LS may also adapt an alternative conformation. Long-distance pairing between the RSE and a 3′ terminal hairpin was less critical when the LS structure was stabilized. A basal level of frameshifting occurring in the absence of the RSE increases to 72% of wild-type when a hairpin upstream of the slippery site is also deleted. These results suggest that suppression of frameshifting may be needed in the absence of an active RSE conformation. PMID:26578603
Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada
2002-07-01
Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Reference genome sequence of the model plant Setaria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M
2012-05-13
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Hall-Pogar, Tyra; Liang, Songchun; Hague, Lisa K.; Lutz, Carol S.
2007-01-01
Two cyclooxygenase (COX) enzymes, COX-1 and COX-2, are present in human cells. While COX-1 is constitutively expressed, COX-2 is inducible and up-regulated in response to many signals. Since increased transcriptional activity accounts for only part of COX-2 up-regulation, we chose to explore other RNA processing mechanisms in the regulation of this gene. Previously, we showed that COX-2 is regulated by alternative polyadenylation, and that the COX-2 proximal polyadenylation signal contains auxiliary upstream sequence elements (USEs) that are very important in efficient polyadenylation. To explore trans-acting protein factors interacting with these cis-acting RNA elements, we performed pull-down assays with HeLa nuclear extract and biotinylated RNA oligonucleotides representing COX-2 USEs. We identified PSF, p54nrb, PTB, and U1A as proteins specifically bound to the COX-2 USEs. We further explored their participation in polyadenylation using MS2 phage coat protein-MS2 RNA binding site tethering assays, and found that tethering any of these four proteins to the COX-2 USE mutant RNA can compensate for these cis-acting elements. Finally, we suggest that these proteins (p54nrb, PTB, PSF, and U1A) may interact as a complex since immunoprecipitations of the transfected MS2 fusion proteins coprecipitate the other proteins. PMID:17507659
Seif, Elias; Niu, Meijuan; Kleiman, Lawrence
2013-01-01
The 5′ untranslated region (5′ UTR) of HIV-1 genomic RNA (gRNA) includes structural elements that regulate reverse transcription, transcription, translation, tRNALys3 annealing to the gRNA, and gRNA dimerization and packaging into viruses. It has been reported that gRNA dimerization and packaging are regulated by changes in the conformation of the 5′-UTR RNA. In this study, we show that annealing of tRNALys3 or a DNA oligomer complementary to sequences within the primer binding site (PBS) loop of the 5′ UTR enhances its dimerization in vitro. Structural analysis of the 5′-UTR RNA using selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) shows that the annealing promotes a conformational change of the 5′ UTR that has been previously reported to favor gRNA dimerization and packaging into virus. The model predicted by SHAPE analysis is supported by antisense experiments designed to test which annealed sequences will promote or inhibit gRNA dimerization. Based on reports showing that the gRNA dimerization favors its incorporation into viruses, we tested the ability of a mutant gRNA unable to anneal to tRNALys3 to be incorporated into virions. We found a ∼60% decrease in mutant gRNA packaging compared with wild-type gRNA. Together, these data further support a model for viral assembly in which the initial annealing of tRNALys3 to gRNA is cytoplasmic, which in turn aids in the promotion of gRNA dimerization and its incorporation into virions. PMID:23960173
Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin
2015-04-01
This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
High-throughput detection of RNA processing in bacteria.
Gill, Erin E; Chan, Luisa S; Winsor, Geoffrey L; Dobson, Neil; Lo, Raymond; Ho Sui, Shannan J; Dhillon, Bhavjinder K; Taylor, Patrick K; Shrestha, Raunak; Spencer, Cory; Hancock, Robert E W; Unrau, Peter J; Brinkman, Fiona S L
2018-03-27
Understanding the RNA processing of an organism's transcriptome is an essential but challenging step in understanding its biology. Here we investigate with unprecedented detail the transcriptome of Pseudomonas aeruginosa PAO1, a medically important and innately multi-drug resistant bacterium. We systematically mapped RNA cleavage and dephosphorylation sites that result in 5'-monophosphate terminated RNA (pRNA) using monophosphate RNA-Seq (pRNA-Seq). Transcriptional start sites (TSS) were also mapped using differential RNA-Seq (dRNA-Seq) and both datasets were compared to conventional RNA-Seq performed in a variety of growth conditions. The pRNA-Seq library revealed known tRNA, rRNA and transfer-messenger RNA (tmRNA) processing sites, together with previously uncharacterized RNA cleavage events that were found disproportionately near the 5' ends of transcripts associated with basic bacterial functions such as oxidative phosphorylation and purine metabolism. The majority (97%) of the processed mRNAs were cleaved at precise codon positions within defined sequence motifs indicative of distinct endonucleolytic activities. The most abundant of these motifs corresponded closely to an E. coli RNase E site previously established in vitro. Using the dRNA-Seq library, we performed an operon analysis and predicted 3159 potential TSS. A correlation analysis uncovered 105 antiparallel pairs of TSS that were separated by 18 bp from each other and were centered on single palindromic TAT(A/T)ATA motifs (likely - 10 promoter elements), suggesting that, consistent with previous in vitro experimentation, these sites can initiate transcription bi-directionally and may thus provide a novel form of transcriptional regulation. TSS and RNA-Seq analysis allowed us to confirm expression of small non-coding RNAs (ncRNAs), many of which are differentially expressed in swarming and biofilm formation conditions. This study uses pRNA-Seq, a method that provides a genome-wide survey of RNA processing, to study the bacterium Pseudomonas aeruginosa and discover extensive transcript processing not previously appreciated. We have also gained novel insight into RNA maturation and turnover as well as a potential novel form of transcription regulation. NOTE: All sequence data has been submitted to the NCBI sequence read archive. Accession numbers are as follows: [NCBI sequence read archive: SRX156386, SRX157659, SRX157660, SRX157661, SRX157683 and SRX158075]. The sequence data is viewable using Jbrowse on www.pseudomonas.com .
A subset of conserved mammalian long non-coding RNAs are fossils of ancestral protein-coding genes.
Hezroni, Hadas; Ben-Tov Perry, Rotem; Meir, Zohar; Housman, Gali; Lubelsky, Yoav; Ulitsky, Igor
2017-08-30
Only a small portion of human long non-coding RNAs (lncRNAs) appear to be conserved outside of mammals, but the events underlying the birth of new lncRNAs in mammals remain largely unknown. One potential source is remnants of protein-coding genes that transitioned into lncRNAs. We systematically compare lncRNA and protein-coding loci across vertebrates, and estimate that up to 5% of conserved mammalian lncRNAs are derived from lost protein-coding genes. These lncRNAs have specific characteristics, such as broader expression domains, that set them apart from other lncRNAs. Fourteen lncRNAs have sequence similarity with the loci of the contemporary homologs of the lost protein-coding genes. We propose that selection acting on enhancer sequences is mostly responsible for retention of these regions. As an example of an RNA element from a protein-coding ancestor that was retained in the lncRNA, we describe in detail a short translated ORF in the JPX lncRNA that was derived from an upstream ORF in a protein-coding gene and retains some of its functionality. We estimate that ~ 55 annotated conserved human lncRNAs are derived from parts of ancestral protein-coding genes, and loss of coding potential is thus a non-negligible source of new lncRNAs. Some lncRNAs inherited regulatory elements influencing transcription and translation from their protein-coding ancestors and those elements can influence the expression breadth and functionality of these lncRNAs.
Identification of a novel box C/D snoRNA from mouse nucleolar cDNA library.
Zhou, Hui; Zhao, Jin; Yu, Chuan-He; Luo, Qing-Jun; Chen, Yue-Qin; Xiao, Yu; Qu, Liang-Hu
2004-02-18
By construction and screen of mouse nucleolar cDNA library, a novel mammalian small nucleolar RNAs (snoRNA) was identified. The novel snoRNA, 70 nt in length, displays structural features typical of C/D box snoRNA family. The snoRNA possesses an 11-nt-long rRNA antisense element and is predicted to guide the 2'-O-methylation of mouse 28S rRNA at G4043, a site unknown so far to be modified in vertebrates. The comparison of functional element of snoRNA guides among eukaryotes reveals that the novel snoRNA is a mammalian counterpart of yeast snR38 despite highly divergent sequence between them. Mouse and human snR38 and other cognates in distant vertebrates were positively detected with slight length variability. As expected, the rRNA ribose-methylation site predicted by mouse snR38 was precisely mapped by specific-primer extension assay. Furthermore, our analyses show that mouse and human snR38 gene have multiple variants and are nested in the introns of different host genes with unknown function. Thus, snR38 is a phylogenetically conserved methylation guide but exhibits different genomic organization in eukaryotes.
A new approach for annotation of transposable elements using small RNA mapping
El Baidouri, Moaine; Kim, Kyung Do; Abernathy, Brian; Arikit, Siwaret; Maumus, Florian; Panaud, Olivier; Meyers, Blake C.; Jackson, Scott A.
2015-01-01
Transposable elements (TEs) are mobile genomic DNA sequences found in most organisms. They so densely populate the genomes of many eukaryotic species that they are often the major constituents. With the rapid generation of many plant genome sequencing projects over the past few decades, there is an urgent need for improved TE annotation as a prerequisite for genome-wide studies. Analogous to the use of RNA-seq for gene annotation, we propose a new method for de novo TE annotation that uses as a guide 24 nt-siRNAs that are a part of TE silencing pathways. We use this new approach, called TASR (for Transposon Annotation using Small RNAs), for de novo annotation of TEs in Arabidopsis, rice and soybean and demonstrate that this strategy can be successfully applied for de novo TE annotation in plants. Executable PERL is available for download from: http://tasr-pipeline.sourceforge.net/ PMID:25813049
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis.
Spangler, Jacob B; Feltus, Frank Alex
2013-01-01
Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression.
Conserved Non-Coding Sequences are Associated with Rates of mRNA Decay in Arabidopsis
Spangler, Jacob B.; Feltus, Frank Alex
2013-01-01
Steady-state mRNA levels are tightly regulated through a combination of transcriptional and post-transcriptional control mechanisms. The discovery of cis-acting DNA elements that encode these control mechanisms is of high importance. We have investigated the influence of conserved non-coding sequences (CNSs), DNA patterns retained after an ancient whole genome duplication event, on the breadth of gene expression and the rates of mRNA decay in Arabidopsis thaliana. The absence of CNSs near α duplicate genes was associated with a decrease in breadth of gene expression and slower mRNA decay rates while the presence CNSs near α duplicates was associated with an increase in breadth of gene expression and faster mRNA decay rates. The observed difference in mRNA decay rate was fastest in genes with CNSs in both non-transcribed and transcribed regions, albeit through an unknown mechanism. This study supports the notion that some Arabidopsis CNSs regulate the steady-state mRNA levels through post-transcriptional control mechanisms and that CNSs also play a role in controlling the breadth of gene expression. PMID:23675377
Down-Regulation of Gene Expression by RNA-Induced Gene Silencing
NASA Astrophysics Data System (ADS)
Travella, Silvia; Keller, Beat
Down-regulation of endogenous genes via post-transcriptional gene silencing (PTGS) is a key to the characterization of gene function in plants. Many RNA-based silencing mechanisms such as post-transcriptional gene silencing, co-suppression, quelling, and RNA interference (RNAi) have been discovered among species of different kingdoms (plants, fungi, and animals). One of the most interesting discoveries was RNAi, a sequence-specific gene-silencing mechanism initiated by the introduction of double-stranded RNA (dsRNA), homologous in sequence to the silenced gene, which triggers degradation of mRNA. Infection of plants with modified viruses can also induce RNA silencing and is referred to as virus-induced gene silencing (VIGS). In contrast to insertional mutagenesis, these emerging new reverse genetic approaches represent a powerful tool for exploring gene function and for manipulating gene expression experimentally in cereal species such as barley and wheat. We examined how RNAi and VIGS have been used to assess gene function in barley and wheat, including molecular mechanisms involved in the process and available methodological elements, such as vectors, inoculation procedures, and analysis of silenced phenotypes.
Yukawa, Yasushi; Akama, Kazuhito; Noguchi, Kanta; Komiya, Masaaki; Sugiura, Masahiro
2013-01-10
Nuclear tRNA genes are transcribed by RNA polymerase III. The A- and B-boxes located within the transcribed regions are essential promoter elements for nuclear tRNA gene transcription. The Arabidopsis genome contains ten annotated genes encoding identical tRNA(Lys)(UUU) molecules, which are scattered on the five chromosomes. In this study, we prepared ten tDNA constructs including each of the tRNA(Lys)(UUU) coding sequences with their individual 5' and 3' flanking sequences, and assayed tRNA expression using an in vitro RNA polymerase III-dependent transcription system. Transcription levels differed significantly among the ten genes and two of the tRNA genes were transcribed at a very low level, despite possessing A- and B-boxes identical to those of the other tRNA genes. To examine whether the in vitro results were reproducible in vivo, the 5' flanking sequence of an amber suppressor tRNA gene was then replaced with those of the ten tRNA(Lys) genes. An in vivo experiment based on an amber suppressor tRNA that mediates suppression of a premature amber codon in a β-glucuronidase (GUS) reporter gene in plant tissues generated nearly identical results to those obtained in vitro. Analysis of mutated versions of the amber suppressor tRNA gene, which contained base substitutions around the transcription start site (TSS), showed that the context around the transcription start sites is a crucial determinant for transcription of plant tRNA(Lys)(UUU) both in vitro and in vivo. The above transcription regulation by context around TSS differed between tRNA genes and other Pol III-dependent genes. Copyright © 2012 Elsevier B.V. All rights reserved.
Alu elements shape the primate transcriptome by cis-regulation of RNA editing
2014-01-01
Background RNA editing by adenosine to inosine deamination is a widespread phenomenon, particularly frequent in the human transcriptome, largely due to the presence of inverted Alu repeats and their ability to form double-stranded structures – a requisite for ADAR editing. While several hundred thousand editing sites have been identified within these primate-specific repeats, the function of Alu-editing has yet to be elucidated. Results We show that inverted Alu repeats, expressed in the primate brain, can induce site-selective editing in cis on sites located several hundred nucleotides from the Alu elements. Furthermore, a computational analysis, based on available RNA-seq data, finds that site-selective editing occurs significantly closer to edited Alu elements than expected. These targets are poorly edited upon deletion of the editing inducers, as well as in homologous transcripts from organisms lacking Alus. Sequences surrounding sites near edited Alus in UTRs, have been subjected to a lesser extent of evolutionary selection than those far from edited Alus, indicating that their editing generally depends on cis-acting Alus. Interestingly, we find an enrichment of primate-specific editing within encoded sequence or the UTRs of zinc finger-containing transcription factors. Conclusions We propose a model whereby primate-specific editing is induced by adjacent Alu elements that function as recruitment elements for the ADAR editing enzymes. The enrichment of site-selective editing with potentially functional consequences on the expression of transcription factors indicates that editing contributes more profoundly to the transcriptomic regulation and repertoire in primates than previously thought. PMID:24485196
Salton, S R; Fischberg, D J; Dong, K W
1991-05-01
Nerve growth factor (NGF) plays a critical role in the development and survival of neurons in the peripheral nervous system. Following treatment with NGF but not epidermal growth factor, rat pheochromocytoma (PC12) cells undergo neural differentiation. We have cloned a nervous system-specific mRNA, NGF33.1, that is rapidly and relatively selectively induced by treatment of PC12 cells with NGF and basic fibroblast growth factor in comparison with epidermal growth factor. Analysis of the nucleic acid and predicted amino acid sequences of the NGF33.1 cDNA clone suggested that this clone corresponded to the NGF-inducible mRNA called VGF (A. Levi, J. D. Eldridge, and B. M. Paterson, Science 229:393-395, 1985; R. Possenti, J. D. Eldridge, B. M. Paterson, A. Grasso, and A. Levi, EMBO J. 8:2217-2223, 1989). We have used the NGF33.1 cDNA clone to isolate and characterize the VGF gene, and in this paper we report the complete sequence of the VGF gene, including 853 bases of 5' flank revealed TATAA and CCAAT elements, several GC boxes, and a consensus cyclic AMP response element-binding protein binding site. The VGF promoter contains sequences homologous to other NGF-inducible, neuronal promoters. We further show that VGF mRNA is induced in PC12 cells to a greater extent by depolarization and by phorbol-12-myristate-13-acetate treatment than by 8-bromo-cyclic AMP treatment. By Northern (RNA) and RNase protection analysis, VGF mRNA is detectable in embryonic and postnatal central and peripheral nervous tissues but not in a number of nonneural tissues. In the cascade of events which ultimately leads to the neural differentiation of NGF-treated PC12 cells, the VGF gene encodes the most rapidly and selectively regulated, nervous-system specific mRNA yet identified.
Kapusta, Aurélie; Zhuo, Xiaoyu; Ramsay, LeeAnn; Bourque, Guillaume; Yandell, Mark; Feschotte, Cédric
2013-01-01
Advances in vertebrate genomics have uncovered thousands of loci encoding long noncoding RNAs (lncRNAs). While progress has been made in elucidating the regulatory functions of lncRNAs, little is known about their origins and evolution. Here we explore the contribution of transposable elements (TEs) to the makeup and regulation of lncRNAs in human, mouse, and zebrafish. Surprisingly, TEs occur in more than two thirds of mature lncRNA transcripts and account for a substantial portion of total lncRNA sequence (∼30% in human), whereas they seldom occur in protein-coding transcripts. While TEs contribute less to lncRNA exons than expected, several TE families are strongly enriched in lncRNAs. There is also substantial interspecific variation in the coverage and types of TEs embedded in lncRNAs, partially reflecting differences in the TE landscapes of the genomes surveyed. In human, TE sequences in lncRNAs evolve under greater evolutionary constraint than their non–TE sequences, than their intronic TEs, or than random DNA. Consistent with functional constraint, we found that TEs contribute signals essential for the biogenesis of many lncRNAs, including ∼30,000 unique sites for transcription initiation, splicing, or polyadenylation in human. In addition, we identified ∼35,000 TEs marked as open chromatin located within 10 kb upstream of lncRNA genes. The density of these marks in one cell type correlate with elevated expression of the downstream lncRNA in the same cell type, suggesting that these TEs contribute to cis-regulation. These global trends are recapitulated in several lncRNAs with established functions. Finally a subset of TEs embedded in lncRNAs are subject to RNA editing and predicted to form secondary structures likely important for function. In conclusion, TEs are nearly ubiquitous in lncRNAs and have played an important role in the lineage-specific diversification of vertebrate lncRNA repertoires. PMID:23637635
U6 small nuclear RNA is transcribed by RNA polymerase III.
Kunkel, G R; Maser, R L; Calvet, J P; Pederson, T
1986-01-01
A DNA fragment homologous to U6 small nuclear RNA was isolated from a human genomic library and sequenced. The immediate 5'-flanking region of the U6 DNA clone had significant homology with a potential mouse U6 gene, including a "TATA box" at a position 26-29 nucleotides upstream from the transcription start site. Although this sequence element is characteristic of RNA polymerase II promoters, the U6 gene also contained a polymerase III "box A" intragenic control region and a typical run of five thymines at the 3' terminus (noncoding strand). The human U6 DNA clone was accurately transcribed in a HeLa cell S100 extract lacking polymerase II activity. U6 RNA transcription in the S100 extract was resistant to alpha-amanitin at 1 microgram/ml but was completely inhibited at 200 micrograms/ml. A comparison of fingerprints of the in vitro transcript and of U6 RNA synthesized in vivo revealed sequence congruence. U6 RNA synthesis in isolated HeLa cell nuclei also displayed low sensitivity to alpha-amanitin, in contrast to U1 and U2 RNA transcription, which was inhibited greater than 90% at 1 microgram/ml. In addition, U6 RNA synthesized in isolated nuclei was efficiently immunoprecipitated by an antibody against the La antigen, a protein known to bind most other RNA polymerase III transcripts. These results establish that, in contrast to the polymerase II-directed transcription of mammalian genes for U1-U5 small nuclear RNAs, human U6 RNA is transcribed by RNA polymerase III. Images PMID:3464970
Park, Eonyoung; Maquat, Lynne E
2013-01-01
Staufen1 (STAU1)-mediated mRNA decay (SMD) is an mRNA degradation process in mammalian cells that is mediated by the binding of STAU1 to a STAU1-binding site (SBS) within the 3'-untranslated region (3'-UTR) of target mRNAs. During SMD, STAU1, a double-stranded (ds) RNA-binding protein, recognizes dsRNA structures formed either by intramolecular base pairing of 3'-UTR sequences or by intermolecular base pairing of 3'-UTR sequences with a long-noncoding RNA (lncRNA) via partially complementary Alu elements. Recently, STAU2, a paralog of STAU1, has also been reported to mediate SMD. Both STAU1 and STAU2 interact directly with the ATP-dependent RNA helicase UPF1, a key SMD factor, enhancing its helicase activity to promote effective SMD. Moreover, STAU1 and STAU2 form homodimeric and heterodimeric interactions via domain-swapping. Because both SMD and the mechanistically related nonsense-mediated mRNA decay (NMD) employ UPF1; SMD and NMD are competitive pathways. Competition contributes to cellular differentiation processes, such as myogenesis and adipogenesis, placing SMD at the heart of various physiologically important mechanisms. Copyright © 2013 John Wiley & Sons, Ltd.
McKinnon, R D; Danielson, P; Brow, M A; Bloom, F E; Sutcliffe, J G
1987-01-01
We examined the level of expression of small RNA transcripts hybridizing to a rodent repetitive DNA element, the identifier (ID) sequence, in a variety of cell types in vivo and in cultured mammalian cells. A 160-nucleotide (160n) cytoplasmic poly(A)+ RNA (BC1) appeared in late embryonic and early postnatal rat brain development, was enriched in the cerebral cortex, and appeared to be restricted to neural tissue and the anterior pituitary gland. A 110n RNA (BC2) was specifically enriched in brain, especially the postnatal cortex, but was detectable at low levels in peripheral tissues. A third, related 75n poly(A)- RNA (T3) was found in rat brain and at lower levels in peripheral tissues but was very abundant in the testes. The BC RNAs were found in a variety of rat cell lines, and their level of expression was dependent upon cell culture conditions. A rat ID probe detected BC-like RNAs in mouse brain but not liver and detected a 200n RNA in monkey brain but not liver at lower hybridization stringencies. These RNAs were expressed by mouse and primate cell lines. Thus, tissue-specific expression of small ID-sequence-related transcripts is conserved among mammals, but the tight regulation found in vivo is lost by cells in culture. Images PMID:2439903
Fu, X Y; Colgan, J D; Manley, J L
1988-01-01
We have determined the effects of a number of mutations in the small-t antigen mRNA intron on the alternative splicing pattern of the simian virus 40 early transcript. Expansion of the distance separating the small-t pre-mRNA lariat branch point and the shared large T-small t 3' splice site from 18 to 29 nucleotides (nt) resulted in a relative enhancement of small-t splicing in vivo. This finding, coupled with the observation that large-T pre-RNA splicing in vitro was not affected by this expansion, suggests that small-t splicing is specifically constrained by a short branch point-3' splice site distance. Similarly, the distance separating the 5' splice site and branch point (48 nt) was found to be at or near a minimum for small-t splicing, because deletions in this region as small as 2 nt dramatically reduced the ratio of small-t to large-T mRNA that accumulated in transfected cells. Finally, a specific sequence within the small-t intron, encompassing the upstream branch sites used in large-T splicing, was found to be an important element in the cell-specific pattern of early alternative splicing. Substitutions within this region reduced the ratio of small-t to large-T mRNA produced in HeLa cells but had only minor effects in human 293 cells. Images PMID:2851720
Kishor, Aparna; Tandukar, Bishal; Ly, Yann V.; Toth, Eric A.; Suarez, Yvelisse; Brewer, Gary
2013-01-01
The AU-rich elements (AREs) encoded within many mRNA 3′ untranslated regions (3′UTRs) are targets for factors that control transcript longevity and translational efficiency. Hsp70, best known as a protein chaperone with well-defined peptide-refolding properties, is known to interact with ARE-like RNA substrates in vitro. Here, we show that cofactor-free preparations of Hsp70 form direct, high-affinity complexes with ARE substrates based on specific recognition of U-rich sequences by both the ATP- and peptide-binding domains. Suppressing Hsp70 in HeLa cells destabilized an ARE reporter mRNA, indicating a novel ARE-directed mRNA-stabilizing role for this protein. Hsp70 also bound and stabilized endogenous ARE-containing mRNAs encoding vascular endothelial growth factor (VEGF) and Cox-2, which involved a mechanism that was unaffected by an inhibitor of its protein chaperone function. Hsp70 recognition and stabilization of VEGF mRNA was mediated by an ARE-like sequence in the proximal 3′UTR. Finally, stabilization of VEGF mRNA coincided with the accumulation of Hsp70 protein in HL60 promyelocytic leukemia cells recovering from acute thermal stress. We propose that the binding and stabilization of selected ARE-containing mRNAs may contribute to the cytoprotective effects of Hsp70 following cellular stress but may also provide a novel mechanism linking constitutively elevated Hsp70 expression to the development of aggressive neoplastic phenotypes. PMID:23109422
Nuclear Retention Elements of U3 Small Nucleolar RNA
Speckmann, Wayne; Narayanan, Aarthi; Terns, Rebecca; Terns, Michael P.
1999-01-01
The processing and methylation of precursor rRNA is mediated by the box C/D small nucleolar RNAs (snoRNAs). These snoRNAs differ from most cellular RNAs in that they are not exported to the cytoplasm. Instead, these RNAs are actively retained in the nucleus where they assemble with proteins into mature small nucleolar ribonucleoprotein particles and are targeted to their intranuclear site of action, the nucleolus. In this study, we have identified the cis-acting sequences responsible for the nuclear retention of U3 box C/D snoRNA by analyzing the nucleocytoplasmic distributions of an extensive panel of U3 RNA variants after injection of the RNAs into Xenopus oocyte nuclei. Our data indicate the importance of two conserved sequence motifs in retaining U3 RNA in the nucleus. The first motif is comprised of the conserved box C′ and box D sequences that characterize the box C/D family. The second motif contains conserved box sequences B and C. Either motif is sufficient for nuclear retention, but disruption of both motifs leads to mislocalization of the RNAs to the cytoplasm. Variant RNAs that are not retained also lack 5′ cap hypermethylation and fail to associate with fibrillarin. Furthermore, our results indicate that nuclear retention of U3 RNA does not simply reflect its nucleolar localization. A fragment of U3 containing the box B/C motif is not localized to nucleoli but retained in coiled bodies. Thus, nuclear retention and nucleolar localization are distinct processes with differing sequence requirements. PMID:10567566
Short intronic repeat sequences facilitate circular RNA production.
Liang, Dongming; Wilusz, Jeremy E
2014-10-15
Recent deep sequencing studies have revealed thousands of circular noncoding RNAs generated from protein-coding genes. These RNAs are produced when the precursor messenger RNA (pre-mRNA) splicing machinery "backsplices" and covalently joins, for example, the two ends of a single exon. However, the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. Using extensive mutagenesis of expression plasmids, we show that miniature introns containing the splice sites along with short (∼ 30- to 40-nucleotide) inverted repeats, such as Alu elements, are sufficient to allow the intervening exons to circularize in cells. The intronic repeats must base-pair to one another, thereby bringing the splice sites into close proximity to each other. More than simple thermodynamics is clearly at play, however, as not all repeats support circularization, and increasing the stability of the hairpin between the repeats can sometimes inhibit circular RNA biogenesis. The intronic repeats and exonic sequences must collaborate with one another, and a functional 3' end processing signal is required, suggesting that circularization may occur post-transcriptionally. These results suggest detailed and generalizable models that explain how the splicing machinery determines whether to produce a circular noncoding RNA or a linear mRNA. © 2014 Liang and Wilusz; Published by Cold Spring Harbor Laboratory Press.
RNA-Seq Analysis to Measure the Expression of SINE Retroelements.
Román, Ángel Carlos; Morales-Hernández, Antonio; Fernández-Salguero, Pedro M
2016-01-01
The intrinsic features of retroelements, like their repetitive nature and disseminated presence in their host genomes, demand the use of advanced methodologies for their bioinformatic and functional study. The short length of SINE (short interspersed elements) retrotransposons makes such analyses even more complex. Next-generation sequencing (NGS) technologies are currently one of the most widely used tools to characterize the whole repertoire of gene expression in a specific tissue. In this chapter, we will review the molecular and computational methods needed to perform NGS analyses on SINE elements. We will also describe new methods of potential interest for researchers studying repetitive elements. We intend to outline the general ideas behind the computational analyses of NGS data obtained from SINE elements, and to stimulate other scientists to expand our current knowledge on SINE biology using RNA-seq and other NGS tools.
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes
Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded
2016-01-01
Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
Staufen1 senses overall transcript secondary structure to regulate translation
Ricci, Emiliano P; Kucukural, Alper; Cenik, Can; Mercier, Blandine C; Singh, Guramrit; Heyer, Erin E; Ashar-Patel, Ami; Peng, Lingtao; Moore, Melissa J
2015-01-01
Human Staufen1 (Stau1) is a double-stranded RNA (dsRNA)-binding protein implicated in multiple post-transcriptional gene-regulatory processes. Here we combined RNA immunoprecipitation in tandem (RIPiT) with RNase footprinting, formaldehyde cross-linking, sonication-mediated RNA fragmentation and deep sequencing to map Staufen1-binding sites transcriptome wide. We find that Stau1 binds complex secondary structures containing multiple short helices, many of which are formed by inverted Alu elements in annotated 3′ untranslated regions (UTRs) or in ‘strongly distal’ 3′ UTRs. Stau1 also interacts with actively translating ribosomes and with mRNA coding sequences (CDSs) and 3′ UTRs in proportion to their GC content and propensity to form internal secondary structure. On mRNAs with high CDS GC content, higher Stau1 levels lead to greater ribosome densities, thus suggesting a general role for Stau1 in modulating translation elongation through structured CDS regions. Our results also indicate that Stau1 regulates translation of transcription-regulatory proteins. PMID:24336223
Angeloni, Debora; ter Elst, Arja; Wei, Ming Hui; van der Veen, Anneke Y; Braga, Eleonora A; Klimov, Eugene A; Timmer, Tineke; Korobeinikova, Luba; Lerman, Michael I; Buys, Charles H C M
2006-07-01
Homozygous deletions or loss of heterozygosity (LOH) at human chromosome band 3p12 are consistent features of lung and other malignancies, suggesting the presence of a tumor suppressor gene(s) (TSG) at this location. Only one gene has been cloned thus far from the overlapping region deleted in lung and breast cancer cell lines U2020, NCI H2198, and HCC38. It is DUTT1 (Deleted in U Twenty Twenty), also known as ROBO1, FLJ21882, and SAX3, according to HUGO. DUTT1, the human ortholog of the fly gene ROBO, has homology with NCAM proteins. Extensive analyses of DUTT1 in lung cancer have not revealed any mutations, suggesting that another gene(s) at this location could be of importance in lung cancer initiation and progression. Here, we report the discovery of a new, small, homozygous deletion in the small cell lung cancer (SCLC) cell line GLC20, nested in the overlapping, critical region. The deletion was delineated using several polymorphic markers and three overlapping P1 phage clones. Fiber-FISH experiments revealed the deletion was approximately 130 kb. Comparative genomic sequence analysis uncovered short sequence elements highly conserved among mammalian genomes and the chicken genome. The discovery of two EST clusters within the deleted region led to the isolation of two noncoding RNA (ncRNA) genes. These were subsequently found differentially expressed in various tumors when compared to their normal tissues. The ncRNA and other highly conserved sequence elements in the deleted region may represent miRNA targets of importance in cancer initiation or progression. Published 2006 Wiley-Liss, Inc.
Munroe, Stephen H.; Morales, Christopher H.; Duyck, Tessa H.; Waters, Paul D.
2015-01-01
The α-thyroid hormone receptor gene (TRα) codes for two functionally distinct proteins: TRα1, the α-thyroid hormone receptor; and TRα2, a non-hormone-binding variant. The final exon of TRα2 mRNA overlaps the 3’ end of Rev-erbα mRNA, which encodes another nuclear receptor on the opposite strand of DNA. To understand the evolution of this antisense overlap, we sequenced these genes and mRNAs in the platypus Orthorhynchus anatinus. Despite its strong homology with other mammals, the platypus TRα/Rev-erbα locus lacks elements essential for expression of TRα2. Comparative analysis suggests that alternative splicing of TRα2 mRNA expression evolved in a stepwise fashion before the divergence of eutherian and marsupial mammals. A short G-rich element (G30) located downstream of the alternative 3’splice site of TRα2 mRNA and antisense to the 3’UTR of Rev-erbα plays an important role in regulating TRα2 splicing. G30 is tightly conserved in eutherian mammals, but is absent in marsupials and monotremes. Systematic deletions and substitutions within G30 have dramatically different effects on TRα2 splicing, leading to either its inhibition or its enhancement. Mutations that disrupt one or more clusters of G residues enhance splicing two- to three-fold. These results suggest the G30 sequence can adopt a highly structured conformation, possibly a G-quadruplex, and that it is part of a complex splicing regulatory element which exerts both positive and negative effects on TRα2 expression. Since mutations that strongly enhance splicing in vivo have no effect on splicing in vitro, it is likely that the regulatory role of G30 is mediated through linkage of transcription and splicing. PMID:26368571
Untranslated regions of diverse plant viral RNAs vary greatly in translation enhancement efficiency
2012-01-01
Background Whole plants or plant cell cultures can serve as low cost bioreactors to produce massive amounts of a specific protein for pharmacological or industrial use. To maximize protein expression, translation of mRNA must be optimized. Many plant viral RNAs harbor extremely efficient translation enhancers. However, few of these different translation elements have been compared side-by-side. Thus, it is unclear which are the most efficient translation enhancers. Here, we compare the effects of untranslated regions (UTRs) containing translation elements from six plant viruses on translation in wheat germ extract and in monocotyledenous and dicotyledenous plant cells. Results The highest expressing uncapped mRNAs contained viral UTRs harboring Barley yellow dwarf virus (BYDV)-like cap-independent translation elements (BTEs). The BYDV BTE conferred the most efficient translation of a luciferase reporter in wheat germ extract and oat protoplasts, while uncapped mRNA containing the BTE from Tobacco necrosis virus-D translated most efficiently in tobacco cells. Capped mRNA containing the Tobacco mosaic virus omega sequence was the most efficient mRNA in tobacco cells. UTRs from Satellite tobacco necrosis virus, Tomato bushy stunt virus, and Crucifer-infecting tobamovirus (crTMV) did not stimulate translation efficiently. mRNA with the crTMV 5′ UTR was unstable in tobacco protoplasts. Conclusions BTEs confer the highest levels of translation of uncapped mRNAs in vitro and in vivo, while the capped omega sequence is most efficient in tobacco cells. These results provide a basis for understanding mechanisms of translation enhancement, and for maximizing protein synthesis in cell-free systems, transgenic plants, or in viral expression vectors. PMID:22559081
A map of human microRNA variation uncovers unexpectedly high levels of variability
2012-01-01
Background MicroRNAs (miRNAs) are key components of the gene regulatory network in many species. During the past few years, these regulatory elements have been shown to be involved in an increasing number and range of diseases. Consequently, the compilation of a comprehensive map of natural variability in a healthy population seems an obvious requirement for future research on miRNA-related pathologies. Methods Data on 14 populations from the 1000 Genomes Project were analyzed, along with new data extracted from 60 exomes of healthy individuals from a population from southern Spain, sequenced in the context of the Medical Genome Project, to derive an accurate map of miRNA variability. Results Despite the common belief that miRNAs are highly conserved elements, analysis of the sequences of the 1,152 individuals indicated that the observed level of variability is double what was expected. A total of 527 variants were found. Among these, 45 variants affected the recognition region of the corresponding miRNA and were found in 43 different miRNAs, 26 of which are known to be involved in 57 diseases. Different parts of the mature structure of the miRNA were affected to different degrees by variants, which suggests the existence of a selective pressure related to the relative functional impact of the change. Moreover, 41 variants showed a significant deviation from the Hardy-Weinberg equilibrium, which supports the existence of a selective process against some alleles. The average number of variants per individual in miRNAs was 28. Conclusions Despite an expectation that miRNAs would be highly conserved genomic elements, our study reports a level of variability comparable to that observed for coding genes. PMID:22906193
Yao, Peng; Potdar, Alka A.; Arif, Abul; Ray, Partho Sarothi; Mukhopadhyay, Rupak; Willard, Belinda; Xu, Yichi; Yan, Jun; Saidel, Gerald M.; Fox, Paul L.
2012-01-01
SUMMARY Post-transcriptional regulatory mechanisms superimpose “fine-tuning” control upon “on-off” switches characteristic of gene transcription. We have exploited computational modeling with experimental validation to resolve an anomalous relationship between mRNA expression and protein synthesis. Differential GAIT (Gamma-interferon Activated Inhibitor of Translation) complex activation repressed VEGF-A synthesis to a low, constant rate despite high, variable VEGFA mRNA expression. Dynamic model simulations indicated the presence of an unidentified, inhibitory GAIT element-interacting factor. We discovered a truncated form of glutamyl-prolyl tRNA synthetase (EPRS), the GAIT constituent that binds the 3’-UTR GAIT element in target transcripts. The truncated protein, EPRSN1, prevents binding of functional GAIT complex. EPRSN1 mRNA is generated by a remarkable polyadenylation-directed conversion of a Tyr codon in the EPRS coding sequence to a stop codon (PAY*). By low-level protection of GAIT element-bearing transcripts, EPRSN1 imposes a robust “translational trickle” of target protein expression. Genome-wide analysis shows PAY* generates multiple truncated transcripts thereby contributing to transcriptome expansion. PMID:22386318
Snedden, Donald D; Bertke, Michelle M; Vernon, Dominic; Huber, Paul W
2013-07-01
The 3' untranslated region of mRNA encoding PHAX, a phosphoprotein required for nuclear export of U-type snRNAs, contains cis-acting sequence motifs E2 and VM1 that are required for localization of RNAs to the vegetal hemisphere of Xenopus oocytes. However, we have found that PHAX mRNA is transported to the opposite, animal, hemisphere. A set of proteins that cross-link to the localization elements of vegetally localized RNAs are also cross-linked to PHAX and An1 mRNAs, demonstrating that the composition of RNP complexes that form on these localization elements is highly conserved irrespective of the final destination of the RNA. The ability of RNAs to bind this core group of proteins is correlated with localization activity. Staufen1, which binds to Vg1 and VegT mRNAs, is not associated with RNAs localized to the animal hemisphere and may determine, at least in part, the direction of RNA movement in Xenopus oocytes.
Discrimination between Closely Related Cellular Metabolites by the SAM-I Riboswitch
DOE Office of Scientific and Technical Information (OSTI.GOV)
Montange, R.; Mondragon, E; van Tyne, D
2010-01-01
The SAM-I riboswitch is a cis-acting element of genetic control found in bacterial mRNAs that specifically binds S-adenosylmethionine (SAM). We previously determined the 2.9-{angstrom} X-ray crystal structure of the effector-binding domain of this RNA element, revealing details of RNA-ligand recognition. To improve this structure, variations were made to the RNA sequence to alter lattice contacts, resulting in a 0.5-{angstrom} improvement in crystallographic resolution and allowing for a more accurate refinement of the crystallographic model. The basis for SAM specificity was addressed by a structural analysis of the RNA complexed to S-adenosylhomocysteine (SAH) and sinefungin and by measuring the affinity ofmore » SAM and SAH for a series of mutants using isothermal titration calorimetry. These data illustrate the importance of two universally conserved base pairs in the RNA that form electrostatic interactions with the positively charged sulfonium group of SAM, thereby providing a basis for discrimination between SAM and SAH.« less
Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin
Böhmdorfer, Gudrun; Sethuraman, Shriya; Rowley, M Jordan; Krzyszton, Michal; Rothi, M Hafiz; Bouzit, Lilia; Wierzbicki, Andrzej T
2016-01-01
RNA-mediated transcriptional gene silencing is a conserved process where small RNAs target transposons and other sequences for repression by establishing chromatin modifications. A central element of this process are long non-coding RNAs (lncRNA), which in Arabidopsis thaliana are produced by a specialized RNA polymerase known as Pol V. Here we show that non-coding transcription by Pol V is controlled by preexisting chromatin modifications located within the transcribed regions. Most Pol V transcripts are associated with AGO4 but are not sliced by AGO4. Pol V-dependent DNA methylation is established on both strands of DNA and is tightly restricted to Pol V-transcribed regions. This indicates that chromatin modifications are established in close proximity to Pol V. Finally, Pol V transcription is preferentially enriched on edges of silenced transposable elements, where Pol V transcribes into TEs. We propose that Pol V may play an important role in the determination of heterochromatin boundaries. DOI: http://dx.doi.org/10.7554/eLife.19092.001 PMID:27779094
Salehi, Abdolreza; Rivera, Rocío Melissa
2018-01-01
RNA editing increases the diversity of the transcriptome and proteome. Adenosine-to-inosine (A-to-I) editing is the predominant type of RNA editing in mammals and it is catalyzed by the adenosine deaminases acting on RNA (ADARs) family. Here, we used a largescale computational analysis of transcriptomic data from brain, heart, colon, lung, spleen, kidney, testes, skeletal muscle and liver, from three adult animals in order to identify RNA editing sites in bovine. We developed a computational pipeline and used a rigorous strategy to identify novel editing sites from RNA-Seq data in the absence of corresponding DNA sequence information. Our methods take into account sequencing errors, mapping bias, as well as biological replication to reduce the probability of obtaining a false-positive result. We conducted a detailed characterization of sequence and structural features related to novel candidate sites and found 1,600 novel canonical A-to-I editing sites in the nine bovine tissues analyzed. Results show that these sites 1) occur frequently in clusters and short interspersed nuclear elements (SINE) repeats, 2) have a preference for guanines depletion/enrichment in the flanking 5′/3′ nucleotide, 3) occur less often in coding sequences than other regions of the genome, and 4) have low evolutionary conservation. Further, we found that a positive correlation exists between expression of ADAR family members and tissue-specific RNA editing. Most of the genes with predicted A-to-I editing in each tissue were significantly enriched in biological terms relevant to the function of the corresponding tissue. Lastly, the results highlight the importance of the RNA editome in nervous system regulation. The present study extends the list of RNA editing sites in bovine and provides pipelines that may be used to investigate the editome in other organisms. PMID:29470549
GC-rich coding sequences reduce transposon-like, small RNA-mediated transgene silencing.
Sidorenko, Lyudmila V; Lee, Tzuu-Fen; Woosley, Aaron; Moskal, William A; Bevan, Scott A; Merlo, P Ann Owens; Walsh, Terence A; Wang, Xiujuan; Weaver, Staci; Glancy, Todd P; Wang, PoHao; Yang, Xiaozeng; Sriram, Shreedharan; Meyers, Blake C
2017-11-01
The molecular basis of transgene susceptibility to silencing is poorly characterized in plants; thus, we evaluated several transgene design parameters as means to reduce heritable transgene silencing. Analyses of Arabidopsis plants with transgenes encoding a microalgal polyunsaturated fatty acid (PUFA) synthase revealed that small RNA (sRNA)-mediated silencing, combined with the use of repetitive regulatory elements, led to aggressive transposon-like silencing of canola-biased PUFA synthase transgenes. Diversifying regulatory sequences and using native microalgal coding sequences (CDSs) with higher GC content improved transgene expression and resulted in a remarkable trans-generational stability via reduced accumulation of sRNAs and DNA methylation. Further experiments in maize with transgenes individually expressing three crystal (Cry) proteins from Bacillus thuringiensis (Bt) tested the impact of CDS recoding using different codon bias tables. Transgenes with higher GC content exhibited increased transcript and protein accumulation. These results demonstrate that the sequence composition of transgene CDSs can directly impact silencing, providing design strategies for increasing transgene expression levels and reducing risks of heritable loss of transgene expression.
Hezroni, Hadas; Koppstein, David; Schwartz, Matthew G; Avrutin, Alexandra; Bartel, David P; Ulitsky, Igor
2015-05-19
The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNAs). Although in each species, >70% of lincRNAs cannot be traced to homologs in species that diverged >50 million years ago, thousands of human lincRNAs have homologs with similar expression patterns in other species. These homologs share short, 5'-biased patches of sequence conservation nested in exonic architectures that have been extensively rewired, in part by transposable element exonization. Thus, over a thousand human lincRNAs are likely to have conserved functions in mammals, and hundreds beyond mammals, but those functions require only short patches of specific sequences and can tolerate major changes in gene architecture. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Yang, Lingna; Wang, Chongyuan; Li, Fudong; Zhang, Jiahai; Nayab, Anam; Wu, Jihui; Shi, Yunyu; Gong, Qingguo
2017-09-29
MEX-3 is a K-homology (KH) domain-containing RNA-binding protein first identified as a translational repressor in Caenorhabditis elegans , and its four orthologs (MEX-3A-D) in human and mouse were subsequently found to have E3 ubiquitin ligase activity mediated by a RING domain and critical for RNA degradation. Current evidence implicates human MEX-3C in many essential biological processes and suggests a strong connection with immune diseases and carcinogenesis. The highly conserved dual KH domains in MEX-3 proteins enable RNA binding and are essential for the recognition of the 3'-UTR and post-transcriptional regulation of MEX-3 target transcripts. However, the molecular mechanisms of translational repression and the consensus RNA sequence recognized by the MEX-3C KH domain are unknown. Here, using X-ray crystallography and isothermal titration calorimetry, we investigated the RNA-binding activity and selectivity of human MEX-3C dual KH domains. Our high-resolution crystal structures of individual KH domains complexed with a noncanonical U-rich and a GA-rich RNA sequence revealed that the KH1/2 domains of human MEX-3C bound MRE10, a 10-mer RNA (5'-CAGAGUUUAG-3') consisting of an eight-nucleotide MEX-3-recognition element (MRE) motif, with high affinity. Of note, we also identified a consensus RNA motif recognized by human MEX-3C. The potential RNA-binding sites in the 3'-UTR of the human leukocyte antigen serotype ( HLA-A2 ) mRNA were mapped with this RNA-binding motif and further confirmed by fluorescence polarization. The binding motif identified here will provide valuable information for future investigations of the functional pathways controlled by human MEX-3C and for predicting potential mRNAs regulated by this enzyme. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
RNA sequencing uncovers antisense RNAs and novel small RNAs in Streptococcus pyogenes.
Le Rhun, Anaïs; Beer, Yan Yan; Reimegård, Johan; Chylinski, Krzysztof; Charpentier, Emmanuelle
2016-01-01
Streptococcus pyogenes is a human pathogen responsible for a wide spectrum of diseases ranging from mild to life-threatening infections. During the infectious process, the temporal and spatial expression of pathogenicity factors is tightly controlled by a complex network of protein and RNA regulators acting in response to various environmental signals. Here, we focus on the class of small RNA regulators (sRNAs) and present the first complete analysis of sRNA sequencing data in S. pyogenes. In the SF370 clinical isolate (M1 serotype), we identified 197 and 428 putative regulatory RNAs by visual inspection and bioinformatics screening of the sequencing data, respectively. Only 35 from the 197 candidates identified by visual screening were assigned a predicted function (T-boxes, ribosomal protein leaders, characterized riboswitches or sRNAs), indicating how little is known about sRNA regulation in S. pyogenes. By comparing our list of predicted sRNAs with previous S. pyogenes sRNA screens using bioinformatics or microarrays, 92 novel sRNAs were revealed, including antisense RNAs that are for the first time shown to be expressed in this pathogen. We experimentally validated the expression of 30 novel sRNAs and antisense RNAs. We show that the expression profile of 9 sRNAs including 2 predicted regulatory elements is affected by the endoribonucleases RNase III and/or RNase Y, highlighting the critical role of these enzymes in sRNA regulation.
Dupont, L; Boizet-Bonhoure, B; Coddeville, M; Auvray, F; Ritzenthaler, P
1995-01-01
Temperate phage mv4 integrates its DNA into the chromosome of Lactobacillus delbrueckii subsp. bulgaricus strains via site-specific recombination. Nucleotide sequencing of a 2.2-kb attP-containing phage fragment revealed the presence of four open reading frames. The larger open reading frame, close to the attP site, encoded a 427-amino-acid polypeptide with similarity in its C-terminal domain to site-specific recombinases of the integrase family. Comparison of the sequences of attP, bacterial attachment site attB, and host-phage junctions attL and attR identified a 17-bp common core sequence, where strand exchange occurs during recombination. Analysis of the attB sequence indicated that the core region overlaps the 3' end of a tRNA(Ser) gene. Phage mv4 DNA integration into the tRNA(Ser) gene preserved an intact tRNA(Ser) gene at the attL site. An integration vector based on the mv4 attP site and int gene was constructed. This vector transforms a heterologous host, L. plantarum, through site-specific integration into the tRNA(Ser) gene of the genome and will be useful for development of an efficient integration system for a number of additional bacterial species in which an identical tRNA gene is present. PMID:7836291
The domain structure and distribution of Alu elements in long noncoding RNAs and mRNAs
Kim, Eugene Z.; Wespiser, Adam R.; Caffrey, Daniel R.
2016-01-01
Approximately 75% of the human genome is transcribed and many of these spliced transcripts contain primate-specific Alu elements, the most abundant mobile element in the human genome. The majority of exonized Alu elements are located in long noncoding RNAs (lncRNAs) and the untranslated regions of mRNA, with some performing molecular functions. To further assess the potential for Alu elements to be repurposed as functional RNA domains, we investigated the distribution and evolution of Alu elements in spliced transcripts. Our analysis revealed that Alu elements are underrepresented in mRNAs and lncRNAs, suggesting that most exonized Alu elements arising in the population are rare or deleterious to RNA function. When mRNAs and lncRNAs retain exonized Alu elements, they have a clear preference for Alu dimers, left monomers, and right monomers. mRNAs often acquire Alu elements when their genes are duplicated within Alu-rich regions. In lncRNAs, reverse-oriented Alu elements are significantly enriched and are not restricted to the 3′ and 5′ ends. Both lncRNAs and mRNAs primarily contain the Alu J and S subfamilies that were amplified relatively early in primate evolution. Alu J subfamilies are typically overrepresented in lncRNAs, whereas the Alu S dimer is overrepresented in mRNAs. The sequences of Alu dimers tend to be constrained in both lncRNAs and mRNAs, whereas the left and right monomers are constrained within particular Alu subfamilies and classes of RNA. Collectively, these findings suggest that Alu-containing RNAs are capable of forming stable structures and that some of these Alu domains might have novel biological functions. PMID:26654912
Burke, W D; Calalang, C C; Eickbush, T H
1987-01-01
Two classes of DNA elements interrupt a fraction of the rRNA repeats of Bombyx mori. We have analyzed by genomic blotting and sequence analysis one class of these elements which we have named R2. These elements occupy approximately 9% of the rDNA units of B. mori and appear to be homologous to the type II rDNA insertions detected in Drosophila melanogaster. Approximately 25 copies of R2 exist within the B. mori genome, of which at least 20 are located at a precise location within otherwise typical rDNA units. Nucleotide sequence analysis has revealed that the 4.2-kilobase-pair R2 element has a single large open reading frame, occupying over 82% of the total length of the element. The central region of this 1,151-amino-acid open reading frame shows homology to the reverse transcriptase enzymes found in retroviruses and certain transposable elements. Amino acid homology of this region is highest to the mobile line 1 elements of mammals, followed by the mitochondrial type II introns of fungi, and the pol gene of retroviruses. Less homology exists with transposable elements of D. melanogaster and Saccharomyces cerevisiae. Two additional regions of sequence homology between L1 and R2 elements were also found outside the reverse transcriptase region. We suggest that the R2 elements are retrotransposons that are site specific in their insertion into the genome. Such mobility would enable these elements to occupy a small fraction of the rDNA units of B. mori despite their continual elimination from the rDNA locus by sequence turnover. Images PMID:2439905
Deas, Tia S; Binduga-Gajewska, Iwona; Tilgner, Mark; Ren, Ping; Stein, David A; Moulton, Hong M; Iversen, Patrick L; Kauffman, Elizabeth B; Kramer, Laura D; Shi, Pei-Yong
2005-04-01
RNA elements within flavivirus genomes are potential targets for antiviral therapy. A panel of phosphorodiamidate morpholino oligomers (PMOs), whose sequences are complementary to RNA elements located in the 5'- and 3'-termini of the West Nile (WN) virus genome, were designed to anneal to important cis-acting elements and potentially to inhibit WN infection. A novel Arg-rich peptide was conjugated to each PMO for efficient cellular delivery. These PMOs exhibited various degrees of antiviral activity upon incubation with a WN virus luciferase-replicon-containing cell line. Among them, PMOs targeting the 5'-terminal 20 nucleotides (5'End) or targeting the 3'-terminal element involved in a potential genome cyclizing interaction (3'CSI) exhibited the greatest potency. When cells infected with an epidemic strain of WN virus were treated with the 5'End or 3'CSI PMO, virus titers were reduced by approximately 5 to 6 logs at a 5 muM concentration without apparent cytotoxicity. The 3'CSI PMO also inhibited mosquito-borne flaviviruses other than WN virus, and the antiviral potency correlated with the conservation of the targeted 3'CSI sequences of specific viruses. Mode-of-action analyses showed that the 5'End and 3'CSI PMOs suppressed viral infection through two distinct mechanisms. The 5'End PMO inhibited viral translation, whereas the 3'CSI PMO did not significantly affect viral translation but suppressed RNA replication. The results suggest that antisense PMO-mediated blocking of cis-acting elements of flavivirus genomes can potentially be developed into an anti-flavivirus therapy. In addition, we report that although a full-length WN virus containing a luciferase reporter (engineered at the 3' untranslated region of the genome) is not stable, an early passage of this reporting virus can be used to screen for inhibitors against any step of the virus life cycle.
Inverse PCR-based method for isolating novel SINEs from genome.
Han, Yawei; Chen, Liping; Guan, Lihong; He, Shunping
2014-04-01
Short interspersed elements (SINEs) are moderately repetitive DNA sequences in eukaryotic genomes. Although eukaryotic genomes contain numerous SINEs copy, it is very difficult and laborious to isolate and identify them by the reported methods. In this study, the inverse PCR was successfully applied to isolate SINEs from Opsariichthys bidens genome in Eastern Asian Cyprinid. A group of SINEs derived from tRNA(Ala) molecular had been identified, which were named Opsar according to Opsariichthys. SINEs characteristics were exhibited in Opsar, which contained a tRNA(Ala)-derived region at the 5' end, a tRNA-unrelated region, and AT-rich region at the 3' end. The tRNA-derived region of Opsar shared 76 % sequence similarity with tRNA(Ala) gene. This result indicated that Opsar could derive from the inactive or pseudogene of tRNA(Ala). The reliability of method was tested by obtaining C-SINE, Ct-SINE, and M-SINEs from Ctenopharyngodon idellus, Megalobrama amblycephala, and Cyprinus carpio genomes. This method is simpler than the previously reported, which successfully omitted many steps, such as preparation of probes, construction of genomic libraries, and hybridization.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Garcia-Zepeda, E.A.; Sarafi, M.N.; Luster, A.D.
1997-05-01
Eotaxin is a CC chemokine that is a specific chemoattractant for eosinophils and is implicated in the pathogenesis of eosinophilic inflammatory diseases, such as asthma. We describe the genomic organization, complete sequence, including 1354 bp 5{prime} of the RNA initiation site, and chromosomal localization of the human eotaxin gene. Fluorescence in situ hybridization analysis localized eotaxin to human chromosome 17, in the region q21.1-q21.2, and the human gene name SCYA11 was assigned. We also present the 5{prime} flanking sequence of the mouse eotaxin gene and have identified several regulatory elements that are conserved between the murine and the human promoters.more » In particular, the presence of elements such as NF-{Kappa}B, interferon-{gamma} response element, and glucocorticoid response element may explain the observed regulation of the eotaxin gene by cytokines and glucocorticoids. 17 refs., 4 figs., 1 tab.« less
2013-01-01
Background Accurate and complete identification of mobile elements is a challenging task in the current era of sequencing, given their large numbers and frequent truncations. Group II intron retroelements, which consist of a ribozyme and an intron-encoded protein (IEP), are usually identified in bacterial genomes through their IEP; however, the RNA component that defines the intron boundaries is often difficult to identify because of a lack of strong sequence conservation corresponding to the RNA structure. Compounding the problem of boundary definition is the fact that a majority of group II intron copies in bacteria are truncated. Results Here we present a pipeline of 11 programs that collect and analyze group II intron sequences from GenBank. The pipeline begins with a BLAST search of GenBank using a set of representative group II IEPs as queries. Subsequent steps download the corresponding genomic sequences and flanks, filter out non-group II introns, assign introns to phylogenetic subclasses, filter out incomplete and/or non-functional introns, and assign IEP sequences and RNA boundaries to the full-length introns. In the final step, the redundancy in the data set is reduced by grouping introns into sets of ≥95% identity, with one example sequence chosen to be the representative. Conclusions These programs should be useful for comprehensive identification of group II introns in sequence databases as data continue to rapidly accumulate. PMID:24359548
Dynamic ASXL1 Exon Skipping and Alternative Circular Splicing in Single Human Cells
Natarajan, Sivaraman; Carter, Robert; Brown, Patrick O.
2016-01-01
Circular RNAs comprise a poorly understood new class of noncoding RNA. In this study, we used a combination of targeted deletion, high-resolution splicing detection, and single-cell sequencing to deeply probe ASXL1 circular splicing. We found that efficient circular splicing required the canonical transcriptional start site and inverted AluSx elements. Sequencing-based interrogation of isoforms after ASXL1 overexpression identified promiscuous linear splicing between all exons, with the two most abundant non-canonical linear products skipping the exons that produced the circular isoforms. Single-cell sequencing revealed a strong preference for either the linear or circular ASXL1 isoforms in each cell, and found the predominant exon skipping product is frequently co-expressed with its reciprocal circular isoform. Finally, absolute quantification of ASXL1 isoforms confirmed our findings and suggests that standard methods overestimate circRNA abundance. Taken together, these data reveal a dynamic new view of circRNA genesis, providing additional framework for studying their roles in cellular biology. PMID:27736885
CRISPR/Cas9 for genome editing: progress, implications and challenges.
Zhang, Feng; Wen, Yan; Guo, Xiong
2014-09-15
Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) protein 9 system provides a robust and multiplexable genome editing tool, enabling researchers to precisely manipulate specific genomic elements, and facilitating the elucidation of target gene function in biology and diseases. CRISPR/Cas9 comprises of a nonspecific Cas9 nuclease and a set of programmable sequence-specific CRISPR RNA (crRNA), which can guide Cas9 to cleave DNA and generate double-strand breaks at target sites. Subsequent cellular DNA repair process leads to desired insertions, deletions or substitutions at target sites. The specificity of CRISPR/Cas9-mediated DNA cleavage requires target sequences matching crRNA and a protospacer adjacent motif locating at downstream of target sequences. Here, we review the molecular mechanism, applications and challenges of CRISPR/Cas9-mediated genome editing and clinical therapeutic potential of CRISPR/Cas9 in future. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Yang, Xiaohan; Ye, Chuyu
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
An Autonomous BMP2 Regulatory Element in Mesenchymal Cells
Kruithof, Boudewijn P.T.; Fritz, David T.; Liu, Yijun; Garsetti, Diane E.; Frank, David B.; Pregizer, Steven K.; Gaussin, Vinciane; Mortlock, Douglas P.; Rogers, Melissa B.
2014-01-01
BMP2 is a morphogen that controls mesenchymal cell differentiation and behavior. For example, BMP2 concentration controls the differentiation of mesenchymal precursors into myocytes, adipocytes, chondrocytes, and osteoblasts. Sequences within the 3′untranslated region (UTR) of the Bmp2 mRNA mediate a post-transcriptional block of protein synthesis. Interaction of cell and developmental stage-specific trans-regulatory factors with the 3′UTR is a nimble and versatile mechanism for modulating this potent morphogen in different cell types. We show here, that an ultra-conserved sequence in the 3′UTR functions independently of promoter, coding region, and 3′UTR context in primary and immortalized tissue culture cells and in transgenic mice. Our findings indicate that the ultra-conserved sequence is an autonomously functioning post-transcriptional element that may be used to modulate the level of BMP2 and other proteins while retaining tissue specific regulatory elements. PMID:21268088
Atrx promotes heterochromatin formation at retrotransposons
Sadic, Dennis; Schmidt, Katharina; Groh, Sophia; Kondofersky, Ivan; Ellwart, Joachim; Fuchs, Christiane; Theis, Fabian J; Schotta, Gunnar
2015-01-01
More than 50% of mammalian genomes consist of retrotransposon sequences. Silencing of retrotransposons by heterochromatin is essential to ensure genomic stability and transcriptional integrity. Here, we identified a short sequence element in intracisternal A particle (IAP) retrotransposons that is sufficient to trigger heterochromatin formation. We used this sequence in a genome-wide shRNA screen and identified the chromatin remodeler Atrx as a novel regulator of IAP silencing. Atrx binds to IAP elements and is necessary for efficient heterochromatin formation. In addition, Atrx facilitates a robust and largely inaccessible heterochromatin structure as Atrx knockout cells display increased chromatin accessibility at retrotransposons and non-repetitive heterochromatic loci. In summary, we demonstrate a direct role of Atrx in the establishment and robust maintenance of heterochromatin. PMID:26012739
Functional Information Stored in the Conserved Structural RNA Domains of Flavivirus Genomes
Fernández-Sanlés, Alba; Ríos-Marco, Pablo; Romero-López, Cristina; Berzal-Herranz, Alfredo
2017-01-01
The genus Flavivirus comprises a large number of small, positive-sense single-stranded, RNA viruses able to replicate in the cytoplasm of certain arthropod and/or vertebrate host cells. The genus, which has some 70 member species, includes a number of emerging and re-emerging pathogens responsible for outbreaks of human disease around the world, such as the West Nile, dengue, Zika, yellow fever, Japanese encephalitis, St. Louis encephalitis, and tick-borne encephalitis viruses. Like other RNA viruses, flaviviruses have a compact RNA genome that efficiently stores all the information required for the completion of the infectious cycle. The efficiency of this storage system is attributable to supracoding elements, i.e., discrete, structural units with essential functions. This information storage system overlaps and complements the protein coding sequence and is highly conserved across the genus. It therefore offers interesting potential targets for novel therapeutic strategies. This review summarizes our knowledge of the features of flavivirus genome functional RNA domains. It also provides a brief overview of the main achievements reported in the design of antiviral nucleic acid-based drugs targeting functional genomic RNA elements. PMID:28421048
Structural imprints in vivo decode RNA regulatory mechanisms.
Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y
2015-03-26
Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.
Butler, Nathaniel M; Hannapel, David J
2012-12-01
Polypyrimidine tract-binding (PTB) proteins are RNA-binding proteins that target specific RNAs for post-transcriptional processing by binding cytosine/uracil motifs. PTBs have established functions in a range of RNA processes including splicing, translation, stability and long-distance transport. Six PTB-like genes identified in potato have been grouped into two clades based on homology to other known plant PTBs. StPTB1 and StPTB6 are closely related to a PTB protein discovered in pumpkin, designated CmRBP50, and contain four canonical RNA-recognition motifs. CmRBP50 is expressed in phloem tissues and functions as the core protein of a phloem-mobile RNA/protein complex. Sequence from the potato genome database was used to clone the upstream sequence of these two PTB genes and analyzed to identify conserved cis-elements. The promoter of StPTB6 was enriched for regulatory elements for light and sucrose induction and defense. Upstream sequence of both PTB genes was fused to β-glucuronidase and monitored in transgenic potato lines. In whole plants, the StPTB1 promoter was most active in leaf veins and petioles, whereas StPTB6 was most active in leaf mesophyll. Both genes are active in new tubers and tuber sprouts. StPTB6 expression was induced in stems and stolon sections in response to sucrose and in leaves or petioles in response to light, heat, drought and mechanical wounding. These results show that CmRBP50-like genes of potato exhibit distinct expression patterns and respond to both developmental and environmental cues.
Functional noncoding sequences derived from SINEs in the mammalian genome.
Nishihara, Hidenori; Smit, Arian F A; Okada, Norihiro
2006-07-01
Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.
Meyer, C; Pouteau, S; Rouzé, P; Caboche, M
1994-01-01
By Northern blot analysis of nitrate reductase-deficient mutants of Nicotiana plumbaginifolia, we identified a mutant (mutant D65), obtained after gamma-ray irradiation of protoplasts, which contained an insertion sequence in the nitrate reductase (NR) mRNA. This insertion sequence was localized by polymerase chain reaction (PCR) in the first exon of NR and was also shown to be present in the NR gene. The mutant gene contained a 565 bp insertion sequence that exhibits the sequence characteristics of a transposable element, which was thus named dTnp1. The dTnp1 element has 14 bp terminal inverted repeats and is flanked by an 8-bp target site duplication generated upon transposition. These inverted repeats have significant sequence homology with those of other transposable elements. Judging by its size and the absence of a long open reading frame, dTnp1 appears to represent a defective, although mobile, transposable element. The octamer motif TTTAGGCC was found several times in direct orientation near the 5' and 3' ends of dTnp1 together with a perfect palindrome located after the 5' inverted repeat. Southern blot analysis using an internal probe of dTnp1 suggested that this element occurs as a single copy in the genome of N. plumbaginifolia. It is also present in N. tabacum, but absent in tomato or petunia. The dTnp1 element is therefore of potential use for gene tagging in Nicotiana species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie
2009-11-20
RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR)more » shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.« less
Guardian small RNAs and sex determination.
Katsuma, Susumu; Kawamoto, Munetaka; Kiuchi, Takashi
2014-01-01
The W chromosome of the silkworm Bombyx mori has been known to determine femaleness for more than 80 years. However, the feminizing gene has not been molecularly identified, because the B. mori W chromosome is almost fully occupied by a large number of transposable elements. The W chromosome-derived feminizing factor of B. mori was recently shown to be a female-specific PIWI-interacting RNA (piRNA). piRNAs are small RNAs that potentially repress invading "non-self" elements (e.g., transposons and virus-like elements) by associating with PIWI proteins. Our results revealed that female-specific piRNA precursors, which we named Fem, are transcribed from the sex-determining region of the W chromosome at the early embryonic stage and are processed into a single mature piRNA (Fem piRNA). Fem piRNA forms a complex with Siwi (silkworm Piwi), which cleaves a protein-coding mRNA transcribed from the Z chromosome. RNA interference of this Z-linked gene, which we named Masc, revealed that this gene encodes a protein required for masculinization and dosage compensation. Fem and Masc both participate in the ping-pong cycle of the piRNA amplification loop by associating with the 2 B. mori PIWI proteins Siwi and BmAgo3 (silkworm Ago3), respectively, indicating that the piRNA-mediated interaction between the 2 sex chromosomes is the primary signal for the B. mori sex determination cascade. Fem is a non-transposable repetitive sequence on the W chromosome, whereas Masc is a single-copy protein-coding gene. It is of great interest how the piRNA system recognizes "self "Masc mRNA as "non-self" RNA.
Brady, J; Radonovich, M; Thoren, M; Das, G; Salzman, N P
1984-01-01
We have previously identified an 11-base DNA sequence, 5'-G-G-T-A-C-C-T-A-A-C-C-3' (simian virus 40 [SV40] map position 294 to 304), which is important in the control of SV40 late RNA expression in vitro and in vivo (Brady et al., Cell 31:625-633, 1982). We report here the identification of another domain of the SV40 late promoter. A series of mutants with deletions extending from SV40 map position 0 to 300 was prepared by nuclease BAL 31 treatment. The cloned templates were then analyzed for efficiency and accuracy of late SV40 RNA expression in the Manley in vitro transcription system. Our studies showed that, in addition to the promoter domain near map position 300, there are essential DNA sequences between nucleotide positions 74 and 95 that are required for efficient expression of late SV40 RNA. Included in this SV40 DNA sequence were two of the six GGGCGG SV40 repeat sequences and an 11-nucleotide segment which showed strong homology with the upstream sequences required for the efficient in vitro and in vivo expression of the histone H2A gene. This upstream promoter sequence supported transcription with the same efficiency even when it was moved 72 nucleotides closer to the major late cap site. In vitro promoter competition analysis demonstrated that the upstream promoter sequence, independent of the 294 to 304 promoter element, is capable of binding polymerase-transcription factors required for SV40 late gene transcription. Finally, we show that DNA sequences which control the specificity of RNA initiation at nucleotide 325 lie downstream of map position 294. Images PMID:6321950
Non-contiguous genome sequence of Mycobacterium simiae strain DSM 44165(T.).
Sassi, Mohamed; Robert, Catherine; Raoult, Didier; Drancourt, Michel
2013-01-01
Mycobacterium simiae is a non-tuberculosis mycobacterium causing pulmonary infections in both immunocompetent and imunocompromized patients. We announce the draft genome sequence of M. simiae DSM 44165(T). The 5,782,968-bp long genome with 65.15% GC content (one chromosome, no plasmid) contains 5,727 open reading frames (33% with unknown function and 11 ORFs sizing more than 5000 -bp), three rRNA operons, 52 tRNA, one 66-bp tmRNA matching with tmRNA tags from Mycobacterium avium, Mycobacterium tuberculosis, Mycobacterium bovis, Mycobacterium microti, Mycobacterium marinum, and Mycobacterium africanum and 389 DNA repetitive sequences. Comparing ORFs and size distribution between M. simiae and five other Mycobacterium species M. simiae clustered with M. abscessus and M. smegmatis. A 40-kb prophage was predicted in addition to two prophage-like elements, 7-kb and 18-kb in size, but no mycobacteriophage was seen after the observation of 10(6) M. simiae cells. Fifteen putative CRISPRs were found. Three genes were predicted to encode resistance to aminoglycosides, betalactams and macrolide-lincosamide-streptogramin B. A total of 163 CAZYmes were annotated. M. simiae contains ESX-1 to ESX-5 genes encoding for a type-VII secretion system. Availability of the genome sequence may help depict the unique properties of this environmental, opportunistic pathogen.
Anish, Ramakrishnan; Hossain, Mohammad B.; Jacobson, Raymond H.; Takada, Shinako
2009-01-01
Background More than 80% of mammalian protein-coding genes are driven by TATA-less promoters which often show multiple transcriptional start sites (TSSs). However, little is known about the core promoter DNA sequences or mechanisms of transcriptional initiation for this class of promoters. Methodology/Principal Findings Here we identify a new core promoter element XCPE2 (X core promoter element 2) (consensus sequence: A/C/G-C-C/T-C-G/A-T-T-G/A-C-C/A+1-C/T) that can direct specific transcription from the second TSS of hepatitis B virus X gene mRNA. XCPE2 sequences can also be found in human promoter regions and typically appear to drive one of the start sites within multiple TSS-containing TATA-less promoters. To gain insight into mechanisms of transcriptional initiation from this class of promoters, we examined requirements of several general transcription factors by in vitro transcription experiments using immunodepleted nuclear extracts and purified factors. Our results show that XCPE2-driven transcription uses at least TFIIB, either TFIID or free TBP, RNA polymerase II (RNA pol II) and the MED26-containing mediator complex but not Gcn5. Therefore, XCPE2-driven transcription can be carried out by a mechanism which differs from previously described TAF-dependent mechanisms for initiator (Inr)- or downstream promoter element (DPE)-containing promoters, the TBP- and SAGA (Spt-Ada-Gcn5-acetyltransferase)-dependent mechanism for yeast TATA-containing promoters, or the TFTC (TBP-free-TAF-containing complex)-dependent mechanism for certain Inr-containing TATA-less promoters. EMSA assays using XCPE2 promoter and purified factors further suggest that XCPE2 promoter recognition requires a set of factors different from those for TATA box, Inr, or DPE promoter recognition. Conclusions/Significance We identified a new core promoter element XCPE2 that are found in multiple TSS-containing TATA-less promoters. Mechanisms of promoter recognition and transcriptional initiation for XCPE2-driven promoters appear different from previously shown mechanisms for classical promoters that show single “focused” TSSs. Our studies provide insight into novel mechanisms of RNA Pol II transcription from multiple TSS-containing TATA-less promoters. PMID:19337366
Lee, Tzuu-fen; Gurazada, Sai Guna Ranjan; Zhai, Jixian; Li, Shengben; Simon, Stacey A; Matzke, Marjori A; Chen, Xuemei; Meyers, Blake C
2012-07-01
In plants, heterochromatin is maintained by a small RNA-based gene silencing mechanism known as RNA-directed DNA methylation (RdDM). RdDM requires the non-redundant functions of two plant-specific DNA-dependent RNA polymerases (RNAP), RNAP IV and RNAP V. RNAP IV plays a major role in siRNA biogenesis, while RNAP V may recruit DNA methylation machinery to target endogenous loci for silencing. Although small RNA-generating regions that are dependent on both RNAP IV and RNAP V have been identified previously, the genomic loci targeted by RNAP V for siRNA accumulation and silencing have not been described extensively. To characterize the RNAP V-dependent, heterochromatic siRNA-generating regions in the Arabidopsis genome, we deeply sequenced the small RNA populations of wild-type and RNAP V null mutant (nrpe1) plants. Our results showed that RNAP V-dependent siRNA-generating loci are associated predominately with short repetitive sequences in intergenic regions. Suppression of small RNA production from short repetitive sequences was also prominent in RdDM mutants including dms4, drd1, dms3 and rdm1, reflecting the known association of these RdDM effectors with RNAP V. The genomic regions targeted by RNAP V were small, with an estimated average length of 238 bp. Our results suggest that RNAP V affects siRNA production from genomic loci with features dissimilar to known RNAP IV-dependent loci. RNAP V, along with RNAP IV and DRM1/2, may target and silence a set of small, intergenic transposable elements located in dispersed genomic regions for silencing. Silencing at these loci may be actively reinforced by RdDM.
Martoni, Francesco; Eickbush, Danna G.; Scavariello, Claudia; Luchetti, Andrea; Mantovani, Barbara
2015-01-01
R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution. PMID:25799008
NMR studies of two spliced leader RNAs using isotope labeling
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lapham, J.; Crothers, D.M.
1994-12-01
Spliced leader RNAs are a class of RNA molecules (<200 nts) involved in the trans splicing of messenger RNA found in trypanosomes, nematodes, and other lower eukaryotes. The spliced leader RNA from the trypanosome Leptomonas Collosoma exists in two alternate structural forms with similar thermal stabilities. The 54 nucleotides on the 5{prime} end of the SL molecule is structurally independent from the 3{prime} half of the RNA, and displays the two structural forms. Furthermore, the favored of the two structures was shown to contain anomalous nuclease sensitivity and thermal stability features, which suggests that there may be tertiary interactions betweenmore » the splice site and other nucleotides in the 5{prime} end. Multidimensional NMR studies are underway to elucidate the structural elements present in the SL RNAs that give rise to their physical properties. Two spliced leader sequences have been studied. The first, the 54 nucleotides on the 5{prime} end of the L. Collosoma sequence, was selected because of earlier studies in our laboratory. The second sequence is the 5{prime} end of the trypanosome Crithidia Fasciculata, which was chosen because of its greater sequence homology to other SL sequences. Given the complexity of the NMR spectra for RNA molecules of this size, we have incorporated {sup 15}N/{sup 13}C-labeled nucleotides into the RNA. One of the techniques we have developed to simplify the spectra of these RNA molecules is isotope labeling of specific regions of the RNA. This has been especially helpful in assigning the secondary structure of molecules that may be able to adopt multiple conformations. Using this technique one can examine a part of the molecule without spectral interference from the unlabeled portion. We hope this approach will promote an avenue for studying the structure of larger RNAs in their native surroundings.« less
Structural imprints in vivo decode RNA regulatory mechanisms
Spitale, Robert C.; Flynn, Ryan A.; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y.; Batista, Pedro J.; Torre, Eduardo A.; Kool, Eric T.; Chang, Howard Y.
2015-01-01
Visualizing the physical basis for molecular behavior inside living cells is a grand challenge in biology. RNAs are central to biological regulation, and RNA’s ability to adopt specific structures intimately controls every step of the gene expression program1. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles view only two of four nucleotides that make up RNA2,3. Here we present a novel biochemical approach, In Vivo Click SHAPE (icSHAPE), that enables the first global view of RNA secondary structures of all four bases in living cells. icSHAPE of mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguishes different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA binding proteins or RNA modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N6-methyladenosine (m6A) modification genome-wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression. PMID:25799993
Gene organization and alternative splicing of human prohormone convertase PC8.
Goodge, K A; Thomas, R J; Martin, T J; Gillespie, M T
1998-01-01
The mammalian Ca2+-dependent serine protease prohormone convertase PC8 is expressed ubiquitously, being transcribed as 3.5, 4.3 and 6.0 kb mRNA isoforms in various tissues. To determine the origin of these various mRNA isoforms we report the characterization of the human PC8 gene, which has been previously localized to chromosome 11q23-24. Consisting of 16 exons, the human PC8 gene spans approx. 27 kb. A comparison of the position of intron-exon junctions of the human PC8 gene with the gene structures of previously reported prohormone convertase genes demonstrated a divergence of the human PC8 from the highly conserved nature of the gene organization of this enzyme family. The nucleotide sequence of the 5'-flanking region of the human PC8 is reported and possesses putative promoter elements characteristic of a GC-rich promoter. Further supporting the potential role of a GC-rich promoter element, multiple transcriptional initiation sites within a 200 bp region were demonstrated. We propose that the various mRNA isoforms of PC8 result from the inclusion of intronic sequences within transcripts. PMID:9820811
Dictyostelium mobile elements: strategies to amplify in a compact genome.
Winckler, T; Dingermann, T; Glöckner, G
2002-12-01
Dictyostelium discoideum is a eukaryotic microorganism that is attractive for the study of fundamental biological phenomena such as cell-cell communication, formation of multicellularity, cell differentiation and morphogenesis. Large-scale sequencing of the D. discoideum genome has provided new insights into evolutionary strategies evolved by transposable elements (TEs) to settle in compact microbial genomes and to maintain active populations over evolutionary time. The high gene density (about 1 gene/2.6 kb) of the D. discoideum genome leaves limited space for selfish molecular invaders to move and amplify without causing deleterious mutations that eradicate their host. Targeting of transfer RNA (tRNA) gene loci appears to be a generally successful strategy for TEs residing in compact genomes to insert away from coding regions. In D. discoideum, tRNA gene-targeted retrotransposition has evolved independently at least three times by both non-long terminal repeat (LTR) retrotransposons and retrovirus-like LTR retrotransposons. Unlike the nonspecifically inserting D. discoideum TEs, which have a strong tendency to insert into preexisting TE copies and form large and complex clusters near the ends of chromosomes, the tRNA gene-targeted retrotransposons have managed to occupy 75% of the tRNA gene loci spread on chromosome 2 and represent 80% of the TEs recognized on the assembled central 6.5-Mb part of chromosome 2. In this review we update the available information about D. discoideum TEs which emerges both from previous work and current large-scale genome sequencing, with special emphasis on the fact that tRNA genes are principal determinants of retrotransposon insertions into the D. discoideum genome.
Cassandra retrotransposons carry independently transcribed 5S RNA
Kalendar, Ruslan; Tanskanen, Jaakko; Chang, Wei; Antonius, Kristiina; Sela, Hanan; Peleg, Ofer; Schulman, Alan H.
2008-01-01
We report a group of TRIMs (terminal-repeat retrotransposons in miniature), which are small nonautonomous retrotransposons. These elements, named Cassandra, universally carry conserved 5S RNA sequences and associated RNA polymerase (pol) III promoters and terminators in their long terminal repeats (LTRs). They were found in all vascular plants investigated. Uniquely for LTR retrotransposons, Cassandra produces noncapped, polyadenylated transcripts from the 5S pol III promoter. Capped, read-through transcripts containing Cassandra sequences can also be detected in RNA and in EST databases. The predicted Cassandra RNA 5S secondary structures resemble those for cellular 5S rRNA, with high information content specifically in the pol III promoter region. Genic integration sites are common for Cassandra, an unusual feature for abundant retrotransposons. The 5S in each LTR produces a tandem 5S arrangement with an inter-5S spacing resembling that of cellular 5S. The distribution of 5S genes is very variable in flowering plants and may be partially explained by Cassandra activity. Cassandra thus appears both to have adapted a ubiquitous cellular gene for ribosomal RNA for use as a promoter and to parasitize an as-yet-unidentified group of retrotransposons for the proteins needed in its lifecycle. PMID:18408163
May, Jared; Johnson, Philip; Saleem, Huma
2017-01-01
ABSTRACT To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5′ cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo. An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo. Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5′ cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary structure has IRES activity and produces low levels of viral coat protein in vitro and in vivo. Our findings may be applicable to cellular mRNA IRES that also have little or no sequences/structures in common. PMID:28179526
The RNA helicase RHAU (DHX36) suppresses expression of the transcription factor PITX1.
Booy, Evan P; Howard, Ryan; Marushchak, Oksana; Ariyo, Emmanuel O; Meier, Markus; Novakowski, Stefanie K; Deo, Soumya R; Dzananovic, Edis; Stetefeld, Jörg; McKenna, Sean A
2014-03-01
RNA Helicase associated with AU-rich element (RHAU) (DHX36) is a DEAH (Aspartic acid, Glumatic Acid, Alanine, Histidine)-box RNA helicase that can bind and unwind G4-quadruplexes in DNA and RNA. To detect novel RNA targets of RHAU, we performed an RNA co-immunoprecipitation screen and identified the PITX1 messenger RNA (mRNA) as specifically and highly enriched. PITX1 is a homeobox transcription factor with roles in both development and cancer. Primary sequence analysis identified three probable quadruplexes within the 3'-untranslated region of the PITX1 mRNA. Each of these sequences, when isolated, forms stable quadruplex structures that interact with RHAU. We provide evidence that these quadruplexes exist in the endogenous mRNA; however, we discovered that RHAU is tethered to the mRNA via an alternative non-quadruplex-forming region. RHAU knockdown by small interfering RNA results in significant increases in PITX1 protein levels with only marginal changes in mRNA, suggesting a role for RHAU in translational regulation. Involvement of components of the microRNA machinery is supported by similar and non-additive increases in PITX1 protein expression on Dicer and combined RHAU/Dicer knockdown. We also demonstrate a requirement of argonaute-2, a key RNA-induced silencing complex component, to mediate RHAU-dependent changes in PITX1 protein levels. These results demonstrate a novel role for RHAU in microRNA-mediated translational regulation at a quadruplex-containing 3'-untranslated region.
Yeakley, J M; Hedjran, F; Morfin, J P; Merillat, N; Rosenfeld, M G; Emeson, R B
1993-01-01
The calcitonin/calcitonin gene-related peptide (CGRP) primary transcript is alternatively spliced in thyroid C cells and neurons, resulting in the tissue-specific production of calcitonin and CGRP mRNAs. Analyses of mutated calcitonin/CGRP transcription units in permanently transfected cell lines have indicated that alternative splicing is regulated by a differential capacity to utilize the calcitonin-specific splice acceptor. The analysis of an extensive series of mutations suggests that tissue-specific regulation of calcitonin mRNA production does not depend on the presence of a single, unique cis-active element but instead appears to be a consequence of suboptimal constitutive splicing signals. While only those mutations that altered constitutive splicing signals affected splice choices, the action of multiple regulatory sequences cannot be formally excluded. Further, we have identified a 13-nucleotide purine-rich element from a constitutive exon that, when placed in exon 4, entirely switches splice site usage in CGRP-producing cells. These data suggest that specific exon recruitment sequences, in combination with other constitutive elements, serve an important function in exon recognition. These results are consistent with the hypothesis that tissue-specific alternative splicing of the calcitonin/CGRP primary transcript is mediated by cell-specific differences in components of the constitutive splicing machinery. Images PMID:8413203
Intronic sequences are required for AINTEGUMENTA-LIKE6 expression in Arabidopsis flowers.
Krizek, Beth A
2015-10-12
The AINTEGUMENTA-LIKE6/PLETHORA3 (AIL6/PLT3) gene of Arabidopsis thaliana is a key regulator of growth and patterning in both shoots and roots. AIL6 encodes an AINTEGUMENTA-LIKE/PLETHORA (AIL/PLT) transcription factor that is expressed in the root stem cell niche, the peripheral region of the shoot apical meristem and young lateral organ primordia. In flowers, AIL6 acts redundantly with AINTEGUMENTA (ANT) to regulate floral organ positioning, growth, identity and patterning. Experiments were undertaken to define the genomic regions required for AIL6 function and expression in flowers. Transgenic plants expressing a copy of the coding region of AIL6 in the context of 7.7 kb of 5' sequence and 919 bp of 3' sequence (AIL6:cAIL6-3') fail to fully complement AIL6 function when assayed in the ant-4 ail6-2 double mutant background. In contrast, a genomic copy of AIL6 with the same amount of 5' and 3' sequence (AIL6:gAIL6-3') can fully complement ant-4 ail6-2. In addition, a genomic copy of AIL6 with 590 bp of 5' sequence and 919 bp of 3' sequence (AIL6m:gAIL6-3') complements ant-4 ail6-2 and contains all regulatory elements needed to confer normal AIL6 expression in inflorescences. Efforts to map cis-regulatory elements reveal that the third intron of AIL6 contains enhancer elements that confer expression in young flowers but in a broader pattern than that of AIL6 mRNA in wild-type flowers. Some AIL6:gAIL6-3' and AIL6m:gAIL6-3' lines confer an over-rescue phenotype in the ant-4 ail6-2 background that is correlated with higher levels of AIL6 mRNA accumulation. The results presented here indicate that AIL6 intronic sequences serve as transcriptional enhancer elements. In addition, the results show that increased expression of AIL6 can partially compensate for loss of ANT function in flowers.
Wang, Cheng; Yu, Jie; Kallen, Caleb B
2008-01-01
The proliferating cell nuclear antigen (PCNA) is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE) sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2) enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2. Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays. We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.
Unexpected expansion of tRNA substrate recognition by the yeast m1G9 methyltransferase Trm10.
Swinehart, William E; Henderson, Jeremy C; Jackman, Jane E
2013-08-01
N-1 Methylation of the nearly invariant purine residue found at position 9 of tRNA is a nucleotide modification found in multiple tRNA species throughout Eukarya and Archaea. First discovered in Saccharomyces cerevisiae, the tRNA methyltransferase Trm10 is a highly conserved protein both necessary and sufficient to catalyze all known instances of m1G9 modification in yeast. Although there are 19 unique tRNA species that contain a G at position 9 in yeast, and whose fully modified sequence is known, only 9 of these tRNA species are modified with m1G9 in wild-type cells. The elements that allow Trm10 to distinguish between structurally similar tRNA species are not known, and sequences that are shared between all substrate or all nonsubstrate tRNAs have not been identified. Here, we demonstrate that the in vitro methylation activity of yeast Trm10 is not sufficient to explain the observed pattern of modification in vivo, as additional tRNA species are substrates for Trm10 m1G9 methyltransferase activity. Similarly, overexpression of Trm10 in yeast yields m1G9 containing tRNA species that are ordinarily unmodified in vivo. Thus, yeast Trm10 has a significantly broader tRNA substrate specificity than is suggested by the observed pattern of modification in wild-type yeast. These results may shed light onto the suggested involvement of Trm10 in other pathways in other organisms, particularly in higher eukaryotes that contain up to three different genes with sequence similarity to the single TRM10 gene in yeast, and where these other enzymes have been implicated in pathways beyond tRNA processing.
Rous Sarcoma Virus RNA Stability Element Inhibits Deadenylation of mRNAs with Long 3′UTRs
Balagopal, Vidya; Beemon, Karen L.
2017-01-01
All retroviruses use their full-length primary transcript as the major mRNA for Group-specific antigen (Gag) capsid proteins. This results in a long 3′ untranslated region (UTR) downstream of the termination codon. In the case of Rous sarcoma virus (RSV), there is a 7 kb 3′UTR downstream of the gag terminator, containing the pol, env, and src genes. mRNAs containing long 3′UTRs, like those with premature termination codons, are frequently recognized by the cellular nonsense-mediated mRNA decay (NMD) machinery and targeted for degradation. To prevent this, RSV has evolved an RNA stability element (RSE) in the RNA immediately downstream of the gag termination codon. This 400-nt RNA sequence stabilizes premature termination codons (PTCs) in gag. It also stabilizes globin mRNAs with long 3′UTRs, when placed downstream of the termination codon. It is not clear how the RSE stabilizes the mRNA and prevents decay. We show here that the presence of RSE inhibits deadenylation severely. In addition, the RSE also impairs decapping (DCP2) and 5′-3′ exonucleolytic (XRN1) function in knockdown experiments in human cells. PMID:28763028
Jones, Christopher P; Saadatmand, Jenan; Kleiman, Lawrence; Musier-Forsyth, Karin
2013-02-01
The primer for initiating reverse transcription in human immunodeficiency virus type 1 (HIV-1) is tRNA(Lys3). Host cell tRNA(Lys) is selectively packaged into HIV-1 through a specific interaction between the major tRNA(Lys)-binding protein, human lysyl-tRNA synthetase (hLysRS), and the viral proteins Gag and GagPol. Annealing of the tRNA primer onto the complementary primer-binding site (PBS) in viral RNA is mediated by the nucleocapsid domain of Gag. The mechanism by which tRNA(Lys3) is targeted to the PBS and released from hLysRS prior to annealing is unknown. Here, we show that hLysRS specifically binds to a tRNA anti-codon-like element (TLE) in the HIV-1 genome, which mimics the anti-codon loop of tRNA(Lys) and is located proximal to the PBS. Mutation of the U-rich sequence within the TLE attenuates binding of hLysRS in vitro and reduces the amount of annealed tRNA(Lys3) in virions. Thus, LysRS binds specifically to the TLE, which is part of a larger LysRS binding domain in the viral RNA that includes elements of the Psi packaging signal. Our results suggest that HIV-1 uses molecular mimicry of the anti-codon of tRNA(Lys) to increase the efficiency of tRNA(Lys3) annealing to viral RNA.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ling, Jiqiang; Peterson, Kaitlyn M.; Simonovic, Ivana
2014-03-12
Aminoacyl-tRNA synthetases (aaRSs) ensure faithful translation of mRNA into protein by coupling an amino acid to a set of tRNAs with conserved anticodon sequences. Here, we show that in mitochondria of Saccharomyces cerevisiae, a single aaRS (MST1) recognizes and aminoacylates two natural tRNAs that contain anticodon loops of different size and sequence. Besides a regular ?? with a threonine (Thr) anticodon, MST1 also recognizes an unusual ??, which contains an enlarged anticodon loop and an anticodon triplet that reassigns the CUN codons from leucine to threonine. Our data show that MST1 recognizes the anticodon loop in both tRNAs, but employsmore » distinct recognition mechanisms. The size but not the sequence of the anticodon loop is critical for ?? recognition, whereas the anticodon sequence is essential for aminoacylation of ??. The crystal structure of MST1 reveals that, while lacking the N-terminal editing domain, the enzyme closely resembles the bacterial threonyl-tRNA synthetase (ThrRS). A detailed structural comparison with Escherichia coli ThrRS, which is unable to aminoacylate ??, reveals differences in the anticodon-binding domain that probably allow recognition of the distinct anticodon loops. Finally, our mutational and modeling analyses identify the structural elements in MST1 (e.g., helix {alpha}11) that define tRNA selectivity. Thus, MTS1 exemplifies that a single aaRS can recognize completely divergent anticodon loops of natural isoacceptor tRNAs and that in doing so it facilitates the reassignment of the genetic code in yeast mitochondria.« less
Canale, Aneth S; Venev, Sergey V; Whitfield, Troy W; Caffrey, Daniel R; Marasco, Wayne A; Schiffer, Celia A; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Zeldovich, Konstantin B; Wang, Jennifer P; Bolon, Daniel N A
2018-04-13
The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses. Copyright © 2018. Published by Elsevier Ltd.
Synthetic biology approach for plant protection using dsRNA.
Niehl, Annette; Soininen, Marjukka; Poranen, Minna M; Heinlein, Manfred
2018-02-26
Pathogens induce severe damages on cultivated plants and represent a serious threat to global food security. Emerging strategies for crop protection involve the external treatment of plants with double-stranded (ds)RNA to trigger RNA interference. However, applying this technology in greenhouses and fields depends on dsRNA quality, stability and efficient large-scale production. Using components of the bacteriophage phi6, we engineered a stable and accurate in vivo dsRNA production system in Pseudomonas syringae bacteria. Unlike other in vitro or in vivo dsRNA production systems that rely on DNA transcription and postsynthetic alignment of single-stranded RNA molecules, the phi6 system is based on the replication of dsRNA by an RNA-dependent RNA polymerase, thus allowing production of high-quality, long dsRNA molecules. The phi6 replication complex was reprogrammed to multiply dsRNA sequences homologous to tobacco mosaic virus (TMV) by replacing the coding regions within two of the three phi6 genome segments with TMV sequences and introduction of these constructs into P. syringae together with the third phi6 segment, which encodes the components of the phi6 replication complex. The stable production of TMV dsRNA was achieved by combining all the three phi6 genome segments and by maintaining the natural dsRNA sizes and sequence elements required for efficient replication and packaging of the segments. The produced TMV-derived dsRNAs inhibited TMV propagation when applied to infected Nicotiana benthamiana plants. The established dsRNA production system enables the broad application of dsRNA molecules as an efficient, highly flexible, nontransgenic and environmentally friendly approach for protecting crops against viruses and other pathogens. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Identification, variation and transcription of pneumococcal repeat sequences
2011-01-01
Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
RNA sequencing uncovers antisense RNAs and novel small RNAs in Streptococcus pyogenes
Le Rhun, Anaïs; Beer, Yan Yan; Reimegård, Johan; Chylinski, Krzysztof; Charpentier, Emmanuelle
2016-01-01
ABSTRACT Streptococcus pyogenes is a human pathogen responsible for a wide spectrum of diseases ranging from mild to life-threatening infections. During the infectious process, the temporal and spatial expression of pathogenicity factors is tightly controlled by a complex network of protein and RNA regulators acting in response to various environmental signals. Here, we focus on the class of small RNA regulators (sRNAs) and present the first complete analysis of sRNA sequencing data in S. pyogenes. In the SF370 clinical isolate (M1 serotype), we identified 197 and 428 putative regulatory RNAs by visual inspection and bioinformatics screening of the sequencing data, respectively. Only 35 from the 197 candidates identified by visual screening were assigned a predicted function (T-boxes, ribosomal protein leaders, characterized riboswitches or sRNAs), indicating how little is known about sRNA regulation in S. pyogenes. By comparing our list of predicted sRNAs with previous S. pyogenes sRNA screens using bioinformatics or microarrays, 92 novel sRNAs were revealed, including antisense RNAs that are for the first time shown to be expressed in this pathogen. We experimentally validated the expression of 30 novel sRNAs and antisense RNAs. We show that the expression profile of 9 sRNAs including 2 predicted regulatory elements is affected by the endoribonucleases RNase III and/or RNase Y, highlighting the critical role of these enzymes in sRNA regulation. PMID:26580233
Fritz, David T; Jiang, Shan; Xu, Junwang; Rogers, Melissa B
2006-07-01
The bone morphogenetic protein (BMP)2 gene has been genetically linked to osteoporosis and osteoarthritis. We have shown that the 3'-untranslated regions (UTR) of BMP2 genes from mammals to fishes are extraordinarily conserved. This indicates that the BMP2 3'-UTR is under stringent selective pressure. We present evidence that the conserved region is a strong posttranscriptional regulator of BMP2 expression. Polymorphisms in cis-regulatory elements have been proven to influence susceptibility to a growing number of diseases. A common single nucleotide polymorphism (SNP) disrupts a putative posttranscriptional regulatory motif, an AU-rich element, within the BMP2 3'-UTR. The affinity of specific proteins for the rs15705 SNP sequence differs from their affinity for the normal human sequence. More importantly, the in vitro decay rate of RNAs with the SNP is higher than that of RNAs with the normal sequence. Such changes in mRNA:protein interactions may influence the posttranscriptional mechanisms that control BMP2 gene expression. The consequent alterations in BMP2 protein levels may influence the development or physiology of bone or other BMP2-influenced tissues.
Hess, M A; Duncan, R F
1996-01-01
Preferential translation of Drosophila heat shock protein 70 (Hsp70) mRNA requires only the 5'-untranslated region (5'-UTR). The sequence of this region suggests that it has relatively little secondary structure, which may facilitate efficient protein synthesis initiation. To determine whether minimal 5'-UTR secondary structure is required for preferential translation during heat shock, the effect of introducing stem-loops into the Hsp70 mRNA 5'-UTR was measured. Stem-loops of -11 kcal/mol abolished translation during heat shock, but did not reduce translation in non-heat shocked cells. A -22 kcal/mol stem-loop was required to comparably inhibit translation during growth at normal temperatures. To investigate whether specific sequence elements are also required for efficient preferential translation, deletion and mutation analyses were conducted in a truncated Hsp70 5'-UTR containing only the cap-proximal and AUG-proximal segments. Linker-scanner mutations in the cap-proximal segment (+1 to +37) did not impair translation. Re-ordering the segments reduced mRNA translational efficiency by 50%. Deleting the AUG-proximal segment severely inhibited translation. A 5-extension of the full-length leader specifically impaired heat shock translation. These results indicate that heat shock reduces the capacity to unwind 5-UTR secondary structure, allowing only mRNAs with minimal 5'-UTR secondary structure to be efficiently translated. A function for specific sequences is also suggested. PMID:8710519
Loke, Johnny C.; Stahlberg, Eric A.; Strenski, David G.; Haas, Brian J.; Wood, Paul Chris; Li, Qingshun Quinn
2005-01-01
Using a novel program, SignalSleuth, and a database containing authenticated polyadenylation [poly(A)] sites, we analyzed the composition of mRNA poly(A) signals in Arabidopsis (Arabidopsis thaliana), and reevaluated previously described cis-elements within the 3′-untranslated (UTR) regions, including near upstream elements and far upstream elements. As predicted, there are absences of high-consensus signal patterns. The AAUAAA signal topped the near upstream elements patterns and was found within the predicted location to only approximately 10% of 3′-UTRs. More importantly, we identified a new set, named cleavage elements, of poly(A) signals flanking both sides of the cleavage site. These cis-elements were not previously revealed by conventional mutagenesis and are contemplated as a cluster of signals for cleavage site recognition. Moreover, a single-nucleotide profile scan on the 3′-UTR regions unveiled a distinct arrangement of alternate stretches of U and A nucleotides, which led to a prediction of the formation of secondary structures. Using an RNA secondary structure prediction program, mFold, we identified three main types of secondary structures on the sequences analyzed. Surprisingly, these observed secondary structures were all interrupted in previously constructed mutations in these regions. These results will enable us to revise the current model of plant poly(A) signals and to develop tools to predict 3′-ends for gene annotation. PMID:15965016
Cold shock protein YB-1 is involved in hypoxia-dependent gene transcription.
Rauen, Thomas; Frye, Bjoern C; Wang, Jialin; Raffetseder, Ute; Alidousty, Christina; En-Nia, Abdelaziz; Floege, Jürgen; Mertens, Peter R
2016-09-16
Hypoxia-dependent gene regulation is largely orchestrated by hypoxia-inducible factors (HIFs), which associate with defined nucleotide sequences of hypoxia-responsive elements (HREs). Comparison of the regulatory HRE within the 3' enhancer of the human erythropoietin (EPO) gene with known binding motifs for cold shock protein Y-box (YB) protein-1 yielded strong similarities within the Y-box element and 3' adjacent sequences. DNA binding assays confirmed YB-1 binding to both, single- and double-stranded HRE templates. Under hypoxia, we observed nuclear shuttling of YB-1 and co-immunoprecipitation assays demonstrated that YB-1 and HIF-1α physically interact with each other. Cellular YB-1 depletion using siRNA significantly induced hypoxia-dependent EPO production at both, promoter and mRNA level. Vice versa, overexpressed YB-1 significantly reduced EPO-HRE-dependent gene transcription, whereas this effect was minor under normoxia. HIF-1α overexpression induced hypoxia-dependent gene transcription through the same element and accordingly, co-expression with YB-1 reduced HIF-1α-mediated EPO induction under hypoxic conditions. Taken together, we identified YB-1 as a novel binding factor for HREs that participates in fine-tuning of the hypoxia transcriptome. Copyright © 2016 Elsevier Inc. All rights reserved.
Finster, Kai Waldemar; Kjeldsen, Kasper Urup; Kube, Michael; Reinhardt, Richard; Mussmann, Marc; Amann, Rudolf; Schreiber, Lars
2013-04-15
Desulfocapsa sulfexigens SB164P1 (DSM 10523) belongs to the deltaproteobacterial family Desulfobulbaceae and is one of two validly described members of its genus. This strain was selected for genome sequencing, because it is the first marine bacterium reported to thrive on the disproportionation of elemental sulfur, a process with a unresolved enzymatic pathway in which elemental sulfur serves both as electron donor and electron acceptor. Furthermore, in contrast to its phylogenetically closest relatives, which are dissimilatory sulfate-reducers, D. sulfexigens is unable to grow by sulfate reduction and appears metabolically specialized in growing by disproportionating elemental sulfur, sulfite or thiosulfate with CO2 as the sole carbon source. The genome of D. sulfexigens contains the set of genes that is required for nitrogen fixation. In an acetylene assay it could be shown that the strain reduces acetylene to ethylene, which is indicative for N-fixation. The circular chromosome of D. sulfexigens SB164P1 comprises 3,986,761 bp and harbors 3,551 protein-coding genes of which 78% have a predicted function based on auto-annotation. The chromosome furthermore encodes 46 tRNA genes and 3 rRNA operons.
Finster, Kai Waldemar; Kjeldsen, Kasper Urup; Kube, Michael; Reinhardt, Richard; Mussmann, Marc; Amann, Rudolf; Schreiber, Lars
2013-01-01
Desulfocapsa sulfexigens SB164P1 (DSM 10523) belongs to the deltaproteobacterial family Desulfobulbaceae and is one of two validly described members of its genus. This strain was selected for genome sequencing, because it is the first marine bacterium reported to thrive on the disproportionation of elemental sulfur, a process with a unresolved enzymatic pathway in which elemental sulfur serves both as electron donor and electron acceptor. Furthermore, in contrast to its phylogenetically closest relatives, which are dissimilatory sulfate-reducers, D. sulfexigens is unable to grow by sulfate reduction and appears metabolically specialized in growing by disproportionating elemental sulfur, sulfite or thiosulfate with CO2 as the sole carbon source. The genome of D. sulfexigens contains the set of genes that is required for nitrogen fixation. In an acetylene assay it could be shown that the strain reduces acetylene to ethylene, which is indicative for N-fixation. The circular chromosome of D. sulfexigens SB164P1 comprises 3,986,761 bp and harbors 3,551 protein-coding genes of which 78% have a predicted function based on auto-annotation. The chromosome furthermore encodes 46 tRNA genes and 3 rRNA operons. PMID:23961312
RNA:RNA interaction can enhance RNA localization in Drosophila oocytes
Hartswood, Eve; Brodie, Jim; Vendra, Georgia; Davis, Ilan; Finnegan, David J.
2012-01-01
RNA localization is a key mechanism for targeting proteins to particular subcellular domains. Sequences necessary and sufficient for localization have been identified, but little is known about factors that affect its kinetics. Transcripts of gurken and the I factor, a non-LTR retrotransposon, colocalize at the nucleus in the dorso–antero corner of the Drosophila oocyte directed by localization signals, the GLS and ILS. I factor RNA localizes faster than gurken after injection into oocytes, due to a difference in the intrinsic localization ability of the GLS and ILS. The kinetics of localization of RNA containing the ILS are enhanced by the presence of a stem–loop, the A loop. This acts as an RNA:RNA interaction element in vivo and in vitro, and stimulates localization of RNA containing other localization signals. RNA:RNA interaction may be a general mechanism for modulating RNA localization and could allow an mRNA that lacks a localization signal to hitchhike on another RNA that has one. PMID:22345148
Lui, Lauren M; Uzilov, Andrew V; Bernick, David L; Corredor, Andrea; Lowe, Todd M; Dennis, Patrick P
2018-05-16
Archaeal homologs of eukaryotic C/D box small nucleolar RNAs (C/D box sRNAs) guide precise 2'-O-methyl modification of ribosomal and transfer RNAs. Although C/D box sRNA genes constitute one of the largest RNA gene families in archaeal thermophiles, most genomes have incomplete sRNA gene annotation because reliable, fully automated detection methods are not available. We expanded and curated a comprehensive gene set across six species of the crenarchaeal genus Pyrobaculum, particularly rich in C/D box sRNA genes. Using high-throughput small RNA sequencing, specialized computational searches and comparative genomics, we analyzed 526 Pyrobaculum C/D box sRNAs, organizing them into 110 families based on synteny and conservation of guide sequences which determine methylation targets. We examined gene duplications and rearrangements, including one family that has expanded in a pattern similar to retrotransposed repetitive elements in eukaryotes. New training data and inclusion of kink-turn secondary structural features enabled creation of an improved search model. Our analyses provide the most comprehensive, dynamic view of C/D box sRNA evolutionary history within a genus, in terms of modification function, feature plasticity, and gene mobility.
Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian
2009-11-01
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Fort, Philippe; Albertini, Aurélie; Van-Hua, Aurélie; Berthomieu, Arnaud; Roche, Stéphane; Delsuc, Frédéric; Pasteur, Nicole; Capy, Pierre; Gaudin, Yves; Weill, Mylène
2012-01-01
Retroelements represent a considerable fraction of many eukaryotic genomes and are considered major drives for adaptive genetic innovations. Recent discoveries showed that despite not normally using DNA intermediates like retroviruses do, Mononegaviruses (i.e., viruses with nonsegmented, negative-sense RNA genomes) can integrate gene fragments into the genomes of their hosts. This was shown for Bornaviridae and Filoviridae, the sequences of which have been found integrated into the germ line cells of many vertebrate hosts. Here, we show that Rhabdoviridae sequences, the major Mononegavirales family, have integrated only into the genomes of arthropod species. We identified 185 integrated rhabdoviral elements (IREs) coding for nucleoproteins, glycoproteins, or RNA-dependent RNA polymerases; they were mostly found in the genomes of the mosquito Aedes aegypti and the blacklegged tick Ixodes scapularis. Phylogenetic analyses showed that most IREs in A. aegypti derived from multiple independent integration events. Since RNA viruses are submitted to much higher substitution rates as compared with their hosts, IREs thus represent fossil traces of the diversity of extinct Rhabdoviruses. Furthermore, analyses of orthologous IREs in A. aegypti field mosquitoes sampled worldwide identified an integrated polymerase IRE fragment that appeared under purifying selection within several million years, which supports a functional role in the host's biology. These results show that A. aegypti was subjected to repeated Rhabdovirus infectious episodes during its evolution history, which led to the accumulation of many integrated sequences. They also suggest that like retroviruses, integrated rhabdoviral sequences may participate actively in the evolution of their hosts.
Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.
Leppek, Kathrin; Das, Rhiju; Barna, Maria
2018-03-01
RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.
Problem-Solving Test: Conditional Gene Targeting Using the Cre/loxP Recombination System
ERIC Educational Resources Information Center
Szeberényi, József
2013-01-01
Terms to be familiar with before you start to solve the test: gene targeting, knock-out mutation, bacteriophage, complementary base-pairing, homologous recombination, deletion, transgenic organisms, promoter, polyadenylation element, transgene, DNA replication, RNA polymerase, Shine-Dalgarno sequence, restriction endonuclease, polymerase chain…
Hay, Elizabeth Anne; Khalaf, Abdulla Razak; Marini, Pietro; Brown, Andrew; Heath, Karyn; Sheppard, Darrin; MacKenzie, Alasdair
2017-08-01
We have successfully used comparative genomics to identify putative regulatory elements within the human genome that contribute to the tissue specific expression of neuropeptides such as galanin and receptors such as CB1. However, a previous inability to rapidly delete these elements from the mouse genome has prevented optimal assessment of their function in-vivo. This has been solved using CAS9/CRISPR genome editing technology which uses a bacterial endonuclease called CAS9 that, in combination with specifically designed guide RNA (gRNA) molecules, cuts specific regions of the mouse genome. However, reports of "off target" effects, whereby the CAS9 endonuclease is able to cut sites other than those targeted, limits the appeal of this technology. We used cytoplasmic microinjection of gRNA and CAS9 mRNA into 1-cell mouse embryos to rapidly generate enhancer knockout mouse lines. The current study describes our analysis of the genomes of these enhancer knockout lines to detect possible off-target effects. Bioinformatic analysis was used to identify the most likely putative off-target sites and to design PCR primers that would amplify these sequences from genomic DNA of founder enhancer deletion mouse lines. Amplified DNA was then sequenced and blasted against the mouse genome sequence to detect off-target effects. Using this approach we were unable to detect any evidence of off-target effects in the genomes of three founder lines using any of the four gRNAs used in the analysis. This study suggests that the problem of off-target effects in transgenic mice have been exaggerated and that CAS9/CRISPR represents a highly effective and accurate method of deleting putative neuropeptide gene enhancer sequences from the mouse genome. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Churakov, Gennady; Smit, Arian F A; Brosius, Jürgen; Schmitz, Jürgen
2005-04-01
About half of the mammalian genome is composed of retroposons. Long interspersed elements (LINEs) and short interspersed elements (SINEs) are the most abundant repetitive elements and account for about 21% and 13% of the human genome, respectively. SINEs have been detected in all major mammalian lineages, except for the South American order Xenarthra, also termed Edentata (armadillos, anteaters, and sloths). Investigating this order, we discovered a novel high-copy-number family of tRNA derived SINEs in the nine-banded armadillo Dasypus novemcinctus, a species that successfully crossed the Central American land bridge to North America in the Pliocene. A specific computer algorithm was developed, and we detected and extracted 687 specific SINEs from databases. Termed DAS-SINEs, we further divided them into six distinct subfamilies. We extracted tRNA(Ala)-derived monomers, two types of dimers, and three subfamilies of chimeric fusion products of a tRNA(Ala) domain and an approximately 180-nt sequence of thus far unidentified origin. Comparisons of secondary structures of the DAS-SINEs' tRNA domains suggest selective pressure to maintain a tRNA-like D-arm structure in the respective founder RNAs, as shown by compensatory mutations. By analysis of subfamily-specific genetic variability, comparison of the proportion of direct repeats, and analysis of self-integrations as well as key events of dimerization and deletions or insertions, we were able to delineate the evolutionary history of the DAS-SINE subfamilies.
Saunders, Keith; Lomonossoff, George P
2017-01-01
We have utilized plant-based transient expression to produce tobacco mosaic virus (TMV)-based nano-rods of predetermined lengths. This is achieved by expressing RNAs containing the TMV origin of assembly sequence (OAS) and the sequence of the TMV coat protein either on the same RNA molecule or on two separate constructs. We show that the length of the resulting nano-rods is dependent upon the length of the RNA that possesses the OAS element. By expressing a version of the TMV coat protein that incorporates a metal-binding peptide at its C-terminus in the presence of RNA containing the OAS we have been able to produce nano-rods of predetermined length that are coated with cobalt-platinum. These nano-rods have the properties of defined-length nano-wires that make them ideal for many developing bionanotechnological processes.
Saunders, Keith; Lomonossoff, George P.
2017-01-01
We have utilized plant-based transient expression to produce tobacco mosaic virus (TMV)-based nano-rods of predetermined lengths. This is achieved by expressing RNAs containing the TMV origin of assembly sequence (OAS) and the sequence of the TMV coat protein either on the same RNA molecule or on two separate constructs. We show that the length of the resulting nano-rods is dependent upon the length of the RNA that possesses the OAS element. By expressing a version of the TMV coat protein that incorporates a metal-binding peptide at its C-terminus in the presence of RNA containing the OAS we have been able to produce nano-rods of predetermined length that are coated with cobalt-platinum. These nano-rods have the properties of defined-length nano-wires that make them ideal for many developing bionanotechnological processes. PMID:28878782
Heyduk, E; Baichoo, N; Heyduk, T
2001-11-30
The alpha-subunit of Escherichia coli RNA polymerase plays an important role in the activity of many promoters by providing a direct protein-DNA contact with a specific sequence (UP element) located upstream of the core promoter sequence. To obtain insight into the nature of thermodynamic forces involved in the formation of this protein-DNA contact, the binding of the alpha-subunit of E. coli RNA polymerase to a fluorochrome-labeled DNA fragment containing the rrnB P1 promoter UP element sequence was quantitatively studied using fluorescence polarization. The alpha dimer and DNA formed a 1:1 complex in solution. Complex formation at 25 degrees C was enthalpy-driven, the binding was accompanied by a net release of 1-2 ions, and no significant specific ion effects were observed. The van't Hoff plot of temperature dependence of binding was linear suggesting that the heat capacity change (Deltac(p)) was close to zero. Protein footprinting with hydroxyradicals showed that the protein did not change its conformation upon protein-DNA contact formation. No conformational changes in the DNA molecule were detected by CD spectroscopy upon protein-DNA complex formation. The thermodynamic characteristics of the binding together with the lack of significant conformational changes in the protein and in the DNA suggested that the alpha-subunit formed a rigid body-like contact with the DNA in which a tight complementary recognition interface between alpha-subunit and DNA was not formed.
Salehipour, Pouya; Nematzadeh, Mahsa; Mobasheri, Maryam Beigom; Afsharpad, Mandana; Mansouri, Kamran; Modarressi, Mohammad Hossein
2017-09-01
Testis specific gene antigen 10 (TSGA10) is a cancer testis antigen involved in the process of spermatogenesis. TSGA10 could also play an important role in the inhibition of angiogenesis by preventing nuclear localization of HIF-1α. Although it has been shown that TSGA10 messenger RNA (mRNA) is mainly expressed in testis and some tumors, the transcription pattern and regulatory mechanisms of this gene remain largely unknown. Here, we report that human TSGA10 comprises at least 22 exons and generates four different transcript variants. It was identified that using two distinct promoters and splicing of exons 4 and 7 produced these transcript variants, which have the same coding sequence, but the sequence of 5'untanslated region (5'UTR) is different between them. This is significant because conserved regulatory RNA elements like upstream open reading frame (uORF) and putative internal ribosome entry site (IRES) were found in this region which have different combinations in each transcript variant and it may influence translational efficiency of them in normal or unusual environmental conditions like hypoxia. To indicate the transcription pattern of TSGA10 in breast cancer, expression of identified transcript variants was analyzed in 62 breast cancer samples. We found that TSGA10 tends to express variants with shorter 5'UTR and fewer uORF elements in breast cancer tissues. Our study demonstrates for the first time the expression of different TSGA10 transcript variants in testis and breast cancer tissues and provides a first clue to a role of TSGA10 5'UTR in regulation of translation in unusual environmental conditions like hypoxia. Copyright © 2017. Published by Elsevier B.V.
Characterization of Rous sarcoma virus polyadenylation site use in vitro
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maciolek, Nicole L.; McNally, Mark T.
2008-05-10
Polyadenylation of Rous sarcoma virus (RSV) RNA is inefficient, as approximately 15% of RSV RNAs represent read-through transcripts that use a downstream cellular polyadenylation site (poly(A) site). Read-through transcription has implications for the virus and the host since it is associated with oncogene capture and tumor induction. To explore the basis of inefficient RSV RNA 3'-end formation, we characterized RSV polyadenylation in vitro using HeLa cell nuclear extracts and HEK293 whole cell extracts. RSV polyadenylation substrates composed of the natural 3' end of viral RNA and various lengths of upstream sequence showed little or no polyadenylation, indicating that the RSVmore » poly(A) site is suboptimal. Efficiently used poly(A) sites often have identifiable upstream and downstream elements (USEs and DSEs) in close proximity to the conserved AAUAAA signal. The sequences upstream and downstream of the RSV poly(A) site deviate from those found in efficiently used poly(A) sites, which may explain inefficient RSV polyadenylation. To assess the quality of the RSV USEs and DSEs, the well-characterized SV40 late USEs and/or DSEs were substituted for the RSV elements and vice versa, which showed that the USEs and DSEs from RSV are suboptimal but functional. CstF interacted poorly with the RSV polyadenylation substrate, and the inactivity of the RSV poly(A) site was at least in part due to poor CstF binding since tethering CstF to the RSV substrate activated polyadenylation. Our data are consistent with poor polyadenylation factor binding sites in both the USE and DSE as the basis for inefficient use of the RSV poly(A) site and point to the importance of additional elements within RSV RNA in promoting 3' end formation.« less
Liu, Dong; Zhu, Guoli; Tang, Wenqiao; Yang, Jinquan; Guo, Hongyi
2012-01-01
Short interspersed nucleotide elements (SINEs), a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR). Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR). The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180-360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNA(Leu)-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.
Functional noncoding sequences derived from SINEs in the mammalian genome
Nishihara, Hidenori; Smit, Arian F.A.; Okada, Norihiro
2006-01-01
Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the ∼1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality. PMID:16717141
The conservation and function of RNA secondary structure in plants
Vandivier, Lee E.; Anderson, Stephen J.; Foley, Shawn W.; Gregory, Brian D.
2016-01-01
RNA transcripts fold into secondary structures via intricate patterns of base pairing. These secondary structures impart catalytic, ligand binding, and scaffolding functions to a wide array of RNAs, forming a critical node of biological regulation. Among their many functions, RNA structural elements modulate epigenetic marks, alter mRNA stability and translation, regulate alternative splicing, transduce signals, and scaffold large macromolecular complexes. Thus, the study of RNA secondary structure is critical to understanding the function and regulation of RNA transcripts. Here, we review the origins, form, and function of RNA secondary structure, focusing on plants. We then provide an overview of methods for probing secondary structure, from physical methods such as X-ray crystallography and nuclear magnetic resonance imaging (NMR) to chemical and nuclease probing methods. Marriage with high-throughput sequencing has enabled these latter methods to scale across whole transcriptomes, yielding tremendous new insights into the form and function of RNA secondary structure. PMID:26865341
A Novel Collection of snRNA-Like Promoters with Tissue-Specific Transcription Properties
Garritano, Sonia; Gigoni, Arianna; Costa, Delfina; Malatesta, Paolo; Florio, Tullio; Cancedda, Ranieri; Pagano, Aldo
2012-01-01
We recently identified a novel dataset of snRNA-like trascriptional units in the human genome. The investigation of a subset of these elements showed that they play relevant roles in physiology and/or pathology. In this work we expand our collection of small RNAs taking advantage of a newly developed algorithm able to identify genome sequence stretches with RNA polymerase (pol) III type 3 promoter features thus constituting putative pol III binding sites. The bioinformatic analysis of a subset of these elements that map in introns of protein-coding genes in antisense configuration suggest their association with alternative splicing, similarly to other recently characterized small RNAs. Interestingly, the analysis of the transcriptional activity of these novel promoters shows that they are active in a cell-type specific manner, in accordance with the emerging body of evidence of a tissue/cell-specific activity of pol III. PMID:23109855
A novel collection of snRNA-like promoters with tissue-specific transcription properties.
Garritano, Sonia; Gigoni, Arianna; Costa, Delfina; Malatesta, Paolo; Florio, Tullio; Cancedda, Ranieri; Pagano, Aldo
2012-01-01
We recently identified a novel dataset of snRNA-like trascriptional units in the human genome. The investigation of a subset of these elements showed that they play relevant roles in physiology and/or pathology. In this work we expand our collection of small RNAs taking advantage of a newly developed algorithm able to identify genome sequence stretches with RNA polymerase (pol) III type 3 promoter features thus constituting putative pol III binding sites. The bioinformatic analysis of a subset of these elements that map in introns of protein-coding genes in antisense configuration suggest their association with alternative splicing, similarly to other recently characterized small RNAs. Interestingly, the analysis of the transcriptional activity of these novel promoters shows that they are active in a cell-type specific manner, in accordance with the emerging body of evidence of a tissue/cell-specific activity of pol III.
Iyer, Lakshminarayan M; Abhiman, Saraswathi; Aravind, L
2008-10-04
Using sequence profile methods and structural comparisons we characterize a previously unknown family of nucleic acid polymerases in a group of mobile elements from genomes of diverse bacteria, an algal plastid and certain DNA viruses, including the recently reported Sputnik virus. Using contextual information from domain architectures and gene-neighborhoods we present evidence that they are likely to possess both primase and DNA polymerase activity, comparable to the previously reported prim-pol proteins. These newly identified polymerases help in defining the minimal functional core of superfamily A DNA polymerases and related RNA polymerases. Thus, they provide a framework to understand the emergence of both DNA and RNA polymerization activity in this class of enzymes. They also provide evidence that enigmatic DNA viruses, such as Sputnik, might have emerged from mobile elements coding these polymerases.
Iyer, Lakshminarayan M; Abhiman, Saraswathi; Aravind, L
2008-01-01
Using sequence profile methods and structural comparisons we characterize a previously unknown family of nucleic acid polymerases in a group of mobile elements from genomes of diverse bacteria, an algal plastid and certain DNA viruses, including the recently reported Sputnik virus. Using contextual information from domain architectures and gene-neighborhoods we present evidence that they are likely to possess both primase and DNA polymerase activity, comparable to the previously reported prim-pol proteins. These newly identified polymerases help in defining the minimal functional core of superfamily A DNA polymerases and related RNA polymerases. Thus, they provide a framework to understand the emergence of both DNA and RNA polymerization activity in this class of enzymes. They also provide evidence that enigmatic DNA viruses, such as Sputnik, might have emerged from mobile elements coding these polymerases. This article was reviewed by Eugene Koonin and Mark Ragan. PMID:18834537
Franco, Bernardo; Hernández, Roberto; López-Villaseñor, Imelda
2012-09-01
Trichomonas vaginalis is a parasitic protozoan of both medical and biological relevance. Transcriptional studies in this organism have focused mainly on type II pol promoters, whereas the elements necessary for transcription by polI or polIII have not been investigated. Here, with the aid of a transient transcription system, we characterised the rDNA intergenic region, defining both the promoter and the terminator sequences required for transcription. We defined the promoter as a compact region of approximately 180 bp. We also identified a potential upstream control element (UCE) that was located 80 bp upstream of the transcription start point (TSP). A transcription termination element was identified within a 34 bp region that was located immediately downstream of the 28S coding sequence. The function of this element depends upon polarity and the presence of both a stretch of uridine residues (U's) and a hairpin structure in the transcript. Our observations provide a strong basis for the study of DNA recognition by the polI transcriptional machinery in this early divergent organism. Copyright © 2012 Elsevier B.V. All rights reserved.
Kresoja-Rakic, Jelena; Felley-Bosco, Emanuela
2018-04-25
The in vitro RNA-pulldown is still largely used in the first steps of protocols aimed at identifying RNA-binding proteins that recognize specific RNA structures and motifs. In this RNA-pulldown protocol, commercially synthesized RNA probes are labeled with a modified form of biotin, desthiobiotin, at the 3' terminus of the RNA strand, which reversibly binds to streptavidin and thus allows elution of proteins under more physiological conditions. The RNA-desthiobiotin is immobilized through interaction with streptavidin on magnetic beads, which are used to pull down proteins that specifically interact with the RNA of interest. Non-denatured and active proteins from the cytosolic fraction of mesothelioma cells are used as the source of proteins. The method described here can be applied to detect the interaction between known RNA binding proteins and a 25-nucleotide (nt) long RNA probe containing a sequence of interest. This is useful to complete the functional characterization of stabilizing or destabilizing elements present in RNA molecules achieved using a reporter vector assay.
Hfq restructures RNA-IN and RNA-OUT and facilitates antisense pairing in the Tn10/IS10 system
Ross, Joseph A.; Ellis, Michael J.; Hossain, Shahan; Haniford, David B.
2013-01-01
Hfq functions in post-transcriptional gene regulation in a wide range of bacteria, usually by promoting base-pairing of mRNAs and trans-encoded sRNAs that share partial sequence complementarity. It is less clear if Hfq is required for pairing of cis-encoded RNAs (i.e., antisense RNAs) with their target mRNAs. In the current work, we have characterized the interactions between Escherichia coli Hfq and the components of the Tn10/IS10 antisense system, RNA-IN and RNA-OUT. We show that Hfq interacts with RNA-OUT through its proximal RNA-binding surface, as is typical for Hfq and trans-encoded sRNAs. In contrast, RNA-IN binds both proximal and distal RNA-binding surfaces in Hfq with a higher affinity for the latter, as is typical for mRNA interactions in canonical sRNA-mRNA pairs. Importantly, an amino acid substitution in Hfq that interferes with RNA binding to the proximal site negatively impacts RNA-IN:OUT pairing in vitro and suppresses the ability of Hfq to negatively regulate IS10 transposition in vivo. We also show that Hfq binding to RNA-IN and RNA-OUT alters secondary structure elements in both of these RNAs and speculate that this could be important in how Hfq facilitates RNA-IN:OUT pairing. Based on the results presented here, we suggest that Hfq could be involved in regulating RNA pairing in other antisense systems, including systems encoded by other transposable elements. PMID:23510801
1993-10-30
hammerhead ribozymes (7-9) and a hairpin ribozyme (10) directed against HIV-l RNA has been shown to confer significant resistance to HIV-I infection...antisense oligodeoxynucleotides (ODN) directed to the Rev Response Element (RRE) and ribozymes that target viral mRNAs. The ribozyme approach, in...particular, has yielded extremely encouraging positive data. We showed that a hairpin ribozyme designed to cleave HIV-1 RNA in the 5’ leader sequence
Genome-wide mapping of autonomous promoter activity in human cells
van Arensbergen, Joris; FitzPatrick, Vincent D.; de Haas, Marcel; Pagie, Ludo; Sluimer, Jasper; Bussemaker, Harmen J.; van Steensel, Bas
2017-01-01
Previous methods to systematically characterize sequence-intrinsic activity of promoters have been limited by relatively low throughput and the length of sequences that could be tested. Here we present Survey of Regulatory Elements (SuRE), a method to assay more than 108 DNA fragments, each 0.2–2kb in size, for their ability to drive transcription autonomously. In SuRE, a plasmid library is constructed of random genomic fragments upstream of a 20bp barcode and decoded by paired-end sequencing. This library is then transfected into cells and transcribed barcodes are quantified in the RNA by high throughput sequencing. When applied to the human genome, we achieved a 55-fold genome coverage, allowing us to map autonomous promoter activity genome-wide. By computational modeling we delineated subregions within promoters that are relevant for their activity. For instance, we show that antisense promoter transcription is generally dependent on the sense core promoter sequences, and that most enhancers and several families of repetitive elements act as autonomous transcription initiation sites. PMID:28024146
Prediction of RNA secondary structures: from theory to models and real molecules
NASA Astrophysics Data System (ADS)
Schuster, Peter
2006-05-01
RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from two letter alphabets form perfect structures when the nucleotides can form a base pair, as is the case with {GC} or {AU}, but the relation between the sequences and structures differs strongly from the four letter alphabet. A comprehensive theory of RNA structure is presented, which is based on the concepts of sequence space and shape space, being a space of structures. It sets the stage for modelling processes in ensembles of RNA molecules like evolutionary optimization or kinetic folding as dynamical phenomena guided by mappings between the two spaces. The number of minimum free energy (mfe) structures is always smaller than the number of sequences, even for two letter alphabets. Folding of RNA molecules into mfe energy structures constitutes a non-invertible mapping from sequence space onto shape space. The preimage of a structure in sequence space is defined as its neutral network. Similarly the set of suboptimal structures is the preimage of a sequence in shape space. This set represents the conformation space of a given sequence. The evolutionary optimization of structures in populations is a process taking place in sequence space, whereas kinetic folding occurs in molecular ensembles that optimize free energy in conformation space. Efficient folding algorithms based on dynamic programming are available for the prediction of secondary structures for given sequences. The inverse problem, the computation of sequences for predefined structures, is an important tool for the design of RNA molecules with tailored properties. Simultaneous folding or cofolding of two or more RNA molecules can be modelled readily at the secondary structure level and allows prediction of the most stable (mfe) conformations of complexes together with suboptimal states. Cofolding algorithms are important tools for efficient and highly specific primer design in the polymerase chain reaction (PCR) and help to explain the mechanisms of small interference RNA (si-RNA) molecules in gene regulation. The evolutionary optimization of RNA structures is illustrated by the search for a target structure and mimics aptamer selection in evolutionary biotechnology. It occurs typically in steps consisting of short adaptive phases interrupted by long epochs of little or no obvious progress in optimization. During these quasi-stationary epochs the populations are essentially confined to neutral networks where they search for sequences that allow a continuation of the adaptive process. Modelling RNA evolution as a simultaneous process in sequence and shape space provides answers to questions of the optimal population size and mutation rates. Kinetic folding is a stochastic process in conformation space. Exact solutions are derived by direct simulation in the form of trajectory sampling or by solving the master equation. The exact solutions can be approximated straightforwardly by Arrhenius kinetics on barrier trees, which represent simplified versions of conformational energy landscapes. The existence of at least one sequence forming any arbitrarily chosen pair of structures is granted by the intersection theorem. Folding kinetics is the key to understanding and designing multistable RNA molecules or RNA switches. These RNAs form two or more long lived conformations, and conformational changes occur either spontaneously or are induced through binding of small molecules or other biopolymers. RNA switches are found in nature where they act as elements in genetic and metabolic regulation. The reliability of RNA secondary structure prediction is limited by the accuracy with which the empirical parameters can be determined and by principal deficiencies, for example by the lack of energy contributions resulting from tertiary interactions. In addition, native structures may be determined by folding kinetics rather than by thermodynamics. We address the first problem by considering base pair probabilities or base pairing entropies, which are derived from the partition function of conformations. A high base pair probability corresponding to a low pairing entropy is taken as an indicator of a high reliability of prediction. Pseudoknots are discussed as an example of a tertiary interaction that is highly important for RNA function. Moreover, pseudoknot formation is readily incorporated into structure prediction algorithms. Some examples of experimental data on RNA secondary structures that are readily explained using the landscape concept are presented. They deal with (i) properties of RNA molecules with random sequences, (ii) RNA molecules from restricted alphabets, (iii) existence of neutral networks, (iv) shape space covering, (v) riboswitches and (vi) evolution of non-coding RNAs as an example of evolution restricted to neutral networks.
Differential display detects host nucleic acid motifs altered in scrapie-infected brain.
Lathe, Richard; Harris, Alyson
2009-09-25
The transmissible spongiform encephalopathies (TSEs) including scrapie have been attributed to an infectious protein or prion. Infectivity is allied to conversion of the endogenous nucleic-acid-binding protein PrP to an infectious modified form known as PrP(sc). The protein-only theory does not easily explain the enigmatic properties of the agent including strain variation. It was previously suggested that a short nucleic acid, perhaps host-encoded, might contribute to the pathoetiology of the TSEs. No candidate host molecules that might explain transmission of strain differences have yet been put forward. Differential display is a robust technique for detecting nucleic acid differences between two populations. We applied this technique to total nucleic acid preparations from scrapie-infected and control brain. Independent RNA preparations from eight normal and eight scrapie-infected (strain 263K) hamster brains were randomly amplified and visualized in parallel. Though the nucleic acid patterns were generally identical in scrapie-infected versus control brain, some rare bands were differentially displayed. Molecular species consistently overrepresented (or underrepresented) in all eight infected brain samples versus all eight controls were excised from the display, sequenced, and assembled into contigs. Only seven ros contigs (RNAs over- or underrepresented in scrapie) emerged, representing <4 kb from the transcriptome. All contained highly stable regions of secondary structure. The most abundant scrapie-only ros sequence was homologous to a repetitive transposable element (LINE; long interspersed nuclear element). Other ros sequences identified cellular RNA 7SL, clathrin heavy chain, visinin-like protein-1, and three highly specific subregions of ribosomal RNA (ros1-3). The ribosomal ros sequences accurately corresponded to LINE; retrotransposon insertion sites in ribosomal DNA (p<0.01). These differential motifs implicate specific host RNAs in the pathoetiology of the TSEs.
Role of transposon-derived small RNAs in the interplay between genomes and parasitic DNA in rice.
Nosaka, Misuzu; Itoh, Jun-Ichi; Nagato, Yasuo; Ono, Akemi; Ishiwata, Aiko; Sato, Yutaka
2012-09-01
RNA silencing is a defense system against "genomic parasites" such as transposable elements (TE), which are potentially harmful to host genomes. In plants, transcripts from TEs induce production of double-stranded RNAs (dsRNAs) and are processed into small RNAs (small interfering RNAs, siRNAs) that suppress TEs by RNA-directed DNA methylation. Thus, the majority of TEs are epigenetically silenced. On the other hand, most of the eukaryotic genome is composed of TEs and their remnants, suggesting that TEs have evolved countermeasures against host-mediated silencing. Under some circumstances, TEs can become active and increase in copy number. Knowledge is accumulating on the mechanisms of TE silencing by the host; however, the mechanisms by which TEs counteract silencing are poorly understood. Here, we show that a class of TEs in rice produces a microRNA (miRNA) to suppress host silencing. Members of the microRNA820 (miR820) gene family are located within CACTA DNA transposons in rice and target a de novo DNA methyltransferase gene, OsDRM2, one of the components of epigenetic silencing. We confirmed that miR820 negatively regulates the expression of OsDRM2. In addition, we found that expression levels of various TEs are increased quite sensitively in response to decreased OsDRM2 expression and DNA methylation at TE loci. Furthermore, we found that the nucleotide sequence of miR820 and its recognition site within the target gene in some Oryza species have co-evolved to maintain their base-pairing ability. The co-evolution of these sequences provides evidence for the functionality of this regulation. Our results demonstrate how parasitic elements in the genome escape the host's defense machinery. Furthermore, our analysis of the regulation of OsDRM2 by miR820 sheds light on the action of transposon-derived small RNAs, not only as a defense mechanism for host genomes but also as a regulator of interactions between hosts and their parasitic elements.
Lloyd, G S; Busby, S J; Savery, N J
1998-01-01
During transcription initiation at bacterial promoters, the C-terminal domain of the RNA polymerase alpha subunit (alphaCTD) can interact with DNA-sequence elements (known as UP elements) and with activator proteins. We have constructed a series of semi-synthetic promoters carrying both an UP element and a consensus DNA-binding site for the Escherichia coli cAMP receptor protein (CRP; a factor that activates transcription by making direct contacts with alphaCTD). At these promoters, the UP element was located at a variety of distances upstream of the CRP-binding site, which was fixed at position -41.5 bp upstream of the transcript start. At some positions, the UP element caused enhanced promoter activity whereas, at other positions, it had very little effect. In no case was the CRP-dependence of the promoter relieved. DNase I and hydroxyl-radical footprinting were used to study ternary RNA polymerase-CRP-promoter complexes formed at two of the most active of these promoters, and co-operativity between the binding of CRP and purified alpha subunits was studied. The footprints show that alphaCTD binds to the UP element as it is displaced upstream but that this displacement does not prevent alphaCTD from being contacted by CRP. Models to account for this are discussed. PMID:9461538
Transfer RNAs with novel cloverleaf structures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus
We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
Transfer RNAs with novel cloverleaf structures
Mukai, Takahito; Vargas-Rodriguez, Oscar; Englert, Markus; ...
2016-10-05
We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter Tstem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function asmore » missense and nonsense suppressor tRNAs and/or regulatory noncod ing RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species inEscherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.« less
Polycomb repressive complex 1 modifies transcription of active genes
Pherson, Michelle; Misulovin, Ziva; Gause, Maria; Mihindukulasuriya, Kathie; Swain, Amanda; Dorsett, Dale
2017-01-01
This study examines the role of Polycomb repressive complex 1 (PRC1) at active genes. The PRC1 and PRC2 complexes are crucial for epigenetic silencing during development of an organism. They are recruited to Polycomb response elements (PREs) and establish silenced domains over several kilobases. Recent studies show that PRC1 is also directly recruited to active genes by the cohesin complex. Cohesin participates broadly in control of gene transcription, but it is unknown whether cohesin-recruited PRC1 also plays a role in transcriptional control of active genes. We address this question using genome-wide RNA sequencing (RNA-seq) and chromatin immunoprecipitation sequencing (ChIP-seq). The results show that PRC1 influences transcription of active genes, and a significant fraction of its effects are likely direct. The roles of different PRC1 subunits can also vary depending on the gene. Depletion of PRC1 subunits by RNA interference alters phosphorylation of RNA polymerase II (Pol II) and occupancy by the Spt5 pausing-elongation factor at most active genes. These effects on Pol II phosphorylation and Spt5 are likely linked to changes in elongation and RNA processing detected by nascent RNA-seq, although the mechanisms remain unresolved. The experiments also reveal that PRC1 facilitates association of Spt5 with enhancers and PREs. Reduced Spt5 levels at these regulatory sequences upon PRC1 depletion coincide with changes in Pol II occupancy and phosphorylation. Our findings indicate that, in addition to its repressive roles in epigenetic gene silencing, PRC1 broadly influences transcription of active genes and may suppress transcription of nonpromoter regulatory sequences. PMID:28782042
Matsumoto, Yusuke; Ohta, Keisuke; Goto, Hideo; Nishio, Machiko
2016-07-01
Gene expression of paramyxoviruses is regulated by genome-encoded cis-acting elements; however, whether all the required elements for viral growth have been identified is not clear. Using a mini-replicon system, it has been shown that human parainfluenza virus type 2 (hPIV2) polymerase can recognize the promoter elements of parainfluenza virus type 5 (PIV5), but reporter activity is lower in this case. We constructed a series of luciferase-encoding chimeric PIV2/5 mini-genomes that are basically hPIV2, but whose leader (le), mRNA start signal and trailer sequence are partially replaced with those of PIV5. Studies of the chimeric PIV2/5 mini-replicons demonstrated that replacement of hPIV2 le with PIV5 le results in remarkably weak luciferase expression. Further mutagenesis identified the responsible region as positions 25-30 of the PIV5 le. Using recombinant hPIV2, the impact of this region on viral life cycles was assessed. Insertion of the mutation at this region facilitated viral growth, genomic replication and mRNA transcription at the early stage of infection, which elicited severe cell damage. In contrast, at the late infection stage it caused a reduction in viral transcription. Here, we identify a novel cis-acting element in the internal region of an le sequence that is involved in the regulation of polymerase, and which contributes to maintaining a balance between viral growth and cytotoxicity.
Tedder, Philip; Zubko, Elena; Westhead, David R.; Meyer, Peter
2009-01-01
Two pools of small RNAs were cloned from inflorescences of Petunia hybrida using a 5′-ligation dependent and a 5′-ligation independent approach. The two libraries were integrated into a public website that allows the screening of individual sequences against 359,769 unique clones. The library contains 15 clones with 100% identity and 53 clones with one mismatch to miRNAs described for other plant species. For two conserved miRNAs, miR159 and miR390, we find clear differences in tissue-specific distribution, compared with other species. This shows that evolutionary conservation of miRNA sequences does not necessarily include a conservation of the miRNA expression profile. Almost 60% of all clones in the database are 24-nucleotide clones. In accordance with the role of 24mers in marking repetitive regions, we find them distributed across retroviral and transposable element sequences but other 24mers map to promoter regions and to different transcript regions. For one target region we observe tissue-specific variation of matching 24mers, which demonstrates that, as for 21mers, 24mer concentrations are not necessarily identical in different tissues. Asymmetric distribution of a putative novel miRNA in the two libraries suggests that the cloning method can be selective for the representation of certain small RNAs in a collection. PMID:19369427
McMurchy, Alicia N; Stempor, Przemyslaw; Gaarenstroom, Tessa; Wysolmerski, Brian; Dong, Yan; Aussianikava, Darya; Appert, Alex; Huang, Ni; Kolasinska-Zwierz, Paulina; Sapetschnig, Alexandra; Miska, Eric A; Ahringer, Julie
2017-01-01
Repetitive sequences derived from transposons make up a large fraction of eukaryotic genomes and must be silenced to protect genome integrity. Repetitive elements are often found in heterochromatin; however, the roles and interactions of heterochromatin proteins in repeat regulation are poorly understood. Here we show that a diverse set of C. elegans heterochromatin proteins act together with the piRNA and nuclear RNAi pathways to silence repetitive elements and prevent genotoxic stress in the germ line. Mutants in genes encoding HPL-2/HP1, LIN-13, LIN-61, LET-418/Mi-2, and H3K9me2 histone methyltransferase MET-2/SETDB1 also show functionally redundant sterility, increased germline apoptosis, DNA repair defects, and interactions with small RNA pathways. Remarkably, fertility of heterochromatin mutants could be partially restored by inhibiting cep-1/p53, endogenous meiotic double strand breaks, or the expression of MIRAGE1 DNA transposons. Functional redundancy among factors and pathways underlies the importance of safeguarding the genome through multiple means. DOI: http://dx.doi.org/10.7554/eLife.21666.001 PMID:28294943
Steckelberg, Anna-Lena; Akiyama, Benjamin M; Costantino, David A; Sit, Tim L; Nix, Jay C; Kieft, Jeffrey S
2018-06-19
Folded RNA elements that block processive 5' → 3' cellular exoribonucleases (xrRNAs) to produce biologically active viral noncoding RNAs have been discovered in flaviviruses, potentially revealing a new mode of RNA maturation. However, whether this RNA structure-dependent mechanism exists elsewhere and, if so, whether a singular RNA fold is required, have been unclear. Here we demonstrate the existence of authentic RNA structure-dependent xrRNAs in dianthoviruses, plant-infecting viruses unrelated to animal-infecting flaviviruses. These xrRNAs have no sequence similarity to known xrRNAs; thus, we used a combination of biochemistry and virology to characterize their sequence requirements and mechanism of stopping exoribonucleases. By solving the structure of a dianthovirus xrRNA by X-ray crystallography, we reveal a complex fold that is very different from that of the flavivirus xrRNAs. However, both versions of xrRNAs contain a unique topological feature, a pseudoknot that creates a protective ring around the 5' end of the RNA structure; this may be a defining structural feature of xrRNAs. Single-molecule FRET experiments reveal that the dianthovirus xrRNAs undergo conformational changes and can use "codegradational remodeling," exploiting the exoribonucleases' degradation-linked helicase activity to help form their resistant structure; such a mechanism has not previously been reported. Convergent evolution has created RNA structure-dependent exoribonuclease resistance in different contexts, which establishes it as a general RNA maturation mechanism and defines xrRNAs as an authentic functional class of RNAs.
Transposable elements in TDP-43-mediated neurodegenerative disorders.
Li, Wanhe; Jin, Ying; Prazak, Lisa; Hammell, Molly; Dubnau, Josh
2012-01-01
Elevated expression of specific transposable elements (TEs) has been observed in several neurodegenerative disorders. TEs also can be active during normal neurogenesis. By mining a series of deep sequencing datasets of protein-RNA interactions and of gene expression profiles, we uncovered extensive binding of TE transcripts to TDP-43, an RNA-binding protein central to amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD). Second, we find that association between TDP-43 and many of its TE targets is reduced in FTLD patients. Third, we discovered that a large fraction of the TEs to which TDP-43 binds become de-repressed in mouse TDP-43 disease models. We propose the hypothesis that TE mis-regulation contributes to TDP-43 related neurodegenerative diseases.
Link, Gerhard
1984-01-01
A nuclease-treated plastid extract from mustard (Sinapis alba L.) allows efficient transcription of cloned plastid DNA templates. In this in vitro system, the major runoff transcript of the truncated gene for the 32 000 mol. wt. photosystem II protein was accurately initiated from a site close to or identical with the in vivo start site. By using plasmids with deletions in the 5'-flanking region of this gene as templates, a DNA region required for efficient and selective initiation was detected ˜28-35 nucleotides upstream of the transcription start site. This region contains the sequence element TTGACA, which matches the consensus sequence for prokaryotic `−35' promoter elements. In the absence of this region, a region ˜13-27 nucleotides upstream of the start site still enables a basic level of specific transcription. This second region contains the sequence element TATATAA, which matches the consensus sequence for the `TATA' box of genes transcribed by RNA polymerase II (or B). The region between the `TATA'-like element and the transcription start site is not sufficient but may be required for specific transcription of the plastid gene. This latter region contains the sequence element TATACT, which resembles the prokaryotic `−10' (Pribnow) box. Based on the structural and transcriptional features of the 5' upstream region, a `promoter switch' mechanism is proposed, which may account for the developmentally regulated expression of this plastid gene. ImagesFig. 1.Fig. 2.Fig. 3.Fig. 4.Figure 5. PMID:16453540
HATAKEYAMA, YOSHINORI; SHIBUYA, NORIHIRO; NISHIYAMA, TAKASHI; NAKASHIMA, NOBUHIKO
2004-01-01
The intergenic region (IGR) located upstream of the capsid protein gene in dicistroviruses contains an internal ribosome entry site (IRES). Translation initiation mediated by the IRES does not require initiator methionine tRNA. Comparison of the IGRs among dicistroviruses suggested that Taura syndrome virus (TSV) and acute bee paralysis virus have an extra side stem loop in the predicted IRES. We examined whether the side stem is responsible for translation activity mediated by the IGR using constructs with compensatory mutations. In vitro translation analysis showed that TSV has an IGR-IRES that is structurally distinct from those previously described. Because IGR-IRES elements determine the translation initiation site by virtue of their own tertiary structure formation, the discovery of this initiation mechanism suggests the possibility that eukaryotic mRNAs might have more extensive coding regions than previously predicted. To test this hypothesis, we searched full-length cDNA databases and whole genome sequences of eukaryotes using the pattern matching program, Scan For Matches, with parameters that can extract sequences containing secondary structure elements resembling those of IGR-IRES. Our search yielded several sequences, but their predicted secondary structures were suggested to be unstable in comparison to those of dicistroviruses. These results suggest that RNAs structurally similar to dicistroviruses are not common. If some eukaryotic mRNAs are translated independently of an initiator methionine tRNA, their structures are likely to be significantly distinct from those of dicistroviruses. PMID:15100433
RUDI, a short interspersed element of the V-SINE superfamily widespread in molluscan genomes.
Luchetti, Andrea; Šatović, Eva; Mantovani, Barbara; Plohl, Miroslav
2016-06-01
Short interspersed elements (SINEs) are non-autonomous retrotransposons that are widespread in eukaryotic genomes. They exhibit a chimeric sequence structure consisting of a small RNA-related head, an anonymous body and an AT-rich tail. Although their turnover and de novo emergence is rapid, some SINE elements found in distantly related species retain similarity in certain core segments (or highly conserved domains, HCD). We have characterized a new SINE element named RUDI in the bivalve molluscs Ruditapes decussatus and R. philippinarum and found this element to be widely distributed in the genomes of a number of mollusc species. An unexpected structural feature of RUDI is the HCD domain type V, which was first found in non-amniote vertebrate SINEs and in the SINE from one cnidarian species. In addition to the V domain, the overall sequence conservation pattern of RUDI elements resembles that found in ancient AmnSINE (~310 Myr old) and Au SINE (~320 Myr old) families, suggesting that RUDI might be among the most ancient SINE families. Sequence conservation suggests a monophyletic origin of RUDI. Nucleotide variability and phylogenetic analyses suggest long-term vertical inheritance combined with at least one horizontal transfer event as the most parsimonious explanation for the observed taxonomic distribution.
Functional 5′ UTR mRNA structures in eukaryotic translation regulation and how to find them
Leppek, Kathrin; Das, Rhiju; Barna, Maria
2017-01-01
RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms. PMID:29165424
Analytical applications of aptamers
NASA Astrophysics Data System (ADS)
Tombelli, S.; Minunni, M.; Mascini, M.
2007-05-01
Aptamers are single stranded DNA or RNA ligands which can be selected for different targets starting from a library of molecules containing randomly created sequences. Aptamers have been selected to bind very different targets, from proteins to small organic dyes. Aptamers are proposed as alternatives to antibodies as biorecognition elements in analytical devices with ever increasing frequency. This in order to satisfy the demand for quick, cheap, simple and highly reproducible analytical devices, especially for protein detection in the medical field or for the detection of smaller molecules in environmental and food analysis. In our recent experience, DNA and RNA aptamers, specific for three different proteins (Tat, IgE and thrombin), have been exploited as bio-recognition elements to develop specific biosensors (aptasensors). These recognition elements have been coupled to piezoelectric quartz crystals and surface plasmon resonance (SPR) devices as transducers where the aptamers have been immobilized on the gold surface of the crystals electrodes or on SPR chips, respectively.
Penno, Christophe; Sharma, Virag; Coakley, Arthur; O'Connell Motherway, Mary; van Sinderen, Douwe; Lubkowska, Lucyna; Kireeva, Maria L; Kashlev, Mikhail; Baranov, Pavel V; Atkins, John F
2015-04-21
Escherichia coli and yeast DNA-dependent RNA polymerases are shown to mediate efficient nascent transcript stem loop formation-dependent RNA-DNA hybrid realignment. The realignment was discovered on the heteropolymeric sequence T5C5 and yields transcripts lacking a C residue within a corresponding U5C4. The sequence studied is derived from a Roseiflexus insertion sequence (IS) element where the resulting transcriptional slippage is required for transposase synthesis. The stability of the RNA structure, the proximity of the stem loop to the slippage site, the length and composition of the slippage site motif, and the identity of its 3' adjacent nucleotides (nt) are crucial for transcripts lacking a single C. In many respects, the RNA structure requirements for this slippage resemble those for hairpin-dependent transcription termination. In a purified in vitro system, the slippage efficiency ranges from 5% to 75% depending on the concentration ratios of the nucleotides specified by the slippage sequence and the 3' nt context. The only previous proposal of stem loop mediated slippage, which was in Ebola virus expression, was based on incorrect data interpretation. We propose a mechanical slippage model involving the RNAP translocation state as the main motor in slippage directionality and efficiency. It is distinct from previously described models, including the one proposed for paramyxovirus, where following random movement efficiency is mainly dependent on the stability of the new realigned hybrid. In broadening the scope for utilization of transcription slippage for gene expression, the stimulatory structure provides parallels with programmed ribosomal frameshifting at the translation level.
Booy, Evan P.; McRae, Ewan K. S.; Howard, Ryan; Deo, Soumya R.; Ariyo, Emmanuel O.; Dzananovic, Edis; Meier, Markus; Stetefeld, Jörg; McKenna, Sean A.
2016-01-01
RNA helicase associated with AU-rich element (RHAU) is an ATP-dependent RNA helicase that demonstrates high affinity for quadruplex structures in DNA and RNA. To elucidate the significance of these quadruplex-RHAU interactions, we have performed RNA co-immunoprecipitation screens to identify novel RNAs bound to RHAU and characterize their function. In the course of this study, we have identified the non-coding RNA BC200 (BCYRN1) as specifically enriched upon RHAU immunoprecipitation. Although BC200 does not adopt a quadruplex structure and does not bind the quadruplex-interacting motif of RHAU, it has direct affinity for RHAU in vitro. Specifically designed BC200 truncations and RNase footprinting assays demonstrate that RHAU binds to an adenosine-rich region near the 3′-end of the RNA. RHAU truncations support binding that is dependent upon a region within the C terminus and is specific to RHAU isoform 1. Tests performed to assess whether BC200 interferes with RHAU helicase activity have demonstrated the ability of BC200 to act as an acceptor of unwound quadruplexes via a cytosine-rich region near the 3′-end of the RNA. Furthermore, an interaction between BC200 and the quadruplex-containing telomerase RNA was confirmed by pull-down assays of the endogenous RNAs. This leads to the possibility that RHAU may direct BC200 to bind and exert regulatory functions at quadruplex-containing RNA or DNA sequences. PMID:26740632
Cenik, Can; Chua, Hon Nian; Zhang, Hui; Tarnawsky, Stefan P.; Akef, Abdalla; Derti, Adnan; Tasan, Murat; Moore, Melissa J.; Palazzo, Alexander F.; Roth, Frederick P.
2011-01-01
In higher eukaryotes, messenger RNAs (mRNAs) are exported from the nucleus to the cytoplasm via factors deposited near the 5′ end of the transcript during splicing. The signal sequence coding region (SSCR) can support an alternative mRNA export (ALREX) pathway that does not require splicing. However, most SSCR–containing genes also have introns, so the interplay between these export mechanisms remains unclear. Here we support a model in which the furthest upstream element in a given transcript, be it an intron or an ALREX–promoting SSCR, dictates the mRNA export pathway used. We also experimentally demonstrate that nuclear-encoded mitochondrial genes can use the ALREX pathway. Thus, ALREX can also be supported by nucleotide signals within mitochondrial-targeting sequence coding regions (MSCRs). Finally, we identified and experimentally verified novel motifs associated with the ALREX pathway that are shared by both SSCRs and MSCRs. Our results show strong correlation between 5′ untranslated region (5′UTR) intron presence/absence and sequence features at the beginning of the coding region. They also suggest that genes encoding secretory and mitochondrial proteins share a common regulatory mechanism at the level of mRNA export. PMID:21533221
Assessment of RNAi-induced silencing in banana (Musa spp.).
Dang, Tuong Vi T; Windelinckx, Saskia; Henry, Isabelle M; De Coninck, Barbara; Cammue, Bruno P A; Swennen, Rony; Remy, Serge
2014-09-18
In plants, RNA- based gene silencing mediated by small RNAs functions at the transcriptional or post-transcriptional level to negatively regulate target genes, repetitive sequences, viral RNAs and/or transposon elements. Post-transcriptional gene silencing (PTGS) or the RNA interference (RNAi) approach has been achieved in a wide range of plant species for inhibiting the expression of target genes by generating double-stranded RNA (dsRNA). However, to our knowledge, successful RNAi-application to knock-down endogenous genes has not been reported in the important staple food crop banana. Using embryogenic cell suspension (ECS) transformed with ß-glucuronidase (GUS) as a model system, we assessed silencing of gusAINT using three intron-spliced hairpin RNA (ihpRNA) constructs containing gusAINT sequences of 299-nt, 26-nt and 19-nt, respectively. Their silencing potential was analysed in 2 different experimental set-ups. In the first, Agrobacterium-mediated co-transformation of banana ECS with a gusAINT containing vector and an ihpRNA construct resulted in a significantly reduced GUS enzyme activity 6-8 days after co-cultivation with either the 299-nt and 19-nt ihpRNA vectors. In the second approach, these ihpRNA constructs were transferred to stable GUS-expressing ECS and their silencing potential was evaluated in the regenerated in vitro plants. In comparison to control plants, transgenic plants transformed with the 299-nt gusAINT targeting sequence showed a 4.5 fold down-regulated gusA mRNA expression level, while GUS enzyme activity was reduced by 9 fold. Histochemical staining of plant tissues confirmed these findings. Northern blotting used to detect the expression of siRNA in the 299-nt ihpRNA vector transgenic in vitro plants revealed a negative relationship between siRNA expression and GUS enzyme activity. In contrast, no reduction in GUS activity or GUS mRNA expression occurred in the regenerated lines transformed with either of the two gusAINT oligo target sequences (26-nt and 19-nt). RNAi-induced silencing was achieved in banana, both at transient and stable level, resulting in significant reduction of gene expression and enzyme activity. The success of silencing was dependent on the targeted region of the target gene. The successful generation of transgenic ECS for second transformation with (an)other construct(s) can be of value for functional genomics research in banana.
Kang, J J; Yokoi, T J; Holland, M J
1995-12-01
The 190-base pair (bp) rDNA enhancer within the intergenic spacer sequences of Saccharomyces cerevisiae rRNA cistrons activates synthesis of the 35S-rRNA precursor about 20-fold in vivo (Mestel,, R., Yip, M., Holland, J. P., Wang, E., Kang, J., and Holland, M. J. (1989) Mol. Cell. Biol. 9, 1243-1254). We now report identification and analysis of transcriptional activities mediated by three cis-acting sites within a 90-bp portion of the rDNA enhancer designated the modulator region. In vivo, these sequences mediated termination of transcription by RNA polymerase I and potentiated the activity of the rDNA enhancer element. Two trans-acting factors, REB1 and REB2, bind independently to sites within the modulator region (Morrow, B. E., Johnson, S. P., and Warner, J. R. (1989) J. Biol. Chem. 264, 9061-9068). We show that REB2 is identical to the ABF1 protien. Site-directed mutagenesis of REB1 and ABF1 binding sites demonstrated uncoupling of RNA polymerase I-dependent termination from transcriptional activation in vivo. We conclude that REB1 and ABF1 are required for RNA polymerase I-dependent termination and enhancer function, respectively, Since REB1 and ABF1 proteins also regulate expression of class II genes and other nuclear functions, our results suggest further similarities between RNA polymerase I and II regulatory mechanisms. Two rDNA enhancers flanking a rDNA minigene stimulated RNA polymerase I transcription in a "multiplicative" fashion. Deletion mapping analysis showed that similar cis-acting sequences were required for enhancer function when positioned upstream or downstream from a rDNA minigene.
Tanner, N K; Cech, T R
1985-01-01
The intervening sequence (IVS) excised from the rRNA precursor of Tetrahymena thermophila is converted to a covalently closed circular RNA in the absence of proteins in vitro. This self-catalyzed cyclization reaction is inhibited by the intercalating dye methidiumpropyl.EDTA (MPE; R.P. Hertzberg and P.B. Dervan (1982) J. Am. Chem. Soc. 104, 313-315). The MPE binding sites have been localized by mapping the sites of MPE.Fe(II) cleavage of the IVS RNA. There are three major binding sites within the 414 nucleotide IVS RNA. Two of these sites coincide with the A.B and 9L.2 pairings. These are structural elements that are conserved in all group I introns and are implicated as being functionally important for splicing. We propose that interaction of MPE with these sites is responsible for dye inhibition of cyclization. The reactions of MPE.Fe(II) with an RNA of known structure, tRNAPhe, and with the IVS RNA were studied as a function of temperature, ionic strength and ethidium concentration. Based on the comparison of the reaction with these two RNAs, we conclude that the dye is a very useful probe for structural regions of large RNAs, while it provides more limited structural information about the small, compact tRNA molecule. Images PMID:2415924
Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun
2013-08-28
The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.
Uncovering the Repertoire of Endogenous Flaviviral Elements in Aedes Mosquito Genomes
Suzuki, Yasutsugu; Frangeul, Lionel; Dickson, Laura B.; Blanc, Hervé; Verdier, Yann; Vinh, Joelle
2017-01-01
ABSTRACT Endogenous viral elements derived from nonretroviral RNA viruses have been described in various animal genomes. Whether they have a biological function, such as host immune protection against related viruses, is a field of intense study. Here, we investigated the repertoire of endogenous flaviviral elements (EFVEs) in Aedes mosquitoes, the vectors of arboviruses such as dengue and chikungunya viruses. Previous studies identified three EFVEs from Aedes albopictus cell lines and one from Aedes aegypti cell lines. However, an in-depth characterization of EFVEs in wild-type mosquito populations and individual mosquitoes in vivo has not been performed. We detected the full-length DNA sequence of the previously described EFVEs and their respective transcripts in several A. albopictus and A. aegypti populations from geographically distinct areas. However, EFVE-derived proteins were not detected by mass spectrometry. Using deep sequencing, we detected the production of PIWI-interacting RNA-like small RNAs, in an antisense orientation, targeting the EFVEs and their flanking regions in vivo. The EFVEs were integrated in repetitive regions of the mosquito genomes, and their flanking sequences varied among mosquito populations. We bioinformatically predicted several new EFVEs from a Vietnamese A. albopictus population and observed variation in the occurrence of those elements among mosquitoes. Phylogenetic analysis of an A. aegypti EFVE suggested that it integrated prior to the global expansion of the species and subsequently diverged among and within populations. The findings of this study together reveal the substantial structural and nucleotide diversity of flaviviral integrations in Aedes genomes. Unraveling this diversity will help to elucidate the potential biological function of these EFVEs. IMPORTANCE Endogenous viral elements (EVEs) are whole or partial viral sequences integrated in host genomes. Interestingly, some EVEs have important functions for host fitness and antiviral defense. Because mosquitoes also have EVEs in their genomes, characterizing these EVEs is a prerequisite for their potential use to manipulate the mosquito antiviral response. In the study described here, we focused on EVEs related to the Flavivirus genus, to which dengue and Zika viruses belong, in individual Aedes mosquitoes from geographically distinct areas. We show the existence in vivo of flaviviral EVEs previously identified in mosquito cell lines, and we detected new ones. We show that EVEs have evolved differently in each mosquito population. They produce transcripts and small RNAs but not proteins, suggesting a function at the RNA level. Our study uncovers the diverse repertoire of flaviviral EVEs in Aedes mosquito populations and contributes to an understanding of their role in the host antiviral system. PMID:28539440
Uncovering the Repertoire of Endogenous Flaviviral Elements in Aedes Mosquito Genomes.
Suzuki, Yasutsugu; Frangeul, Lionel; Dickson, Laura B; Blanc, Hervé; Verdier, Yann; Vinh, Joelle; Lambrechts, Louis; Saleh, Maria-Carla
2017-08-01
Endogenous viral elements derived from nonretroviral RNA viruses have been described in various animal genomes. Whether they have a biological function, such as host immune protection against related viruses, is a field of intense study. Here, we investigated the repertoire of endogenous flaviviral elements (EFVEs) in Aedes mosquitoes, the vectors of arboviruses such as dengue and chikungunya viruses. Previous studies identified three EFVEs from Aedes albopictus cell lines and one from Aedes aegypti cell lines. However, an in-depth characterization of EFVEs in wild-type mosquito populations and individual mosquitoes in vivo has not been performed. We detected the full-length DNA sequence of the previously described EFVEs and their respective transcripts in several A. albopictus and A. aegypti populations from geographically distinct areas. However, EFVE-derived proteins were not detected by mass spectrometry. Using deep sequencing, we detected the production of PIWI-interacting RNA-like small RNAs, in an antisense orientation, targeting the EFVEs and their flanking regions in vivo The EFVEs were integrated in repetitive regions of the mosquito genomes, and their flanking sequences varied among mosquito populations. We bioinformatically predicted several new EFVEs from a Vietnamese A. albopictus population and observed variation in the occurrence of those elements among mosquitoes. Phylogenetic analysis of an A. aegypti EFVE suggested that it integrated prior to the global expansion of the species and subsequently diverged among and within populations. The findings of this study together reveal the substantial structural and nucleotide diversity of flaviviral integrations in Aedes genomes. Unraveling this diversity will help to elucidate the potential biological function of these EFVEs. IMPORTANCE Endogenous viral elements (EVEs) are whole or partial viral sequences integrated in host genomes. Interestingly, some EVEs have important functions for host fitness and antiviral defense. Because mosquitoes also have EVEs in their genomes, characterizing these EVEs is a prerequisite for their potential use to manipulate the mosquito antiviral response. In the study described here, we focused on EVEs related to the Flavivirus genus, to which dengue and Zika viruses belong, in individual Aedes mosquitoes from geographically distinct areas. We show the existence in vivo of flaviviral EVEs previously identified in mosquito cell lines, and we detected new ones. We show that EVEs have evolved differently in each mosquito population. They produce transcripts and small RNAs but not proteins, suggesting a function at the RNA level. Our study uncovers the diverse repertoire of flaviviral EVEs in Aedes mosquito populations and contributes to an understanding of their role in the host antiviral system. Copyright © 2017 Suzuki et al.
A New Class of SINEs with snRNA Gene-Derived Heads
Kojima, Kenji K.
2015-01-01
Eukaryotic genomes are colonized by various transposons including short interspersed elements (SINEs). The 5′ region (head) of the majority of SINEs is derived from one of the three types of RNA genes—7SL RNA, transfer RNA (tRNA), or 5S ribosomal RNA (rRNA)—and the internal promoter inside the head promotes the transcription of the entire SINEs. Here I report a new group of SINEs whose heads originate from either the U1 or U2 small nuclear RNA gene. These SINEs, named SINEU, are distributed among crocodilians and classified into three families. The structures of the SINEU-1 subfamilies indicate the recurrent addition of a U1- or U2-derived sequence onto the 5′ end of SINEU-1 elements. SINEU-1 and SINEU-3 are ancient and shared among alligators, crocodiles, and gharials, while SINEU-2 is absent in the alligator genome. SINEU-2 is the only SINE family that was active after the split of crocodiles and gharials. All SINEU families, especially SINEU-3, are preferentially inserted into a family of Mariner DNA transposon, Mariner-N4_AMi. A group of Tx1 non-long terminal repeat retrotransposons designated Tx1-Mar also show target preference for Mariner-N4_AMi, indicating that SINEU was mobilized by Tx1-Mar. PMID:26019167
A New Class of SINEs with snRNA Gene-Derived Heads.
Kojima, Kenji K
2015-05-27
Eukaryotic genomes are colonized by various transposons including short interspersed elements (SINEs). The 5' region (head) of the majority of SINEs is derived from one of the three types of RNA genes--7SL RNA, transfer RNA (tRNA), or 5S ribosomal RNA (rRNA)--and the internal promoter inside the head promotes the transcription of the entire SINEs. Here I report a new group of SINEs whose heads originate from either the U1 or U2 small nuclear RNA gene. These SINEs, named SINEU, are distributed among crocodilians and classified into three families. The structures of the SINEU-1 subfamilies indicate the recurrent addition of a U1- or U2-derived sequence onto the 5' end of SINEU-1 elements. SINEU-1 and SINEU-3 are ancient and shared among alligators, crocodiles, and gharials, while SINEU-2 is absent in the alligator genome. SINEU-2 is the only SINE family that was active after the split of crocodiles and gharials. All SINEU families, especially SINEU-3, are preferentially inserted into a family of Mariner DNA transposon, Mariner-N4_AMi. A group of Tx1 non-long terminal repeat retrotransposons designated Tx1-Mar also show target preference for Mariner-N4_AMi, indicating that SINEU was mobilized by Tx1-Mar. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Quarles, Kaycee A; Chadalavada, Durga; Showalter, Scott A
2015-06-01
The prevalence of double-stranded RNA (dsRNA) in eukaryotic cells has only recently been appreciated. Of interest here, RNA silencing begins with dsRNA substrates that are bound by the dsRNA-binding domains (dsRBDs) of their processing proteins. Specifically, processing of microRNA (miRNA) in the nucleus minimally requires the enzyme Drosha and its dsRBD-containing cofactor protein, DGCR8. The smallest recombinant construct of DGCR8 that is sufficient for in vitro dsRNA binding, referred to as DGCR8-Core, consists of its two dsRBDs and a C-terminal tail. As dsRBDs rarely recognize the nucleotide sequence of dsRNA, it is reasonable to hypothesize that DGCR8 function is dependent on the recognition of specific structural features in the miRNA precursor. Previously, we demonstrated that noncanonical structural elements that promote RNA flexibility within the stem of miRNA precursors are necessary for efficient in vitro cleavage by reconstituted Microprocessor complexes. Here, we combine gel shift assays with in vitro processing assays to demonstrate that neither the N-terminal dsRBD of DGCR8 in isolation nor the DGCR8-Core construct is sensitive to the presence of noncanonical structural elements within the stem of miRNA precursors, or to single-stranded segments flanking the stem. Extending DGCR8-Core to include an N-terminal heme-binding region does not change our conclusions. Thus, our data suggest that although the DGCR8-Core region is necessary for dsRNA binding and recruitment to the Microprocessor, it is not sufficient to establish the previously observed connection between RNA flexibility and processing efficiency. © 2015 Wiley Periodicals, Inc.
Burgess, Diane; Freeling, Michael
2014-01-01
In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing–associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates. PMID:24681619
Histone-derived piRNA biogenesis depends on the ping-pong partners Piwi5 and Ago3 in Aedes aegypti
Girardi, Erika; Miesen, Pascal; Pennings, Bas; Frangeul, Lionel; Saleh, Maria-Carla
2017-01-01
Abstract The piRNA pathway is of key importance in controlling transposable elements in most animal species. In the vector mosquito Aedes aegypti, the presence of eight PIWI proteins and the accumulation of viral piRNAs upon arbovirus infection suggest additional functions of the piRNA pathway beyond genome defense. To better understand the regulatory potential of this pathway, we analyzed in detail host-derived piRNAs in A. aegypti Aag2 cells. We show that a large repertoire of protein-coding genes and non-retroviral integrated RNA virus elements are processed into genic piRNAs by different combinations of PIWI proteins. Among these, we identify a class of genes that produces piRNAs from coding sequences in an Ago3- and Piwi5-dependent fashion. We demonstrate that the replication-dependent histone gene family is a genic source of ping-pong dependent piRNAs and that histone-derived piRNAs are dynamically expressed throughout the cell cycle, suggesting a role for the piRNA pathway in the regulation of histone gene expression. Moreover, our results establish the Aag2 cell line as an accessible experimental model to study gene-derived piRNAs. PMID:28115625
Samson, Marie-Laure
2008-01-01
Background The Drosophila gene embryonic lethal abnormal visual system (elav) is the prototype of a gene family present in all metazoans. Its members encode structurally conserved neuronal proteins with three RNA Recognition Motifs (RRM) but they paradoxically act at diverse levels of post-transcriptional regulation. In an attempt to understand the history of this family, we searched for orthologs in eleven completely sequenced genomes, including those of humans, D. melanogaster and C. elegans, for which cDNAs are available. Results We analyzed 23 orthologs/paralogs of elav, and found evidence of gain/loss of gene copy number. For one set of genes, including elav itself, the coding sequences are free of introns and their products most resemble ELAV. The remaining genes show remarkable conservation of their exon organization, and their products most resemble FNE and RBP9, proteins encoded by the two elav paralogs of Drosophila. Remarkably, three of the conserved exon junctions are both close to structural elements, involved respectively in protein-RNA interactions and in the regulation of sub-cellular localization, and in the vicinity of diverse sequence variations. Conclusion The data indicate that the essential elav gene of Drosophila is newly emerged, restricted to dipterans and of retrotransposed origin. We propose that the conserved exon junctions constitute potential sites for sequence/function modifications, and that RRM binding proteins, whose function relies upon plastic RNA-protein interactions, may have played an important role in brain evolution. PMID:18715504
Defining Transcriptional Regulatory Mechanisms for Primary let-7 miRNAs
Gaeta, Xavier; Le, Luat; Lin, Ying; Xie, Yuan; Lowry, William E.
2017-01-01
The let-7 family of miRNAs have been shown to control developmental timing in organisms from C. elegans to humans; their function in several essential cell processes throughout development is also well conserved. Numerous studies have defined several steps of post-transcriptional regulation of let-7 production; from pri-miRNA through pre-miRNA, to the mature miRNA that targets endogenous mRNAs for degradation or translational inhibition. Less-well defined are modes of transcriptional regulation of the pri-miRNAs for let-7. let-7 pri-miRNAs are expressed in polycistronic fashion, in long transcripts newly annotated based on chromatin-associated RNA-sequencing. Upon differentiation, we found that some let-7 pri-miRNAs are regulated at the transcriptional level, while others appear to be constitutively transcribed. Using the Epigenetic Roadmap database, we further annotated regulatory elements of each polycistron identified putative promoters and enhancers. Probing these regulatory elements for transcription factor binding sites identified factors that regulate transcription of let-7 in both promoter and enhancer regions, and identified novel regulatory mechanisms for this important class of miRNAs. PMID:28052101
Su, Zhipeng; Zhu, Jiawen; Xu, Zhuofei; Xiao, Ran; Zhou, Rui; Li, Lu; Chen, Huanchun
2016-01-01
Actinobacillus pleuropneumoniae is the pathogen of porcine contagious pleuropneumoniae, a highly contagious respiratory disease of swine. Although the genome of A. pleuropneumoniae was sequenced several years ago, limited information is available on the genome-wide transcriptional analysis to accurately annotate the gene structures and regulatory elements. High-throughput RNA sequencing (RNA-seq) has been applied to study the transcriptional landscape of bacteria, which can efficiently and accurately identify gene expression regions and unknown transcriptional units, especially small non-coding RNAs (sRNAs), UTRs and regulatory regions. The aim of this study is to comprehensively analyze the transcriptome of A. pleuropneumoniae by RNA-seq in order to improve the existing genome annotation and promote our understanding of A. pleuropneumoniae gene structures and RNA-based regulation. In this study, we utilized RNA-seq to construct a single nucleotide resolution transcriptome map of A. pleuropneumoniae. More than 3.8 million high-quality reads (average length ~90 bp) from a cDNA library were generated and aligned to the reference genome. We identified 32 open reading frames encoding novel proteins that were mis-annotated in the previous genome annotations. The start sites for 35 genes based on the current genome annotation were corrected. Furthermore, 51 sRNAs in the A. pleuropneumoniae genome were discovered, of which 40 sRNAs were never reported in previous studies. The transcriptome map also enabled visualization of 5'- and 3'-UTR regions, in which contained 11 sRNAs. In addition, 351 operons covering 1230 genes throughout the whole genome were identified. The RNA-Seq based transcriptome map validated annotated genes and corrected annotations of open reading frames in the genome, and led to the identification of many functional elements (e.g. regions encoding novel proteins, non-coding sRNAs and operon structures). The transcriptional units described in this study provide a foundation for future studies concerning the gene functions and the transcriptional regulatory architectures of this pathogen. PMID:27018591
Patel, Hardip; Forêt, Sylvain; Karlsen, Bård Ove; Jørgensen, Tor Erik; Hall-Spencer, Jason M
2018-01-01
Abstract Cnidarians harbor a variety of small regulatory RNAs that include microRNAs (miRNAs) and PIWI-interacting RNAs (piRNAs), but detailed information is limited. Here, we report the identification and expression of novel miRNAs and putative piRNAs, as well as their genomic loci, in the symbiotic sea anemone Anemonia viridis. We generated a draft assembly of the A. viridis genome with putative size of 313 Mb that appeared to be composed of about 36% repeats, including known transposable elements. We detected approximately equal fractions of DNA transposons and retrotransposons. Deep sequencing of small RNA libraries constructed from A. viridis adults sampled at a natural CO2 gradient off Vulcano Island, Italy, identified 70 distinct miRNAs. Eight were homologous to previously reported miRNAs in cnidarians, whereas 62 appeared novel. Nine miRNAs were recognized as differentially expressed along the natural seawater pH gradient. We found a highly abundant and diverse population of piRNAs, with a substantial fraction showing ping–pong signatures. We identified nearly 22% putative piRNAs potentially targeting transposable elements within the A. viridis genome. The A. viridis genome appeared similar in size to that of other hexacorals with a very high divergence of transposable elements resembling that of the sea anemone genus Exaiptasia. The genome encodes and expresses a high number of small regulatory RNAs, which include novel miRNAs and piRNAs. Differentially expressed small RNAs along the seawater pH gradient indicated regulatory gene responses to environmental stressors. PMID:29385567
Widespread and evolutionary analysis of a MITE family Monkey King in Brassicaceae.
Dai, Shutao; Hou, Jinna; Long, Yan; Wang, Jing; Li, Cong; Xiao, Qinqin; Jiang, Xiaoxue; Zou, Xiaoxiao; Zou, Jun; Meng, Jinling
2015-06-19
Miniature inverted repeat transposable elements (MITEs) are important components of eukaryotic genomes, with hundreds of families and many copies, which may play important roles in gene regulation and genome evolution. However, few studies have investigated the molecular mechanisms involved. In our previous study, a Tourist-like MITE, Monkey King, was identified from the promoter region of a flowering time gene, BnFLC.A10, in Brassica napus. Based on this MITE, the characteristics and potential roles on gene regulation of the MITE family were analyzed in Brassicaceae. The characteristics of the Tourist-like MITE family Monkey King in Brassicaceae, including its distribution, copies and insertion sites in the genomes of major Brassicaceae species were analyzed in this study. Monkey King was actively amplified in Brassica after divergence from Arabidopsis, which was indicated by the prompt increase in copy number and by phylogenetic analysis. The genomic variations caused by Monkey King insertions, both intra- and inter-species in Brassica, were traced by PCR amplification. Genomic sequence analysis showed that most complete Monkey King elements are located in gene-rich regions, less than 3kb from genes, in both the B. rapa and A. thaliana genomes. Sixty-seven Brassica expressed sequence tags carrying Monkey King fragments were also identified from the NCBI database. Bisulfite sequencing identified specific DNA methylation of cytosine residues in the Monkey King sequence. A fragment containing putative TATA-box motifs in the MITE sequence could bind with nuclear protein(s) extracted from leaves of B. napus plants. A Monkey King-related microRNA, bna-miR6031, was identified in the microRNA database. In transgenic A. thaliana, when the Monkey King element was inserted upstream of 35S promoter, the promoter activity was weakened. Monkey King, a Brassicaceae Tourist-like MITE family, has amplified relatively recently and has induced intra- and inter-species genomic variations in Brassica. Monkey King elements are most abundant in the vicinity of genes and may have a substantial effect on genome-wide gene regulation in Brassicaceae. Monkey King insertions potentially regulate gene expression and genome evolution through epigenetic modification and new regulatory motif production.
Wei, Yunzhou; Chesne, Megan T.; Terns, Rebecca M.; Terns, Michael P.
2015-01-01
CRISPR-Cas systems are RNA-based immune systems that protect prokaryotes from invaders such as phages and plasmids. In adaptation, the initial phase of the immune response, short foreign DNA fragments are captured and integrated into host CRISPR loci to provide heritable defense against encountered foreign nucleic acids. Each CRISPR contains a ∼100–500 bp leader element that typically includes a transcription promoter, followed by an array of captured ∼35 bp sequences (spacers) sandwiched between copies of an identical ∼35 bp direct repeat sequence. New spacers are added immediately downstream of the leader. Here, we have analyzed adaptation to phage infection in Streptococcus thermophilus at the CRISPR1 locus to identify cis-acting elements essential for the process. We show that the leader and a single repeat of the CRISPR locus are sufficient for adaptation in this system. Moreover, we identified a leader sequence element capable of stimulating adaptation at a dormant repeat. We found that sequences within 10 bp of the site of integration, in both the leader and repeat of the CRISPR, are required for the process. Our results indicate that information at the CRISPR leader-repeat junction is critical for adaptation in this Type II-A system and likely other CRISPR-Cas systems. PMID:25589547
Transposable elements and G-quadruplexes.
Kejnovsky, Eduard; Tokan, Viktor; Lexa, Matej
2015-09-01
A significant part of eukaryotic genomes is formed by transposable elements (TEs) containing not only genes but also regulatory sequences. Some of the regulatory sequences located within TEs can form secondary structures like hairpins or three-stranded (triplex DNA) and four-stranded (quadruplex DNA) conformations. This review focuses on recent evidence showing that G-quadruplex-forming sequences in particular are often present in specific parts of TEs in plants and humans. We discuss the potential role of these structures in the TE life cycle as well as the impact of G-quadruplexes on replication, transcription, translation, chromatin status, and recombination. The aim of this review is to emphasize that TEs may serve as vehicles for the genomic spread of G-quadruplexes. These non-canonical DNA structures and their conformational switches may constitute another regulatory system that, together with small and long non-coding RNA molecules and proteins, contribute to the complex cellular network resulting in the large diversity of eukaryotes.
Dissection of affinity captured LINE-1 macromolecular complexes
Mita, Paolo; Jiang, Hua; Adney, Emily M; Wudzinska, Aleksandra; Badri, Sana; Ischenko, Dmitry; Eng, George; Burns, Kathleen H; Fenyö, David; Chait, Brian T; Alexeev, Dmitry; Rout, Michael P; Boeke, Jef D
2018-01-01
Long Interspersed Nuclear Element-1 (LINE-1, L1) is a mobile genetic element active in human genomes. L1-encoded ORF1 and ORF2 proteins bind L1 RNAs, forming ribonucleoproteins (RNPs). These RNPs interact with diverse host proteins, some repressive and others required for the L1 lifecycle. Using differential affinity purifications, quantitative mass spectrometry, and next generation RNA sequencing, we have characterized the proteins and nucleic acids associated with distinctive, enzymatically active L1 macromolecular complexes. Among them, we describe a cytoplasmic intermediate that we hypothesize to be the canonical ORF1p/ORF2p/L1-RNA-containing RNP, and we describe a nuclear population containing ORF2p, but lacking ORF1p, which likely contains host factors participating in target-primed reverse transcription. PMID:29309035
Rizzon, Carène; Marais, Gabriel; Gouy, Manolo; Biémont, Christian
2002-03-01
We analyzed the distribution of 54 families of transposable elements (TEs; transposons, LTR retrotransposons, and non-LTR retrotransposons) in the chromosomes of Drosophila melanogaster, using data from the sequenced genome. The density of LTR and non-LTR retrotransposons (RNA-based elements) was high in regions with low recombination rates, but there was no clear tendency to parallel the recombination rate. However, the density of transposons (DNA-based elements) was significantly negatively correlated with recombination rate. The accumulation of TEs in regions of reduced recombination rate is compatible with selection acting against TEs, as selection is expected to be weaker in regions with lower recombination. The differences in the relationship between recombination rate and TE density that exist between chromosome arms suggest that TE distribution depends on specific characteristics of the chromosomes (chromatin structure, distribution of other sequences), the TEs themselves (transposition mechanism), and the species (reproductive system, effective population size, etc.), that have differing influences on the effect of natural selection acting against the TE insertions.
Hodgetts, Ross
2004-12-01
RNA interference might have evolved to minimize the deleterious impact of transposable elements and viruses on eukaryotic genomes, because mutations in genes within the RNAi pathway cause mobilization of transposons in nematodes and flies. Although the first examples of RNAi involved post-transcriptional gene silencing, recently the pathway has been shown to act at the transcriptional level. It does so by establishing a chromatin configuration on the target DNA that has many of the hallmarks of heterochromatin, thus preventing its transcription. Members of dispersed, repeated sequence families appear to have been utilized by the RNAi machinery to regulate nearby genes in yeast. The unusual genomic distribution of three repeated element families in the chicken, fruit-fly and nematode genomes prompts speculation that some of these repeats have been co-opted to control gene expression, either locally or over extended chromosomal domains.
Wachter, Shaun; Raghavan, Rahul; Wachter, Jenny; Minnick, Michael F
2018-04-11
Coxiella burnetii is a Gram-negative gammaproteobacterium and zoonotic agent of Q fever. C. burnetii's genome contains an abundance of pseudogenes and numerous selfish genetic elements. MITEs (miniature inverted-repeat transposable elements) are non-autonomous transposons that occur in all domains of life and are thought to be insertion sequences (ISs) that have lost their transposase function. Like most transposable elements (TEs), MITEs are thought to play an active role in evolution by altering gene function and expression through insertion and deletion activities. However, information regarding bacterial MITEs is limited. We describe two MITE families discovered during research on small non-coding RNAs (sRNAs) of C. burnetii. Two sRNAs, Cbsr3 and Cbsr13, were found to originate from a novel MITE family, termed QMITE1. Another sRNA, CbsR16, was found to originate from a separate and novel MITE family, termed QMITE2. Members of each family occur ~ 50 times within the strains evaluated. QMITE1 is a typical MITE of 300-400 bp with short (2-3 nt) direct repeats (DRs) of variable sequence and is often found overlapping annotated open reading frames (ORFs). Additionally, QMITE1 elements possess sigma-70 promoters and are transcriptionally active at several loci, potentially influencing expression of nearby genes. QMITE2 is smaller (150-190 bps), but has longer (7-11 nt) DRs of variable sequences and is mainly found in the 3' untranslated region of annotated ORFs and intergenic regions. QMITE2 contains a GTAG repetitive extragenic palindrome (REP) that serves as a target for IS1111 TE insertion. Both QMITE1 and QMITE2 display inter-strain linkage and sequence conservation, suggesting that they are adaptive and existed before divergence of C. burnetii strains. We have discovered two novel MITE families of C. burnetii. Our finding that MITEs serve as a source for sRNAs is novel. QMITE2 has a unique structure and occurs in large or small versions with unique DRs that display linkage and sequence conservation between strains, allowing for tracking of genomic rearrangements. QMITE1 and QMITE2 copies are hypothesized to influence expression of neighboring genes involved in DNA repair and virulence through transcriptional interference and ribonuclease processing.
A proposal to rename the hyperthermophile Pyrococcus woesei as Pyrococcus furiosus subsp. woesei.
Kanoksilapatham, Wirojne; González, Juan M; Maeder, Dennis L; DiRuggiero, Jocelyne; Robb, Frank T
2004-10-01
Pyrococcus species are hyperthermophilic members of the order Thermococcales, with optimal growth temperatures approaching 100 degrees C. All species grow heterotrophically and produce H2 or, in the presence of elemental sulfur (S(o)), H2S. Pyrococcus woesei and P. furiosus were isolated from marine sediments at the same Vulcano Island beach site and share many morphological and physiological characteristics. We report here that the rDNA operons of these strains have identical sequences, including their intergenic spacer regions and part of the 23S rRNA. Both species grow rapidly and produce H2 in the presence of 0.1% maltose and 10-100 microM sodium tungstate in S(o)-free medium. However, P. woesei shows more extensive autolysis than P. furiosus in the stationary phase. Pyrococcus furiosus and P. woesei share three closely related families of insertion sequences (ISs). A Southern blot performed with IS probes showed extensive colinearity between the genomes of P. woesei and P. furiosus. Cloning and sequencing of ISs that were in different contexts in P. woesei and P. furiosus revealed that the napA gene in P. woesei is disrupted by a type III IS element, whereas in P. furiosus, this gene is intact. A type I IS element, closely linked to the napA gene, was observed in the same context in both P. furiosus and P. woesei genomes. Our results suggest that the IS elements are implicated in genomic rearrangements and reshuffling in these closely related strains. We propose to rename P. woesei a subspecies of P. furiosus based on their identical rDNA operon sequences, many common IS elements that are shared genomic markers, and the observation that all P. woesei nucleotide sequences deposited in GenBank to date are > 99% identical to P. furiosus sequences.
SpliceDisease database: linking RNA splicing and disease.
Wang, Juan; Zhang, Jie; Li, Kaibo; Zhao, Wei; Cui, Qinghua
2012-01-01
RNA splicing is an important aspect of gene regulation in many organisms. Splicing of RNA is regulated by complicated mechanisms involving numerous RNA-binding proteins and the intricate network of interactions among them. Mutations in cis-acting splicing elements or its regulatory proteins have been shown to be involved in human diseases. Defects in pre-mRNA splicing process have emerged as a common disease-causing mechanism. Therefore, a database integrating RNA splicing and disease associations would be helpful for understanding not only the RNA splicing but also its contribution to disease. In SpliceDisease database, we manually curated 2337 splicing mutation disease entries involving 303 genes and 370 diseases, which have been supported experimentally in 898 publications. The SpliceDisease database provides information including the change of the nucleotide in the sequence, the location of the mutation on the gene, the reference Pubmed ID and detailed description for the relationship among gene mutations, splicing defects and diseases. We standardized the names of the diseases and genes and provided links for these genes to NCBI and UCSC genome browser for further annotation and genomic sequences. For the location of the mutation, we give direct links of the entry to the respective position/region in the genome browser. The users can freely browse, search and download the data in SpliceDisease at http://cmbi.bjmu.edu.cn/sdisease.
Web-Beagle: a web server for the alignment of RNA secondary structures.
Mattei, Eugenio; Pietrosanto, Marco; Ferrè, Fabrizio; Helmer-Citterich, Manuela
2015-07-01
Web-Beagle (http://beagle.bio.uniroma2.it) is a web server for the pairwise global or local alignment of RNA secondary structures. The server exploits a new encoding for RNA secondary structure and a substitution matrix of RNA structural elements to perform RNA structural alignments. The web server allows the user to compute up to 10 000 alignments in a single run, taking as input sets of RNA sequences and structures or primary sequences alone. In the latter case, the server computes the secondary structure prediction for the RNAs on-the-fly using RNAfold (free energy minimization). The user can also compare a set of input RNAs to one of five pre-compiled RNA datasets including lncRNAs and 3' UTRs. All types of comparison produce in output the pairwise alignments along with structural similarity and statistical significance measures for each resulting alignment. A graphical color-coded representation of the alignments allows the user to easily identify structural similarities between RNAs. Web-Beagle can be used for finding structurally related regions in two or more RNAs, for the identification of homologous regions or for functional annotation. Benchmark tests show that Web-Beagle has lower computational complexity, running time and better performances than other available methods. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Wada, K; Wada, Y; Iwasaki, Y; Ikemura, T
2017-10-01
Oligonucleotides are key elements of nucleic acid therapeutics such as small interfering RNAs (siRNAs). Influenza and Ebolaviruses are zoonotic RNA viruses mutating very rapidly, and their sequence changes must be characterized intensively to design therapeutic oligonucleotides with long utility. Focusing on a total of 182 experimentally validated siRNAs for influenza A, B and Ebolaviruses compiled by the siRNA database, we conducted time-series analyses of occurrences of siRNA targets in these viral genomes. Reflecting their high mutation rates, occurrences of target oligonucleotides evidently fluctuate in viral populations and often disappear. Time-series analysis of the one-base changed sequences derived from each original target identified the oligonucleotide that shows a compensatory increase and will potentially become the 'awaiting-type oligonucleotide'; the combined use of this oligonucleotide with the original can provide therapeutics with long utility. This strategy is also useful for assigning diagnostic reverse transcription-PCR primers with long utility.
Wada, K; Wada, Y; Iwasaki, Y; Ikemura, T
2017-01-01
Oligonucleotides are key elements of nucleic acid therapeutics such as small interfering RNAs (siRNAs). Influenza and Ebolaviruses are zoonotic RNA viruses mutating very rapidly, and their sequence changes must be characterized intensively to design therapeutic oligonucleotides with long utility. Focusing on a total of 182 experimentally validated siRNAs for influenza A, B and Ebolaviruses compiled by the siRNA database, we conducted time-series analyses of occurrences of siRNA targets in these viral genomes. Reflecting their high mutation rates, occurrences of target oligonucleotides evidently fluctuate in viral populations and often disappear. Time-series analysis of the one-base changed sequences derived from each original target identified the oligonucleotide that shows a compensatory increase and will potentially become the ‘awaiting-type oligonucleotide’ the combined use of this oligonucleotide with the original can provide therapeutics with long utility. This strategy is also useful for assigning diagnostic reverse transcription-PCR primers with long utility. PMID:28905886
Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi
2015-10-01
Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis.
Deppdb--DNA electrostatic potential properties database: electrostatic properties of genome DNA.
Osypov, Alexander A; Krutinin, Gleb G; Kamzolova, Svetlana G
2010-06-01
The electrostatic properties of genome DNA influence its interactions with different proteins, in particular, the regulation of transcription by RNA-polymerases. DEPPDB--DNA Electrostatic Potential Properties Database--was developed to hold and provide all available information on the electrostatic properties of genome DNA combined with its sequence and annotation of biological and structural properties of genome elements and whole genomes. Genomes in DEPPDB are organized on a taxonomical basis. Currently, the database contains all the completely sequenced bacterial and viral genomes according to NCBI RefSeq. General properties of the genome DNA electrostatic potential profile and principles of its formation are revealed. This potential correlates with the GC content but does not correspond to it exactly and strongly depends on both the sequence arrangement and its context (flanking regions). Analysis of the promoter regions for bacterial and viral RNA polymerases revealed a correspondence between the scale of these proteins' physical properties and electrostatic profile patterns. We also discovered a direct correlation between the potential value and the binding frequency of RNA polymerase to DNA, supporting the idea of the role of electrostatics in these interactions. This matches a pronounced tendency of the promoter regions to possess higher values of the electrostatic potential.
Sequence Analysis of the Genome of Carnation (Dianthus caryophyllus L.)
Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi
2014-01-01
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. ‘Francesco’ was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568 887 315 bp, consisting of 45 088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16 644 bp and 60 737 bp, respectively, and the longest scaffold was 1 287 144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. PMID:24344172
Wiegand, Sandra; Dietrich, Sascha; Hertel, Robert; Bongaerts, Johannes; Evers, Stefan; Volland, Sonja; Daniel, Rolf; Liesegang, Heiko
2013-10-01
The production of enzymes by an industrial strain requires a complex adaption of the bacterial metabolism to the conditions within the fermenter. Regulatory events within the process result in a dynamic change of the transcriptional activity of the genome. This complex network of genes is orchestrated by proteins as well as regulatory RNA elements. Here we present an RNA-Seq based study considering selected phases of an industry-oriented fermentation of Bacillus licheniformis. A detailed analysis of 20 strand-specific RNA-Seq datasets revealed a multitude of transcriptionally active genomic regions. 3314 RNA features encoded by such active loci have been identified and sorted into ten functional classes. The identified sequences include the expected RNA features like housekeeping sRNAs, metabolic riboswitches and RNA switches well known from studies on Bacillus subtilis as well as a multitude of completely new candidates for regulatory RNAs. An unexpectedly high number of 855 RNA features are encoded antisense to annotated protein and RNA genes, in addition to 461 independently transcribed small RNAs. These antisense transcripts contain molecules with a remarkable size range variation from 38 to 6348 base pairs in length. The genome of the type strain B. licheniformis DSM13 was completely reannotated using data obtained from RNA-Seq analyses and from public databases. The hereby generated data-sets represent a solid amount of knowledge on the dynamic transcriptional activities during the investigated fermentation stages. The identified regulatory elements enable research on the understanding and the optimization of crucial metabolic activities during a productive fermentation of Bacillus licheniformis strains.
Exosomes Derived from HIV-1-infected Cells Contain Trans-activation Response Element RNA*
Narayanan, Aarthi; Iordanskiy, Sergey; Das, Ravi; Van Duyne, Rachel; Santos, Steven; Jaworski, Elizabeth; Guendel, Irene; Sampey, Gavin; Dalby, Elizabeth; Iglesias-Ussel, Maria; Popratiloff, Anastas; Hakami, Ramin; Kehn-Hall, Kylene; Young, Mary; Subra, Caroline; Gilbert, Caroline; Bailey, Charles; Romerio, Fabio; Kashanchi, Fatah
2013-01-01
Exosomes are nano-sized vesicles produced by healthy and virus-infected cells. Exosomes derived from infected cells have been shown to contain viral microRNAs (miRNAs). HIV-1 encodes its own miRNAs that regulate viral and host gene expression. The most abundant HIV-1-derived miRNA, first reported by us and later by others using deep sequencing, is the trans-activation response element (TAR) miRNA. In this study, we demonstrate the presence of TAR RNA in exosomes from cell culture supernatants of HIV-1-infected cells and patient sera. TAR miRNA was not in Ago2 complexes outside the exosomes but enclosed within the exosomes. We detected the host miRNA machinery proteins Dicer and Drosha in exosomes from infected cells. We report that transport of TAR RNA from the nucleus into exosomes is a CRM1 (chromosome region maintenance 1)-dependent active process. Prior exposure of naive cells to exosomes from infected cells increased susceptibility of the recipient cells to HIV-1 infection. Exosomal TAR RNA down-regulated apoptosis by lowering Bim and Cdk9 proteins in recipient cells. We found 104–106 copies/ml TAR RNA in exosomes derived from infected culture supernatants and 103 copies/ml TAR RNA in the serum exosomes of highly active antiretroviral therapy-treated patients or long term nonprogressors. Taken together, our experiments demonstrated that HIV-1-infected cells produced exosomes that are uniquely characterized by their proteomic and RNA profiles that may contribute to disease pathology in AIDS. PMID:23661700
Transcriptional activity of transposable elements in coelacanth.
Forconi, Mariko; Chalopin, Domitille; Barucca, Marco; Biscotti, Maria Assunta; De Moro, Gianluca; Galiana, Delphine; Gerdol, Marco; Pallavicini, Alberto; Canapa, Adriana; Olmo, Ettore; Volff, Jean-Nicolas
2014-09-01
The morphological stasis of coelacanths has long suggested a slow evolutionary rate. General genomic stasis might also imply a decrease of transposable elements activity. To evaluate the potential activity of transposable elements (TEs) in "living fossil" species, transcriptomic data of Latimeria chalumnae and its Indonesian congener Latimeria menadoensis were compared through the RNA-sequencing mapping procedures in three different organs (liver, testis, and muscle). The analysis of coelacanth transcriptomes highlights a significant percentage of transcribed TEs in both species. Major contributors are LINE retrotransposons, especially from the CR1 family. Furthermore, some particular elements such as a LF-SINE and a LINE2 sequences seem to be more expressed than other elements. The amount of TEs expressed in testis suggests possible transposition burst in incoming generations. Moreover, significant amount of TEs in liver and muscle transcriptomes were also observed. Analyses of elements displaying marked organ-specific expression gave us the opportunity to highlight exaptation cases, that is, the recruitment of TEs as new cellular genes, but also to identify a new Latimeria-specific family of Short Interspersed Nuclear Elements called CoeG-SINEs. Overall, transcriptome results do not seem to be in line with a slow-evolving genome with poor TE activity. © 2013 Wiley Periodicals, Inc.
Alu expression in human cell lines and their retrotranspositional potential.
Oler, Andrew J; Traina-Dorge, Stephen; Derbes, Rebecca S; Canella, Donatella; Cairns, Brad R; Roy-Engel, Astrid M
2012-06-20
The vast majority of the 1.1 million Alu elements are retrotranspositionally inactive, where only a few loci referred to as 'source elements' can generate new Alu insertions. The first step in identifying the active Alu sources is to determine the loci transcribed by RNA polymerase III (pol III). Previous genome-wide analyses from normal and transformed cell lines identified multiple Alu loci occupied by pol III factors, making them candidate source elements. Analysis of the data from these genome-wide studies determined that the majority of pol III-bound Alus belonged to the older subfamilies Alu S and Alu J, which varied between cell lines from 62.5% to 98.7% of the identified loci. The pol III-bound Alus were further scored for estimated retrotransposition potential (ERP) based on the absence or presence of selected sequence features associated with Alu retrotransposition capability. Our analyses indicate that most of the pol III-bound Alu loci candidates identified lack the sequence characteristics important for retrotransposition. These data suggest that Alu expression likely varies by cell type, growth conditions and transformation state. This variation could extend to where the same cell lines in different laboratories present different Alu expression patterns. The vast majority of Alu loci potentially transcribed by RNA pol III lack important sequence features for retrotransposition and the majority of potentially active Alu loci in the genome (scored high ERP) belong to young Alu subfamilies. Our observations suggest that in an in vivo scenario, the contribution of Alu activity on somatic genetic damage may significantly vary between individuals and tissues.
Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou
2011-11-01
Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.
Deciphering the role of the Gag-Pol ribosomal frameshift signal in HIV-1 RNA genome packaging.
Nikolaitchik, Olga A; Hu, Wei-Shau
2014-04-01
A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5' untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5' end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts.
Deciphering the Role of the Gag-Pol Ribosomal Frameshift Signal in HIV-1 RNA Genome Packaging
Nikolaitchik, Olga A.
2014-01-01
ABSTRACT A key step of retroviral replication is packaging of the viral RNA genome during virus assembly. Specific packaging is mediated by interactions between the viral protein Gag and elements in the viral RNA genome. In HIV-1, similar to most retroviruses, the packaging signal is located within the 5′ untranslated region and extends into the gag-coding region. A recent study reported that a region including the Gag-Pol ribosomal frameshift signal plays an important role in HIV-1 RNA packaging; deletions or mutations that affect the RNA structure of this signal lead to drastic decreases (10- to 50-fold) in viral RNA packaging and virus titer. We examined here the role of the ribosomal frameshift signal in HIV-1 RNA packaging by studying the RNA packaging and virus titer in the context of proviruses. Three mutants with altered ribosomal frameshift signal, either through direct deletion of the signal, mutation of the 6U slippery sequence, or alterations of the secondary structure were examined. We found that RNAs from all three mutants were packaged efficiently, and they generate titers similar to that of a virus containing the wild-type ribosomal frameshift signal. We conclude that although the ribosomal frameshift signal plays an important role in regulating the replication cycle, this RNA element is not directly involved in regulating RNA encapsidation. IMPORTANCE To generate infectious viruses, HIV-1 must package viral RNA genome during virus assembly. The specific HIV-1 genome packaging is mediated by interactions between the structural protein Gag and elements near the 5′ end of the viral RNA known as packaging signal. In this study, we examined whether the Gag-Pol ribosomal frameshift signal is important for HIV-1 RNA packaging as recently reported. Our results demonstrated that when Gag/Gag-Pol is supplied in trans, none of the tested ribosomal frameshift signal mutants has defects in RNA packaging or virus titer. These studies provide important information on how HIV-1 regulates its genome packaging and generate infectious viruses necessary for transmission to new hosts. PMID:24453371
Novel Structure of Ty3 Reverse Transcriptase | Center for Cancer Research
Retrotransposons are mobile genetic elements that self amplify via a single-stranded RNA intermediate, which is converted to double-stranded DNA by an encoded reverse transcriptase (RT) with both DNA polymerase (pol) and ribonuclease H (RNase) activities. Categorized by whether they contain flanking long terminal repeat (LTR) sequences, retrotransposons play a critical role in
Recruitment of CRISPR-Cas systems by Tn7-like transposons.
Peters, Joseph E; Makarova, Kira S; Shmakov, Sergey; Koonin, Eugene V
2017-08-29
A survey of bacterial and archaeal genomes shows that many Tn7-like transposons contain minimal type I-F CRISPR-Cas systems that consist of fused cas8f and cas5f , cas7f , and cas6f genes and a short CRISPR array. Several small groups of Tn7-like transposons encompass similarly truncated type I-B CRISPR-Cas. This minimal gene complement of the transposon-associated CRISPR-Cas systems implies that they are competent for pre-CRISPR RNA (precrRNA) processing yielding mature crRNAs and target binding but not target cleavage that is required for interference. Phylogenetic analysis demonstrates that evolution of the CRISPR-Cas-containing transposons included a single, ancestral capture of a type I-F locus and two independent instances of type I-B loci capture. We show that the transposon-associated CRISPR arrays contain spacers homologous to plasmid and temperate phage sequences and, in some cases, chromosomal sequences adjacent to the transposon. We hypothesize that the transposon-encoded CRISPR-Cas systems generate displacement (R-loops) in the cognate DNA sites, targeting the transposon to these sites and thus facilitating their spread via plasmids and phages. These findings suggest the existence of RNA-guided transposition and fit the guns-for-hire concept whereby mobile genetic elements capture host defense systems and repurpose them for different stages in the life cycle of the element.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-07
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem-loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping-pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration.
Krishna, Srikar; Nair, Aparna; Cheedipudi, Sirisha; Poduval, Deepak; Dhawan, Jyotsna; Palakodeti, Dasaradhi; Ghanekar, Yashoda
2013-01-01
Small non-coding RNAs such as miRNAs, piRNAs and endo-siRNAs fine-tune gene expression through post-transcriptional regulation, modulating important processes in development, differentiation, homeostasis and regeneration. Using deep sequencing, we have profiled small non-coding RNAs in Hydra magnipapillata and investigated changes in small RNA expression pattern during head regeneration. Our results reveal a unique repertoire of small RNAs in hydra. We have identified 126 miRNA loci; 123 of these miRNAs are unique to hydra. Less than 50% are conserved across two different strains of Hydra vulgaris tested in this study, indicating a highly diverse nature of hydra miRNAs in contrast to bilaterian miRNAs. We also identified siRNAs derived from precursors with perfect stem–loop structure and that arise from inverted repeats. piRNAs were the most abundant small RNAs in hydra, mapping to transposable elements, the annotated transcriptome and unique non-coding regions on the genome. piRNAs that map to transposable elements and the annotated transcriptome display a ping–pong signature. Further, we have identified several miRNAs and piRNAs whose expression is regulated during hydra head regeneration. Our study defines different classes of small RNAs in this cnidarian model system, which may play a role in orchestrating gene expression essential for hydra regeneration. PMID:23166307
Switzer, Blum J.; Stolz, J.F.; Oren, A.; Oremland, R.S.
2001-01-01
We isolated an obligately anaerobic halophilic bacterium from the Dead Sea that grew by respiration of selenate. The isolate, designated strain DSSe-1, was a gram-negative, non-motile rod. It oxidized glycerol or glucose to acetate+CO2 with concomitant reduction of selenate to selenite plus elemental selenium. Other electron acceptors that supported anaerobic growth on glycerol were nitrate and trimethylamine-N-oxide; nitrite, arsenate, fumarate, dimethylsulfoxide, thiosulfate, elemental sulfur, sulfite or sulfate could not serve as electron acceptors. Growth on glycerol in the presence of nitrate occurred over a salinity range from 100 to 240 g/l, with an optimum at 210 g/l. Analysis of the 16S rRNA gene sequence suggests that strain DSSe-1 belongs to the order Halanaerobiales, an order of halophilic anaerobes with a fermentative or homoacetogenic metabolism, in which anaerobic respiratory metabolism has never been documented. The highest 16S rRNA sequence similarity (90%) was found with Acetohalobium arabaticum (X89077). On the basis of physiological properties as well as the relatively low homology of 16S rRNA from strain DSSe-1 with known genera, classification in a new genus within the order Halanaerobiales, family Halobacteroidaceae is warranted. We propose the name Selenihalanaerobacter shriftii. Type strain is strain DSSe-1 (ATCC accession number BAA-73).
Cis-acting elements in its 3′ UTR mediate post-transcriptional regulation of KRAS
Kim, Minlee; Kogan, Nicole; Slack, Frank J.
2016-01-01
Multiple RNA-binding proteins and non-coding RNAs, such as microRNAs (miRNAs), are involved in post-transcriptional gene regulation through recognition motifs in the 3′ untranslated region (UTR) of their target genes. The KRAS gene encodes a key signaling protein, and its messenger RNA (mRNA) contains an exceptionally long 3′ UTR; this suggests that it may be subject to a highly complex set of regulatory processes. However, 3′ UTR-dependent regulation of KRAS expression has not been explored in detail. Using extensive deletion and mutational analyses combined with luciferase reporter assays, we have identified inhibitory and stabilizing cis-acting regions within the KRAS 3′ UTR that may interact with miRNAs and RNA-binding proteins, such as HuR. Particularly, we have identified an AU-rich 49-nt fragment in the KRAS 3′ UTR that is required for KRAS 3′ UTR reporter repression. This element contains a miR-185 complementary element, and we show that overexpression of miR-185 represses endogenous KRAS mRNA and protein in vitro. In addition, we have identified another 49-nt fragment that is required to promote KRAS 3′ UTR reporter expression. These findings indicate that multiple cis-regulatory motifs in the 3′ UTR of KRAS finely modulate its expression, and sequence alterations within a binding motif may disrupt the precise functions of trans-regulatory factors, potentially leading to aberrant KRAS expression. PMID:26930719
Mao, Hongliang; Wang, Hao
2017-03-01
Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences. Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structural signals to identify a SINE element. SINE_Scan outperforms the previously published de novo SINE discovery program. It shows high sensitivity and specificity in 19 plant and animal genome assemblies, of which sizes vary from 120 Mb to 3.5 Gb. It identifies numerous new families and substantially increases the estimation of the abundance of SINEs in these genomes. The code of SINE_Scan is freely available at http://github.com/maohlzj/SINE_Scan , implemented in PERL and supported on Linux. wangh8@fudan.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Mao, Hongliang
2017-01-01
Abstract Motivation: Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences. Results: Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structural signals to identify a SINE element. SINE_Scan outperforms the previously published de novo SINE discovery program. It shows high sensitivity and specificity in 19 plant and animal genome assemblies, of which sizes vary from 120 Mb to 3.5 Gb. It identifies numerous new families and substantially increases the estimation of the abundance of SINEs in these genomes. Availability and Implementation: The code of SINE_Scan is freely available at http://github.com/maohlzj/SINE_Scan, implemented in PERL and supported on Linux. Contact: wangh8@fudan.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28062442
Liu, Wendy Y Y; Ridgway, Hayley J; James, Trevor K; James, Euan K; Chen, Wen-Ming; Sprent, Janet I; Young, J Peter W; Andrews, Mitchell
2014-10-01
The South African invasive legume Dipogon lignosus (Phaseoleae) produces nodules with both determinate and indeterminate characteristics in New Zealand (NZ) soils. Ten bacterial isolates produced functional nodules on D. lignosus. The 16S ribosomal RNA (rRNA) gene sequences identified one isolate as Bradyrhizobium sp., one isolate as Rhizobium sp. and eight isolates as Burkholderia sp. The Bradyrhizobium sp. and Rhizobium sp. 16S rRNA sequences were identical to those of strains previously isolated from crop plants and may have originated from inocula used on crops. Both 16S rRNA and DNA recombinase A (recA) gene sequences placed the eight Burkholderia isolates separate from previously described Burkholderia rhizobial species. However, the isolates showed a very close relationship to Burkholderia rhizobial strains isolated from South African plants with respect to their nitrogenase iron protein (nifH), N-acyltransferase nodulation protein A (nodA) and N-acetylglucosaminyl transferase nodulation protein C (nodC) gene sequences. Gene sequences and enterobacterial repetitive intergenic consensus (ERIC) PCR and repetitive element palindromic PCR (rep-PCR) banding patterns indicated that the eight Burkholderia isolates separated into five clones of one strain and three of another. One strain was tested and shown to produce functional nodules on a range of South African plants previously reported to be nodulated by Burkholderia tuberum STM678(T) which was isolated from the Cape Region. Thus, evidence is strong that the Burkholderia strains isolated here originated in South Africa and were somehow transported with the plants from their native habitat to NZ. It is possible that the strains are of a new species capable of nodulating legumes.
Structural landscape of base pairs containing post-transcriptional modifications in RNA
Seelam, Preethi P.; Sharma, Purshotam
2017-01-01
Base pairs involving post-transcriptionally modified nucleobases are believed to play important roles in a wide variety of functional RNAs. Here we present our attempts toward understanding the structural and functional role of naturally occurring modified base pairs using a combination of X-ray crystal structure database analysis, sequence analysis, and advanced quantum chemical methods. Our bioinformatics analysis reveals that despite their presence in all major secondary structural elements, modified base pairs are most prevalent in tRNA crystal structures and most commonly involve guanine or uridine modifications. Further, analysis of tRNA sequences reveals additional examples of modified base pairs at structurally conserved tRNA regions and highlights the conservation patterns of these base pairs in three domains of life. Comparison of structures and binding energies of modified base pairs with their unmodified counterparts, using quantum chemical methods, allowed us to classify the base modifications in terms of the nature of their electronic structure effects on base-pairing. Analysis of specific structural contexts of modified base pairs in RNA crystal structures revealed several interesting scenarios, including those at the tRNA:rRNA interface, antibiotic-binding sites on the ribosome, and the three-way junctions within tRNA. These scenarios, when analyzed in the context of available experimental data, allowed us to correlate the occurrence and strength of modified base pairs with their specific functional roles. Overall, our study highlights the structural importance of modified base pairs in RNA and points toward the need for greater appreciation of the role of modified bases and their interactions, in the context of many biological processes involving RNA. PMID:28341704
Yordanova, Martina M; Wu, Cheng; Andreev, Dmitry E; Sachs, Matthew S; Atkins, John F
2015-07-17
The protein antizyme is a negative regulator of cellular polyamine concentrations from yeast to mammals. Synthesis of functional antizyme requires programmed +1 ribosomal frameshifting at the 3' end of the first of two partially overlapping ORFs. The frameshift is the sensor and effector in an autoregulatory circuit. Except for Saccharomyces cerevisiae antizyme mRNA, the frameshift site alone only supports low levels of frameshifting. The high levels usually observed depend on the presence of cis-acting stimulatory elements located 5' and 3' of the frameshift site. Antizyme genes from different evolutionary branches have evolved different stimulatory elements. Prior and new multiple alignments of fungal antizyme mRNA sequences from the Agaricomycetes class of Basidiomycota show a distinct pattern of conservation 5' of the frameshift site consistent with a function at the amino acid level. As shown here when tested in Schizosaccharomyces pombe and mammalian HEK293T cells, the 5' part of this conserved sequence acts at the nascent peptide level to stimulate the frameshifting, without involving stalling detectable by toe-printing. However, the peptide is only part of the signal. The 3' part of the stimulator functions largely independently and acts at least mostly at the nucleotide level. When polyamine levels were varied, the stimulatory effect was seen to be especially responsive in the endogenous polyamine concentration range, and this effect may be more general. A conserved RNA secondary structure 3' of the frameshift site has weaker stimulatory and polyamine sensitizing effects on frameshifting. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Man, Michal; Epel, Bernard L
2004-06-01
A replicon based on Tobacco mosaic virus that was engineered to express the open reading frame (ORF) of the green fluorescent protein (GFP) gene in place of the native coat protein (CP) gene from a minimal CP subgenomic (sg) RNA promoter was found to accumulate very low levels of GFP. Regulatory regions within the CP ORF were identified that, when presented as untranslated regions flanking the GFP ORF, enhanced or inhibited sg transcription and GFP expression. Full GFP expression from the CP sgRNA promoter required more than the first 20 nt of the CP ORF but not beyond the first 56 nt. Further analysis indicated the presence of an enhancer element between nt +25 and +55 with respect to the CP translation start site. The inclusion of this enhancer sequence upstream of the GFP ORF led to elevated sg transcription and to a 50-fold increase in GFP accumulation in comparison with a minimal CP promoter in which the entire CP ORF was displaced by the GFP ORF. Inclusion of the 3'-terminal 22 nt had a minor positive effect on GFP accumulation, but the addition of extended untranslated sequences from the 3' terminus of the CP ORF downstream of the GFP ORF was basically found to inhibit sg transcription. Secondary structure analysis programs predicted the CP sgRNA promoter to reside within two stable stem-loop structures, which are followed by an enhancer region.
Pivotal Impacts of Retrotransposon Based Invasive RNAs on Evolution.
Habibi, Laleh; Salmani, Hamzeh
2017-01-01
RNAs have long been described as the mediators of gene expression; they play a vital role in the structure and function of cellular complexes. Although the role of RNAs in the prokaryotes is mainly confined to these basic functions, the effects of these molecules in regulating the gene expression and enzymatic activities have been discovered in eukaryotes. Recently, a high-resolution analysis of the DNA obtained from different organisms has revealed a fundamental impact of the RNAs in shaping the genomes, heterochromatin formation, and gene creation. Deep sequencing of the human genome revealed that about half of our DNA is comprised of repetitive sequences (remnants of transposable element movements) expanded mostly through RNA-mediated processes. ORF2 encoded by L1 retrotransposons is a cellular reverse transcriptase which is mainly responsible for RNA invasion of various transposable elements (L1s, Alus, and SVAs) and cellular mRNAs in to the genomic DNA. In addition to increasing retroelements copy number; genomic expansion in association with centromere, telomere, and heterochromatin formation as well as pseudogene creation are the evolutionary consequences of this RNA-based activity. Threatening DNA integrity by disrupting the genes and forming excessive double strand breaks is another effect of this invasion. Therefore, repressive mechanisms have been evolved to control the activities of these invasive intracellular RNAs. All these mechanisms now have essential roles in the complex cellular functions. Therefore, it can be concluded that without direct action of RNA networks in shaping the genome and in the development of different cellular mechanisms, the evolution of higher eukaryotes would not be possible.
Pivotal Impacts of Retrotransposon Based Invasive RNAs on Evolution
Habibi, Laleh; Salmani, Hamzeh
2017-01-01
RNAs have long been described as the mediators of gene expression; they play a vital role in the structure and function of cellular complexes. Although the role of RNAs in the prokaryotes is mainly confined to these basic functions, the effects of these molecules in regulating the gene expression and enzymatic activities have been discovered in eukaryotes. Recently, a high-resolution analysis of the DNA obtained from different organisms has revealed a fundamental impact of the RNAs in shaping the genomes, heterochromatin formation, and gene creation. Deep sequencing of the human genome revealed that about half of our DNA is comprised of repetitive sequences (remnants of transposable element movements) expanded mostly through RNA-mediated processes. ORF2 encoded by L1 retrotransposons is a cellular reverse transcriptase which is mainly responsible for RNA invasion of various transposable elements (L1s, Alus, and SVAs) and cellular mRNAs in to the genomic DNA. In addition to increasing retroelements copy number; genomic expansion in association with centromere, telomere, and heterochromatin formation as well as pseudogene creation are the evolutionary consequences of this RNA-based activity. Threatening DNA integrity by disrupting the genes and forming excessive double strand breaks is another effect of this invasion. Therefore, repressive mechanisms have been evolved to control the activities of these invasive intracellular RNAs. All these mechanisms now have essential roles in the complex cellular functions. Therefore, it can be concluded that without direct action of RNA networks in shaping the genome and in the development of different cellular mechanisms, the evolution of higher eukaryotes would not be possible. PMID:29067016
Efficient algorithms for probing the RNA mutation landscape.
Waldispühl, Jérôme; Devadas, Srinivas; Berger, Bonnie; Clote, Peter
2008-08-08
The diversity and importance of the role played by RNAs in the regulation and development of the cell are now well-known and well-documented. This broad range of functions is achieved through specific structures that have been (presumably) optimized through evolution. State-of-the-art methods, such as McCaskill's algorithm, use a statistical mechanics framework based on the computation of the partition function over the canonical ensemble of all possible secondary structures on a given sequence. Although secondary structure predictions from thermodynamics-based algorithms are not as accurate as methods employing comparative genomics, the former methods are the only available tools to investigate novel RNAs, such as the many RNAs of unknown function recently reported by the ENCODE consortium. In this paper, we generalize the McCaskill partition function algorithm to sum over the grand canonical ensemble of all secondary structures of all mutants of the given sequence. Specifically, our new program, RNAmutants, simultaneously computes for each integer k the minimum free energy structure MFE(k) and the partition function Z(k) over all secondary structures of all k-point mutants, even allowing the user to specify certain positions required not to mutate and certain positions required to base-pair or remain unpaired. This technically important extension allows us to study the resilience of an RNA molecule to pointwise mutations. By computing the mutation profile of a sequence, a novel graphical representation of the mutational tendency of nucleotide positions, we analyze the deleterious nature of mutating specific nucleotide positions or groups of positions. We have successfully applied RNAmutants to investigate deleterious mutations (mutations that radically modify the secondary structure) in the Hepatitis C virus cis-acting replication element and to evaluate the evolutionary pressure applied on different regions of the HIV trans-activation response element. In particular, we show qualitative agreement between published Hepatitis C and HIV experimental mutagenesis studies and our analysis of deleterious mutations using RNAmutants. Our work also predicts other deleterious mutations, which could be verified experimentally. Finally, we provide evidence that the 3' UTR of the GB RNA virus C has been optimized to preserve evolutionarily conserved stem regions from a deleterious effect of pointwise mutations. We hope that there will be long-term potential applications of RNAmutants in de novo RNA design and drug design against RNA viruses. This work also suggests potential applications for large-scale exploration of the RNA sequence-structure network. Binary distributions are available at http://RNAmutants.csail.mit.edu/.
miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades.
Friedländer, Marc R; Mackowiak, Sebastian D; Li, Na; Chen, Wei; Rajewsky, Nikolaus
2012-01-01
microRNAs (miRNAs) are a large class of small non-coding RNAs which post-transcriptionally regulate the expression of a large fraction of all animal genes and are important in a wide range of biological processes. Recent advances in high-throughput sequencing allow miRNA detection at unprecedented sensitivity, but the computational task of accurately identifying the miRNAs in the background of sequenced RNAs remains challenging. For this purpose, we have designed miRDeep2, a substantially improved algorithm which identifies canonical and non-canonical miRNAs such as those derived from transposable elements and informs on high-confidence candidates that are detected in multiple independent samples. Analyzing data from seven animal species representing the major animal clades, miRDeep2 identified miRNAs with an accuracy of 98.6-99.9% and reported hundreds of novel miRNAs. To test the accuracy of miRDeep2, we knocked down the miRNA biogenesis pathway in a human cell line and sequenced small RNAs before and after. The vast majority of the >100 novel miRNAs expressed in this cell line were indeed specifically downregulated, validating most miRDeep2 predictions. Last, a new miRNA expression profiling routine, low time and memory usage and user-friendly interactive graphic output can make miRDeep2 useful to a wide range of researchers.
Odon, Valerie; Luke, Garry A.; Roulston, Claire; Brown, Jeremy D.; Ryan, Martin D.; Sukhodub, Andriy
2013-01-01
2A oligopeptide sequences (“2As”) mediate a cotranslational recoding event termed “ribosome skipping.” Previously we demonstrated the activity of 2As (and “2A-like sequences”) within a wide range of animal RNA virus genomes and non-long terminal repeat retrotransposons (non-LTRs) in the genomes of the unicellular organisms Trypanosoma brucei (Ingi) and T. cruzi (L1Tc). Here, we report the presence of 2A-like sequences in the genomes of a wide range of multicellular organisms and, as in the trypanosome genomes, within non-LTR retrotransposons (non-LTRs)—clustering in the Rex1, Crack, L2, L2A, and CR1 clades, in addition to Ingi. These 2A-like sequences were tested for translational recoding activity, and highly active sequences were found within the Rex1, L2, CR1, and Ingi clades. The presence of 2A-like sequences within non-LTRs may not only represent a method of controlling protein biogenesis but also shows some correlation with such apurinic/apyrimidinic DNA endonuclease-type non-LTRs encoding one, rather than two, open reading frames (ORFs). Interestingly, such non-LTRs cluster with closely related elements lacking 2A-like recoding elements but retaining ORF1. Taken together, these observations suggest that acquisition of 2A-like translational recoding sequences may have played a role in the evolution of these elements. PMID:23728794
Programming mRNA decay to modulate synthetic circuit resource allocation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Venturelli, Ophelia S.; Tei, Mika; Bauer, Stefan
Synthetic circuits embedded in host cells compete with cellular processes for limited intracellular resources. Here we show how funnelling of cellular resources, after global transcriptome degradation by the sequence-dependent endoribonuclease MazF, to a synthetic circuit can increase production. Target genes are protected from MazF activity by recoding the gene sequence to eliminate recognition sites, while preserving the amino acid sequence. The expression of a protected fluorescent reporter and flux of a high-value metabolite are significantly enhanced using this genome-scale control strategy. Proteomics measurements discover a host factor in need of protection to improve resource redistribution activity. A computational model demonstratesmore » that the MazF mRNA-decay feedback loop enables proportional control of MazF in an optimal operating regime. Transcriptional profiling of MazF-induced cells elucidates the dynamic shifts in transcript abundance and discovers regulatory design elements. Altogether, our results suggest that manipulation of cellular resource allocation is a key control parameter for synthetic circuit design.« less
Programming mRNA decay to modulate synthetic circuit resource allocation
Venturelli, Ophelia S.; Tei, Mika; Bauer, Stefan; ...
2017-04-26
Synthetic circuits embedded in host cells compete with cellular processes for limited intracellular resources. Here we show how funnelling of cellular resources, after global transcriptome degradation by the sequence-dependent endoribonuclease MazF, to a synthetic circuit can increase production. Target genes are protected from MazF activity by recoding the gene sequence to eliminate recognition sites, while preserving the amino acid sequence. The expression of a protected fluorescent reporter and flux of a high-value metabolite are significantly enhanced using this genome-scale control strategy. Proteomics measurements discover a host factor in need of protection to improve resource redistribution activity. A computational model demonstratesmore » that the MazF mRNA-decay feedback loop enables proportional control of MazF in an optimal operating regime. Transcriptional profiling of MazF-induced cells elucidates the dynamic shifts in transcript abundance and discovers regulatory design elements. Altogether, our results suggest that manipulation of cellular resource allocation is a key control parameter for synthetic circuit design.« less
Transcriptome Engineering with RNA-Targeting Type VI-D CRISPR Effectors.
Konermann, Silvana; Lotfy, Peter; Brideau, Nicholas J; Oki, Jennifer; Shokhirev, Maxim N; Hsu, Patrick D
2018-04-19
Class 2 CRISPR-Cas systems endow microbes with diverse mechanisms for adaptive immunity. Here, we analyzed prokaryotic genome and metagenome sequences to identify an uncharacterized family of RNA-guided, RNA-targeting CRISPR systems that we classify as type VI-D. Biochemical characterization and protein engineering of seven distinct orthologs generated a ribonuclease effector derived from Ruminococcus flavefaciens XPD3002 (CasRx) with robust activity in human cells. CasRx-mediated knockdown exhibits high efficiency and specificity relative to RNA interference across diverse endogenous transcripts. As one of the most compact single-effector Cas enzymes, CasRx can also be flexibly packaged into adeno-associated virus. We target virally encoded, catalytically inactive CasRx to cis elements of pre-mRNA to manipulate alternative splicing, alleviating dysregulated tau isoform ratios in a neuronal model of frontotemporal dementia. Our results present CasRx as a programmable RNA-binding module for efficient targeting of cellular RNA, enabling a general platform for transcriptome engineering and future therapeutic development. Copyright © 2018 Elsevier Inc. All rights reserved.
Transcriptional activation of short interspersed elements by DNA-damaging agents.
Rudin, C M; Thompson, C B
2001-01-01
Short interspersed elements (SINEs), typified by the human Alu repeat, are RNA polymerase III (pol III)-transcribed sequences that replicate within the genome through an RNA intermediate. Replication of SINEs has been extensive in mammalian evolution: an estimated 5% of the human genome consists of Alu repeats. The mechanisms regulating transcription, reverse transcription, and reinsertion of SINE elements in genomic DNA are poorly understood. Here we report that expression of murine SINE transcripts of both the B1 and B2 classes is strongly upregulated after prolonged exposure to cisplatin, etoposide, or gamma radiation. A similar induction of Alu transcripts in human cells occurs under these conditions. This induction is not due to a general upregulation of pol III activity in either species. Genotoxic treatment of murine cells containing an exogenous human Alu element induced Alu transcription. Concomitant with the increased expression of SINEs, an increase in cellular reverse transcriptase was observed after exposure to these same DNA-damaging agents. These findings suggest that genomic damage may be an important activator of SINEs, and that SINE mobility may contribute to secondary malignancy after exposure to DNA-damaging chemotherapy.
Jones, Christopher P.; Saadatmand, Jenan; Kleiman, Lawrence; Musier-Forsyth, Karin
2013-01-01
The primer for initiating reverse transcription in human immunodeficiency virus type 1 (HIV-1) is tRNALys3. Host cell tRNALys is selectively packaged into HIV-1 through a specific interaction between the major tRNALys-binding protein, human lysyl-tRNA synthetase (hLysRS), and the viral proteins Gag and GagPol. Annealing of the tRNA primer onto the complementary primer-binding site (PBS) in viral RNA is mediated by the nucleocapsid domain of Gag. The mechanism by which tRNALys3 is targeted to the PBS and released from hLysRS prior to annealing is unknown. Here, we show that hLysRS specifically binds to a tRNA anti-codon-like element (TLE) in the HIV-1 genome, which mimics the anti-codon loop of tRNALys and is located proximal to the PBS. Mutation of the U-rich sequence within the TLE attenuates binding of hLysRS in vitro and reduces the amount of annealed tRNALys3 in virions. Thus, LysRS binds specifically to the TLE, which is part of a larger LysRS binding domain in the viral RNA that includes elements of the Psi packaging signal. Our results suggest that HIV-1 uses molecular mimicry of the anti-codon of tRNALys to increase the efficiency of tRNALys3 annealing to viral RNA. PMID:23264568
The role of heterologous chloroplast sequence elements in transgene integration and expression.
Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry
2010-04-01
Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5' untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5' UTR and 3' UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5' UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5' UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation.
Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry
2010-01-01
Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5′ untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5′ UTR and 3′ UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5′ UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5′ UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation. PMID:20130101
Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.
VanBuren, Robert; Bryant, Doug; Edger, Patrick P; Tang, Haibao; Burgess, Diane; Challabathula, Dinakar; Spittle, Kristi; Hall, Richard; Gu, Jenny; Lyons, Eric; Freeling, Michael; Bartels, Dorothea; Ten Hallers, Boudewijn; Hastie, Alex; Michael, Todd P; Mockler, Todd C
2015-11-26
Plant genomes, and eukaryotic genomes in general, are typically repetitive, polyploid and heterozygous, which complicates genome assembly. The short read lengths of early Sanger and current next-generation sequencing platforms hinder assembly through complex repeat regions, and many draft and reference genomes are fragmented, lacking skewed GC and repetitive intergenic sequences, which are gaining importance due to projects like the Encyclopedia of DNA Elements (ENCODE). Here we report the whole-genome sequencing and assembly of the desiccation-tolerant grass Oropetium thomaeum. Using only single-molecule real-time sequencing, which generates long (>16 kilobases) reads with random errors, we assembled 99% (244 megabases) of the Oropetium genome into 625 contigs with an N50 length of 2.4 megabases. Oropetium is an example of a 'near-complete' draft genome which includes gapless coverage over gene space as well as intergenic sequences such as centromeres, telomeres, transposable elements and rRNA clusters that are typically unassembled in draft genomes. Oropetium has 28,466 protein-coding genes and 43% repeat sequences, yet with 30% more compact euchromatic regions it is the smallest known grass genome. The Oropetium genome demonstrates the utility of single-molecule real-time sequencing for assembling high-quality plant and other eukaryotic genomes, and serves as a valuable resource for the plant comparative genomics community.
Circular RNA biogenesis can proceed through an exon-containing lariat precursor.
Barrett, Steven P; Wang, Peter L; Salzman, Julia
2015-06-09
Pervasive expression of circular RNA is a recently discovered feature of eukaryotic gene expression programs, yet its function remains largely unknown. The presumed biogenesis of these RNAs involves a non-canonical 'backsplicing' event. Recent studies in mammalian cell culture posit that backsplicing is facilitated by inverted repeats flanking the circularized exon(s). Although such sequence elements are common in mammals, they are rare in lower eukaryotes, making current models insufficient to describe circularization. Through systematic splice site mutagenesis and the identification of splicing intermediates, we show that circular RNA in Schizosaccharomyces pombe is generated through an exon-containing lariat precursor. Furthermore, we have performed high-throughput and comprehensive mutagenesis of a circle-forming exon, which enabled us to discover a systematic effect of exon length on RNA circularization. Our results uncover a mechanism for circular RNA biogenesis that may account for circularization in genes that lack noticeable flanking intronic secondary structure.
Recipe for a Busy Bee: MicroRNAs in Honey Bee Caste Determination
Skogerboe, Geir; Dai, Shuanjin; Li, Wenfeng; Li, Zhiguo; Liu, Fang; Ni, Ruifeng; Guo, Yu; Chen, Shenglu; Zhang, Shaowu; Chen, Runsheng
2013-01-01
Social caste determination in the honey bee is assumed to be determined by the dietary status of the young larvae and translated into physiological and epigenetic changes through nutrient-sensing pathways. We have employed Illumina/Solexa sequencing to examine the small RNA content in the bee larval food, and show that worker jelly is enriched in miRNA complexity and abundance relative to royal jelly. The miRNA levels in worker jelly were 7–215 fold higher than in royal jelly, and both jellies showed dynamic changes in miRNA content during the 4th to 6th day of larval development. Adding specific miRNAs to royal jelly elicited significant changes in queen larval mRNA expression and morphological characters of the emerging adult queen bee. We propose that miRNAs in the nurse bee secretions constitute an additional element in the regulatory control of caste determination in the honey bee. PMID:24349106
Regulation of alternative splicing in Drosophila by 56 RNA binding proteins
Brooks, Angela N.; Duff, Michael O.; May, Gemma; ...
2015-08-20
Alternative splicing is regulated by RNA binding proteins (RBPs) that recognize pre-mRNA sequence elements and activate or repress adjacent exons. Here, we used RNA interference and RNA-seq to identify splicing events regulated by 56 Drosophila proteins, some previously unknown to regulate splicing. Nearly all proteins affected alternative first exons, suggesting that RBPs play important roles in first exon choice. Half of the splicing events were regulated by multiple proteins, demonstrating extensive combinatorial regulation. We observed that SR and hnRNP proteins tend to act coordinately with each other, not antagonistically. We also identified a cross-regulatory network where splicing regulators affected themore » splicing of pre-mRNAs encoding other splicing regulators. In conclusion, this large-scale study substantially enhances our understanding of recent models of splicing regulation and provides a resource of thousands of exons that are regulated by 56 diverse RBPs.« less
Antiviral Goes Viral: Harnessing CRISPR/Cas9 to Combat Viruses in Humans.
Soppe, Jasper Adriaan; Lebbink, Robert Jan
2017-10-01
The clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) systems are RNA-guided sequence-specific prokaryotic antiviral immune systems. In prokaryotes, small RNA molecules guide Cas effector endonucleases to invading foreign genetic elements in a sequence-dependent manner, resulting in DNA cleavage by the endonuclease upon target binding. A rewired CRISPR/Cas9 system can be used for targeted and precise genome editing in eukaryotic cells. CRISPR/Cas has also been harnessed to target human pathogenic viruses as a potential new antiviral strategy. Here, we review recent CRISPR/Cas9-based approaches to combat specific human viruses in humans and discuss challenges that need to be overcome before CRISPR/Cas9 may be used in the clinic as an antiviral strategy. Copyright © 2017 Elsevier Ltd. All rights reserved.
Zygotic amplification of secondary piRNAs during silkworm embryogenesis
Kawaoka, Shinpei; Arai, Yuji; Kadota, Koji; Suzuki, Yutaka; Hara, Kahori; Sugano, Sumio; Shimizu, Kentaro; Tomari, Yukihide; Shimada, Toru; Katsuma, Susumu
2011-01-01
PIWI-interacting RNAs (piRNAs) are 23–30-nucleotide-long small RNAs that act as sequence-specific silencers of transposable elements in animal gonads. In flies, genetics and deep sequencing data have led to a hypothesis for piRNA biogenesis called the ping-pong cycle, where antisense primary piRNAs initiate an amplification loop to generate sense secondary piRNAs. However, to date, the process of the ping-pong cycle has never been monitored at work. Here, by large-scale profiling of piRNAs from silkworm ovary and embryos of different developmental stages, we demonstrate that maternally inherited antisense-biased piRNAs trigger acute amplification of secondary sense piRNA production in zygotes, at a time coinciding with zygotic transcription of sense transposon mRNAs. These results provide on-site evidence for the ping-pong cycle. PMID:21628432
Lactase non-persistence is directed by DNA variation-dependent epigenetic aging
Labrie, Viviane; Buske, Orion J; Oh, Edward; Jeremian, Richie; Ptak, Carolyn; Gasiūnas, Giedrius; Maleckas, Almantas; Petereit, Rūta; Žvirbliene, Aida; Adamonis, Kęstutis; Kriukienė, Edita; Koncevičius, Karolis; Gordevičius, Juozas; Nair, Akhil; Zhang, Aiping; Ebrahimi, Sasha; Oh, Gabriel; Šikšnys, Virginijus; Kupčinskas, Limas; Brudno, Michael; Petronis, Arturas
2016-01-01
Inability to digest lactose due to lactase non-persistence is a common trait in adult mammals, with the exception of certain human populations that exhibit lactase persistence. It is not clear how the lactase gene can be dramatically downregulated with age in most individuals, but remains active in some. We performed a comprehensive epigenetic study of the human and mouse intestine using chromosome-wide DNA modification profiling and targeted bisulfite sequencing. Epigenetically-controlled regulatory elements were found to account for the differences in lactase mRNA levels between individuals, intestinal cell types and species. The importance of these regulatory elements in modulating lactase mRNA levels was confirmed by CRISPR-Cas9-induced deletions. Genetic factors contribute to epigenetic changes occurring with age at the regulatory elements, as lactase persistence- and non-persistence-DNA haplotypes demonstrated markedly different epigenetic aging. Thus, genetic factors facilitate a gradual accumulation of epigenetic changes with age to affect phenotypic outcome. PMID:27159559
Fenstermacher, Katherine J; Achuthan, Vasudevan; Schneider, Thomas D; DeStefano, Jeffrey J
2018-01-16
DNA polymerases (DNAPs) recognize 3' recessed termini on duplex DNA and carry out nucleotide catalysis. Unlike promoter-specific RNA polymerases (RNAPs), no sequence specificity is required for binding or initiation of catalysis. Despite this, previous results indicate that viral reverse transcriptases bind much more tightly to DNA primers that mimic the polypurine tract. In the current report, primer sequences that bind with high affinity to Taq and Klenow polymerases were identified using a modified Selective Evolution of Ligands by Exponential Enrichment (SELEX) approach. Two Taq -specific primers that bound ∼10 (Taq1) and over 100 (Taq2) times more stably than controls to Taq were identified. Taq1 contained 8 nucleotides (5' -CACTAAAG-3') that matched the phage T3 RNAP "core" promoter. Both primers dramatically outcompeted primers with similar binding thermodynamics in PCR reactions. Similarly, exonuclease minus Klenow polymerase also selected a high affinity primer that contained a related core promoter sequence from phage T7 RNAP (5' -ACTATAG-3'). For both Taq and Klenow, even small modifications to the sequence resulted in large losses in binding affinity suggesting that binding was highly sequence-specific. The results are discussed in the context of possible effects on multi-primer (multiplex) PCR assays, molecular information theory, and the evolution of RNAPs and DNAPs. Importance This work further demonstrates that primer-dependent DNA polymerases can have strong sequence biases leading to dramatically tighter binding to specific sequences. These may be related to biological function, or be a consequences of the structural architecture of the enzyme. New sequence specificity for Taq and Klenow polymerases were uncovered and among them were sequences that contained the core promoter elements from T3 and T7 phage RNA polymerase promoters. This suggests the intriguing possibility that phage RNA polymerases exploited intrinsic binding affinities of ancestral DNA polymerases to develop their promotors. Conversely, DNA polymerases could have evolved from related RNA polymerases and retained the intrinsic binding preference despite there being no clear function for such a preference in DNA biology. Copyright © 2018 American Society for Microbiology.
Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Carte, Jason; Wang, Ruiying; Li, Hong
An RNA-based gene silencing pathway that protects bacteria and archaea from viruses and other genome invaders is hypothesized to arise from guide RNAs encoded by CRISPR loci and proteins encoded by the cas genes. CRISPR loci contain multiple short invader-derived sequences separated by short repeats. The presence of virus-specific sequences within CRISPR loci of prokaryotic genomes confers resistance against corresponding viruses. The CRISPR loci are transcribed as long RNAs that must be processed to smaller guide RNAs. Here we identified Pyrococcus furiosus Cas6 as a novel endoribonuclease that cleaves CRISPR RNAs within the repeat sequences to release individual invader targetingmore » RNAs. Cas6 interacts with a specific sequence motif in the 5{prime} region of the CRISPR repeat element and cleaves at a defined site within the 3{prime} region of the repeat. The 1.8 angstrom crystal structure of the enzyme reveals two ferredoxin-like folds that are also found in other RNA-binding proteins. The predicted active site of the enzyme is similar to that of tRNA splicing endonucleases, and concordantly, Cas6 activity is metal-independent. cas6 is one of the most widely distributed CRISPR-associated genes. Our findings indicate that Cas6 functions in the generation of CRISPR-derived guide RNAs in numerous bacteria and archaea.« less
The Evolution of Tyrosine-Recombinase Elements in Nematoda
Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L.; Lunt, David H.
2014-01-01
Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum. PMID:25197791
The evolution of tyrosine-recombinase elements in Nematoda.
Szitenberg, Amir; Koutsovoulos, Georgios; Blaxter, Mark L; Lunt, David H
2014-01-01
Transposable elements can be categorised into DNA and RNA elements based on their mechanism of transposition. Tyrosine recombinase elements (YREs) are relatively rare and poorly understood, despite sharing characteristics with both DNA and RNA elements. Previously, the Nematoda have been reported to have a substantially different diversity of YREs compared to other animal phyla: the Dirs1-like YRE retrotransposon was encountered in most animal phyla but not in Nematoda, and a unique Pat1-like YRE retrotransposon has only been recorded from Nematoda. We explored the diversity of YREs in Nematoda by sampling broadly across the phylum and including 34 genomes representing the three classes within Nematoda. We developed a method to isolate and classify YREs based on both feature organization and phylogenetic relationships in an open and reproducible workflow. We also ensured that our phylogenetic approach to YRE classification identified truncated and degenerate elements, informatively increasing the number of elements sampled. We identified Dirs1-like elements (thought to be absent from Nematoda) in the nematode classes Enoplia and Dorylaimia indicating that nematode model species do not adequately represent the diversity of transposable elements in the phylum. Nematode Pat1-like elements were found to be a derived form of another Pat1-like element that is present more widely in animals. Several sequence features used widely for the classification of YREs were found to be homoplasious, highlighting the need for a phylogenetically-based classification scheme. Nematode model species do not represent the diversity of transposable elements in the phylum.
Modular structural elements in the replication origin region of Tetrahymena rDNA.
Du, C; Sanzgiri, R P; Shaiu, W L; Choi, J K; Hou, Z; Benbow, R M; Dobbs, D L
1995-01-01
Computer analyses of the DNA replication origin region in the amplified rRNA genes of Tetrahymena thermophila identified a potential initiation zone in the 5'NTS [Dobbs, Shaiu and Benbow (1994), Nucleic Acids Res. 22, 2479-2489]. This region consists of a putative DNA unwinding element (DUE) aligned with predicted bent DNA segments, nuclear matrix or scaffold associated region (MAR/SAR) consensus sequences, and other common modular sequence elements previously shown to be clustered in eukaryotic chromosomal origin regions. In this study, two mung bean nuclease-hypersensitive sites in super-coiled plasmid DNA were localized within the major DUE-like element predicted by thermodynamic analyses. Three restriction fragments of the 5'NTS region predicted to contain bent DNA segments exhibited anomalous migration characteristic of bent DNA during electrophoresis on polyacrylamide gels. Restriction fragments containing the 5'NTS region bound Tetrahymena nuclear matrices in an in vitro binding assay, consistent with an association of the replication origin region with the nuclear matrix in vivo. The direct demonstration in a protozoan origin region of elements previously identified in Drosophila, chick and mammalian origin regions suggests that clusters of modular structural elements may be a conserved feature of eukaryotic chromosomal origins of replication. Images PMID:7784181
Unraveling transcriptional control and cis-regulatory codes using the software suite GeneACT
Cheung, Tom Hiu; Kwan, Yin Lam; Hamady, Micah; Liu, Xuedong
2006-01-01
Deciphering gene regulatory networks requires the systematic identification of functional cis-acting regulatory elements. We present a suite of web-based bioinformatics tools, called GeneACT , that can rapidly detect evolutionarily conserved transcription factor binding sites or microRNA target sites that are either unique or over-represented in differentially expressed genes from DNA microarray data. GeneACT provides graphic visualization and extraction of common regulatory sequence elements in the promoters and 3'-untranslated regions that are conserved across multiple mammalian species. PMID:17064417
Mutations Affecting Expression of the rosy Locus in Drosophila melanogaster
Lee, Chong Sung; Curtis, Daniel; McCarron, Margaret; Love, Carol; Gray, Mark; Bender, Welcome; Chovnick, Arthur
1987-01-01
The rosy locus in Drosophila melanogaster codes for the enzyme xanthine dehydrogenase (XDH). Previous studies defined a "control element" near the 5' end of the gene, where variant sites affected the amount of rosy mRNA and protein produced. We have determined the DNA sequence of this region from both genomic and cDNA clones, and from the ry+10 underproducer strain. This variant strain had many sequence differences, so that the site of the regulatory change could not be fixed. A mutagenesis was also undertaken to isolate new regulatory mutations. We induced 376 new mutations with 1-ethyl-1-nitrosourea (ENU) and screened them to isolate those that reduced the amount of XDH protein produced, but did not change the properties of the enzyme. Genetic mapping was used to find mutations located near the 5' end of the gene. DNA from each of seven mutants was cloned and sequenced through the 5' region. Mutant base changes were identified in all seven; they appear to affect splicing and translation of the rosy mRNA. In a related study (T. P. Keith et al. 1987), the genomic and cDNA sequences are extended through the 3' end of the gene; the combined sequences define the processing pattern of the rosy transcript and predict the amino acid sequence of XDH. PMID:3036645
López-Lastra, M; Gabus, C; Darlix, J L
1997-11-01
The murine leukemia virus (MLV)-related type C viruses constitute a major class of retroviruses that includes numerous endogenous and exogenous mammalian viruses and the related avian spleen necrosis virus (SNV). The MLV-related viruses possess a long and multifunctional 5' untranslated leader involved in key steps of the viral life cycle--splicing, translation, RNA dimerization, encapsidation, and reverse transcription. Recent studies have shown that the 5' leader of Friend murine leukemia virus and Moloney murine leukemia virus can direct cap independent translation of gag precursor proteins (Berlioz et al., 1995; Vagner et al., 1995b). These data, together with structural homology studies (Koning et al., 1992), prompted us to undertake a search for new internal ribosome entry segment (IRES) of retroviral origin. Here we describe an IRES element within the 5' leader of avian reticuloendotheliosis virus type A (REV-A) genomic RNA. Data show that the REV-A 5' IRES element maps downstream of the packaging/dimerization (E/DLS) sequence (Watanabe and Temin, 1982; Darlix et al., 1992) and the minimal IRES sequence appears to be within a 129 nt fragment (nucleotides 452-580) of the 5' leader, immediately upstream of the gag AUG codon. The REV-A IRES has been successfully utilized in the construction of novel high titer MLV-based retroviral vectors, containing one or more IRES elements of retroviral origin. These retroviral constructs, which represent a starting point for the design of novel vectors suitable for gene therapy, are also of interest as a model system of internal translation initiation and its possible regulation during development, cancer, or virus infection.
Luque, Daniel; Gómez-Blanco, Josué; Garriga, Damiá; Brilot, Axel F.; González, José M.; Havens, Wendy M.; Carrascosa, José L.; Trus, Benes L.; Verdaguer, Nuria; Ghabrial, Said A.; Castón, José R.
2014-01-01
Viruses evolve so rapidly that sequence-based comparison is not suitable for detecting relatedness among distant viruses. Structure-based comparisons suggest that evolution led to a small number of viral classes or lineages that can be grouped by capsid protein (CP) folds. Here, we report that the CP structure of the fungal dsRNA Penicillium chrysogenum virus (PcV) shows the progenitor fold of the dsRNA virus lineage and suggests a relationship between lineages. Cryo-EM structure at near-atomic resolution showed that the 982-aa PcV CP is formed by a repeated α-helical core, indicative of gene duplication despite lack of sequence similarity between the two halves. Superimposition of secondary structure elements identified a single “hotspot” at which variation is introduced by insertion of peptide segments. Structural comparison of PcV and other distantly related dsRNA viruses detected preferential insertion sites at which the complexity of the conserved α-helical core, made up of ancestral structural motifs that have acted as a skeleton, might have increased, leading to evolution of the highly varied current structures. Analyses of structural motifs only apparent after systematic structural comparisons indicated that the hallmark fold preserved in the dsRNA virus lineage shares a long (spinal) α-helix tangential to the capsid surface with the head-tailed phage and herpesvirus viral lineage. PMID:24821769
Hirano, Minato; Muto, Memi; Sakai, Mizuki; Kondo, Hirofumi; Kobayashi, Shintaro; Kariwa, Hiroaki; Yoshii, Kentaro
2017-09-12
Neurological diseases caused by encephalitic flaviviruses are severe and associated with high levels of mortality. However, little is known about the detailed mechanisms of viral replication and pathogenicity in the brain. Previously, we reported that the genomic RNA of tick-borne encephalitis virus (TBEV), a member of the genus Flavivirus , is transported and replicated in the dendrites of neurons. In the present study, we analyzed the transport mechanism of the viral genome to dendrites. We identified specific sequences of the 5' untranslated region of TBEV genomic RNA that act as a cis -acting element for RNA transport. Mutated TBEV with impaired RNA transport in dendrites caused a reduction in neurological symptoms in infected mice. We show that neuronal granules, which regulate the transport and local translation of dendritic mRNAs, are involved in TBEV genomic RNA transport. TBEV genomic RNA bound an RNA-binding protein of neuronal granules and disturbed the transport of dendritic mRNAs. These results demonstrated a neuropathogenic virus hijacking the neuronal granule system for the transport of viral genomic RNA in dendrites, resulting in severe neurological disease.
The control of paramyxovirus genome hexamer length and mRNA editing.
Matsumoto, Yusuke; Ohta, Keisuke; Kolakofsky, Daniel; Nishio, Machiko
2018-04-01
The unusual ability of a human parainfluenza virus type 2 (hPIV2) nucleoprotein point mutation (NP Q202A ) to strongly enhance minigenome replication was found to depend on the absence of a functional, internal element of the bipartite replication promoter (CRII). This point mutation allows relatively robust CRII-minus minigenome replication in a CRII-independent manner, under conditions in which NP wt is essentially inactive. The nature of the amino acid at position 202 apparently controls whether viral RNA-dependent RNA polymerase (vRdRp) can, or cannot, initiate RNA synthesis in a CRII-independent manner. By repressing genome synthesis when vRdRp cannot correctly interact with CRII, gln 202 of N, the only residue of the RNA-binding groove that contacts a nucleotide base in the N-RNA, acts as a gatekeeper for wild-type (CRII-dependent) RNA synthesis. This ensures that only hexamer-length genomes are replicated, and that the critical hexamer phase of the cis -acting mRNA editing sequence is maintained. © 2018 Matsumoto et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing
Kwak, Hojoong; Fuda, Nicholas J.; Core, Leighton J.; Lis, John T.
2014-01-01
Transcription regulation occurs frequently through promoter-associated pausing of RNA polymerase II (Pol II). We developed a Precision nuclear Run-On and sequencing assay (PRO-seq) to map the genome-wide distribution of transcriptionally-engaged Pol II at base-pair resolution. Pol II accumulates immediately downstream of promoters, at intron-exon junctions that are efficiently used for splicing, and over 3' poly-adenylation sites. Focused analyses of promoters reveal that pausing is not fixed relative to initiation sites nor is it specified directly by the position of a particular core promoter element or the first nucleosome. Core promoter elements function beyond initiation, and when optimally positioned they act collectively to dictate the position and strength of pausing. We test this ‘Complex Interaction’ model with insertional mutagenesis of the Drosophila Hsp70 core promoter. PMID:23430654
Jády, Beáta E.; Ketele, Amandine; Kiss, Tamás
2012-01-01
Alu repetitive sequences are the most abundant short interspersed DNA elements in the human genome. Full-length Alu elements are composed of two tandem sequence monomers, the left and right Alu arms, both derived from the 7SL signal recognition particle RNA. Since Alu elements are common in protein-coding genes, they are frequently transcribed into pre-mRNAs. Here, we demonstrate that the right arms of nascent Alu transcripts synthesized within pre-mRNA introns are processed into metabolically stable small RNAs. The intron-encoded Alu RNAs, termed AluACA RNAs, are structurally highly reminiscent of box H/ACA small Cajal body (CB) RNAs (scaRNAs). They are composed of two hairpin units followed by the essential H (AnAnnA) and ACA box motifs. The mature AluACA RNAs associate with the four H/ACA core proteins: dyskerin, Nop10, Nhp2, and Gar1. Moreover, the 3′ hairpin of AluACA RNAs carries two closely spaced CB localization motifs, CAB boxes (UGAG), which bind Wdr79 in a cumulative fashion. In contrast to canonical H/ACA scaRNPs, which concentrate in CBs, the AluACA RNPs accumulate in the nucleoplasm. Identification of 348 human AluACA RNAs demonstrates that intron-encoded AluACA RNAs represent a novel, large subgroup of H/ACA RNAs, which are apparently confined to human or primate cells. PMID:22892240
Fuchs, Ryan T.; Grundy, Frank J.; Henkin, Tina M.
2007-01-01
The SMK box is a conserved riboswitch motif found in the 5′ untranslated region of metK genes [encoding S-adenosylmethionine (SAM) synthetase] in lactic acid bacteria, including Enterococcus, Streptococcus, and Lactococcus sp. Previous studies showed that this RNA element binds SAM in vitro, and SAM binding causes a structural rearrangement that sequesters the Shine–Dalgarno (SD) sequence by pairing with an anti-SD (ASD) element. A model was proposed in which SAM binding inhibits metK translation by preventing binding of the ribosome to the SD region of the mRNA. In the current work, the addition of SAM was shown to inhibit binding of 30S ribosomal subunits to SMK box RNA; in contrast, the addition of S-adenosylhomocysteine (SAH) had no effect. A mutant RNA, which has a disrupted SD-ASD pairing, was defective in SAM binding and showed no reduction of ribosome binding in the presence of SAM, whereas a compensatory mutation that restored SD-ASD pairing restored the response to SAM. Primer extension inhibition assays provided further evidence for SD-ASD pairing in the presence of SAM. These results strongly support the model that SMK box translational repression operates through occlusion of the ribosome binding site and that SAM binding requires the SD-ASD pairing. PMID:17360376
Succession of splicing regulatory elements determines cryptic 5΄ss functionality
Brillen, Anna-Lena; Schöneweis, Katrin; Walotka, Lara; Hartmann, Linda; Müller, Lisa; Ptok, Johannes; Kaisers, Wolfgang; Poschmann, Gereon; Stühler, Kai; Buratti, Emanuele
2017-01-01
Abstract A critical step in exon definition is the recognition of a proper splice donor (5΄ss) by the 5’ end of U1 snRNA. In the selection of appropriate 5΄ss, cis-acting splicing regulatory elements (SREs) are indispensable. As a model for 5΄ss recognition, we investigated cryptic 5΄ss selection within the human fibrinogen Bβ-chain gene (FGB) exon 7, where we identified several exonic SREs that simultaneously acted on up- and downstream cryptic 5΄ss. In the FGB exon 7 model system, 5΄ss selection iteratively proceeded along an alternating sequence of U1 snRNA binding sites and interleaved SREs which in principle supported different 3’ exon ends. Like in a relay race, SREs either suppressed a potential 5΄ss and passed the splicing baton on or splicing actually occurred. From RNA-Seq data, we systematically selected 19 genes containing exons with silent U1 snRNA binding sites competing with nearby highly used 5΄ss. Extensive SRE analysis by different algorithms found authentic 5΄ss significantly more supported by SREs than silent U1 snRNA binding sites, indicating that our concept may permit generalization to a model for 5΄ss selection and 3’ exon end definition. PMID:28039323
Chao, Mei; Lin, Chia-Chi; Lin, Feng-Ming; Li, Hsin-Pai; Iang, Shan-Bei
2015-12-01
Hepatitis delta virus (HDV) is the only animal RNA virus that has an unbranched rod-like genome with ribozyme activity and is replicated by host RNA polymerase. HDV RNA recombination was previously demonstrated in patients and in cultured cells by analysis of a region corresponding to the C terminus of the delta antigen (HDAg), the only viral-encoded protein. Here, a whole-genome recombination map of HDV was constructed using an experimental system in which two HDV-1 sequences were co-transfected into cultured cells and the recombinants were analysed by sequencing of cloned reverse transcription-PCR products. Fifty homologous recombinants with 60 crossovers mapping to 22 junctions were identified from 200 analysed clones. Small HDAg chimeras harbouring a junction newly detected in the recombination map were then constructed. The results further indicated that the genome-replication level of HDV was sensitive to the sixth amino acid within the N-terminal 22 aa of HDAg. Therefore, the recombination map established in this study provided a tool for not only understanding HDV RNA recombination, but also elucidating the related mechanisms, such as molecular elements responsible for the trans-activation levels of the small HDAg.
Sunita, S; Schwartz, Samantha L; Conn, Graeme L
2015-11-20
Double-stranded RNA (dsRNA)-activated protein kinase (PKR) is an important component of the innate immune system that presents a crucial first line of defense against viral infection. PKR has a modular architecture comprising a regulatory N-terminal dsRNA binding domain and a C-terminal kinase domain interposed by an unstructured ∼80-residue interdomain linker (IDL). Guided by sequence alignment, we created IDL deletions in human PKR (hPKR) and regulatory/kinase domain swap human-rat chimeric PKRs to assess the contributions of each domain and the IDL to regulation of the kinase activity by RNA. Using circular dichroism spectroscopy, limited proteolysis, kinase assays, and isothermal titration calorimetry, we show that each PKR protein is properly folded with similar domain boundaries and that each exhibits comparable polyinosinic-cytidylic (poly(rI:rC)) dsRNA activation profiles and binding affinities for adenoviral virus-associated RNA I (VA RNAI) and HIV-1 trans-activation response (TAR) RNA. From these results we conclude that the IDL of PKR is not required for RNA binding or mediating changes in protein conformation or domain interactions necessary for PKR regulation by RNA. In contrast, inhibition of rat PKR by VA RNAI and TAR RNA was found to be weaker than for hPKR by 7- and >300-fold, respectively, and each human-rat chimeric domain-swapped protein showed intermediate levels of inhibition. These findings indicate that PKR sequence or structural elements in the kinase domain, present in hPKR but absent in rat PKR, are exploited by viral non-coding RNAs to accomplish efficient inhibition of PKR. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Chaw, Shu-Miaw; Shih, Arthur Chun-Chieh; Wang, Daryi; Wu, Yu-Wei; Liu, Shu-Mei; Chou, The-Yuan
2008-03-01
The mtDNA of Cycas taitungensis is a circular molecule of 414,903 bp, making it 2- to 6-fold larger than the known mtDNAs of charophytes and bryophytes, but similar to the average of 7 elucidated angiosperm mtDNAs. It is characterized by abundant RNA editing sites (1,084), more than twice the number found in the angiosperm mtDNAs. The A + T content of Cycas mtDNA is 53.1%, the lowest among known land plants. About 5% of the Cycas mtDNA is composed of a novel family of mobile elements, which we designated as "Bpu sequences." They share a consensus sequence of 36 bp with 2 terminal direct repeats (AAGG) and a recognition site for the Bpu 10I restriction endonuclease (CCTGAAGC). Comparison of the Cycas mtDNA with other plant mtDNAs revealed many new insights into the biology and evolution of land plant mtDNAs. For example, the noncoding sequences in mtDNAs have drastically expanded as land plants have evolved, with abrupt increases appearing in the bryophytes, and then in the seed plants. As a result, the genomic organizations of seed plant mtDNAs are much less compact than in other plants. Also, the Cycas mtDNA appears to have been exempted from the frequent gene loss observed in angiosperm mtDNAs. Similar to the angiosperms, the 3 Cycas genes nad1, nad2, and nad5 are disrupted by 5 group II intron squences, which have brought the genes into trans-splicing arrangements. The evolutionary origin and invasion/duplication mechanism of the Bpu sequences in Cycas mtDNA are hypothesized and discussed.
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter.
Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian
2018-03-12
ZmbZIP25 ( Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5' RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from -2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from -2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5'-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5'-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5'-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5'-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from -1117 to -957 that were responsible for the specificity of the ZmbZIP25 5'-flanking sequence.
Functional Analysis of Maize Silk-Specific ZmbZIP25 Promoter
Li, Wanying; Yu, Dan; Yu, Jingjuan; Zhu, Dengyun; Zhao, Qian
2018-01-01
ZmbZIP25 (Zea mays bZIP (basic leucine zipper) transcription factor 25) is a function-unknown protein that belongs to the D group of the bZIP transcription factor family. RNA-seq data showed that the expression of ZmbZIP25 was tissue-specific in maize silks, and this specificity was confirmed by RT-PCR (reverse transcription-polymerase chain reaction). In situ RNA hybridization showed that ZmbZIP25 was expressed exclusively in the xylem of maize silks. A 5′ RACE (rapid amplification of cDNA ends) assay identified an adenine residue as the transcription start site of the ZmbZIP25 gene. To characterize this silk-specific promoter, we isolated and analyzed a 2450 bp (from −2083 to +367) and a 2600 bp sequence of ZmbZIP25 (from −2083 to +517, the transcription start site was denoted +1). Stable expression assays in Arabidopsis showed that the expression of the reporter gene GUS driven by the 2450 bp ZmbZIP25 5′-flanking fragment occurred exclusively in the papillae of Arabidopsis stigmas. Furthermore, transient expression assays in maize indicated that GUS and GFP expression driven by the 2450 bp ZmbZIP25 5′-flanking sequences occurred only in maize silks and not in other tissues. However, no GUS or GFP expression was driven by the 2600 bp ZmbZIP25 5′-flanking sequences in either stable or transient expression assays. A series of deletion analyses of the 2450 bp ZmbZIP25 5′-flanking sequence was performed in transgenic Arabidopsis plants, and probable elements prediction analysis revealed the possible presence of negative regulatory elements within the 161 bp region from −1117 to −957 that were responsible for the specificity of the ZmbZIP25 5′-flanking sequence. PMID:29534529
A Structural Overview of RNA-Dependent RNA Polymerases from the Flaviviridae Family
Wu, Jiqin; Liu, Weichi; Gong, Peng
2015-01-01
RNA-dependent RNA polymerases (RdRPs) from the Flaviviridae family are representatives of viral polymerases that carry out RNA synthesis through a de novo initiation mechanism. They share a ≈ 600-residue polymerase core that displays a canonical viral RdRP architecture resembling an encircled right hand with palm, fingers, and thumb domains surrounding the active site. Polymerase catalytic motifs A–E in the palm and motifs F/G in the fingers are shared by all viral RdRPs with sequence and/or structural conservations regardless of the mechanism of initiation. Different from RdRPs carrying out primer-dependent initiation, Flaviviridae and other de novo RdRPs utilize a priming element often integrated in the thumb domain to facilitate primer-independent initiation. Upon the transition to the elongation phase, this priming element needs to undergo currently unresolved conformational rearrangements to accommodate the growth of the template-product RNA duplex. In the genera of Flavivirus and Pestivirus, the polymerase module in the C-terminal part of the RdRP protein may be regulated in cis by the N-terminal region of the same polypeptide. Either being a methyltransferase in Flavivirus or a functionally unclarified module in Pestivirus, this region could play auxiliary roles for the canonical folding and/or the catalysis of the polymerase, through defined intra-molecular interactions. PMID:26062131
A Viral (Arc)hive for Metazoan Memory.
Parrish, Nicholas F; Tomonaga, Keizo
2018-01-11
Arc, a master regulator of synaptic plasticity, contains sequence elements that are evolutionarily related to retrotransposon Gag genes. Two related papers in this issue of Cell show that Arc retains retroviral-like capsid-forming ability and can transmit mRNA between cells in the nervous system, a process that may be important for synaptic function. Copyright © 2017 Elsevier Inc. All rights reserved.
Non-canonical mechanism for translational control in bacteria: synthesis of ribosomal protein S1
Boni, Irina V.; Artamonova, Valentina S.; Tzareva, Nina V.; Dreyfus, Marc
2001-01-01
Translation initiation region (TIR) of the rpsA mRNA encoding ribosomal protein S1 is one of the most efficient in Escherichia coli despite the absence of a canonical Shine–Dalgarno-element. Its high efficiency is under strong negative autogenous control, a puzzling phenomenon as S1 has no strict sequence specificity. To define sequence and structural elements responsible for translational efficiency and autoregulation of the rpsA mRNA, a series of rpsA′–′lacZ chromosomal fusions bearing various mutations in the rpsA TIR was created and tested for β-galactosidase activity in the absence and presence of excess S1. These in vivo results, as well as data obtained by in vitro techniques and phylogenetic comparison, allow us to propose a model for the structural and functional organization of the rpsA TIR specific for proteobacteria related to E.coli. According to the model, the high efficiency of translation initiation is provided by a specific fold of the rpsA leader forming a non-contiguous ribosome entry site, which is destroyed upon binding of free S1 when it acts as an autogenous repressor. PMID:11483525
RNA Editing and Its Molecular Mechanism in Plant Organelles
Ichinose, Mizuho; Sugita, Mamoru
2016-01-01
RNA editing by cytidine (C) to uridine (U) conversions is widespread in plant mitochondria and chloroplasts. In some plant taxa, “reverse” U-to-C editing also occurs. However, to date, no instance of RNA editing has yet been reported in green algae and the complex thalloid liverworts. RNA editing may have evolved in early land plants 450 million years ago. However, in some plant species, including the liverwort, Marchantia polymorpha, editing may have been lost during evolution. Most RNA editing events can restore the evolutionarily conserved amino acid residues in mRNAs or create translation start and stop codons. Therefore, RNA editing is an essential process to maintain genetic information at the RNA level. Individual RNA editing sites are recognized by plant-specific pentatricopeptide repeat (PPR) proteins that are encoded in the nuclear genome. These PPR proteins are characterized by repeat elements that bind specifically to RNA sequences upstream of target editing sites. In flowering plants, non-PPR proteins also participate in multiple RNA editing events as auxiliary factors. C-to-U editing can be explained by cytidine deamination. The proteins discovered to date are important factors for RNA editing but a bona fide RNA editing enzyme has yet to be identified. PMID:28025543
Hamm, Jorg; Alessi, Dario R; Biondi, Ricardo M
2002-11-29
The design of specific inhibitors for protein kinases is an important step toward elucidation of intracellular signal transduction pathways and to guide drug discovery programs. We devised a model approach to generate specific, competitive kinase inhibitors by isolating substrate mimics containing two independent binding sites with an anti-idiotype strategy from combinatorial RNA libraries. As a general test for the ability to generate highly specific kinase inhibitors, we selected the transcription factor cAMP-response element-binding protein (CREB) that is phosphorylated on the same serine residue by the protein kinase MSK1 as well as by RSK1. The sequences and structures of these kinases are very similar, about 60% of their amino acids are identical. Nevertheless, we can demonstrate that the selected RNA inhibitors inhibit specifically CREB phosphorylation by MSK1 but do not affect CREB phosphorylation by RSK1. The inhibitors interact preferentially with the inactive form of MSK1. Furthermore, we demonstrate that RNA ligands can be conformation-specific probes, and this feature allowed us to describe magnesium ion-dependent conformational changes of MSK1 upon activation.
Bioinformatic Analysis Reveals Archaeal tRNATyr and tRNATrp Identities in Bacteria
Mukai, Takahito; Reynolds, Noah M.; Crnković, Ana; Söll, Dieter
2017-01-01
The tRNA identity elements for some amino acids are distinct between the bacterial and archaeal domains. Searching in recent genomic and metagenomic sequence data, we found some candidate phyla radiation (CPR) bacteria with archaeal tRNA identity for Tyr-tRNA and Trp-tRNA synthesis. These bacteria possess genes for tyrosyl-tRNA synthetase (TyrRS) and tryptophanyl-tRNA synthetase (TrpRS) predicted to be derived from DPANN superphylum archaea, while the cognate tRNATyr and tRNATrp genes reveal bacterial or archaeal origins. We identified a trace of domain fusion and swapping in the archaeal-type TyrRS gene of a bacterial lineage, suggesting that CPR bacteria may have used this mechanism to create diverse proteins. Archaeal-type TrpRS of bacteria and a few TrpRS species of DPANN archaea represent a new phylogenetic clade (named TrpRS-A). The TrpRS-A open reading frames (ORFs) are always associated with another ORF (named ORF1) encoding an unknown protein without global sequence identity to any known protein. However, our protein structure prediction identified a putative HIGH-motif and KMSKS-motif as well as many α-helices that are characteristic of class I aminoacyl-tRNA synthetase (aaRS) homologs. These results provide another example of the diversity of molecular components that implement the genetic code and provide a clue to the early evolution of life and the genetic code. PMID:28230768
Conifers have a unique small RNA silencing signature
Dolgosheina, Elena V.; Morin, Ryan D.; Aksay, Gozde; Sahinalp, S. Cenk; Magrini, Vincent; Mardis, Elaine R.; Mattsson, Jim; Unrau, Peter J.
2008-01-01
Plants produce small RNAs to negatively regulate genes, viral nucleic acids, and repetitive elements at either the transcriptional or post-transcriptional level in a process that is referred to as RNA silencing. While RNA silencing has been extensively studied across the different phyla of the animal kingdom (e.g., mouse, fly, worm), similar studies in the plant kingdom have focused primarily on angiosperms, thus limiting evolutionary studies of RNA silencing in plants. Here we report on an unexpected phylogenetic difference in the size distribution of small RNAs among the vascular plants. By extracting total RNA from freshly growing shoot tissue, we conducted a survey of small RNAs in 24 vascular plant species. We find that conifers, which radiated from the other seed-bearing plants ∼260 million years ago, fail to produce significant amounts of 24-nucleotide (nt) RNAs that are known to guide DNA methylation and heterochromatin formation in angiosperms. Instead, they synthesize a diverse population of small RNAs that are exactly 21-nt long. This finding was confirmed by high-throughput sequencing of the small RNA sequences from a conifer, Pinus contorta. A conifer EST search revealed the presence of a novel Dicer-like (DCL) family, which may be responsible for the observed change in small RNA expression. No evidence for DCL3, an enzyme that matures 24-nt RNAs in angiosperms, was found. We hypothesize that the diverse class of 21-nt RNAs found in conifers may help to maintain organization of their unusually large genomes. PMID:18566193
Conifers have a unique small RNA silencing signature.
Dolgosheina, Elena V; Morin, Ryan D; Aksay, Gozde; Sahinalp, S Cenk; Magrini, Vincent; Mardis, Elaine R; Mattsson, Jim; Unrau, Peter J
2008-08-01
Plants produce small RNAs to negatively regulate genes, viral nucleic acids, and repetitive elements at either the transcriptional or post-transcriptional level in a process that is referred to as RNA silencing. While RNA silencing has been extensively studied across the different phyla of the animal kingdom (e.g., mouse, fly, worm), similar studies in the plant kingdom have focused primarily on angiosperms, thus limiting evolutionary studies of RNA silencing in plants. Here we report on an unexpected phylogenetic difference in the size distribution of small RNAs among the vascular plants. By extracting total RNA from freshly growing shoot tissue, we conducted a survey of small RNAs in 24 vascular plant species. We find that conifers, which radiated from the other seed-bearing plants approximately 260 million years ago, fail to produce significant amounts of 24-nucleotide (nt) RNAs that are known to guide DNA methylation and heterochromatin formation in angiosperms. Instead, they synthesize a diverse population of small RNAs that are exactly 21-nt long. This finding was confirmed by high-throughput sequencing of the small RNA sequences from a conifer, Pinus contorta. A conifer EST search revealed the presence of a novel Dicer-like (DCL) family, which may be responsible for the observed change in small RNA expression. No evidence for DCL3, an enzyme that matures 24-nt RNAs in angiosperms, was found. We hypothesize that the diverse class of 21-nt RNAs found in conifers may help to maintain organization of their unusually large genomes.
RNA structural constraints in the evolution of the influenza A virus genome NP segment
Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL
2014-01-01
Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940
qPMS9: An Efficient Algorithm for Quorum Planted Motif Search
NASA Astrophysics Data System (ADS)
Nicolae, Marius; Rajasekaran, Sanguthevar
2015-01-01
Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.
Ruchko, Mykhaylo V; Gorodnya, Olena M; Pastukh, Viktor M; Swiger, Brad M; Middleton, Natavia S; Wilson, Glenn L; Gillespie, Mark N
2009-02-01
Reactive oxygen species (ROS) generated in hypoxic pulmonary artery endothelial cells cause transient oxidative base modifications in the hypoxia-response element (HRE) of the VEGF gene that bear a conspicuous relationship to induction of VEGF mRNA expression (K.A. Ziel et al., FASEB J. 19, 387-394, 2005). If such base modifications are indeed linked to transcriptional regulation, then they should be detected in HRE sequences associated with transcriptionally active nucleosomes. Southern blot analysis of the VEGF HRE associated with nucleosome fractions prepared by micrococcal nuclease digestion indicated that hypoxia redistributed some HRE sequences from multinucleosomes to transcriptionally active mono- and dinucleosome fractions. A simple PCR method revealed that VEGF HRE sequences harboring oxidative base modifications were found exclusively in mononucleosomes. Inhibition of hypoxia-induced ROS generation with myxathiozol prevented formation of oxidative base modifications but not the redistribution of HRE sequences into mono- and dinucleosome fractions. The histone deacetylase inhibitor trichostatin A caused retention of HRE sequences in compacted nucleosome fractions and prevented formation of oxidative base modifications. These findings suggest that the hypoxia-induced oxidant stress directed at the VEGF HRE requires the sequence to be repositioned into mononucleosomes and support the prospect that oxidative modifications in this sequence are an important step in transcriptional activation.
Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman
2016-11-02
Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Transcriptomic analysis of rice aleurone cells identified a novel abscisic acid response element.
Watanabe, Kenneth A; Homayouni, Arielle; Gu, Lingkun; Huang, Kuan-Ying; Ho, Tuan-Hua David; Shen, Qingxi J
2017-09-01
Seeds serve as a great model to study plant responses to drought stress, which is largely mediated by abscisic acid (ABA). The ABA responsive element (ABRE) is a key cis-regulatory element in ABA signalling. However, its consensus sequence (ACGTG(G/T)C) is present in the promoters of only about 40% of ABA-induced genes in rice aleurone cells, suggesting other ABREs may exist. To identify novel ABREs, RNA sequencing was performed on aleurone cells of rice seeds treated with 20 μM ABA. Gibbs sampling was used to identify enriched elements, and particle bombardment-mediated transient expression studies were performed to verify the function. Gene ontology analysis was performed to predict the roles of genes containing the novel ABREs. This study revealed 2443 ABA-inducible genes and a novel ABRE, designated as ABREN, which was experimentally verified to mediate ABA signalling in rice aleurone cells. Many of the ABREN-containing genes are predicted to be involved in stress responses and transcription. Analysis of other species suggests that the ABREN may be monocot specific. This study also revealed interesting expression patterns of genes involved in ABA metabolism and signalling. Collectively, this study advanced our understanding of diverse cis-regulatory sequences and the transcriptomes underlying ABA responses in rice aleurone cells. © 2017 John Wiley & Sons Ltd.
Conservation of CD44 exon v3 functional elements in mammals
Vela, Elena; Hilari, Josep M; Delclaux, María; Fernández-Bellon, Hugo; Isamat, Marcos
2008-01-01
Background The human CD44 gene contains 10 variable exons (v1 to v10) that can be alternatively spliced to generate hundreds of different CD44 protein isoforms. Human CD44 variable exon v3 inclusion in the final mRNA depends on a multisite bipartite splicing enhancer located within the exon itself, which we have recently described, and provides the protein domain responsible for growth factor binding to CD44. Findings We have analyzed the sequence of CD44v3 in 95 mammalian species to report high conservation levels for both its splicing regulatory elements (the 3' splice site and the exonic splicing enhancer), and the functional glycosaminglycan binding site coded by v3. We also report the functional expression of CD44v3 isoforms in peripheral blood cells of different mammalian taxa with both consensus and variant v3 sequences. Conclusion CD44v3 mammalian sequences maintain all functional splicing regulatory elements as well as the GAG binding site with the same relative positions and sequence identity previously described during alternative splicing of human CD44. The sequence within the GAG attachment site, which in turn contains the Y motif of the exonic splicing enhancer, is more conserved relative to the rest of exon. Amplification of CD44v3 sequence from mammalian species but not from birds, fish or reptiles, may lead to classify CD44v3 as an exclusive mammalian gene trait. PMID:18710510
A U-Rich Element in the 5′ Untranslated Region Is Necessary for the Translation of p27 mRNA
Millard, S. Sean; Vidal, Anxo; Markus, Maurice; Koff, Andrew
2000-01-01
Increased translation of p27 mRNA correlates with withdrawal of cells from the cell cycle. This raised the possibility that antimitogenic signals might mediate their effects on p27 expression by altering complexes that formed on p27 mRNA, regulating its translation. In this report, we identify a U-rich sequence in the 5′ untranslated region (5′UTR) of p27 mRNA that is necessary for efficient translation in proliferating and nonproliferating cells. We show that a number of factors bind to the 5′UTR in vitro in a manner dependent on the U-rich element, and their availability in the cytosol is controlled in a growth- and cell cycle-dependent fashion. One of these factors is HuR, a protein previously implicated in mRNA stability, transport, and translation. Another is hnRNP C1 and C2, proteins implicated in mRNA processing and the translation of a specific subset of mRNAs expressed in differentiated cells. In lovastatin-treated MDA468 cells, the mobility of the associated hnRNP C1 and C2 proteins changed, and this correlated with increased p27 expression. Together, these data suggest that the U-rich dependent RNP complex on the 5′UTR may regulate the translation of p27 mRNA and may be a target of antimitogenic signals. PMID:10913178
Cross-Species Functionality of Pararetroviral Elements Driving Ribosome Shunting
Pooggin, Mikhail M.; Fütterer, Johannes; Hohn, Thomas
2008-01-01
Background Cauliflower mosaic virus (CaMV) and Rice tungro bacilliform virus (RTBV) belong to distinct genera of pararetroviruses infecting dicot and monocot plants, respectively. In both viruses, polycistronic translation of pregenomic (pg) RNA is initiated by shunting ribosomes that bypass a large region of the pgRNA leader with several short (s)ORFs and a stable stem-loop structure. The shunt requires translation of a 5′-proximal sORF terminating near the stem. In CaMV, mutations knocking out this sORF nearly abolish shunting and virus viability. Methodology/Principal Findings Here we show that two distant regions of the CaMV leader that form a minimal shunt configuration comprising the sORF, a bottom part of the stem, and a shunt landing sequence can be replaced by heterologous sequences that form a structurally similar configuration in RTBV without any dramatic effect on shunt-mediated translation and CaMV infectivity. The CaMV-RTBV chimeric leader sequence was largely stable over five viral passages in turnip plants: a few alterations that did eventually occur in the virus progenies are indicative of fine tuning of the chimeric sequence during adaptation to a new host. Conclusions/Significance Our findings demonstrate cross-species functionality of pararetroviral cis-elements driving ribosome shunting and evolutionary conservation of the shunt mechanism. We are grateful to Matthias Müller and Sandra Pauli for technical assistance. This work was initiated at Friedrich Miescher Institute (Basel, Switzerland). We thank Prof. Thomas Boller for hosting the group at the Institute of Botany. PMID:18286203
Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).
Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi
2014-06-01
The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Villacreses, Javier; Rojas-Herrera, Marcelo; Sánchez, Carolina; Hewstone, Nicole; Undurraga, Soledad F.; Alzate, Juan F.; Manque, Patricio; Maracaja-Coutinho, Vinicius; Polanco, Victor
2015-01-01
Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1). High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs): ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV), Petuvirus genus. ORF1 encodes a movement protein (MP); ORF2 a Reverse Transcriptase (RT) and a Ribonuclease H (RNase H) domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs), AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq). Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant. PMID:25855242
RNA Interference for improving the Outcome of Islet Transplantation
Li, Feng; Mahato, Ram I
2010-01-01
Islet transplantation has the potential to cure type 1 diabetes. Despite recent therapeutic success, it is still not common because a large number of transpanted islets get damaged by multiple challenges including instant blood mediated inflammatory reaction, hypoxia/reperfusion injury, inflammatory cytokines, and immune rejection. RNA interference (RNAi) is an novel strategy to selectively degrade target mRNA. The use of RNAi technologies to downregulate the expression of harmful genes has the potential to improve the outcome of islet transplantation. The aim of this review is to gain a thorough understanding of biological obstacles to islet transplantation and discuss how to overcome these barriers using different RNAi technologies. This eventually will help improve islet survival and function post transplantaion. Chemically synthesized small interferring RNA (siRNA), vector based short haripin RNA (shRNA), and their critical design elements (such as sequences, promoters, backbone) are discussed. The application of combinatorial RNAi in islet transplantation is also discussed. Last but not the least, several delivery strategies for enhanced gene silencing are discussed, including chemical modification of siRNA, complex formation, bioconjugation, and viral vectors. PMID:21156190
Li, Jin-Xue; Hou, Xiao-Jin; Zhu, Jiao; Zhou, Jing-Jing; Huang, Hua-Bin; Yue, Jian-Qiang; Gao, Jun-Yan; Du, Yu-Xia; Hu, Cheng-Xiao; Hu, Chun-Gen; Zhang, Jin-Zhi
2017-01-01
Water deficit is a key factor to induce flowering in many woody plants, but reports on the molecular mechanisms of floral induction and flowering by water deficit are scarce. Here, we analyzed the morphology, cytology, and different hormone levels of lemon buds during floral inductive water deficits. Higher levels of ABA were observed, and the initiation of floral bud differentiation was examined by paraffin sections analysis. A total of 1638 differentially expressed genes (DEGs) were identified by RNA sequencing. DEGs were related to flowering, hormone biosynthesis, or metabolism. The expression of some DEGs was associated with floral induction by real-time PCR analysis. However, some DEGs may not have anything to do with flowering induction/flower development; they may be involved in general stress/drought response. Four genes from the phosphatidylethanolamine-binding protein family were further investigated. Ectopic expression of these genes in Arabidopsis changed the flowering time of transgenic plants. Furthermore, the 5′ flanking region of these genes was also isolated and sequence analysis revealed the presence of several putative cis-regulatory elements, including basic elements and hormone regulation elements. The spatial and temporal expression patterns of these promoters were investigated under water deficit treatment. Based on these findings, we propose a model for citrus flowering under water deficit conditions, which will enable us to further understand the molecular mechanism of water deficit-regulated flowering in citrus. Highlight: Based on gene activity during floral inductive water deficits identified by RNA sequencing and genes associated with lemon floral transition, a model for citrus flowering under water deficit conditions is proposed. PMID:28659956
Characterization of the human gene (TBXAS1) encoding thromboxane synthase.
Miyata, A; Yokoyama, C; Ihara, H; Bandoh, S; Takeda, O; Takahashi, E; Tanabe, T
1994-09-01
The gene encoding human thromboxane synthase (TBXAS1) was isolated from a human EMBL3 genomic library using human platelet thromboxane synthase cDNA as a probe. Nucleotide sequencing revealed that the human thromboxane synthase gene spans more than 75 kb and consists of 13 exons and 12 introns, of which the splice donor and acceptor sites conform to the GT/AG rule. The exon-intron boundaries of the thromboxane synthase gene were similar to those of the human cytochrome P450 nifedipine oxidase gene (CYP3A4) except for introns 9 and 10, although the primary sequences of these enzymes exhibited 35.8% identity each other. The 1.2-kb of the 5'-flanking region sequence contained potential binding sites for several transcription factors (AP-1, AP-2, GATA-1, CCAAT box, xenobiotic-response element, PEA-3, LF-A1, myb, basic transcription element and cAMP-response element). Primer-extension analysis indicated the multiple transcription-start sites, and the major start site was identified as an adenine residue located 142 bases upstream of the translation-initiation site. However, neither a typical TATA box nor a typical CAAT box is found within the 100-b upstream of the translation-initiation site. Southern-blot analysis revealed the presence of one copy of the thromboxane synthase gene per haploid genome. Furthermore, a fluorescence in situ hybridization study revealed that the human gene for thromboxane synthase is localized to band q33-q34 of the long arm of chromosome 7. A tissue-distribution study demonstrated that thromboxane synthase mRNA is widely expressed in human tissues and is particularly abundant in peripheral blood leukocyte, spleen, lung and liver. The low but significant levels of mRNA were observed in kidney, placenta and thymus.
MicroRNA-Dependent Transcriptional Silencing of Transposable Elements in Drosophila Follicle Cells.
Mugat, Bruno; Akkouche, Abdou; Serrano, Vincent; Armenise, Claudia; Li, Blaise; Brun, Christine; Fulga, Tudor A; Van Vactor, David; Pélisson, Alain; Chambeyron, Séverine
2015-05-01
RNA interference-related silencing mechanisms concern very diverse and distinct biological processes, from gene regulation (via the microRNA pathway) to defense against molecular parasites (through the small interfering RNA and the Piwi-interacting RNA pathways). Small non-coding RNAs serve as specificity factors that guide effector proteins to ribonucleic acid targets via base-pairing interactions, to achieve transcriptional or post-transcriptional regulation. Because of the small sequence complementarity required for microRNA-dependent post-transcriptional regulation, thousands of microRNA (miRNA) putative targets have been annotated in Drosophila. In Drosophila somatic ovarian cells, genomic parasites, such as transposable elements (TEs), are transcriptionally repressed by chromatin changes induced by Piwi-interacting RNAs (piRNAs) that prevent them from invading the germinal genome. Here we show, for the first time, that a functional miRNA pathway is required for the piRNA-mediated transcriptional silencing of TEs in this tissue. Global miRNA depletion, caused by tissue- and stage-specific knock down of drosha (involved in miRNA biogenesis), AGO1 or gawky (both responsible for miRNA activity), resulted in loss of TE-derived piRNAs and chromatin-mediated transcriptional de-silencing of TEs. This specific TE de-repression was also observed upon individual titration (by expression of the complementary miRNA sponge) of two miRNAs (miR-14 and miR-34) as well as in a miR-14 loss-of-function mutant background. Interestingly, the miRNA defects differentially affected TE- and 3' UTR-derived piRNAs. To our knowledge, this is the first indication of possible differences in the biogenesis or stability of TE- and 3' UTR-derived piRNAs. This work is one of the examples of detectable phenotypes caused by loss of individual miRNAs in Drosophila and the first genetic evidence that miRNAs have a role in the maintenance of genome stability via piRNA-mediated TE repression.
Huiet, L; Feldstein, P A; Tsai, J H; Falk, B W
1993-12-01
Primer extension analyses and a PCR-based cloning strategy were used to identify and characterize 5' nucleotide sequences on the maize stripe virus (MStV) RNA4 mRNA transcripts encoding the major noncapsid protein (NCP). Direct RNA sequence analysis by primer extension showed that the NCP mRNA transcripts had 10-15 nucleotides beyond the 5' terminus of the MStV RNA4 nucleotide sequence. MStV genomic RNAs isolated from ribonucleoprotein particles (RNPs) lacked the additional 5' nucleotides. cDNA clones representing the 5' region of the mRNA transcripts were constructed, and the nucleotide sequences of the 5' regions were determined for 16 clones. Each was found to have a distinct 10-15 nucleotide sequence immediately 5' of the MStV RNA4 sequence. Eleven of 16 clones had the correct MStV RNA4 5' nucleotide sequence, while five showed minor variations at or near the 5' most MStV RNA4 nucleotide. These characteristics show strong similarities to other viral mRNA transcripts which are synthesized by cap snatching.
DIP1 modulates stem cell homeostasis in Drosophila through regulation of sisR-1.
Wong, Jing Ting; Akhbar, Farzanah; Ng, Amanda Yunn Ee; Tay, Mandy Li-Ian; Loi, Gladys Jing En; Pek, Jun Wei
2017-10-02
Stable intronic sequence RNAs (sisRNAs) are by-products of splicing and regulate gene expression. How sisRNAs are regulated is unclear. Here we report that a double-stranded RNA binding protein, Disco-interacting protein 1 (DIP1) regulates sisRNAs in Drosophila. DIP1 negatively regulates the abundance of sisR-1 and INE-1 sisRNAs. Fine-tuning of sisR-1 by DIP1 is important to maintain female germline stem cell homeostasis by modulating germline stem cell differentiation and niche adhesion. Drosophila DIP1 localizes to a nuclear body (satellite body) and associates with the fourth chromosome, which contains a very high density of INE-1 transposable element sequences that are processed into sisRNAs. DIP1 presumably acts outside the satellite bodies to regulate sisR-1, which is not on the fourth chromosome. Thus, our study identifies DIP1 as a sisRNA regulatory protein that controls germline stem cell self-renewal in Drosophila.Stable intronic sequence RNAs (sisRNAs) are by-products of splicing from introns with roles in embryonic development in Drosophila. Here, the authors show that the RNA binding protein DIP1 regulates sisRNAs in Drosophila, which is necessary for germline stem cell homeostasis.
Schwientek, Patrick; Wendler, Sergej; Neshat, Armin; Eirich, Christina; Rückert, Christian; Klein, Andreas; Wehmeier, Udo F; Kalinowski, Jörn; Stoye, Jens; Pühler, Alfred
2013-08-20
Actinoplanes sp. SE50/110 is known as the producer of the alpha-glucosidase inhibitor acarbose, a potent drug in the treatment of type-2 diabetes mellitus. We conducted the first whole transcriptome analysis of Actinoplanes sp. SE50/110, using RNA-sequencing technology for comparative gene expression studies between cells grown in maltose minimal medium, maltose minimal medium with trace elements, and glucose complex medium. We first studied the behavior of Actinoplanes sp. SE50/110 cultivations in these three media and found that the different media had significant impact on growth rate and in particular on acarbose production. It was demonstrated that Actinoplanes sp. SE50/110 grew well in all three media, but acarbose biosynthesis was only observed in cultures grown in maltose minimal medium with and without trace elements. When comparing the expression profiles between the maltose minimal media with and without trace elements, only few significantly differentially expressed genes were found, which mainly code for uptake systems of metal ions provided in the trace element solution. In contrast, the comparison of expression profiles from maltose minimal medium and glucose complex medium revealed a large number of differentially expressed genes, of which the most conspicuous genes account for iron storage and uptake. Furthermore, the acarbose gene cluster was found to be highly expressed in maltose-containing media and almost silent in the glucose-containing medium. In addition, a putative antibiotic biosynthesis gene cluster was found to be similarly expressed as the acarbose cluster. Copyright © 2012 Elsevier B.V. All rights reserved.
Cold shock protein YB-1 is involved in hypoxia-dependent gene transcription
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rauen, Thomas; Frye, Bjoern C.; Pneumology, University Medical Center, University of Freiburg, Freiburg
Hypoxia-dependent gene regulation is largely orchestrated by hypoxia-inducible factors (HIFs), which associate with defined nucleotide sequences of hypoxia-responsive elements (HREs). Comparison of the regulatory HRE within the 3′ enhancer of the human erythropoietin (EPO) gene with known binding motifs for cold shock protein Y-box (YB) protein-1 yielded strong similarities within the Y-box element and 3′ adjacent sequences. DNA binding assays confirmed YB-1 binding to both, single- and double-stranded HRE templates. Under hypoxia, we observed nuclear shuttling of YB-1 and co-immunoprecipitation assays demonstrated that YB-1 and HIF-1α physically interact with each other. Cellular YB-1 depletion using siRNA significantly induced hypoxia-dependent EPOmore » production at both, promoter and mRNA level. Vice versa, overexpressed YB-1 significantly reduced EPO-HRE-dependent gene transcription, whereas this effect was minor under normoxia. HIF-1α overexpression induced hypoxia-dependent gene transcription through the same element and accordingly, co-expression with YB-1 reduced HIF-1α-mediated EPO induction under hypoxic conditions. Taken together, we identified YB-1 as a novel binding factor for HREs that participates in fine-tuning of the hypoxia transcriptome. - Highlights: • Hypoxia drives nuclear translocation of cold shock protein YB-1. • YB-1 physically interacts with hypoxia-inducible factor (HIF)-1α. • YB-1 binds to the hypoxia-responsive element (HRE) within the erythropoietin (EPO) 3′ enhancer. • YB-1 trans-regulates transcription of hypoxia-dependent genes such as EPO and VEGF.« less
Diene, Seydina M; Merhej, Vicky; Henry, Mireille; El Filali, Adil; Roux, Véronique; Robert, Catherine; Azza, Saïd; Gavory, Frederick; Barbe, Valérie; La Scola, Bernard; Raoult, Didier; Rolain, Jean-Marc
2013-02-01
Here, we sequenced the 5,419,609 bp circular genome of an Enterobacter aerogenes clinical isolate that killed a patient and was resistant to almost all current antibiotics (except gentamicin) commonly used to treat Enterobacterial infections, including colistin. Genomic and phylogenetic analyses explain the discrepancies of this bacterium and show that its core genome originates from another genus, Klebsiella. Atypical characteristics of this bacterium (i.e., motility, presence of ornithine decarboxylase, and lack of urease activity) are attributed to genomic mosaicism, by acquisition of additional genes, such as the complete 60,582 bp flagellar assembly operon acquired "en bloc" from the genus Serratia. The genealogic tree of the 162,202 bp multidrug-resistant conjugative plasmid shows that it is a chimera of transposons and integrative conjugative elements from various bacterial origins, resembling a rhizome. Moreover, we demonstrate biologically that a G53S mutation in the pmrA gene results in colistin resistance. E. aerogenes has a large RNA population comprising 8 rRNA operons and 87 cognate tRNAs that have the ability to translate transferred genes that use different codons, as exemplified by the significantly different codon usage between genes from the core genome and the "mobilome." On the basis of our findings, the evolution of this bacterium to become a "killer bug" with new genomic repertoires was from three criteria that are "opportunity, power, and usage" to indicate a sympatric lifestyle: "opportunity" to meet other bacteria and exchange foreign sequences since this bacteria was similar to sympatric bacteria; "power" to integrate these foreign sequences such as the acquisition of several mobile genetic elements (plasmids, integrative conjugative element, prophages, transposons, flagellar assembly system, etc.) found in his genome; and "usage" to have the ability to translate these sequences including those from rare codons to serve as a translator of foreign languages.
Structure and mechanism of the T-box riboswitches
Zhang, Jinwei
2015-01-01
In most Gram-positive bacteria, including many clinically devastating pathogens from genera such as Bacillus, Clostridium, Listeria and Staphylococcus, T-box riboswitches sense and regulate intracellular availability of amino acids through a multipartite mRNA-tRNA interaction. The T-box mRNA leaders respond to nutrient starvation by specifically binding cognate tRNAs and sensing whether the bound tRNA is aminoacylated, as a proxy for amino acid availability. Based on this readout, T-boxes direct a transcriptional or translational switch to control the expression of downstream genes involved in various aspects of amino acid metabolism: biosynthesis, transport, aminoacylation, transamidation, etc. Two decades after its discovery, the structural and mechanistic underpinnings of the T-box riboswitch were recently elucidated, producing a wealth of insights into how two structured RNAs can recognize each other with robust affinity and exquisite selectivity. The T-box paradigm exemplifies how natural non-coding RNAs can interact not just through sequence complementarity, but can add molecular specificity by precisely juxtaposing RNA structural motifs, exploiting inherently flexible elements and the biophysical properties of post-transcriptional modifications, ultimately achieving a high degree of shape complementarity through mutually induced fit. The T-box also provides a proof-of-principle that compact RNA domains can recognize minute chemical changes (such as tRNA aminoacylation) on another RNA. The unveiling of the structure and mechanism of the T-box system thus expands our appreciation of the range of capabilities and modes of action of structured non-coding RNAs, and hints at the existence of networks of non-coding RNAs that communicate through both, structural and sequence specificity. PMID:25959893
Long interspersed element-1 (LINE-1): passenger or driver in human neoplasms?
Rodić, Nemanja; Burns, Kathleen H
2013-03-01
LINE-1 (L1) retrotransposons make up a significant portion of human genomes, with an estimated 500,000 copies per genome. Like other retrotransposons, L1 retrotransposons propagate through RNA sequences that are reverse transcribed into DNA sequences, which are integrated into new genomic loci. L1 somatic insertions have the potential to disrupt the transcriptome by inserting into or nearby genes. By mutating genes and playing a role in epigenetic dysregulation, L1 transposons may contribute to tumorigenesis. Studies of the "mobilome" have lagged behind other tumor characterizations at the sequence, transcript, and epigenetic levels. Here, we consider evidence that L1 retrotransposons may sometimes drive human tumorigenesis.
Comparison of ribosomal RNA removal methods for transcriptome sequencing workflows in teleost fish
USDA-ARS?s Scientific Manuscript database
RNA sequencing (RNA-Seq) is becoming the standard for transcriptome analysis. Removal of contaminating ribosomal RNA (rRNA) is a priority in the preparation of libraries suitable for sequencing. rRNAs are commonly removed from total RNA via either mRNA selection or rRNA depletion. These methods have...
Regulatory effects of cotranscriptional RNA structure formation and transitions.
Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi
2016-09-01
RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.
antaRNA: ant colony-based RNA sequence design.
Kleinkauf, Robert; Mann, Martin; Backofen, Rolf
2015-10-01
RNA sequence design is studied at least as long as the classical folding problem. Although for the latter the functional fold of an RNA molecule is to be found ,: inverse folding tries to identify RNA sequences that fold into a function-specific target structure. In combination with RNA-based biotechnology and synthetic biology ,: reliable RNA sequence design becomes a crucial step to generate novel biochemical components. In this article ,: the computational tool antaRNA is presented. It is capable of compiling RNA sequences for a given structure that comply in addition with an adjustable full range objective GC-content distribution ,: specific sequence constraints and additional fuzzy structure constraints. antaRNA applies ant colony optimization meta-heuristics and its superior performance is shown on a biological datasets. http://www.bioinf.uni-freiburg.de/Software/antaRNA CONTACT: backofen@informatik.uni-freiburg.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Accetto, Tomaž; Avguštin, Gorazd
2011-01-01
The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
The Drosophila Tis11 protein and its effects on mRNA expression in flies.
Choi, Youn-Jeong; Lai, Wi S; Fedic, Robert; Stumpo, Deborah J; Huang, Weichun; Li, Leping; Perera, Lalith; Brewer, Brandy Y; Wilson, Gerald M; Mason, James M; Blackshear, Perry J
2014-12-19
Members of the mammalian tristetraprolin family of CCCH tandem zinc finger proteins can bind to certain AU-rich elements (AREs) in mRNAs, leading to their deadenylation and destabilization. Mammals express three or four members of this family, but Drosophila melanogaster and other insects appear to contain a single gene, Tis11. We found that recombinant Drosophila Tis11 protein could bind to ARE-containing RNA oligonucleotides with low nanomolar affinity. Remarkably, co-expression in mammalian cells with "target" RNAs demonstrated that Tis11 could promote destabilization of ARE-containing mRNAs and that this was partially dependent on a conserved C-terminal sequence resembling the mammalian NOT1 binding domain. Drosophila Tis11 promoted both deadenylation and decay of a target transcript in this heterologous cell system. We used chromosome deletion/duplication and P element insertion to produce two types of Tis11 deficiency in adult flies, both of which were viable and fertile. To address the hypothesis that Tis11 deficiency would lead to the abnormal accumulation of potential target transcripts, we analyzed gene expression in adult flies by deep mRNA sequencing. We identified 69 transcripts from 56 genes that were significantly up-regulated more than 1.5-fold in both types of Tis11-deficient flies. Ten of the up-regulated transcripts encoded probable proteases, but many other functional classes of proteins were represented. Many of the up-regulated transcripts contained potential binding sites for tristetraprolin family member proteins that were conserved in other Drosophila species. Tis11 is thus an ARE-binding, mRNA-destabilizing protein that may play a role in post-transcriptional gene expression in Drosophila and other insects. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
The Drosophila Tis11 Protein and Its Effects on mRNA Expression in Flies*
Choi, Youn-Jeong; Lai, Wi S.; Fedic, Robert; Stumpo, Deborah J.; Huang, Weichun; Li, Leping; Perera, Lalith; Brewer, Brandy Y.; Wilson, Gerald M.; Mason, James M.; Blackshear, Perry J.
2014-01-01
Members of the mammalian tristetraprolin family of CCCH tandem zinc finger proteins can bind to certain AU-rich elements (AREs) in mRNAs, leading to their deadenylation and destabilization. Mammals express three or four members of this family, but Drosophila melanogaster and other insects appear to contain a single gene, Tis11. We found that recombinant Drosophila Tis11 protein could bind to ARE-containing RNA oligonucleotides with low nanomolar affinity. Remarkably, co-expression in mammalian cells with “target” RNAs demonstrated that Tis11 could promote destabilization of ARE-containing mRNAs and that this was partially dependent on a conserved C-terminal sequence resembling the mammalian NOT1 binding domain. Drosophila Tis11 promoted both deadenylation and decay of a target transcript in this heterologous cell system. We used chromosome deletion/duplication and P element insertion to produce two types of Tis11 deficiency in adult flies, both of which were viable and fertile. To address the hypothesis that Tis11 deficiency would lead to the abnormal accumulation of potential target transcripts, we analyzed gene expression in adult flies by deep mRNA sequencing. We identified 69 transcripts from 56 genes that were significantly up-regulated more than 1.5-fold in both types of Tis11-deficient flies. Ten of the up-regulated transcripts encoded probable proteases, but many other functional classes of proteins were represented. Many of the up-regulated transcripts contained potential binding sites for tristetraprolin family member proteins that were conserved in other Drosophila species. Tis11 is thus an ARE-binding, mRNA-destabilizing protein that may play a role in post-transcriptional gene expression in Drosophila and other insects. PMID:25342740
Wei, Liya; Gu, Lianfeng; Song, Xianwei; Cui, Xiekui; Lu, Zhike; Zhou, Ming; Wang, Lulu; Hu, Fengyi; Zhai, Jixian; Meyers, Blake C.; Cao, Xiaofeng
2014-01-01
Transposable elements (TEs) and repetitive sequences make up over 35% of the rice (Oryza sativa) genome. The host regulates the activity of different TEs by different epigenetic mechanisms, including DNA methylation, histone H3K9 methylation, and histone H3K4 demethylation. TEs can also affect the expression of host genes. For example, miniature inverted repeat TEs (MITEs), dispersed high copy-number DNA TEs, can influence the expression of nearby genes. In plants, 24-nt small interfering RNAs (siRNAs) are mainly derived from repeats and TEs. However, the extent to which TEs, particularly MITEs associated with 24-nt siRNAs, affect gene expression remains elusive. Here, we show that the rice Dicer-like 3 homolog OsDCL3a is primarily responsible for 24-nt siRNA processing. Impairing OsDCL3a expression by RNA interference caused phenotypes affecting important agricultural traits; these phenotypes include dwarfism, larger flag leaf angle, and fewer secondary branches. We used small RNA deep sequencing to identify 535,054 24-nt siRNA clusters. Of these clusters, ∼82% were OsDCL3a-dependent and showed significant enrichment of MITEs. Reduction of OsDCL3a function reduced the 24-nt siRNAs predominantly from MITEs and elevated expression of nearby genes. OsDCL3a directly targets genes involved in gibberellin and brassinosteroid homeostasis; OsDCL3a deficiency may affect these genes, thus causing the phenotypes of dwarfism and enlarged flag leaf angle. Our work identifies OsDCL3a-dependent 24-nt siRNAs derived from MITEs as broadly functioning regulators for fine-tuning gene expression, which may reflect a conserved epigenetic mechanism in higher plants with genomes rich in dispersed repeats or TEs. PMID:24554078
DOE Office of Scientific and Technical Information (OSTI.GOV)
von Arnim, Albrecht G.
2015-02-04
Protein synthesis, or translation, consumes a sizable fraction of the cell’s energy budget, estimated at 5% and up to 50% in differentiated and growing cells, respectively. Plants also invest significant energy and biomass to construct and maintain the translation apparatus. Translation is regulated by a variety of external stimuli. Compared to transcriptional control, attributes of translational control include reduced sensitivity to stochastic fluctuation, a finer gauge of control, and more rapid responsiveness to environmental stimuli. Yet, our murky understanding of translational control allows few generalizations. Consequently, translational regulation is underutilized in the context of transgene regulation, although synthetic biologists aremore » now beginning to appropriate RNA-level gene regulation into their regulatory circuits. We also know little about how translational control contributes to the diversity of plant form and function. This project explored how an emerging regulatory mRNA sequence element, upstream open reading frames (uORFs), is integrated with the general translation initiation machinery to permit translational regulation on specific mRNAs.« less
2010-01-01
Background Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several human pathologies. However, the variability across human individuals of editing levels in other tissues has not been studied so far. Results Here, we analyzed 32 skin samples, looking at A-to-I editing level in three genes within coding sequences and in the Alu repeats of six different genes. We observed highly consistent editing levels across different individuals as well as across tissues, not only in coding targets but, surprisingly, also in the non evolutionary conserved Alu repeats. Conclusions Our findings suggest that A-to-I RNA-editing of Alu elements is a tightly regulated process and, as such, might have been recruited in the course of primate evolution for post-transcriptional regulatory mechanisms. PMID:21029430
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pallan, Pradeep S.; Marshall, William S.; Harp, Joel
To understand the role of structural elements of RNA pseudoknots in controlling the extent of -1-type ribosomal frameshifting, we determined the crystal structure of a high-efficiency frameshifting mutant of the pseudoknot from potato leaf roll virus (PLRV). Correlations of the structure with available in vitro frameshifting data for PLRV pseudoknot mutants implicate sequence and length of a stem-loop linker as modulators of frameshifting efficiency. Although the sequences and overall structures of the RNA pseudoknots from PLRV and beet western yellow virus (BWYV) are similar, nucleotide deletions in the linker and adjacent minor groove loop abolish frameshifting only with the latter.more » Conversely, mutant PLRV pseudoknots with up to four nucleotides deleted in this region exhibit nearly wild-type frameshifting efficiencies. The crystal structure helps rationalize the different tolerances for deletions in the PLRV and BWYV RNAs, and we have used it to build a three-dimensional model of the PRLV pseudoknot with a four-nucleotide deletion. The resulting structure defines a minimal RNA pseudoknot motif composed of 22 nucleotides capable of stimulating -1-type ribosomal frameshifts.« less
“Guest list” or “Black list”? Heritable Small RNAs as Immunogenic Memories
Rechavi, Oded
2016-01-01
Small RNA-mediated gene silencing plays a pivotal role in genome immunity by recognizing and eliminating viruses and transposons which otherwise may colonize the genome. However, this can be challenging since individual genomic parasites are highly diverse, and employ multiple immune evasion techniques. In this review, I discuss a new theory proposing that the integrity of the germline is maintained by transgenerationally-transmitted RNA “memories” that record ancestral gene expression patterns, and delineate “Self” from “Foreign” sequences. To maintain such recollection two tactics are employed in parallel: “black listing” of invading nucleic acids, and “guest listing” of endogenous genes. Studies in a number of organisms have shown that this memorization is used by the next generation small RNAs to act as “Inherited Vaccines” that ambush invading elements, or as “Inherited Licenses” that grant the transcription of autogenous sequences. PMID:24231398
Genome analysis of the platypus reveals unique signatures of evolution.
Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K
2008-05-08
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.
Genome analysis of the platypus reveals unique signatures of evolution
Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.
2009-01-01
We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734
Sela, Dotan; Chen, Lu; Martin-Brown, Skylar; Washburn, Michael P; Florens, Laurence; Conaway, Joan Weliky; Conaway, Ronald C
2012-06-29
The basic leucine zipper transcription factor ATF6α functions as a master regulator of endoplasmic reticulum (ER) stress response genes. Previous studies have established that, in response to ER stress, ATF6α translocates to the nucleus and activates transcription of ER stress response genes upon binding sequence specifically to ER stress response enhancer elements in their promoters. In this study, we investigate the biochemical mechanism by which ATF6α activates transcription. By exploiting a combination of biochemical and multidimensional protein identification technology-based mass spectrometry approaches, we have obtained evidence that ATF6α functions at least in part by recruiting to the ER stress response enhancer elements of ER stress response genes a collection of RNA polymerase II coregulatory complexes, including the Mediator and multiple histone acetyltransferase complexes, among which are the Spt-Ada-Gcn5 acetyltransferase (SAGA) and Ada-Two-A-containing (ATAC) complexes. Our findings shed new light on the mechanism of action of ATF6α, and they outline a straightforward strategy for applying multidimensional protein identification technology mass spectrometry to determine which RNA polymerase II transcription factors and coregulators are recruited to promoters and other regulatory elements to control transcription.
Kakou, Bidénam; Angers, Bernard; Glémet, Hélène
2016-03-01
The intergenic spacer (IGS) is located between ribosomal RNA (rRNA) gene copies. Within the IGS, regulatory elements for rRNA gene transcription are found, as well as a varying number of other repetitive elements that are at the root of IGS length heterogeneity. This heterogeneity has been shown to have a functional significance through its effect on growth rate. Here, we present the structural organization of yellow perch (Perca flavescens) IGS based on its entire sequence, as well as the IGS length variation within a natural population. Yellow perch IGS structure has four discrete regions containing tandem repeat elements. For three of these regions, no specific length class was detected as allele size was seemingly normally distributed. However, for one repeat region, PCR amplification uncovered the presence of two distinctive IGS variants representing a length difference of 1116 bp. This repeat region was also devoid of any CpG sites despite a high GC content. Balanced selection may be holding the alleles in the population and would account for the high diversity of length variants observed for adjacent regions. Our study is an important precursor for further work aiming to assess the role of IGS length variation in influencing growth rate in fish.
Shen, Yingjia; Venu, R.C.; Nobuta, Kan; Wu, Xiaohui; Notibala, Varun; Demirci, Caghan; Meyers, Blake C.; Wang, Guo-Liang; Ji, Guoli; Li, Qingshun Q.
2011-01-01
Polyadenylation sites mark the ends of mRNA transcripts. Alternative polyadenylation (APA) may alter sequence elements and/or the coding capacity of transcripts, a mechanism that has been demonstrated to regulate gene expression and transcriptome diversity. To study the role of APA in transcriptome dynamics, we analyzed a large-scale data set of RNA “tags” that signify poly(A) sites and expression levels of mRNA. These tags were derived from a wide range of tissues and developmental stages that were mutated or exposed to environmental treatments, and generated using digital gene expression (DGE)–based protocols of the massively parallel signature sequencing (MPSS-DGE) and the Illumina sequencing-by-synthesis (SBS-DGE) sequencing platforms. The data offer a global view of APA and how it contributes to transcriptome dynamics. Upon analysis of these data, we found that ∼60% of Arabidopsis genes have multiple poly(A) sites. Likewise, ∼47% and 82% of rice genes use APA, supported by MPSS-DGE and SBS-DGE tags, respectively. In both species, ∼49%–66% of APA events were mapped upstream of annotated stop codons. Interestingly, 10% of the transcriptomes are made up of APA transcripts that are differentially distributed among developmental stages and in tissues responding to environmental stresses, providing an additional level of transcriptome dynamics. Examples of pollen-specific APA switching and salicylic acid treatment-specific APA clearly demonstrated such dynamics. The significance of these APAs is more evident in the 3034 genes that have conserved APA events between rice and Arabidopsis. PMID:21813626
Two alternative ways of start site selection in human norovirus reinitiation of translation.
Luttermann, Christine; Meyers, Gregor
2014-04-25
The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.