Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.
2015-01-01
While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.
2015-01-01
ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
Wu, C J; Janssen, G R
1996-10-01
The Streptomyces vinaceus viomycin phosphotransferase (vph) mRNA contains an untranslated leader with a conventional Shine-Dalgarno homology. The vph leader was removed by ligation of the vph coding sequence to the transcriptional start site of a Streptomyces or an Escherichia coli promoter, such that transcription would initiate at the first position of the vph start codon. Analysis of mRNA demonstrated that transcription initiated primarily at the A of the vph AUG translational start codon in both Streptomyces lividans and E. coli; cells expressing the unleadered vph mRNA were resistant to viomycin indicating that the Shine-Dalgarno sequence, or other features contained within the leader, was not necessary for vph translation. Addition of four nucleotides (5'-AUGC-3') onto the 5' end of the unleadered vph mRNA resulted in translation initiation from the vph start codon and the AUG triplet contained within the added sequence. Translational fusions of vph sequence to a Tn5 neo reporter gene indicated that the first 16 codons of vph coding sequence were sufficient to specify the translational start site and reading frame for expression of neomycin resistance in both E. coli and S. lividans.
New Universal Rules of Eukaryotic Translation Initiation Fidelity
Zur, Hadas; Tuller, Tamir
2013-01-01
The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Ivanov, Ivaylo P.; Loughran, Gary; Atkins, John F.
2008-01-01
In a minority of eukaryotic mRNAs, a small functional upstream ORF (uORF), often performing a regulatory role, precedes the translation start site for the main product(s). Here, conserved uORFs in numerous ornithine decarboxylase homologs are identified from yeast to mammals. Most have noncanonical evolutionarily conserved start codons, the main one being AUU, which has not been known as an initiator for eukaryotic chromosomal genes. The AUG-less uORF present in mouse antizyme inhibitor, one of the ornithine decarboxylase homologs in mammals, mediates polyamine-induced repression of the downstream main ORF. This repression is part of an autoregulatory circuit, and one of its sensors is the AUU codon, which suggests that translation initiation codon identity is likely used for regulation in eukaryotes. PMID:18626014
Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R
2016-06-01
As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Hui, A; Hayflick, J; Dinkelspiel, K; de Boer, H A
1984-01-01
The effect on the translation efficiency of various mutations in the three bases (the -1 triplet) that precede the AUG start codon of the beta-galactosidase mRNA in Escherichia coli was studied. Of the 39 mutants examined, the level of expression varies over a 20-fold range. The most favorable combinations of bases in the -1 triplet are UAU and CUU. The expression levels in the mutants with UUC, UCA or AGG as the -1 triplet are 20-fold lower than those with UAU or CUU. In general, a U residue immediately preceding the start codon is more favorable for expression than any other base; furthermore, an A residue at the -2 position enhances the translation efficiency in most instances. In both cases, however, the degree of enhancement depends on its context, i.e. the neighboring bases. Although the rules derived from this study are complex, the results show that mutations in any of the three bases preceding the start codon can strongly affect the translational efficiency of the beta-galactosidase mRNA. PMID:6425057
Non-AUG translation: a new start for protein synthesis in eukaryotes
Kearse, Michael G.; Wilusz, Jeremy E.
2017-01-01
Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states. PMID:28982758
PreTIS: A Tool to Predict Non-canonical 5’ UTR Translational Initiation Sites in Human and Mouse
Reuter, Kerstin; Helms, Volkhard
2016-01-01
Translation of mRNA sequences into proteins typically starts at an AUG triplet. In rare cases, translation may also start at alternative non–AUG codons located in the annotated 5’ UTR which leads to an increased regulatory complexity. Since ribosome profiling detects translational start sites at the nucleotide level, the properties of these start sites can then be used for the statistical evaluation of functional open reading frames. We developed a linear regression approach to predict in–frame and out–of–frame translational start sites within the 5’ UTR from mRNA sequence information together with their translation initiation confidence. Predicted start codons comprise AUG as well as near–cognate codons. The underlying datasets are based on published translational start sites for human HEK293 and mouse embryonic stem cells that were derived by the original authors from ribosome profiling data. The average prediction accuracy of true vs. false start sites for HEK293 cells was 80%. When applied to mouse mRNA sequences, the same model predicted translation initiation sites observed in mouse ES cells with an accuracy of 76%. Moreover, we illustrate the effect of in silico mutations in the flanking sequence context of a start site on the predicted initiation confidence. Our new webservice PreTIS visualizes alternative start sites and their respective ORFs and predicts their ability to initiate translation. Solely, the mRNA sequence is required as input. PreTIS is accessible at http://service.bioinformatik.uni-saarland.de/pretis. PMID:27768687
Feeney, Morgan A.; Chandra, Govind; Findlay, Kim C.; Paget, Mark S. B.
2017-01-01
ABSTRACT The major oxidative stress response in Streptomyces is controlled by the sigma factor SigR and its cognate antisigma factor RsrA, and SigR activity is tightly controlled through multiple mechanisms at both the transcriptional and posttranslational levels. Here we show that sigR has a highly unusual GTC start codon and that this leads to another level of SigR regulation, in which SigR translation is repressed by translation initiation factor 3 (IF3). Changing the GTC to a canonical start codon causes SigR to be overproduced relative to RsrA, resulting in unregulated and constitutive expression of the SigR regulon. Similarly, introducing IF3* mutations that impair its ability to repress SigR translation has the same effect. Thus, the noncanonical GTC sigR start codon and its repression by IF3 are critical for the correct and proper functioning of the oxidative stress regulatory system. sigR and rsrA are cotranscribed and translationally coupled, and it had therefore been assumed that SigR and RsrA are produced in stoichiometric amounts. Here we show that RsrA can be transcribed and translated independently of SigR, present evidence that RsrA is normally produced in excess of SigR, and describe the factors that determine SigR-RsrA stoichiometry. PMID:28611250
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hipp, Katharina, E-mail: katharina.hipp@bio.uni-st
Plant infecting geminiviruses encode a small (A)C4 protein within the open reading frame of the replication-initiator protein. In African cassava mosaic virus, two in-frame start codons may be used for the translation of a longer and a shorter AC4 variant. Both were fused to green fluorescent protein or glutathione-S-transferase genes and expressed in fission yeast. The longer variant accumulated in discrete spots in the cytoplasm, whereas the shorter variant localized to the plasma membrane. A similar expression pattern was found in plants. A myristoylation motif may promote a targeting of the shorter variant to the plasma membrane. Mass spectrometry analysismore » of the yeast-expressed shorter variant detected the corresponding myristoylation. The biological relevance of the second start codon was confirmed using mutated infectious clones. Whereas mutating the first start codon had no effect on the infectivity in Nicotiana benthamiana plants, the second start codon proved to be essential. -- Highlights: •The ACMV AC4 may be translated from one or the other in-frame start codon. •Both AC4 variants are translated in fission yeast. •The long AC4 protein localizes to the cytoplasm, the short to the plasma membrane. •The short variant is myristoylated in yeast and may promote membrane localization. •Only the shorter AC4 variant has an impact on viral infections in plants.« less
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2009-01-01
Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Studtmann, Katrin; Ölschläger-Schütt, Janin; Buck, Friedrich; Richter, Dietmar; Sala, Carlo; Bockmann, Jürgen; Kindler, Stefan; Kreienkamp, Hans-Jürgen
2014-01-01
Local protein synthesis in dendrites enables neurons to selectively change the protein complement of individual postsynaptic sites. Though it is generally assumed that this mechanism requires tight translational control of dendritically transported mRNAs, it is unclear how translation of dendritic mRNAs is regulated. We have analyzed here translational control elements of the dendritically localized mRNA coding for the postsynaptic scaffold protein Shank1. In its 5′ region, the human Shank1 mRNA exhibits two alternative translation initiation sites (AUG+1 and AUG+214), three canonical upstream open reading frames (uORFs1-3) and a high GC content. In reporter assays, fragments of the 5′UTR with high GC content inhibit translation, suggesting a contribution of secondary structures. uORF3 is most relevant to translation control as it overlaps with the first in frame start codon (AUG+1), directing translation initiation to the second in frame start codon (AUG+214). Surprisingly, our analysis points to an additional uORF initiated at a non-canonical ACG start codon. Mutation of this start site leads to an almost complete loss of translation initiation at AUG+1, demonstrating that this unconventional uORF is required for Shank1 synthesis. Our data identify a novel mechanism whereby initiation at a non-canonical site allows for translation of the main Shank1 ORF despite a highly structured 5′UTR. PMID:24533096
CCC CGA is a weak translational recoding site in Escherichia coli.
Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel
2004-12-08
Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster
Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.
1988-01-01
We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
The helicase Ded1p controls use of near-cognate translation initiation codons in 5' UTRs.
Guenther, Ulf-Peter; Weinberg, David E; Zubradt, Meghan M; Tedeschi, Frank A; Stawicki, Brittany N; Zagore, Leah L; Brar, Gloria A; Licatalosi, Donny D; Bartel, David P; Weissman, Jonathan S; Jankowsky, Eckhard
2018-06-27
The conserved and essential DEAD-box RNA helicase Ded1p from yeast and its mammalian orthologue DDX3 are critical for the initiation of translation 1 . Mutations in DDX3 are linked to tumorigenesis 2-4 and intellectual disability 5 , and the enzyme is targeted by a range of viruses 6 . How Ded1p and its orthologues engage RNAs during the initiation of translation is unknown. Here we show, by integrating transcriptome-wide analyses of translation, RNA structure and Ded1p-RNA binding, that the effects of Ded1p on the initiation of translation are connected to near-cognate initiation codons in 5' untranslated regions. Ded1p associates with the translation pre-initiation complex at the mRNA entry channel and repressing the activity of Ded1p leads to the accumulation of RNA structure in 5' untranslated regions, the initiation of translation from near-cognate start codons immediately upstream of these structures and decreased protein synthesis from the corresponding main open reading frames. The data reveal a program for the regulation of translation that links Ded1p, the activation of near-cognate start codons and mRNA structure. This program has a role in meiosis, in which a marked decrease in the levels of Ded1p is accompanied by the activation of the alternative translation initiation sites that are seen when the activity of Ded1p is repressed. Our observations indicate that Ded1p affects translation initiation by controlling the use of near-cognate initiation codons that are proximal to mRNA structure in 5' untranslated regions.
Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui
2014-11-01
Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
Seligmann, Hervé
2018-05-01
Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove
Filbin, Megan E.; Kieft, Jeffrey S.
2011-01-01
Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem–loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem–loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis. PMID:21606179
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove.
Filbin, Megan E; Kieft, Jeffrey S
2011-07-01
Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem-loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem-loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis.
Lalaouna, David; Morissette, Audrey; Carrier, Marie-Claude; Massé, Eric
2015-10-01
The 87 nucleotide long DsrA sRNA has been mostly studied for its translational activation of the transcriptional regulator RpoS. However, it also represses hns mRNA, which encodes H-NS, a major regulator that affects expression of nearly 5% of Escherichia coli genes. A speculative model previously suggested that DsrA would block hns mRNA translation by binding simultaneously to start and stop codon regions of hns mRNA (coaxial model). Here, we show that DsrA efficiently blocked translation of hns mRNA by base-pairing immediately downstream of the start codon. In addition, DsrA induced hns mRNA degradation by actively recruiting the RNA degradosome complex. Data presented here led to a model of DsrA action on hns mRNA, which supports a canonical mechanism of sRNA-induced mRNA degradation by binding to the translation initiation region. Furthermore, using MS2-affinity purification coupled with RNA sequencing technology (MAPS), we also demonstrated that DsrA targets rbsD mRNA, involved in ribose utilization. Surprisingly, DsrA base pairs far downstream of rbsD start codon and induces rapid degradation of the transcript. Thus, our study enables us to draw an extended DsrA targetome. © 2015 John Wiley & Sons Ltd.
Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.
2014-01-01
Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
Beyond the Triplet Code: Context Cues Transform Translation.
Brar, Gloria A
2016-12-15
The elucidation of the genetic code remains among the most influential discoveries in biology. While innumerable studies have validated the general universality of the code and its value in predicting and analyzing protein coding sequences, established and emerging work has also suggested that full genome decryption may benefit from a greater consideration of a codon's neighborhood within an mRNA than has been broadly applied. This Review examines the evidence for context cues in translation, with a focus on several recent studies that reveal broad roles for mRNA context in programming translation start sites, the rate of translation elongation, and stop codon identity. Copyright © 2016 Elsevier Inc. All rights reserved.
Au, Hilda H T; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES
Au, Hilda H. T.; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Two alternative ways of start site selection in human norovirus reinitiation of translation.
Luttermann, Christine; Meyers, Gregor
2014-04-25
The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.
Nakamura, Masayuki; Sugiura, Masahiro
2007-01-01
Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Antony A, Charles; Alone, Pankaj V
2017-05-13
In eukaryotes, the eIF5 protein plays an important role in translation start site selection by providing the GAP (GTPase activating protein) function. However, in yeast translation initiation fidelity defective eIF5 G31R mutant causes preferential utilization of UUG as initiation codon and is termed as Suppressor of initiation codon (Sui - ) phenotype due to its hyper GTPase activity. The eIF5 G31R mutant dominantly represses GCN4 expression and confers sensitivity to 3-Amino-1,2,4-Trizole (3AT) induced starvation. The down-regulation of the GCN4 expression (Gcn - phenotype) in the eIF5 G31R mutant was not because of leaky scanning defects; rather was due to the utilization of upUUG initiation codons at the 5' regulatory region present between uORF1 and the main GCN4 ORF. Copyright © 2017 Elsevier Inc. All rights reserved.
Differences in codon bias cannot explain differences in translational power among microbes.
Dethlefsen, Les; Schmidt, Thomas M
2005-01-06
Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Possibilities for the evolution of the genetic code from a preceding form
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1973-01-01
Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency
Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi
2012-01-01
Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi
2017-08-21
Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.
Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R
1982-01-01
The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Structure of a human cap-dependent 48S translation pre-initiation complex
Eliseev, Boris; Yeramala, Lahari; Leitner, Alexander; Karuppasamy, Manikandan; Raimondeau, Etienne; Huard, Karine; Alkalaeva, Elena; Aebersold, Ruedi
2018-01-01
Abstract Eukaryotic translation initiation is tightly regulated, requiring a set of conserved initiation factors (eIFs). Translation of a capped mRNA depends on the trimeric eIF4F complex and eIF4B to load the mRNA onto the 43S pre-initiation complex comprising 40S and initiation factors 1, 1A, 2, 3 and 5 as well as initiator-tRNA. Binding of the mRNA is followed by mRNA scanning in the 48S pre-initiation complex, until a start codon is recognised. Here, we use a reconstituted system to prepare human 48S complexes assembled on capped mRNA in the presence of eIF4B and eIF4F. The highly purified h-48S complexes are used for cross-linking/mass spectrometry, revealing the protein interaction network in this complex. We report the electron cryo-microscopy structure of the h-48S complex at 6.3 Å resolution. While the majority of eIF4B and eIF4F appear to be flexible with respect to the ribosome, additional density is detected at the entrance of the 40S mRNA channel which we attribute to the RNA-recognition motif of eIF4B. The eight core subunits of eIF3 are bound at the 40S solvent-exposed side, as well as the subunits eIF3d, eIF3b and eIF3i. elF2 and initiator-tRNA bound to the start codon are present at the 40S intersubunit side. This cryo-EM structure represents a molecular snap-shot revealing the h-48S complex following start codon recognition. PMID:29401259
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons
Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel
2004-01-01
Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
NASA Astrophysics Data System (ADS)
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-11-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-01-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Birikh, K R; Lebedenko, E N; Boni, I V; Berlin, Y A
1995-10-27
Synthetic intronless genes, coding for human interleukin 1 alpha (IL 1 alpha) and interleukin 1 receptor antagonist (IL1ra), have been expressed efficiently in a specially designed prokaryotic vector, pGMCE (a pGEM1 derivative), where the target gene forms the second part of a two-cistron system. The first part of the system is a translation enhancer-containing mini-cistron, whose termination codon overlaps the start codon of the target gene. In the case of the IL1 alpha gene, the high expression level is largely due to the direct efficient translation initiation at the second cistron, whereas with the IL1ra gene in the same system, the proximal translation initiation region (TIR) provides a high level of coupled expression of the target gene. Thus, pGMCE is a potentially versatile vector for direct prokaryotic expression.
How the Sequence of a Gene Specifies Structural Symmetry in Proteins
Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin
2015-01-01
Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
NASA Astrophysics Data System (ADS)
Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.
2018-02-01
Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index
Xia, Xuhua
2015-01-01
Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2010-01-01
Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…
Llácer, Jose L.; Hussain, Tanweer; Marler, Laura; Aitken, Colin Echeverría; Thakur, Anil; Lorsch, Jon R.; Hinnebusch, Alan G.; Ramakrishnan, V.
2015-01-01
Summary Translation initiation in eukaryotes begins with the formation of a pre-initiation complex (PIC) containing the 40S ribosomal subunit, eIF1, eIF1A, eIF3, ternary complex (eIF2-GTP-Met-tRNAi), and eIF5. The PIC, in an open conformation, attaches to the 5′ end of the mRNA and scans to locate the start codon, whereupon it closes to arrest scanning. We present single particle cryo-electron microscopy (cryo-EM) reconstructions of 48S PICs from yeast in these open and closed states, at 6.0 Å and 4.9 Å, respectively. These reconstructions show eIF2β as well as a configuration of eIF3 that appears to encircle the 40S, occupying part of the subunit interface. Comparison of the complexes reveals a large conformational change in the 40S head from an open mRNA latch conformation to a closed one that constricts the mRNA entry channel and narrows the P site to enclose tRNAi, thus elucidating key events in start codon recognition. PMID:26212456
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H
2004-08-16
This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
Accetto, Tomaž; Avguštin, Gorazd
2011-01-01
The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah
2012-01-01
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Global analysis of translation termination in E. coli.
Baggett, Natalie E; Zhang, Yan; Gross, Carol A
2017-03-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins.
Levin-Karp, Ayelet; Barenholz, Uri; Bareia, Tasneem; Dayagi, Michal; Zelcbuch, Lior; Antonovsky, Niv; Noor, Elad; Milo, Ron
2013-06-21
Translational coupling is the interdependence of translation efficiency of neighboring genes encoded within an operon. The degree of coupling may be quantified by measuring how the translation rate of a gene is modulated by the translation rate of its upstream gene. Translational coupling was observed in prokaryotic operons several decades ago, but the quantitative range of modulation translational coupling leads to and the factors governing this modulation were only partially characterized. In this study, we systematically quantify and characterize translational coupling in E. coli synthetic operons using a library of plasmids carrying fluorescent reporter genes that are controlled by a set of different ribosome binding site (RBS) sequences. The downstream gene expression level is found to be enhanced by the upstream gene expression via translational coupling with the enhancement level varying from almost no coupling to over 10-fold depending on the upstream gene's sequence. Additionally, we find that the level of translational coupling in our system is similar between the second and third locations in the operon. The coupling depends on the distance between the stop codon of the upstream gene and the start codon of the downstream gene. This study is the first to systematically and quantitatively characterize translational coupling in a synthetic E. coli operon. Our analysis will be useful in accurate manipulation of gene expression in synthetic biology and serves as a step toward understanding the mechanisms involved in translational expression modulation.
Nissley, Daniel A.; Sharma, Ajeet K.; Ahmed, Nabeel; Friedrich, Ulrike A.; Kramer, Günter; Bukau, Bernd; O'Brien, Edward P.
2016-01-01
The rates at which domains fold and codons are translated are important factors in determining whether a nascent protein will co-translationally fold and function or misfold and malfunction. Here we develop a chemical kinetic model that calculates a protein domain's co-translational folding curve during synthesis using only the domain's bulk folding and unfolding rates and codon translation rates. We show that this model accurately predicts the course of co-translational folding measured in vivo for four different protein molecules. We then make predictions for a number of different proteins in yeast and find that synonymous codon substitutions, which change translation-elongation rates, can switch some protein domains from folding post-translationally to folding co-translationally—a result consistent with previous experimental studies. Our approach explains essential features of co-translational folding curves and predicts how varying the translation rate at different codon positions along a transcript's coding sequence affects this self-assembly process. PMID:26887592
Leaderless mRNAs are circularized in Chlamydomonas reinhardtii mitochondria.
Cahoon, A Bruce; Qureshi, Ali A
2018-06-01
The mitochondrial genome of Chlamydomonas reinhardtii encodes eight protein coding genes transcribed on two polycistronic primary transcripts. The mRNAs are endonucleolytically cleaved from these transcripts directly upstream of their AUG start codons, creating leaderless mRNAs with 3' untranslated regions (UTR) comprised of most or all of their downstream intergenic regions. In this report, we provide evidence that these processed linear mRNAs are circularized, which places the 3' UTR upstream of the 5' start codon, creating a leader sequence ex post facto. The circular mRNAs were found to be ribosome associate by polysome profiling experiments suggesting they are translated. Sequencing of the 3'-5' junctions of the circularized mRNAs found the intra-molecular ligations occurred between fully processed 5' ends (the start AUG) and a variable 3' terminus. For five genes (cob, cox, nd2, nd4, and nd6), some of the 3' ends maintained an oligonucleotide addition during ligation, and for two of them, cob and nd6, these 3' termini were the most commonly recovered sequence. Previous reports have shown that after cleavage, three untemplated oligonucleotide additions may occur on the 3' termini of these mRNAs-adenylation, uridylylation, or cytidylation. These results suggest oligo(U) and oligo(C) additions may be part of the maturation process since they are maintained in the circular mRNAs. Circular RNAs occur in organisms across the biological spectrum, but their purpose in some systems, such as organelles (mitochondria and chloroplasts) is unclear. We hypothesize, that in C. reinhardtii mitochondria it may create a leader sequence to facilitate translation initiation, which may negate the need for an alternative translation initiation mechanism in this system, as previously speculated. In addition, circularization may play a protective role against exonucleases, and/or increase translational productivity.
Zhang, Xin; Huang, Danping; Jia, Xiwei; Zou, Zhihua; Wang, Yilei; Zhang, Ziping
2018-04-01
In this study, the 5'-flanking region of molt-inhibiting hormone (MIH) gene was cloned by Tail-PCR. It is 2024 bp starting from the translation initiation site, and 1818 bp starting from the predicted transcription start site. Forecast analysis results by the bioinformatics software showed that the transcription start site is located at 207 bp upstream of the start codon ATG, and TATA box is located at 240 bp upstream of the start codon ATG. Potential transcription factor binding sites include Sp1, NF-1, Oct-1, Sox-2, RAP1, and so on. There are two CpG islands, located at -25- +183 bp and -1451- -1316 bp respectively. The transfection results of luciferase reporter constructs showed that the core promoter region was located in the fragment -308 bp to -26 bp. NF-kappaB and RAP1 were essential for mih basal transcriptional activity. There are three kinds of polymorphism CA in the 5'-flanking sequence, and they can influence mih promoter activity. These findings provide a genetic foundation of the further research of mih transcription regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Efficient initiation of mammalian mRNA translation at a CUG codon.
Dasso, M C; Jackson, R J
1989-01-01
Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
Demonstration of GTG as an alternative initiation codon for the serpin endopin 2B-2.
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill L; Hook, Vivian Y H
2005-02-18
This study demonstrates GTG as a novel, alternative initiation codon for translation of bovine endopin 2B-2, a serpin protease inhibitor. Molecular cDNA cloning revealed the endopin 2B-1 and endopin 2B-2 isoforms that are predicted to inhibit papain and elastase. Notably, GTG was demonstrated as the initiation codon for endopin 2B-2, whereas endopin 2B-1 possesses ATG as its initiation codon. GTG mediated in vitro translation of 46kDa endopin 2B-2. GTG also mediated translation of EGFP by in vitro translation and by expression in mammalian cells. Notably, mutagenesis of GTG to GTC resulted in the absence of EGFP expression in cells. GTG produced a lower level of protein expression compared to ATG. The use of GTG as an initiation codon to direct translation of endopin 2B, as well as the heterologous protein EGFP, demonstrates the role of GTG in the regulation of mRNA translation in mammalian cells. Significantly, further analyses of mammalian genomes based on GTG as an alternative initiation codon may predict new candidate gene products expressed by mammalian and human genomes.
Williams, N P; Mueller, P P; Hinnebusch, A G
1988-01-01
Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Global analysis of translation termination in E. coli
Baggett, Natalie E.
2017-01-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins. PMID:28301469
Kim, Younghyun; Lee, Goeun; Jeon, Eunhyun; Sohn, Eun ju; Lee, Yongjik; Kang, Hyangju; Lee, Dong wook; Kim, Dae Heon; Hwang, Inhwan
2014-01-01
The nucleotide sequence around the translational initiation site is an important cis-acting element for post-transcriptional regulation. However, it has not been fully understood how the sequence context at the 5′-untranslated region (5′-UTR) affects the translational efficiency of individual mRNAs. In this study, we provide evidence that the 5′-UTRs of Arabidopsis genes showing a great difference in the nucleotide sequence vary greatly in translational efficiency with more than a 200-fold difference. Of the four types of nucleotides, the A residue was the most favourable nucleotide from positions −1 to −21 of the 5′-UTRs in Arabidopsis genes. In particular, the A residue in the 5′-UTR from positions −1 to −5 was required for a high-level translational efficiency. In contrast, the T residue in the 5′-UTR from positions −1 to −5 was the least favourable nucleotide in translational efficiency. Furthermore, the effect of the sequence context in the −1 to −21 region of the 5′-UTR was conserved in different plant species. Based on these observations, we propose that the sequence context immediately upstream of the AUG initiation codon plays a crucial role in determining the translational efficiency of plant genes. PMID:24084084
β-Glucuronidase as a Sensitive and Versatile Reporter in Actinomycetes ▿
Myronovskyi, Maksym; Welle, Elisabeth; Fedorenko, Viktor; Luzhetskyy, Andriy
2011-01-01
Here we describe a versatile and sensitive reporter system for actinomycetes that is based on gusA, which encodes the β-glucuronidase enzyme. A series of gusA-containing transcriptional and translational fusion vectors were constructed and utilized to study the regulatory cascade of the phenalinolactone biosynthetic gene cluster. Furthermore, these vectors were used to study the efficiency of translation initiation at the ATG, GTG, TTG, and CTG start codons. Surprisingly, constructs using a TTG start codon showed the best activity, whereas those using ATG or GTG were approximately one-half or one-third as active, respectively. The CTG fusion showed only 5% of the activity of the TTG fusion. A suicide vector, pKGLP2, carrying gusA in its backbone was used to visually detect merodiploid formation and resolution, making gene targeting in actinomycetes much faster and easier. Three regulatory genes, plaR1, plaR2, and plaR3, involved in phenalinolactone biosynthesis were efficiently replaced with an apramycin resistance marker using this system. Finally, we expanded the genetic code of actinomycetes by introducing the nonproteinogenic amino acid N-epsilon-cyclopentyloxycarbonyl-l-lysine with the GusA protein as a reporter. PMID:21685164
Richter, Lubna V.; Sandler, Steven J.
2012-01-01
Type IV pili of Geobacter sulfurreducens are composed of PilA monomers and are essential for long-range extracellular electron transfer to insoluble Fe(III) oxides and graphite anodes. A previous analysis of pilA expression indicated that transcription was initiated at two positions, with two predicted ribosome-binding sites and translation start codons, potentially producing two PilA preprotein isoforms. The present study supports the existence of two functional translation start codons for pilA and identifies two isoforms (short and long) of the PilA preprotein. The short PilA isoform is found predominantly in an intracellular fraction. It seems to stabilize the long isoform and to influence the secretion of several outer-surface c-type cytochromes. The long PilA isoform is required for secretion of PilA to the outer cell surface, a process that requires coexpression of pilA with nine downstream genes. The long isoform was determined to be essential for biofilm formation on certain surfaces, for optimum current production in microbial fuel cells, and for growth on insoluble Fe(III) oxides. PMID:22408162
Genome-wide analysis of codon usage bias in four sequenced cotton species.
Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen
2018-01-01
Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
Neymotin, Benjamin; Ettorre, Victoria; Gresham, David
2016-01-01
Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789
Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G
1999-07-01
We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Chen, Augustine; Kao, Y. F.; Brown, Chris M.
2005-01-01
The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
Butler, J S; Springer, M; Grunberg-Manago, M
1987-01-01
We previously showed that Escherichia coli translation initiation factor IF3 regulates the expression of its own gene infC at the translational level in vivo. Here we create two alterations in the infC gene and test their effects on translational autocontrol of infC expression in vivo by measuring beta-galactosidase activity expressed from infC-lacZ gene fusions under conditions of up to 4-fold derepression or 3-fold repression of infC expression. Replacement of the infC promoter with the trp promoter deletes 120 nucleotides of the infC mRNA 5' to the translation initiation site without affecting autogenous translational control. Mutation of the unusual AUU initiator codon of infC to the more common AUG initiator codon abolishes translation initiation factor IF3-dependent repression and derepression of infC expression in vivo. These results establish the AUU initiator codon of infC as an essential cis-acting element in autogenous translational control of translation initiation factor IF3 expression in vivo. PMID:2954162
Butler, J S; Springer, M; Grunberg-Manago, M
1987-06-01
We previously showed that Escherichia coli translation initiation factor IF3 regulates the expression of its own gene infC at the translational level in vivo. Here we create two alterations in the infC gene and test their effects on translational autocontrol of infC expression in vivo by measuring beta-galactosidase activity expressed from infC-lacZ gene fusions under conditions of up to 4-fold derepression or 3-fold repression of infC expression. Replacement of the infC promoter with the trp promoter deletes 120 nucleotides of the infC mRNA 5' to the translation initiation site without affecting autogenous translational control. Mutation of the unusual AUU initiator codon of infC to the more common AUG initiator codon abolishes translation initiation factor IF3-dependent repression and derepression of infC expression in vivo. These results establish the AUU initiator codon of infC as an essential cis-acting element in autogenous translational control of translation initiation factor IF3 expression in vivo.
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN
Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice
2016-01-01
Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Ribosomes slide on lysine-encoding homopolymeric A stretches
Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel
2015-01-01
Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki
2003-04-15
The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.
Pechmann, Sebastian; Frydman, Judith
2013-02-01
The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
Speed Controls in Translating Secretory Proteins in Eukaryotes - an Evolutionary Perspective
Mahlab, Shelly; Linial, Michal
2014-01-01
Protein translation is the most expensive operation in dividing cells from bacteria to humans. Therefore, managing the speed and allocation of resources is subject to tight control. From bacteria to humans, clusters of relatively rare tRNA codons at the N′-terminal of mRNAs have been implicated in attenuating the process of ribosome allocation, and consequently the translation rate in a broad range of organisms. The current interpretation of “slow” tRNA codons does not distinguish between protein translations mediated by free- or endoplasmic reticulum (ER)-bound ribosomes. We demonstrate that proteins translated by free- or ER-bound ribosomes exhibit different overall properties in terms of their translation efficiency and speed in yeast, fly, plant, worm, bovine and human. We note that only secreted or membranous proteins with a Signal peptide (SP) are specified by segments of “slow” tRNA at the N′-terminal, followed by abundant codons that are considered “fast.” Such profiles apply to 3100 proteins of the human proteome that are composed of secreted and signal peptide (SP)-assisted membranous proteins. Remarkably, the bulks of the proteins (12,000), or membranous proteins lacking SP (3400), do not have such a pattern. Alternation of “fast” and “slow” codons was found also in proteins that translocate to mitochondria through transit peptides (TP). The differential clusters of tRNA adapted codons is not restricted to the N′-terminal of transcripts. Specifically, Glycosylphosphatidylinositol (GPI)-anchored proteins are unified by clusters of low adapted tRNAs codons at the C′-termini. Furthermore, selection of amino acids types and specific codons was shown as the driving force which establishes the translation demands for the secretory proteome. We postulate that “hard-coded” signals within the secretory proteome assist the steps of protein maturation and folding. Specifically, “speed control” signals for delaying the translation of a nascent protein fulfill the co- and post-translational stages such as membrane translocation, proteins processing and folding. PMID:24391480
Mathew, Suneeth F; Crowe-McAuliffe, Caillan; Graves, Ryan; Cardno, Tony S; McKinney, Cushla; Poole, Elizabeth S; Tate, Warren P
2015-01-01
HIV-1 utilises -1 programmed ribosomal frameshifting to translate structural and enzymatic domains in a defined proportion required for replication. A slippery sequence, U UUU UUA, and a stem-loop are well-defined RNA features modulating -1 frameshifting in HIV-1. The GGG glycine codon immediately following the slippery sequence (the 'intercodon') contributes structurally to the start of the stem-loop but has no defined role in current models of the frameshift mechanism, as slippage is inferred to occur before the intercodon has reached the ribosomal decoding site. This GGG codon is highly conserved in natural isolates of HIV. When the natural intercodon was replaced with a stop codon two different decoding molecules-eRF1 protein or a cognate suppressor tRNA-were able to access and decode the intercodon prior to -1 frameshifting. This implies significant slippage occurs when the intercodon is in the (perhaps distorted) ribosomal A site. We accommodate the influence of the intercodon in a model of frame maintenance versus frameshifting in HIV-1.
Abad, Francisco; de la Morena-Barrio, María Eugenia; Fernández-Breis, Jesualdo Tomás; Corral, Javier
2018-06-01
Translation is a key biological process controlled in eukaryotes by the initiation AUG codon. Variations affecting this codon may have pathological consequences by disturbing the correct initiation of translation. Unfortunately, there is no systematic study describing these variations in the human genome. Moreover, we aimed to develop new tools for in silico prediction of the pathogenicity of gene variations affecting AUG codons, because to date, these gene defects have been wrongly classified as missense. Whole-exome analysis revealed the mean of 12 gene variations per person affecting initiation codons, mostly with high (> 0:01) minor allele frequency (MAF). Moreover, analysis of Ensembl data (December 2017) revealed 11,261 genetic variations affecting the initiation AUG codon of 7,205 genes. Most of these variations (99.5%) have low or unknown MAF, probably reflecting deleterious consequences. Only 62 variations had high MAF. Genetic variations with high MAF had closer alternative AUG downstream codons than did those with low MAF. Besides, the high-MAF group better maintained both the signal peptide and reading frame. These differentiating elements could help to determine the pathogenicity of this kind of variation. Data and scripts in Perl and R are freely available at https://github.com/fanavarro/hemodonacion. jfernand@um.es. Supplementary data are available at Bioinformatics online.
Hypothesis Formation and Qualitative Reasoning in Molecular Biology
1989-06-01
presents studies of the trp operon in the bacterium S . Marcescens . In vitro transcription studies showed that transcription termination does occur in...observed was that there are two 4.4. ANNOTATED CHRONOLOGY OF THE RESEARCH 135 translation-start codons in the S . marcescens leader region. The authors...of leader-region mRNA secondary structures in attenuation in the S . marcescens trp operon. A different bac- terium was used because it included
Translation efficiency is determined by both codon bias and folding energy
Tuller, Tamir; Waldman, Yedael Y.; Kupiec, Martin; Ruppin, Eytan
2010-01-01
Synonymous mutations do not alter the protein produced yet can have a significant effect on protein levels. The mechanisms by which this effect is achieved are controversial; although some previous studies have suggested that codon bias is the most important determinant of translation efficiency, a recent study suggested that mRNA folding at the beginning of genes is the dominant factor via its effect on translation initiation. Using the Escherichia coli and Saccharomyces cerevisiae transcriptomes, we conducted a genome-scale study aiming at dissecting the determinants of translation efficiency. There is a significant association between codon bias and translation efficiency across all endogenous genes in E. coli and S. cerevisiae but no association between folding energy and translation efficiency, demonstrating the role of codon bias as an important determinant of translation efficiency. However, folding energy does modulate the strength of association between codon bias and translation efficiency, which is maximized at very weak mRNA folding (i.e., high folding energy) levels. We find a strong correlation between the genomic profiles of ribosomal density and genomic profiles of folding energy across mRNA, suggesting that lower folding energies slow down the ribosomes and decrease translation efficiency. Accordingly, we find that selection forces act near uniformly to decrease the folding energy at the beginning of genes. In summary, these findings testify that in endogenous genes, folding energy affects translation efficiency in a global manner that is not related to the expression levels of individual genes, and thus cannot be detected by correlation with their expression levels. PMID:20133581
Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François
2000-01-01
The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Hussmann, Jeffrey A; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H
2015-12-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics.
Hussmann, Jeffrey A.; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H.
2015-01-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics. PMID:26656907
Conserved small mRNA with an unique, extended Shine-Dalgarno sequence
Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald
2017-01-01
ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lobanov, Alexei V.; Heaphy, Stephen M.; Turanov, Anton A.
2016-11-21
The ribosome can change its reading frame during translation in a process known as programmed ribosomal frameshifting. These rare events are supported by complex mRNA signals. However, we found that the ciliates Euplotes crassus and Euplotes focardii exhibit widespread frameshifting at stop codons. 47 different codons preceding stop signals resulted in either +1 or +2 frameshifts, and +1 frameshifting at AAA was the most frequent. The frameshifts showed unusual plasticity and rapid evolution, and had little influence on translation rates. The proximity of a stop codon to the 3' mRNA end, rather than its occurrence or sequence context, appeared tomore » designate termination. Thus, a ‘stop codon’ is not a sufficient signal for translation termination, and the default function of stop codons in Euplotes is frameshifting, whereas termination is specific to certain mRNA positions and probably requires additional factors.« less
Mechanism and Regulation of Protein Synthesis in Saccharomyces cerevisiae
Dever, Thomas E.; Kinzy, Terri Goss; Pavitt, Graham D.
2016-01-01
In this review, we provide an overview of protein synthesis in the yeast Saccharomyces cerevisiae. The mechanism of protein synthesis is well conserved between yeast and other eukaryotes, and molecular genetic studies in budding yeast have provided critical insights into the fundamental process of translation as well as its regulation. The review focuses on the initiation and elongation phases of protein synthesis with descriptions of the roles of translation initiation and elongation factors that assist the ribosome in binding the messenger RNA (mRNA), selecting the start codon, and synthesizing the polypeptide. We also examine mechanisms of translational control highlighting the mRNA cap-binding proteins and the regulation of GCN4 and CPA1 mRNAs. PMID:27183566
Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi
2010-09-01
In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.
Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter
2016-02-19
Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Tsotakos, Nikolaos; Silveyra, Patricia; Lin, Zhenwu; Thomas, Neal; Vaid, Mudit
2014-01-01
Surfactant protein A (SP-A), a molecule with roles in lung innate immunity and surfactant-related functions, is encoded by two genes in humans: SFTPA1 (SP-A1) and SFTPA2 (SP-A2). The mRNAs from these genes differ in their 5′-untranslated regions (5′-UTR) due to differential splicing. The 5′-UTR variant ACD′ is exclusively found in transcripts of SP-A1, but not in those of SP-A2. Its unique exon C contains two upstream AUG codons (uAUGs) that may affect SP-A1 translation efficiency. The first uAUG (u1) is in frame with the primary start codon (p), but the second one (u2) is not. The purpose of this study was to assess the impact of uAUGs on SP-A1 expression. We employed RT-qPCR to determine the presence of exon C-containing SP-A1 transcripts in human RNA samples. We also used in vitro techniques including mutagenesis, reporter assays, and toeprinting analysis, as well as in silico analyses to determine the role of uAUGs. Exon C-containing mRNA is present in most human lung tissue samples and its expression can, under certain conditions, be regulated by factors such as dexamethasone or endotoxin. Mutating uAUGs resulted in increased luciferase activity. The mature protein size was not affected by the uAUGs, as shown by a combination of toeprint and in silico analysis for Kozak sequence, secondary structure, and signal peptide and in vitro translation in the presence of microsomes. In conclusion, alternative splicing may introduce uAUGs in SP-A1 transcripts, which in turn negatively affect SP-A1 translation, possibly affecting SP-A1/SP-A2 ratio, with potential for clinical implication. PMID:25326576
Mutations That Affect the Efficiency of Translation of mRNA for the cII Gene of Coliphage Lambda
Dul, Ed; Mahoney, Michael E.; Wulff, Daniel L.
1987-01-01
Starting with the λ pRE- strain λctr1 cy3008, which forms clear plaques, we have isolated two mutant strains, λdya2 ctr1 cy3008 and λ dya3 ctr1 cy3008, that form plaques with very slightly turbid centers. The dya2 and dya3 mutations lie in the region of overlap between the PRE promoter and the ribosome recognition region of the cII gene, and have nucleotide alterations at positions -1 and +5 of pRE, and alterations of cII mRNA at -16 and -21 nucleotides before the initial AUG codon of the gene. Both mutations destabilize a stem structure that may be formed by cII mRNA, and dya2 also changes the sequence on cII mRNA that is complementary to the 3'-end of 16 S rRNA from 5'-UAAGGA-3' to 5'-UGAGGA-3'.—The dya2 and dya3 mutations, along with the ctr1 mutation, which destabilizes either of two alternate stem structures which may be formed by cII mRNA (these being more stable stem structures than the one affected by dya2 and dya3), were tested for their ability to reverse two cII- mutations that are characterized by inefficient translation of cII mRNA. These are cII3088, an A → G mutation four bases before the initial AUG codon, and cII3059 , a GUU → GAU (Val2 → Asp) second codon mutation. It was found that ctr1 completely reverses the translation defects of these two mutations, while dya2 partially reverses these translation defects. The dya3 mutation has no effect on translation efficiency under any condition tested. However neither the ctr1 mutation nor the dya2 mutation has much effect on translation efficiency in an otherwise cII+ background, indicating that other factors must limit the rate of translation of cII mRNA under these conditions. PMID:2953647
Moustakas, A; Sonstegard, T S; Hackett, P B
1993-01-01
The Rous sarcoma virus (RSV) leader RNA has three short open reading frames (ORF1 to ORF3) which are conserved in all avian sarcoma-leukosis retroviruses. Effects on virus propagation were determined following three types of alterations in the ORFs: (i) replacement of AUG initiation codons in order to prohibit ORF translation, (ii) alterations of the codon context around the AUG initiation codon to enhance translation of the normally silent ORF3, and (iii) elongation of the ORF coding sequences. Mutagenesis of the AUG codons for ORF1 and ORF2 (AUG1 and AUG2) singly or together delayed the onset of viral replication and cell transformation. In contrast, mutagenesis of AUG3 almost completely suppressed these viral activities. Mutagenesis of ORF3 to enhance its translation inhibited viral propagation. When the mutant ORF3 included an additional frameshift mutation which extended the ORF beyond the initiation site for the gag, gag-pol, and env proteins, host cells were initially transformed but died soon thereafter. Elongation of ORF1 from 7 to 62 codons led to the accumulation of transformation-defective virus with a delayed onset of replication. In contrast, viruses with elongation of ORF1 from 7 to 30 codons, ORF2 from 16 to 48 codons, or ORF3 from 9 to 64 codons, without any alterations in the AUG context, exhibited wild-type phenotypes. These results are consistent with a model that translation of the ORFs is necessary to facilitate virus production. Images PMID:7685415
Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao
2013-01-11
Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
Trotta, Edoardo
2016-05-17
The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
Circ-ZNF609 Is a Circular RNA that Can Be Translated and Functions in Myogenesis.
Legnini, Ivano; Di Timoteo, Gaia; Rossi, Francesca; Morlando, Mariangela; Briganti, Francesca; Sthandier, Olga; Fatica, Alessandro; Santini, Tiziana; Andronache, Adrian; Wade, Mark; Laneve, Pietro; Rajewsky, Nikolaus; Bozzoni, Irene
2017-04-06
Circular RNAs (circRNAs) constitute a family of transcripts with unique structures and still largely unknown functions. Their biogenesis, which proceeds via a back-splicing reaction, is fairly well characterized, whereas their role in the modulation of physiologically relevant processes is still unclear. Here we performed expression profiling of circRNAs during in vitro differentiation of murine and human myoblasts, and we identified conserved species regulated in myogenesis and altered in Duchenne muscular dystrophy. A high-content functional genomic screen allowed the study of their functional role in muscle differentiation. One of them, circ-ZNF609, resulted in specifically controlling myoblast proliferation. Circ-ZNF609 contains an open reading frame spanning from the start codon, in common with the linear transcript, and terminating at an in-frame STOP codon, created upon circularization. Circ-ZNF609 is associated with heavy polysomes, and it is translated into a protein in a splicing-dependent and cap-independent manner, providing an example of a protein-coding circRNA in eukaryotes. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M
2018-06-01
Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Aubee, Joseph I.; Olu, Morenike
2016-01-01
The translation of rpoS (σS), the general stress/stationary phase sigma factor, is tightly regulated at the post-transcriptional level by several factors via mechanisms that are not clearly defined. One of these factors is MiaA, the enzyme necessary for the first step in the N6-isopentyl-2-thiomethyladenosinemethyladenosine 37 (ms2i6A37) tRNA modification. We tested the hypothesis that an elevated UUX-Leucine/total leucine codon ratio can be used to identify transcripts whose translation would be sensitive to loss of the MiaA-dependent modification. We identified iraP as another candidate MiaA-sensitive gene, based on the UUX-Leucine/total leucine codon ratio. An iraP-lacZ fusion was significantly decreased in the absence of MiaA, consistent with our predictive model. To determine the role of MiaA in UUX-Leucine decoding in rpoS and iraP, we measured β-galactosidase-specific activity of miaA− rpoS and iraP translational fusions upon overexpression of leucine tRNAs. We observed suppression of the MiaA effect on rpoS, and not iraP, via overexpression of tRNALeuX but not tRNALeuZ. We also tested the hypothesis that the MiaA requirement for rpoS and iraP translation is due to decoding of UUX-Leucine codons within the rpoS and iraP transcripts, respectively. We observed a partial suppression of the MiaA requirement for rpoS and iraP translational fusions containing one or both UUX-Leucine codons removed. Taken together, this suggests that MiaA is necessary for rpoS and iraP translation through proper decoding of UUX-Leucine codons and that rpoS and iraP mRNAs are both modification tunable transcripts (MoTTs) via the presence of the modification. PMID:26979278
James, D; Varga, A; Croft, H
2007-01-01
The entire genome of peach chlorotic mottle virus (PCMV), originally identified as Prunus persica cv. Agua virus (4N6), was sequenced and analysed. PCMV cross-reacts with antisera to diverse viruses, such as plum pox virus (PPV), genus Potyvirus, family Potyviridae; and apple stem pitting virus (ASPV), genus Foveavirus, family Flexiviridae. The PCMV genome consists of 9005 nucleotides (nts), excluding a poly(A) tail at the 3' end of the genome. Five open reading frames (ORFs) were identified with four untranslated regions (UTR) including a 5', a 3', and two intergenic UTRs. The genome organisation of PCMV is similar to that of ASPV and the two genomes share a nucleotide (nt) sequence identity of 58%. PCMV ORF1 encodes the replication-associated protein complex (Mr 241,503), ORF2-ORF4 code for the triple gene block proteins (TGBp; Mr 24,802, 12,370, and 7320, respectively), and ORF5 encodes the coat protein (CP) (Mr 42,505). Two non-AUG start codons participate in the initiation of translation: 35AUC and 7676AUA initiate translation of ORF1 and ORF5. In vitro expression with subsequent Western blot analysis confirmed ORF5 as the CP-encoding gene and confirmed that the codon AUA is able to initiate translation of the CP. Expression of a truncated CP fragment (Mr 39, 689) was demonstrated, and both proteins are expressed in vivo, since both were observed in Western blot analysis of PCMV-infected peach and Nicotiana occidentalis. The expressed proteins cross-reacted with an antiserum against ASPV. The amino acid sequences of the CPs of PCMV and ASPV CP share only 37% identity, but there are 11 shared peptides 4-8 aa residues long. These may constitute linear epitopes responsible for ASPV antiserum cross reactions. No significant common linear epitopes were associated with PPV. Extensive phylogenetic analysis indicates that PCMV is closely related to ASPV and is a new and distinct member of the genus Foveavirus.
Codon influence on protein expression in E. coli correlates with mRNA levels
Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.
2016-01-01
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
Cenik, Can; Cenik, Elif Sarinay; Byeon, Gun W.; Grubert, Fabian; Candille, Sophie I.; Spacek, Damek; Alsallakh, Bilal; Tilgner, Hagen; Araya, Carlos L.; Tang, Hua; Ricci, Emiliano; Snyder, Michael P.
2015-01-01
Elucidating the consequences of genetic differences between humans is essential for understanding phenotypic diversity and personalized medicine. Although variation in RNA levels, transcription factor binding, and chromatin have been explored, little is known about global variation in translation and its genetic determinants. We used ribosome profiling, RNA sequencing, and mass spectrometry to perform an integrated analysis in lymphoblastoid cell lines from a diverse group of individuals. We find significant differences in RNA, translation, and protein levels suggesting diverse mechanisms of personalized gene expression control. Combined analysis of RNA expression and ribosome occupancy improves the identification of individual protein level differences. Finally, we identify genetic differences that specifically modulate ribosome occupancy—many of these differences lie close to start codons and upstream ORFs. Our results reveal a new level of gene expression variation among humans and indicate that genetic variants can cause changes in protein levels through effects on translation. PMID:26297486
2008-10-13
Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Translational Redefinition of UGA Codons Is Regulated by Selenium Availability*
Howard, Michael T.; Carlson, Bradley A.; Anderson, Christine B.; Hatfield, Dolph L.
2013-01-01
Incorporation of selenium into ∼25 mammalian selenoproteins occurs by translational recoding whereby in-frame UGA codons are redefined to encode the selenium containing amino acid, selenocysteine (Sec). Here we applied ribosome profiling to examine the effect of dietary selenium levels on the translational mechanisms controlling selenoprotein synthesis in mouse liver. Dietary selenium levels were shown to control gene-specific selenoprotein expression primarily at the translation level by differential regulation of UGA redefinition and Sec incorporation efficiency, although effects on translation initiation and mRNA abundance were also observed. Direct evidence is presented that increasing dietary selenium causes a vast increase in ribosome density downstream of UGA-Sec codons for a subset of selenoprotein mRNAs and that the selenium-dependent effects on Sec incorporation efficiency are mediated in part by the degree of Sec-tRNA[Ser]Sec Um34 methylation. Furthermore, we find evidence for translation in the 5′-UTRs for a subset of selenoproteins and for ribosome pausing near the UGA-Sec codon in those mRNAs encoding the selenoproteins most affected by selenium availability. These data illustrate how dietary levels of the trace element selenium can alter the readout of the genetic code to affect the expression of an entire class of proteins. PMID:23696641
Salvatori, Francesca; Pappadà, Mariangela; Breveglieri, Giulia; D'Aversa, Elisabetta; Finotti, Alessia; Lampronti, Ilaria; Gambari, Roberto; Borgatti, Monica
2018-05-15
Nonsense mutations promote premature translational termination, introducing stop codons within the coding region of mRNAs and causing inherited diseases, including thalassemia. For instance, in β 0 39 thalassemia the CAG (glutamine) codon is mutated to the UAG stop codon, leading to premature translation termination and to mRNA destabilization through the well described NMD (nonsense-mediated mRNA decay). In order to develop an approach facilitating translation and, therefore, protection from NMD, ribosomal read-through molecules, such as aminoglycoside antibiotics, have been tested on mRNAs carrying premature stop codons. These findings have introduced new hopes for the development of a pharmacological approach to the β 0 39 thalassemia therapy. While several strategies, designed to enhance translational read-through, have been reported to inhibit NMD efficiency concomitantly, experimental tools for systematic analysis of mammalian NMD inhibition by translational read-through are lacking. We developed a human cellular model of the β 0 39 thalassemia mutation with UPF-1 suppressed and showing a partial NMD suppression. This novel cellular model could be used for the screening of molecules exhibiting preferential read-through activity allowing a great rescue of the mutated transcripts.
Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M
2017-01-01
Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-02-24
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-01-01
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
Schlesinger, Orr; Chemla, Yonatan; Heltberg, Mathias; Ozer, Eden; Marshall, Ryan; Noireaux, Vincent; Jensen, Mogens Høgh; Alfonta, Lital
2017-06-16
Protein synthesis in cells has been thoroughly investigated and characterized over the past 60 years. However, some fundamental issues remain unresolved, including the reasons for genetic code redundancy and codon bias. In this study, we changed the kinetics of the Eschrichia coli transcription and translation processes by mutating the promoter and ribosome binding domains and by using genetic code expansion. The results expose a counterintuitive phenomenon, whereby an increase in the initiation rates of transcription and translation lead to a decrease in protein expression. This effect can be rescued by introducing slow translating codons into the beginning of the gene, by shortening gene length or by reducing initiation rates. On the basis of the results, we developed a biophysical model, which suggests that the density of co-transcriptional-translation plays a role in bacterial protein synthesis. These findings indicate how cells use codon bias to tune translation speed and protein synthesis.
Koh, Dora Chin-Yen; Wang, Xiaoxing; Wong, Sek-Man; Liu, D X
2006-12-01
Viruses depend heavily on host cells for replication and exploit the host translation machinery for its gene expression using various unorthodox translation mechanisms. According to the conventional scanning model, only the 5'-proximal gene in the viral RNA is accessible to the ribosomes whereas other genes are silent. In this study, we use a model plant RNA virus, Hibiscus chlorotic ringspot virus (HCRSV), to investigate various translation mechanisms involved in regulation of the expression of internal genes. The 3'-end 1.2kb region of HCRSV genomic and subgenomic RNAs were shown to encode four polypeptides of 38, 27, 25 and 22.5kDa. Mutagenesis studies revealed that a CUG codon ((2570)CUG) is the initiation codon for p27, the longest of the three co-C-terminal products (p27, p25 and p22.5), and translation of p25 and p22.5 was initiated at (2603)AUG and (2666)AUG, respectively. Translation initiation of the p27 expression at the (2570)CUG codon regulates the expression of p38, the viral coat protein through a leaky scanning mechanism and mutational analysis of an upstream open reading frame (ORF) demonstrated that initiation of the p27 expression at this CUG codon (instead of an AUG) may play a role in maintaining the ratio of p27 and p38. In addition, a previously identified internal ribosome entry site was shown to control the expression of p27 and p38 in the subgenomic RNA 2.
Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi
2010-10-01
Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
The Role of +4U as an Extended Translation Termination Signal in Bacteria
Wei, Yulong; Xia, Xuhua
2017-01-01
Termination efficiency of stop codons depends on the first 3′ flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria. PMID:27903612
Properties and determinants of codon decoding time distributions
2014-01-01
Background Codon decoding time is a fundamental property of mRNA translation believed to affect the abundance, function, and properties of proteins. Recently, a novel experimental technology--ribosome profiling--was developed to measure the density, and thus the speed, of ribosomes at codon resolution. Specifically, this method is based on next-generation sequencing, which theoretically can provide footprint counts that correspond to the probability of observing a ribosome in this position for each nucleotide in each transcript. Results In this study, we report for the first time various novel properties of the distribution of codon footprint counts in five organisms, based on large-scale analysis of ribosomal profiling data. We show that codons have distinctive footprint count distributions. These tend to be preserved along the inner part of the ORF, but differ at the 5' and 3' ends of the ORF, suggesting that the translation-elongation stage actually includes three biophysical sub-steps. In addition, we study various basic properties of the codon footprint count distributions and show that some of them correlate with the abundance of the tRNA molecule types recognizing them. Conclusions Our approach emphasizes the advantages of analyzing ribosome profiling and similar types of data via a comparative genomic codon-distribution-centric view. Thus, our methods can be used in future studies related to translation and even transcription elongation. PMID:25572668
Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals.
El Houmami, Nawal; Seligmann, Hervé
2017-01-01
We present an evolutionary hypothesis assuming that signals marking nucleotide synthesis (DNA replication and RNA transcription) evolved from multi- to unidimensional structures, and were carried over from transcription to translation. This evolutionary scenario presumes that signals combining secondary and primary nucleotide structures are evolutionary transitions. Mitochondrial replication initiation fits this scenario. Some observations reported in the literature corroborate that several signals for nucleotide synthesis function in translation, and vice versa. (a) Polymerase-induced frameshift mutations occur preferentially at translational termination signals (nucleotide deletion is interpreted as termination of nucleotide polymerization, paralleling the role of stop codons in translation). (b) Stem-loop hairpin presence/absence modulates codon-amino acid assignments, showing that translational signals sometimes combine primary and secondary nucleotide structures (here codon and stem-loop). (c) Homopolymer nucleotide triplets (AAA, CCC, GGG, TTT) cause transcriptional and ribosomal frameshifts. Here we find in recently described human mitochondrial RNAs that systematically lack mono-, dinucleotides after each trinucleotide (delRNAs) that delRNA triplets include 2x more homopolymers than mitogenome regions not covered by delRNA. Further analyses of delRNAs show that the natural circular code X (a little-known group of 20 translational signals enabling ribosomal frame retrieval consisting of 20 codons {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} universally overrepresented in coding versus other frames of gene sequences), regulates frameshift in transcription and translation. This dual transcription and translation role confirms for X the hypothesis that translational signals were carried over from transcriptional signals.
Beck, Heather J.; Fleming, Ian M. C.
2016-01-01
Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs) that contain a conventional untranslated leader and Shine-Dalgarno (SD) sequence upstream of the gene’s start codon while also containing an AUG triplet at the mRNA’s 5’- terminus (5’-uAUG). Fusion of the coding sequence specified by the 5’-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5’-terminal upstream open reading frames (5’-uORFs) tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5’-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5’-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5’-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5’-uORFs may play roles in downstream regulation. Since the 5’-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5’-uAUGs should be examined for their potential to function also as leaderless mRNAs. PMID:27467758
Origin, antigenicity, and function of a secreted form of ORF2 in hepatitis E virus infection.
Yin, Xin; Ying, Dong; Lhomme, Sébastien; Tang, Zimin; Walker, Christopher M; Xia, Ningshao; Zheng, Zizheng; Feng, Zongdi
2018-05-01
The enterically transmitted hepatitis E virus (HEV) adopts a unique strategy to exit cells by cloaking its capsid (encoded by the viral ORF2 gene) and circulating in the blood as "quasi-enveloped" particles. However, recent evidence suggests that the majority of the ORF2 protein present in the patient serum and supernatants of HEV-infected cell culture exists in a free form and is not associated with virus particles. The origin and biological functions of this secreted form of ORF2 (ORF2 S ) are unknown. Here we show that production of ORF2 S results from translation initiated at the previously presumed AUG start codon for the capsid protein, whereas translation of the actual capsid protein (ORF2 C ) is initiated at a previously unrecognized internal AUG codon (15 codons downstream of the first AUG). The addition of 15 amino acids to the N terminus of the capsid protein creates a signal sequence that drives ORF2 S secretion via the secretory pathway. Unlike ORF2 C , ORF2 S is glycosylated and exists as a dimer. Nonetheless, ORF2 S exhibits substantial antigenic overlap with the capsid, but the epitopes predicted to bind the putative cell receptor are lost. Consistent with this, ORF2 S does not block HEV cell entry but inhibits antibody-mediated neutralization. These results reveal a previously unrecognized aspect in HEV biology and shed new light on the immune evasion mechanisms and pathogenesis of this virus.
Selenocysteine incorporation: A trump card in the game of mRNA decay
Shetty, Sumangala P.; Copeland, Paul R.
2015-01-01
The incorporation of the 21st amino acid, selenocysteine (Sec), occurs on mRNAs that harbor in-frame stop codons because the Sec-tRNASec recognizes a UGA codon. This sets up an intriguing interplay between translation elongation, translation termination and the complex machinery that marks mRNAs that contain premature termination codons for degradation, leading to nonsense mediated mRNA decay (NMD). In this review we discuss the intricate and complex relationship between this key quality control mechanism and the process of Sec incorporation in mammals. PMID:25622574
Pi, J; Wookey, P J; Pittard, A J
1991-01-01
The phenylalanine-specific permease gene (pheP) of Escherichia coli has been cloned and sequenced. The gene was isolated on a 6-kb Sau3AI fragment from a chromosomal library, and its presence was verified by complementation of a mutant lacking the functional phenylalanine-specific permease. Subcloning from this fragment localized the pheP gene on a 2.7-kb HindIII-HindII fragment. The nucleotide sequence of this 2.7-kb region was determined. An open reading frame was identified which extends from a putative start point of translation (GTG at position 636) to a termination signal (TAA at position 2010). The assignment of the GTG as the initiation codon was verified by site-directed mutagenesis of the initiation codon and by introducing a chain termination mutation into the pheP-lacZ fusion construct. A single initiation site of transcription 30 bp upstream of the start point of translation was identified by the primer extension analysis. The pheP structural gene consists of 1,374 nucleotides specifying a protein of 458 amino acid residues. The PheP protein is very hydrophobic (71% nonpolar residues). A topological model predicted from the sequence analysis defines 12 transmembrane segments. This protein is highly homologous with the AroP (general aromatic transport) system of E. coli (59.6% identity) and to a lesser extent with the yeast permeases CAN1 (arginine), PUT4 (proline), and HIP1 (histidine) of Saccharomyces cerevisiae. Images PMID:1711024
Renovell, Agueda; Gago, Selma; Ruiz-Ruiz, Susana; Velázquez, Karelia; Navarro, Luis; Moreno, Pedro; Vives, Mari Carmen; Guerri, José
2010-10-25
Citrus leaf blotch virus has a single-stranded positive-sense genomic RNA (gRNA) of 8747 nt organized in three open reading frames (ORFs). The ORF1, encoding a polyprotein involved in replication, is translated directly from the gRNA, whereas ORFs encoding the movement (MP) and coat (CP) proteins are expressed via 3' coterminal subgenomic RNAs (sgRNAs). We characterized the minimal promoter region critical for the CP-sgRNA expression in infected cells by deletion analyses using Agrobacterium-mediated infection of Nicotiana benthamiana plants. The minimal CP-sgRNA promoter was mapped between nucleotides -67 and +50 nt around the transcription start site. Surprisingly, larger deletions in the region between the CP-sgRNA transcription start site and the CP translation initiation codon resulted in increased CP-sgRNA accumulation, suggesting that this sequence could modulate the CP-sgRNA transcription. Site-specific mutational analysis of the transcription start site revealed that the +1 guanylate and the +2 adenylate are important for CP-sgRNA synthesis. Copyright © 2010 Elsevier Inc. All rights reserved.
Takai, Kazuyuki
2017-01-21
Codon adaptation index (CAI) has been widely used for prediction of expression of recombinant genes in Escherichia coli and other organisms. However, CAI has no mechanistic basis that rationalizes its application to estimation of translational efficiency. Here, I propose a model based on which we could consider how codon usage is related to the level of expression during exponential growth of bacteria. In this model, translation of a gene is considered as an analog of electric current, and an analog of electric resistance corresponding to each gene is considered. "Translational resistance" is dependent on the steady-state concentration and the sequence of the mRNA species, and "translational resistivity" is dependent only on the mRNA sequence. The latter is the sum of two parts: one is the resistivity for the elongation reaction (coding sequence resistivity), and the other comes from all of the other steps of the decoding reaction. This electric circuit model clearly shows that some conditions should be met for codon composition of a coding sequence to correlate well with its expression level. On the other hand, I calculated relative frequency of each of the 61 sense codon triplets translated during exponential growth of E. coli from a proteomic dataset covering over 2600 proteins. A tentative method for estimating relative coding sequence resistivity based on the data is presented. Copyright © 2016. Published by Elsevier Ltd.
Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer
2004-01-01
Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290
Johansson, Magnus; Zhang, Jingji; Ehrenberg, Måns
2012-01-03
Rapid and accurate translation of the genetic code into protein is fundamental to life. Yet due to lack of a suitable assay, little is known about the accuracy-determining parameters and their correlation with translational speed. Here, we develop such an assay, based on Mg(2+) concentration changes, to determine maximal accuracy limits for a complete set of single-mismatch codon-anticodon interactions. We found a simple, linear trade-off between efficiency of cognate codon reading and accuracy of tRNA selection. The maximal accuracy was highest for the second codon position and lowest for the third. The results rationalize the existence of proofreading in code reading and have implications for the understanding of tRNA modifications, as well as of translation error-modulating ribosomal mutations and antibiotics. Finally, the results bridge the gap between in vivo and in vitro translation and allow us to calibrate our test tube conditions to represent the environment inside the living cell.
Kim, Julie J; Yu, Jaeju; Bag, Jnanankur; Bakovic, Marica; Cant, John P
2015-01-01
The rate of secretion of αs2-casein into bovine milk is approximately 25% of that of β-casein, yet mammary expression of their respective mRNA transcripts (csn1s2 and csn2) is not different. Our objective was to identify molecular mechanisms that explain the difference in translation efficiency between csn1s2 and csn2. Cell-free translational efficiency of csn2 was 5 times that of csn1s2. Transcripts of csn1s2 distributed into heavier polysomes than csn2 transcripts, indicating an attenuation of elongation and/or termination. Stimulatory and inhibitory effects of the 5′ and 3′ UTRs on translational efficiency were different with luciferase and casein sequences in the coding regions. Substituting the 5′ and 3′ UTRs from csn2 into csn1s2 did not improve csn1s2 translation, implicating the coding region itself in the translation difference. Deletion of a 28-codon fragment from the 3′ terminus of the csn1s2 coding region, which displays codons with low correlations to cell fitness, increased translation to a par with csn2. We conclude that the usage of the last 28 codons of csn1s2 is the main regulatory element that attenuates its expression and is responsible for the differential translational expression of csn1s2 and csn2. PMID:25826667
Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup
2015-01-01
Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.
Alkalaeva, Elena; Mikhailova, Tatiana
2017-03-01
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Rezgui, Vanessa Anissa Nathalie; Tyagi, Kshitiz; Ranjan, Namit; Konevega, Andrey L; Mittelstaet, Joerg; Rodnina, Marina V; Peter, Matthias; Pedrioli, Patrick G A
2013-07-23
tRNA modifications are crucial to ensure translation efficiency and fidelity. In eukaryotes, the URM1 and ELP pathways increase cellular resistance to various stress conditions, such as nutrient starvation and oxidative agents, by promoting thiolation and methoxycarbonylmethylation, respectively, of the wobble uridine of cytoplasmic (tK(UUU)), (tQ(UUG)), and (tE(UUC)). Although in vitro experiments have implicated these tRNA modifications in modulating wobbling capacity and translation efficiency, their exact in vivo biological roles remain largely unexplored. Using a combination of quantitative proteomics and codon-specific translation reporters, we find that translation of a specific gene subset enriched for AAA, CAA, and GAA codons is impaired in the absence of URM1- and ELP-dependent tRNA modifications. Moreover, in vitro experiments using native tRNAs demonstrate that both modifications enhance binding of tK(UUU) to the ribosomal A-site. Taken together, our data suggest that tRNA thiolation and methoxycarbonylmethylation regulate translation of genes with specific codon content.
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts
Wang, Hongju; Liu, Siqing; Zhang, Bo
2016-01-01
Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S
2017-08-29
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
Circuitry linking the global Csr and σE-dependent cell envelope stress response systems.
Yakhnin, Helen; Aichele, Robert; Ades, Sarah E; Romeo, Tony; Babitzke, Paul
2017-09-18
CsrA of Escherichia coli is an RNA-binding protein that globally regulates a wide variety of cellular processes and behaviors including carbon metabolism, motility, biofilm formation, and the stringent response. CsrB and CsrC are sRNAs that sequester CsrA, thereby preventing CsrA-mRNA interaction. RpoE (σ E ) is the extracytoplasmic stress response sigma factor of E. coli Previous RNA-seq studies identified rpoE mRNA as a CsrA target. Here we explored the regulation of rpoE by CsrA and found that CsrA represses rpoE translation. Gel mobility shift, footprint and toeprint studies identified three CsrA binding sites in the rpoE leader transcript, one of which overlaps the rpoE Shine-Dalgarno (SD) sequence, while another overlaps the rpoE translation initiation codon. Coupled in vitro transcription-translation experiments showed that CsrA represses rpoE translation by binding to these sites. We further demonstrate that σ E indirectly activates transcription of csrB and csrC , leading to increased sequestration of CsrA such that repression of rpoE by CsrA is reduced. We propose that the Csr system fine-tunes the σ E -dependent cell envelope stress response. We also identified a 51 amino acid coding sequence whose stop codon overlaps the rpoE start codon, and demonstrate that rpoE is translationally coupled with this upstream open reading frame (ORF51). Loss of coupling reduces rpoE translation by more than 50%. Identification of a translationally coupled ORF upstream of rpoE suggests that this previously unannotated protein may participate in the cell envelope stress response. In keeping with existing nomenclature, we name ORF51 as rseD , resulting in an operon arrangement of rseD-rpoE-rseA-rseB-rseC IMPORTANCE CsrA posttranscriptionally represses genes required for bacterial stress responses, including the stringent response, catabolite repression, and the RpoS (σ S )-mediated general stress response. We show that CsrA represses translation of rpoE , encoding the extracytoplasmic stress response sigma factor and that σ E indirectly activates transcription of csrB and csrC , resulting in reciprocal regulation of these two global regulatory systems. These findings suggest that extracytoplasmic stress leads to derepression of rpoE translation by CsrA, and CsrA-mediated repression helps to reset RpoE abundance to pre-stress levels once envelope damage is repaired. The discovery of an ORF, RseD, translationally coupled with rpoE adds further complexity to translational control of rpoE . Copyright © 2017 American Society for Microbiology.
2013-01-01
Background Segment 6 of the ISA virus codes for hemoagglutinin-esterase (HE). This segment is highly variable, with more than 26 variants identified. The major variation is observed in what is called the high polymorphism region (HPR). The role of the different HPR zones in the viral cycle or evolution remains unknown. However viruses that present the HPR0 are avirulent, while viruses with important deletions in this region have been responsible for outbreaks with high mortality rates. In this work, using bioinformatic tools, we examined the influence of different HPRs on the adaptation of HE genes to the host translational machinery and the relationship to observed virulence. Methods Translational efficiency of HE genes and their HPR were estimated analyzing codon-pair bias (CPB), adaptation to host codon use (codon adaptation index - CAI) and the adaptation to available tRNAs (tAI). These values were correlated with reported mortality for the respective ISA virus and the ΔG of RNA folding. tRNA abundance was inferred from tRNA gene numbers identified in the Salmo salar genome using tRNAScan-SE. Statistical correlation between data was performed using a non-parametric test. Results We found that HPR0 contains zones with codon pairs of low frequency and low availability of tRNA with respect to salmon codon-pair usage, suggesting that HPR modifies HE translational efficiency. Although calculating tAI was impossible because one third of tRNAs (~60.000) were tRNA-ala, translational efficiency measured by CPB shows that as HPR size increases, the CPB value of the HE gene decreases (P = 2x10-7, ρ = −0.675, n = 63) and that these values correlate positively with the mortality rates caused by the virus (ρ = 0.829, P = 2x10-7, n = 11). The mortality associated with different virus isolates or their corresponding HPR sizes were not related with the ΔG of HPR RNA folding, suggesting that the secondary structure of HPR RNA does not modify virulence. Conclusions Our results suggest that HPR size affects the efficiency of gene translation, which modulates the virulence of the virus by a mechanism similar to that observed in production of live attenuated vaccines through deoptimization of codon-pair usage. PMID:23742749
Quach, Tommy; Brooks, Daniel M; Miranda, Hector C
2016-01-01
The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Three stages during the evolution of the genetic code. [Abstract only
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1994-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
Three stages in the evolution of the genetic code
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1993-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Dunne, Matthew; Leicht, Stefan; Krichel, Boris; Thompson, Andrew; Gómez-Torres, Natalia; Garde, Sonia; Narbad, Arjan; Mayer, Melinda J.
2016-01-01
Bacteriophages produce endolysins, which lyse the bacterial host cell to release newly produced virions. The timing of lysis is regulated and is thought to involve the activation of a molecular switch. We present a crystal structure of the activated endolysin CTP1L that targets Clostridium tyrobutyricum, consisting of a complex between the full-length protein and an N-terminally truncated C-terminal cell wall binding domain (CBD). The truncated CBD is produced through an internal translation start site within the endolysin gene. Mutants affecting the internal translation site change the oligomeric state of the endolysin and reduce lytic activity. The activity can be modulated by reconstitution of the full-length endolysin-CBD complex with free CBD. The same oligomerization mechanism applies to the CD27L endolysin that targets Clostridium difficile and the CS74L endolysin that targets Clostridium sporogenes. When the CTP1L endolysin gene is introduced into the commensal bacterium Lactococcus lactis, the truncated CBD is also produced, showing that the alternative start codon can be used in other bacterial species. The identification of a translational switch affecting oligomerization presented here has implications for the design of effective endolysins for the treatment of bacterial infections. PMID:26683375
Tidwell, Timothy; Wechsler, Jeremy; Nayak, Ramesh C.; Trump, Lisa; Salipante, Stephen J.; Cheng, Jerry C.; Donadieu, Jean; Glaubach, Taly; Corey, Seth J.; Grimes, H. Leighton; Lutzko, Carolyn; Cancelas, Jose A.
2014-01-01
Hereditary neutropenia is usually caused by heterozygous germline mutations in the ELANE gene encoding neutrophil elastase (NE). How mutations cause disease remains uncertain, but two hypotheses have been proposed. In one, ELANE mutations lead to mislocalization of NE. In the other, ELANE mutations disturb protein folding, inducing an unfolded protein response in the endoplasmic reticulum (ER). In this study, we describe new types of mutations that disrupt the translational start site. At first glance, they should block translation and are incompatible with either the mislocalization or misfolding hypotheses, which require mutant protein for pathogenicity. We find that start-site mutations, instead, force translation from downstream in-frame initiation codons, yielding amino-terminally truncated isoforms lacking ER-localizing (pre) and zymogen-maintaining (pro) sequences, yet retain essential catalytic residues. Patient-derived induced pluripotent stem cells recapitulate hematopoietic and molecular phenotypes. Expression of the amino-terminally deleted isoforms in vitro reduces myeloid cell clonogenic capacity. We define an internal ribosome entry site (IRES) within ELANE and demonstrate that adjacent mutations modulate IRES activity, independently of protein-coding sequence alterations. Some ELANE mutations, therefore, appear to cause neutropenia via the production of amino-terminally deleted NE isoforms rather than by altering the coding sequence of the full-length protein. PMID:24184683
José, Marco V; Morgado, Eberto R; Govezensky, Tzipe
2011-07-01
Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Cenik, Can; Cenik, Elif Sarinay; Byeon, Gun W; Grubert, Fabian; Candille, Sophie I; Spacek, Damek; Alsallakh, Bilal; Tilgner, Hagen; Araya, Carlos L; Tang, Hua; Ricci, Emiliano; Snyder, Michael P
2015-11-01
Elucidating the consequences of genetic differences between humans is essential for understanding phenotypic diversity and personalized medicine. Although variation in RNA levels, transcription factor binding, and chromatin have been explored, little is known about global variation in translation and its genetic determinants. We used ribosome profiling, RNA sequencing, and mass spectrometry to perform an integrated analysis in lymphoblastoid cell lines from a diverse group of individuals. We find significant differences in RNA, translation, and protein levels suggesting diverse mechanisms of personalized gene expression control. Combined analysis of RNA expression and ribosome occupancy improves the identification of individual protein level differences. Finally, we identify genetic differences that specifically modulate ribosome occupancy--many of these differences lie close to start codons and upstream ORFs. Our results reveal a new level of gene expression variation among humans and indicate that genetic variants can cause changes in protein levels through effects on translation. © 2015 Cenik et al.; Published by Cold Spring Harbor Laboratory Press.
A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells
Cao, Jicong; Arha, Manish; Sudrik, Chaitanya; Mukherjee, Abhirup; Wu, Xia; Kane, Ravi S.
2015-01-01
We describe a simple strategy to control mRNA translation in both prokaryotic and eukaryotic cells which relies on a unique protein–RNA interaction. Specifically, we used the Pumilio/FBF (PUF) protein to repress translation by binding in between the ribosome binding site (RBS) and the start codon (in Escherichia coli), or by binding to the 5′ untranslated region of target mRNAs (in mammalian cells). The design principle is straightforward, the extent of translational repression can be tuned and the regulator is genetically encoded, enabling the construction of artificial signal cascades. We demonstrate that this approach can also be used to regulate polycistronic mRNAs; such regulation has rarely been achieved in previous reports. Since the regulator used in this study is a modular RNA-binding protein, which can be engineered to target different 8-nucleotide RNA sequences, our strategy could be used in the future to target endogenous mRNAs for regulating metabolic flows and signaling pathways in both prokaryotic and eukaryotic cells. PMID:25845589
Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson
2016-06-14
In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
Schuster, W; Brennicke, A
1991-01-01
An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
Ederveen, Thomas H. A.; Overmars, Lex; van Hijum, Sacha A. F. T.
2013-01-01
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35–52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity. PMID:23675487
Hashem, Yaser; Frank, Joachim
2018-03-01
Translation initiation in eukaryotes is a highly regulated and rate-limiting process. It results in the assembly and disassembly of numerous transient and intermediate complexes involving over a dozen eukaryotic initiation factors (eIFs). This process culminates in the accommodation of a start codon marking the beginning of an open reading frame at the appropriate ribosomal site. Although this process has been extensively studied by hundreds of groups for nearly half a century, it has been only recently, especially during the last decade, that we have gained deeper insight into the mechanics of the eukaryotic translation initiation process. This advance in knowledge is due in part to the contributions of structural biology, which have shed light on the molecular mechanics underlying the different functions of various eukaryotic initiation factors. In this review, we focus exclusively on the contribution of structural biology to the understanding of the eukaryotic initiation process, a long-standing jigsaw puzzle that is just starting to yield the bigger picture. Expected final online publication date for the Annual Review of Biophysics Volume 47 is May 20, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
2016-01-01
Cells respond to stress by controlling gene expression at several levels, with little known about the role of translation. Here, we demonstrate a coordinated translational stress response system involving stress-specific reprogramming of tRNA wobble modifications that leads to selective translation of codon-biased mRNAs representing different classes of critical response proteins. In budding yeast exposed to four oxidants and five alkylating agents, tRNA modification patterns accurately distinguished among chemically similar stressors, with 14 modified ribonucleosides forming the basis for a data-driven model that predicts toxicant chemistry with >80% sensitivity and specificity. tRNA modification subpatterns also distinguish SN1 from SN2 alkylating agents, with SN2-induced increases in m3C in tRNA mechanistically linked to selective translation of threonine-rich membrane proteins from genes enriched with ACC and ACT degenerate codons for threonine. These results establish tRNA modifications as predictive biomarkers of exposure and illustrate a novel regulatory mechanism for translational control of cell stress response. PMID:25772370
2014-01-01
Background mRNA translation involves simultaneous movement of multiple ribosomes on the mRNA and is also subject to regulatory mechanisms at different stages. Translation can be described by various codon-based models, including ODE, TASEP, and Petri net models. Although such models have been extensively used, the overlap and differences between these models and the implications of the assumptions of each model has not been systematically elucidated. The selection of the most appropriate modelling framework, and the most appropriate way to develop coarse-grained/fine-grained models in different contexts is not clear. Results We systematically analyze and compare how different modelling methodologies can be used to describe translation. We define various statistically equivalent codon-based simulation algorithms and analyze the importance of the update rule in determining the steady state, an aspect often neglected. Then a novel probabilistic Boolean network (PBN) model is proposed for modelling translation, which enjoys an exact numerical solution. This solution matches those of numerical simulation from other methods and acts as a complementary tool to analytical approximations and simulations. The advantages and limitations of various codon-based models are compared, and illustrated by examples with real biological complexities such as slow codons, premature termination and feedback regulation. Our studies reveal that while different models gives broadly similiar trends in many cases, important differences also arise and can be clearly seen, in the dependence of the translation rate on different parameters. Furthermore, the update rule affects the steady state solution. Conclusions The codon-based models are based on different levels of abstraction. Our analysis suggests that a multiple model approach to understanding translation allows one to ascertain which aspects of the conclusions are robust with respect to the choice of modelling methodology, and when (and why) important differences may arise. This approach also allows for an optimal use of analysis tools, which is especially important when additional complexities or regulatory mechanisms are included. This approach can provide a robust platform for dissecting translation, and results in an improved predictive framework for applications in systems and synthetic biology. PMID:24576337
Heinemann, Ilka U.; Rovner, Alexis J.; Aerni, Hans R.; Rogulina, Svetlana; Cheng, Laura; Olds, William; Fischer, Jonathan T.; Söll, Dieter; Isaacs, Farren J.; Rinehart, Jesse
2012-01-01
Genetically encoded phosphoserine incorporation programmed by the UAG codon was achieved by addition of engineered elongation factor and an archaeal aminoacyl-tRNA synthetase to the normal Escherichia coli translation machinery (Park (2011) Science 333, 1151). However, protein yield suffers from expression of the orthogonal phosphoserine translation system and competition with release factor 1 (RF-1). In a strain lacking RF-1, phosphoserine phosphatase, and where 7 UAG codons residing in essential genes were converted to UAA, phosphoserine incorporation into GFP and WNK4 was significantly elevated, but with an accompanying loss in cellular fitness and viability. PMID:22982858
AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer
2014-10-07
The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact "nanogenome."
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.
2017-01-01
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
Behura, Susanta K; Severson, David W
2013-02-01
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer
2014-01-01
The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact “nanogenome.” PMID:25253891
Hockenberry, Adam J; Pah, Adam R; Jewett, Michael C; Amaral, Luís A N
2017-01-01
Studies dating back to the 1970s established that sequence complementarity between the anti-Shine-Dalgarno (aSD) sequence on prokaryotic ribosomes and the 5' untranslated region of mRNAs helps to facilitate translation initiation. The optimal location of aSD sequence binding relative to the start codon, the full extents of the aSD sequence and the functional form of the relationship between aSD sequence complementarity and translation efficiency have not been fully resolved. Here, we investigate these relationships by leveraging the sequence diversity of endogenous genes and recently available genome-wide estimates of translation efficiency. We show that-after accounting for predicted mRNA structure-aSD sequence complementarity increases the translation of endogenous mRNAs by roughly 50%. Further, we observe that this relationship is nonlinear, with translation efficiency maximized for mRNAs with intermediate levels of aSD sequence complementarity. The mechanistic insights that we observe are highly robust: we find nearly identical results in multiple datasets spanning three distantly related bacteria. Further, we verify our main conclusions by re-analysing a controlled experimental dataset. © 2017 The Authors.
Takaoka, N; Fukuzawa, M; Saito, T; Sakaitani, T; Ochiai, H
1999-10-28
We cloned a genomic fragment of the membrane protein gp64 gene of the cellular slime mold Polysphondylium pallidum by inverse PCR. Primer extension analysis identified a major transcription start site 65 bp upstream of the translation start codon. The promoter region of the gp64 gene contains sequences homologous to a TATA box at position -47 to -37 and to an initiator (Inr, PyPyCAPyPyPyPy) at position -3 to +5 from the transcription start site. Successively truncated segments of the promoter were tested for their ability to drive expression of the beta-galactosidase reporter gene in transformed cells; also the difference in activity between growth conditions was compared. The results indicated that there are two positive vegetative regulatory elements extending between -187 and -62 bp from the transcription start site of the gp64 promoter; also their activity was two to three times higher in the cells grown with bacteria in shaken suspension than in the cells grown in an axenic medium.
Wu, Yi-Hsuan; Taggart, Janet; Song, Pamela Xiyao; MacDiarmid, Colin; Eide, David J.
2016-01-01
The Msc2 and Zrg17 proteins of Saccharomyces cerevisiae form a complex to transport zinc into the endoplasmic reticulum. ZRG17 is transcriptionally induced in zinc-limited cells by the Zap1 transcription factor. In this report, we show that MSC2 mRNA also increases (~1.5 fold) in zinc-limited cells. The MSC2 gene has two in-frame ATG codons at its 5’ end, ATG1 and ATG2; ATG2 is the predicted initiation codon. When the MSC2 promoter was fused at ATG2 to the lacZ gene, we found that unlike the chromosomal gene this reporter showed a 4-fold decrease in lacZ mRNA in zinc-limited cells. Surprisingly, β-galactosidase activity generated by this fusion gene increased ~7 fold during zinc deficiency suggesting the influence of post-transcriptional factors. Transcription of MSC2ATG2-lacZ was found to start upstream of ATG1 in zinc-replete cells. In zinc-limited cells, transcription initiation shifted to sites just upstream of ATG2. From the results of mutational and polysome profile analyses, we propose the following explanation for these effects. In zinc-replete cells, MSC2ATG2-lacZ mRNA with long 5’ UTRs fold into secondary structures that inhibit translation. In zinc-limited cells, transcripts with shorter unstructured 5’ UTRs are generated that are more efficiently translated. Surprisingly, chromosomal MSC2 did not show start site shifts in response to zinc status and only shorter 5’ UTRs were observed. However, the shifts that occur in the MSC2ATG2-lacZ construct led us to identify significant transcription start site changes affecting the expression of ~3% of all genes. Therefore, zinc status can profoundly alter transcription initiation across the yeast genome. PMID:27657924
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.
Morton, B R
1993-09-01
Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Romero, H; Zavala, A; Musto, H
2000-01-25
It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
On origin of genetic code and tRNA before translation
2011-01-01
Background Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. Results The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. Conclusion Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony. Reviewers This article was reviewed by Eugene V. Koonin, Wentao Ma (nominated by Juergen Brosius) and Anthony Poole. PMID:21342520
Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P
2017-03-01
Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells.
Cao, Jicong; Arha, Manish; Sudrik, Chaitanya; Mukherjee, Abhirup; Wu, Xia; Kane, Ravi S
2015-04-30
We describe a simple strategy to control mRNA translation in both prokaryotic and eukaryotic cells which relies on a unique protein-RNA interaction. Specifically, we used the Pumilio/FBF (PUF) protein to repress translation by binding in between the ribosome binding site (RBS) and the start codon (in Escherichia coli), or by binding to the 5' untranslated region of target mRNAs (in mammalian cells). The design principle is straightforward, the extent of translational repression can be tuned and the regulator is genetically encoded, enabling the construction of artificial signal cascades. We demonstrate that this approach can also be used to regulate polycistronic mRNAs; such regulation has rarely been achieved in previous reports. Since the regulator used in this study is a modular RNA-binding protein, which can be engineered to target different 8-nucleotide RNA sequences, our strategy could be used in the future to target endogenous mRNAs for regulating metabolic flows and signaling pathways in both prokaryotic and eukaryotic cells. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Architecture of human translation initiation factor 3
Querol-Audi, Jordi; Sun, Chaomin; Vogan, Jacob M.; Smith, Duane; Gu, Yu; Cate, Jamie; Nogales, Eva
2013-01-01
SUMMARY Eukaryotic translation initiation factor 3 (eIF3) plays a central role in protein synthesis by organizing the formation of the 43S preinitiation complex. Using genetic tag visualization by electron microscopy, we reveal the molecular organization of ten human eIF3 subunits, including an octameric core. The structure of eIF3 bears a close resemblance to that of the proteasome lid, with a conserved spatial organization of eight core subunits containing PCI and MPN domains that coordinate functional interactions in both complexes. We further show that eIF3 subunits a and c interact with initiation factors eIF1 and eIF1A, which control the stringency of start codon selection. Finally, we find that subunit j, which modulates messenger RNA interactions with the small ribosomal subunit, makes multiple independent interactions with the eIF3 octameric core. These results highlight the conserved architecture of eIF3 and how it scaffolds key factors that control translation initiation in higher eukaryotes, including humans. PMID:23623729
Characterization of a novel RNA-binding region of eIF4GI critical for ribosomal scanning
Prévôt, Déborah; Décimo, Didier; Herbreteau, Cécile H.; Roux, Florence; Garin, Jérôme; Darlix, Jean-Luc; Ohlmann, Théophile
2003-01-01
The eukaryotic translation initiation factor eIF4GI binds several proteins and acts as a scaffold to promote preinitiation complex formation on the mRNA molecule (48S). Following mRNA attachment this complex scans along the messenger in a 5′ to 3′ direction until it locates and recognizes the initiation start codon. By using a combination of retroviral and picornaviral proteases (HIV-2 and L respectively) in the reticulocyte lysate system, we have characterized a 40 amino acid (aa) region of eIF4GI (aa 642–681) that exhibits general RNA-binding properties. Removal of this domain by proteolytic processing followed by translational assays showed virtually no inhibition of internal ribosome entry on the encephalomyocarditis virus, but resulted in drastic impairment of ribosome scanning as demonstrated by studying poliovirus and foot-and-mouth disease virus translation. Based on these findings, we propose that this 40 aa motif of eIF4GI is critical for ribosome scanning. PMID:12682023
Révay, T; Villagómez, D A F; Brewer, D; Chenier, T; King, W A
2012-01-01
Genetic sex in mammals is determined by the sex chromosomal composition of the zygote. The X and Y chromosomes are responsible for numerous factors that must work in close concert for the proper development of a healthy sexual phenotype. The role of androgens in case of XY chromosomal constitution is crucial for normal male sex differentiation. The intracellular androgenic action is mediated by the androgen receptor (AR), and its impaired function leads to a myriad of syndromes with severe clinical consequences, most notably androgen insensitivity syndrome and prostate cancer. In this paper, we investigated the possibility that an alteration of the equine AR gene explains a recently described familial XY, SRY + disorder of sex development. We uncovered a transition in the first nucleotide of the AR start codon (c.1A>G). To our knowledge, this represents the first causative AR mutation described in domestic animals. It is also a rarely observed mutation in eukaryotes and is unique among the >750 entries of the human androgen receptor mutation database. In addition, we found another quiet missense mutation in exon 1 (c.322C>T). Transcription of AR was confirmed by RT-PCR amplification of several exons. Translation of the full-length AR protein from the initiating GTG start codon was confirmed by Western blot using N- and C-terminal-specific antibodies. Two smaller peptides (25 and 14 amino acids long) were identified from the middle of exon 1 and across exons 5 and 6 by mass spectrometry. Based upon our experimental data and the supporting literature, it appears that the AR is expressed as a full-length protein and in a functional form, and the observed phenotype is the result of reduced AR protein expression levels. Copyright © 2011 S. Karger AG, Basel.
Abolition of Peroxiredoxin-5 Mitochondrial Targeting during Canid Evolution
Van der Eecken, Valérie; Clippe, André; Dekoninck, Sophie; Goemaere, Julie; Walbrecq, Geoffroy; Van Veldhoven, Paul P.; Knoops, Bernard
2013-01-01
In human, the subcellular targeting of peroxiredoxin-5 (PRDX5), a thioredoxin peroxidase, is dependent on the use of multiple alternative transcription start sites and two alternative in-frame translation initiation sites, which determine whether or not the region encoding a mitochondrial targeting sequence (MTS) is translated. In the present study, the abolition of PRDX5 mitochondrial targeting in dog is highlighted and the molecular mechanism underlying the loss of mitochondrial PRDX5 during evolution is examined. Here, we show that the absence of mitochondrial PRDX5 is generalized among the extant canids and that the first events leading to PRDX5 MTS abolition in canids involve a mutation in the more 5′ translation initiation codon as well as the appearance of a STOP codon. Furthermore, we found that PRDX5 MTS functionality is maintained in giant panda and northern elephant seal, which are phylogenetically closely related to canids. Also, the functional consequences of the restoration of mitochondrial PRDX5 in dog Madin-Darby canine kidney (MDCK) cells were investigated. The restoration of PRDX5 mitochondrial targeting in MDCK cells, instead of protecting, provokes deleterious effects following peroxide exposure independently of its peroxidase activity, indicating that mitochondrial PRDX5 gains cytotoxic properties under acute oxidative stress in MDCK cells. Altogether our results show that, although mitochondrial PRDX5 cytoprotective function against oxidative stress has been clearly demonstrated in human and rodents, PRDX5 targeting to mitochondria has been evolutionary lost in canids. Moreover, restoration of mitochondrial PRDX5 in dog MDCK cells, instead of conferring protection against peroxide exposure, makes them more vulnerable. PMID:24023783
Shin, Young C.; Desrosiers, Ronald C.
2011-01-01
Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403
2'-O-methylation in mRNA disrupts tRNA decoding during translation elongation.
Choi, Junhong; Indrisiunaite, Gabriele; DeMirci, Hasan; Ieong, Ka-Weng; Wang, Jinfan; Petrov, Alexey; Prabhakar, Arjun; Rechavi, Gideon; Dominissini, Dan; He, Chuan; Ehrenberg, Måns; Puglisi, Joseph D
2018-03-01
Chemical modifications of mRNA may regulate many aspects of mRNA processing and protein synthesis. Recently, 2'-O-methylation of nucleotides was identified as a frequent modification in translated regions of human mRNA, showing enrichment in codons for certain amino acids. Here, using single-molecule, bulk kinetics and structural methods, we show that 2'-O-methylation within coding regions of mRNA disrupts key steps in codon reading during cognate tRNA selection. Our results suggest that 2'-O-methylation sterically perturbs interactions of ribosomal-monitoring bases (G530, A1492 and A1493) with cognate codon-anticodon helices, thereby inhibiting downstream GTP hydrolysis by elongation factor Tu (EF-Tu) and A-site tRNA accommodation, leading to excessive rejection of cognate aminoacylated tRNAs in initial selection and proofreading. Our current and prior findings highlight how chemical modifications of mRNA tune the dynamics of protein synthesis at different steps of translation elongation.
Recent evidence for evolution of the genetic code
NASA Technical Reports Server (NTRS)
Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.
1992-01-01
The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
2014-01-01
Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Methylation of class I translation termination factors: structural and functional aspects.
Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie
2012-07-01
During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.
Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K
1991-09-15
We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-04-27
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data
Gritsenko, Alexey A.; Hulsman, Marc; Reinders, Marcel J. T.; de Ridder, Dick
2015-01-01
Translation of RNA to protein is a core process for any living organism. While for some steps of this process the effect on protein production is understood, a holistic understanding of translation still remains elusive. In silico modelling is a promising approach for elucidating the process of protein synthesis. Although a number of computational models of the process have been proposed, their application is limited by the assumptions they make. Ribosome profiling (RP), a relatively new sequencing-based technique capable of recording snapshots of the locations of actively translating ribosomes, is a promising source of information for deriving unbiased data-driven translation models. However, quantitative analysis of RP data is challenging due to high measurement variance and the inability to discriminate between the number of ribosomes measured on a gene and their speed of translation. We propose a solution in the form of a novel multi-scale interpretation of RP data that allows for deriving models with translation dynamics extracted from the snapshots. We demonstrate the usefulness of this approach by simultaneously determining for the first time per-codon translation elongation and per-gene translation initiation rates of Saccharomyces cerevisiae from RP data for two versions of the Totally Asymmetric Exclusion Process (TASEP) model of translation. We do this in an unbiased fashion, by fitting the models using only RP data with a novel optimization scheme based on Monte Carlo simulation to keep the problem tractable. The fitted models match the data significantly better than existing models and their predictions show better agreement with several independent protein abundance datasets than existing models. Results additionally indicate that the tRNA pool adaptation hypothesis is incomplete, with evidence suggesting that tRNA post-transcriptional modifications and codon context may play a role in determining codon elongation rates. PMID:26275099
Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data.
Gritsenko, Alexey A; Hulsman, Marc; Reinders, Marcel J T; de Ridder, Dick
2015-08-01
Translation of RNA to protein is a core process for any living organism. While for some steps of this process the effect on protein production is understood, a holistic understanding of translation still remains elusive. In silico modelling is a promising approach for elucidating the process of protein synthesis. Although a number of computational models of the process have been proposed, their application is limited by the assumptions they make. Ribosome profiling (RP), a relatively new sequencing-based technique capable of recording snapshots of the locations of actively translating ribosomes, is a promising source of information for deriving unbiased data-driven translation models. However, quantitative analysis of RP data is challenging due to high measurement variance and the inability to discriminate between the number of ribosomes measured on a gene and their speed of translation. We propose a solution in the form of a novel multi-scale interpretation of RP data that allows for deriving models with translation dynamics extracted from the snapshots. We demonstrate the usefulness of this approach by simultaneously determining for the first time per-codon translation elongation and per-gene translation initiation rates of Saccharomyces cerevisiae from RP data for two versions of the Totally Asymmetric Exclusion Process (TASEP) model of translation. We do this in an unbiased fashion, by fitting the models using only RP data with a novel optimization scheme based on Monte Carlo simulation to keep the problem tractable. The fitted models match the data significantly better than existing models and their predictions show better agreement with several independent protein abundance datasets than existing models. Results additionally indicate that the tRNA pool adaptation hypothesis is incomplete, with evidence suggesting that tRNA post-transcriptional modifications and codon context may play a role in determining codon elongation rates.
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.
Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier
2004-08-01
Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.
Baca, A M; Hol, W G
2000-02-01
Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Site-specific incorporation of 4-iodo-L-phenylalanine through opal suppression.
Kodama, Koichiro; Nakayama, Hiroshi; Sakamoto, Kensaku; Fukuzawa, Seketsu; Kigawa, Takanori; Yabuki, Takashi; Kitabatake, Makoto; Takio, Koji; Yokoyama, Shigeyuki
2010-08-01
A variety of unique codons have been employed to expand the genetic code. The use of the opal (UGA) codon is promising, but insufficient information is available about the UGA suppression approach, which facilitates the incorporation of non-natural amino acids through suppression of the UGA codon. In this study, the UGA codon was used to incorporate 4-iodo-l-phenylalanine into position 32 of the Ras protein in an Escherichia coli cell-free translation system. The undesired incorporation of tryptophan in response to the UGA codon was completely repressed by the addition of indolmycin. The minor amount (3%) of contaminating 4-bromo-l-phenylalanine in the building block 4-iodo-l-phenylalanine led to the significant incorporation of 4-bromo-l-phenylalanine (21%), and this problem was solved by using a purified 4-iodo-l-phenylalanine sample. Optimization of the incubation time was also important, since the undesired incorporation of free phenylalanine increased during the cell-free translation reaction. The 4-iodo-l-phenylalanine residue can be used for the chemoselective modification of proteins. This method will contribute to advancements in protein engineering studies with non-natural amino acid substitutions.
Levitin, Anastasia; Yanofsky, Charles
2010-01-01
Tryptophan, phenylalanine, tyrosine, and several other metabolites are all synthesized from a common precursor, chorismic acid. Since tryptophan is a product of an energetically expensive biosynthetic pathway, bacteria have developed sensing mechanisms to downregulate synthesis of the enzymes of tryptophan formation when synthesis of the amino acid is not needed. In Bacillus subtilis and some other Gram-positive bacteria, trp operon expression is regulated by two proteins, TRAP (the tryptophan-activated RNA binding protein) and AT (the anti-TRAP protein). TRAP is activated by bound tryptophan, and AT synthesis is increased upon accumulation of uncharged tRNATrp. Tryptophan-activated TRAP binds to trp operon leader RNA, generating a terminator structure that promotes transcription termination. AT binds to tryptophan-activated TRAP, inhibiting its RNA binding ability. In B. subtilis, AT synthesis is upregulated both transcriptionally and translationally in response to the accumulation of uncharged tRNATrp. In this paper, we focus on explaining the differences in organization and regulatory functions of the at operon's leader peptide-coding region, rtpLP, of B. subtilis and Bacillus licheniformis. Our objective was to correlate the greater growth sensitivity of B. licheniformis to tryptophan starvation with the spacing of the three Trp codons in its at operon leader peptide-coding region. Our findings suggest that the Trp codon location in rtpLP of B. licheniformis is designed to allow a mild charged-tRNATrp deficiency to expose the Shine-Dalgarno sequence and start codon for the AT protein, leading to increased AT synthesis. PMID:20061467
The effect of tRNA levels on decoding times of mRNA codons.
Dana, Alexandra; Tuller, Tamir
2014-08-01
The possible effect of transfer ribonucleic acid (tRNA) concentrations on codons decoding time is a fundamental biomedical research question; however, due to a large number of variables affecting this process and the non-direct relation between them, a conclusive answer to this question has eluded so far researchers in the field. In this study, we perform a novel analysis of the ribosome profiling data of four organisms which enables ranking the decoding times of different codons while filtering translational phenomena such as experimental biases, extreme ribosomal pauses and ribosome traffic jams. Based on this filtering, we show for the first time that there is a significant correlation between tRNA concentrations and the codons estimated decoding time both in prokaryotes and in eukaryotes in natural conditions (-0.38 to -0.66, all P values <0.006); in addition, we show that when considering tRNA concentrations, codons decoding times are not correlated with aminoacyl-tRNA levels. The reported results support the conjecture that translation efficiency is directly influenced by the tRNA levels in the cell. Thus, they should help to understand the evolution of synonymous aspects of coding sequences via the adaptation of their codons to the tRNA pool. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Iben, James R.; Maraia, Richard J.
2012-01-01
tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-01-01
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
Deciphering mRNA Sequence Determinants of Protein Production Rate
NASA Astrophysics Data System (ADS)
Szavits-Nossan, Juraj; Ciandrini, Luca; Romano, M. Carmen
2018-03-01
One of the greatest challenges in biophysical models of translation is to identify coding sequence features that affect the rate of translation and therefore the overall protein production in the cell. We propose an analytic method to solve a translation model based on the inhomogeneous totally asymmetric simple exclusion process, which allows us to unveil simple design principles of nucleotide sequences determining protein production rates. Our solution shows an excellent agreement when compared to numerical genome-wide simulations of S. cerevisiae transcript sequences and predicts that the first 10 codons, which is the ribosome footprint length on the mRNA, together with the value of the initiation rate, are the main determinants of protein production rate under physiological conditions. Finally, we interpret the obtained analytic results based on the evolutionary role of the codons' choice for regulating translation rates and ribosome densities.
Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G
2007-01-01
Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
DNA Translator and Aligner: HyperCard utilities to aid phylogenetic analysis of molecules.
Eernisse, D J
1992-04-01
DNA Translator and Aligner are molecular phylogenetics HyperCard stacks for Macintosh computers. They manipulate sequence data to provide graphical gene mapping, conversions, translations and manual multiple-sequence alignment editing. DNA Translator is able to convert documented GenBank or EMBL documented sequences into linearized, rescalable gene maps whose gene sequences are extractable by clicking on the corresponding map button or by selection from a scrolling list. Provided gene maps, complete with extractable sequences, consist of nine metazoan, one yeast, and one ciliate mitochondrial DNAs and three green plant chloroplast DNAs. Single or multiple sequences can be manipulated to aid in phylogenetic analysis. Sequences can be translated between nucleic acids and proteins in either direction with flexible support of alternate genetic codes and ambiguous nucleotide symbols. Multiple aligned sequence output from diverse sources can be converted to Nexus, Hennig86 or PHYLIP format for subsequent phylogenetic analysis. Input or output alignments can be examined with Aligner, a convenient accessory stack included in the DNA Translator package. Aligner is an editor for the manual alignment of up to 100 sequences that toggles between display of matched characters and normal unmatched sequences. DNA Translator also generates graphic displays of amino acid coding and codon usage frequency relative to all other, or only synonymous, codons for approximately 70 select organism-organelle combinations. Codon usage data is compatible with spreadsheet or UWGCG formats for incorporation of additional molecules of interest. The complete package is available via anonymous ftp and is free for non-commercial uses.
Thakur, Anil; Hinnebusch, Alan G
2018-05-01
The eukaryotic 43S preinitiation complex (PIC), bearing initiator methionyl transfer RNA (Met-tRNA i ) in a ternary complex (TC) with eukaryotic initiation factor 2 (eIF2)-GTP, scans the mRNA leader for an AUG codon in favorable context. AUG recognition evokes rearrangement from an open PIC conformation with TC in a "P OUT " state to a closed conformation with TC more tightly bound in a "P IN " state. eIF1 binds to the 40S subunit and exerts a dual role of enhancing TC binding to the open PIC conformation while antagonizing the P IN state, necessitating eIF1 dissociation for start codon selection. Structures of reconstituted PICs reveal juxtaposition of eIF1 Loop 2 with the Met-tRNA i D loop in the P IN state and predict a distortion of Loop 2 from its conformation in the open complex to avoid a clash with Met-tRNA i We show that Ala substitutions in Loop 2 increase initiation at both near-cognate UUG codons and AUG codons in poor context. Consistently, the D71A-M74A double substitution stabilizes TC binding to 48S PICs reconstituted with mRNA harboring a UUG start codon, without affecting eIF1 affinity for 40S subunits. Relatively stronger effects were conferred by arginine substitutions; and no Loop 2 substitutions perturbed the rate of TC loading on scanning 40S subunits in vivo. Thus, Loop 2-D loop interactions specifically impede Met-tRNA i accommodation in the P IN state without influencing the P OUT mode of TC binding; and Arg substitutions convert the Loop 2-tRNA i clash to an electrostatic attraction that stabilizes P IN and enhances selection of poor start codons in vivo.
Identification of Conflicting Selective Effects on Highly Expressed Genes
Higgs, Paul G.; Hao, Weilong; Golding, G. Brian
2007-01-01
Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2016-01-01
The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
Ribosome Profiling Reveals a Cell-Type-Specific Translational Landscape in Brain Tumors
Gonzalez, Christian; Sims, Jennifer S.; Hornstein, Nicholas; Mela, Angeliki; Garcia, Franklin; Lei, Liang; Gass, David A.; Amendolara, Benjamin; Bruce, Jeffrey N.
2014-01-01
Glioma growth is driven by signaling that ultimately regulates protein synthesis. Gliomas are also complex at the cellular level and involve multiple cell types, including transformed and reactive cells in the brain tumor microenvironment. The distinct functions of the various cell types likely lead to different requirements and regulatory paradigms for protein synthesis. Proneural gliomas can arise from transformation of glial progenitors that are driven to proliferate via mitogenic signaling that affects translation. To investigate translational regulation in this system, we developed a RiboTag glioma mouse model that enables cell-type-specific, genome-wide ribosome profiling of tumor tissue. Infecting glial progenitors with Cre-recombinant retrovirus simultaneously activates expression of tagged ribosomes and delivers a tumor-initiating mutation. Remarkably, we find that although genes specific to transformed cells are highly translated, their translation efficiencies are low compared with normal brain. Ribosome positioning reveals sequence-dependent regulation of ribosomal activity in 5′-leaders upstream of annotated start codons, leading to differential translation in glioma compared with normal brain. Additionally, although transformed cells express a proneural signature, untransformed tumor-associated cells, including reactive astrocytes and microglia, express a mesenchymal signature. Finally, we observe the same phenomena in human disease by combining ribosome profiling of human proneural tumor and non-neoplastic brain tissue with computational deconvolution to assess cell-type-specific translational regulation. PMID:25122893
Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A
2007-07-01
A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.
Khaĭrulina, Iu S; Molotkov, M V; Bulygin, K N; Graĭfer, D M; Ven'yaminova, A G; Frolova, L Iu; Stahl, J; Karpova, G G
2008-01-01
Protein S3 fragments were determined that crosslink to modified mRNA analogues in positions +5 to +12 relative to the first nucleotide in the P-site binding codon in model complexes mimicking states of ribosomes at the elongation and translation termination steps. The mRNA analogues contained a Phe codon UUU/UUC at the 5'-termini that could predetermine the position of the tRNA(Phe) on the ribosome by the location of P-site binding and perfluorophenylazidobenzoyl group at a nucleotide in various positions 3' of the UUU/UUC codon. The crosslinked S3 protein was isolated from 80S ribosomal complexes irradiated with mild UV light and subjected to cyanogen bromide-induced cleavage at methionine residues with subsequent identification of the crosslinked oligopeptides. An analysis of the positions of modified oligopeptides resulting from the cleavage showed that, in dependence on the positions of modified nucleotides in the mRNA analogue, the crosslinking sites were found in the N-terminal half of the protein (fragment 2-127) and/or in the C-terminal fragment 190-236; the latter reflects a new peculiarity in the structure of the mRNA binding center in the ribosome, unknown to date. The results of crosslinking did not depend on the type of A-site codon or on the presence of translation termination factor eRF1.
Ribosome hijacking: a role for small protein B during trans-translation.
Nonin-Lecomte, Sylvie; Germain-Amiot, Noella; Gillet, Reynald; Hallier, Marc; Ponchon, Luc; Dardel, Frédéric; Felden, Brice
2009-02-01
Tight recognition of codon-anticodon pairings by the ribosome ensures the accuracy and fidelity of protein synthesis. In eubacteria, translational surveillance and ribosome rescue are performed by the 'tmRNA-SmpB' system (transfer messenger RNA-small protein B). Remarkably, entry and accommodation of aminoacylated-tmRNA into stalled ribosomes occur without a codon-anticodon interaction but in the presence of SmpB. Here, we show that within a stalled ribosome, SmpB interacts with the three universally conserved bases G530, A1492 and A1493 that form the 30S subunit decoding centre, in which canonical codon-anticodon pairing occurs. The footprints at positions A1492 and A1493 of a small decoding centre, as well as on a set of conserved SmpB amino acids, were identified by nuclear magnetic resonance. Mutants at these residues display the same growth defects as for DeltasmpB strains. The SmpB protein has functional and structural similarities with initiation factor 1, and is proposed to be a functional mimic of the pairing between a codon and an anticodon.
Man, Orna; Pilpel, Yitzhak
2007-03-01
A major challenge in comparative genomics is to understand how phenotypic differences between species are encoded in their genomes. Phenotypic divergence may result from differential transcription of orthologous genes, yet less is known about the involvement of differential translation regulation in species phenotypic divergence. In order to assess translation effects on divergence, we analyzed approximately 2,800 orthologous genes in nine yeast genomes. For each gene in each species, we predicted translation efficiency, using a measure of the adaptation of its codons to the organism's tRNA pool. Mining this data set, we found hundreds of genes and gene modules with correlated patterns of translational efficiency across the species. One signal encompassed entire modules that are either needed for oxidative respiration or fermentation and are efficiently translated in aerobic or anaerobic species, respectively. In addition, the efficiency of translation of the mRNA splicing machinery strongly correlates with the number of introns in the various genomes. Altogether, we found extensive selection on synonymous codon usage that modulates translation according to gene function and organism phenotype. We conclude that, like factors such as transcription regulation, translation efficiency affects and is affected by the process of species divergence.
Mutations in eukaryotic release factors 1 and 3 act as general nonsense suppressors in Drosophila.
Chao, Anna T; Dierick, Herman A; Addy, Tracie M; Bejsovec, Amy
2003-01-01
In a screen for suppressors of the Drosophila wingless(PE4) nonsense allele, we isolated mutations in the two components that form eukaryotic release factor. eRF1 and eRF3 comprise the translation termination complex that recognizes stop codons and catalyzes the release of nascent polypeptide chains from ribosomes. Mutations disrupting the Drosophila eRF1 and eRF3 show a strong maternal-effect nonsense suppression due to readthrough of stop codons and are zygotically lethal during larval stages. We tested nonsense mutations in wg and in other embryonically acting genes and found that different stop codons can be suppressed but only a subset of nonsense alleles are subject to suppression. We suspect that the context of the stop codon is significant: nonsense alleles sensitive to suppression by eRF1 and eRF3 encode stop codons that are immediately followed by a cytidine. Such suppressible alleles appear to be intrinsically weak, with a low level of readthrough that is enhanced when translation termination is disrupted. Thus the eRF1 and eRF3 mutations provide a tool for identifying nonsense alleles that are leaky. Our findings have important implications for assigning null mutant phenotypes and for selecting appropriate alleles to use in suppressor screens. PMID:14573473
Zaborske, John M.; Bauer DuMont, Vanessa L.; Wallace, Edward W. J.; Pan, Tao; Aquadro, Charles F.; Drummond, D. Allan
2014-01-01
Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient. PMID:25489848
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rouha, Harald; Hoenninger, Verena M.; Thurner, Caroline
2011-08-15
Flavivirus gene expression is modulated by RNA secondary structure elements at the terminal ends of the viral RNA molecule. For tick-borne encephalitis virus (TBEV), four stem-loop (SL) elements have been predicted in the first 180 nucleotides of the viral genome: 5'-SL1, 5'-SL2, 5'-SL3 and 5'-SL4. The last three of these appear to be unique to tick-borne flaviviruses. Here, we report their characterization by mutagenesis in a TBEV luciferase reporter system. By manipulating their thermodynamic properties, we found that an optimal stability of the 5'-SL2 is required for efficient RNA replication. 5'-SL3 formation is also important for viral RNA replication, butmore » although it contains the viral start codon, its formation is dispensable for RNA translation. 5'-SL4 appears to facilitate both RNA translation and replication. Our data suggest that maintenance of the balanced thermodynamic stability of these SL elements is important for temporal regulation of its different functions.« less
Translation regulation of mammalian selenoproteins.
Vindry, Caroline; Ohlmann, Théophile; Chavatte, Laurent
2018-05-09
Interest in selenium research has considerably grown over the last decades owing to the association of selenium deficiencies with an increased risk of several human diseases, including cancers, cardiovascular disorders and infectious diseases. The discovery of a genetically encoded 21 st amino acid, selenocysteine, is a fascinating breakthrough in molecular biology as it is the first addition to the genetic code deciphered in the 1960s. Selenocysteine is a structural and functional analog of cysteine, where selenium replaces sulfur, and its presence is critical for the catalytic activity of selenoproteins. The insertion of selenocysteine is a non-canonical translational event, based on the recoding of a UGA codon in selenoprotein mRNAs, normally used as a stop codon in other cellular mRNAs. Two RNA molecules and associated partners are crucial components of the selenocysteine insertion machinery, the Sec-tRNA [Ser]Sec devoted to UGA codon recognition and the SECIS elements located in the 3'UTR of selenoprotein mRNAs. The translational UGA recoding event is a limiting stage of selenoprotein expression and its efficiency is regulated by several factors. The control of selenoproteome expression is crucial for redox homeostasis and antioxidant defense of mammalian organisms. In this review, we summarize current knowledge on the co-translational insertion of selenocysteine into selenoproteins, and its layers of regulation. Copyright © 2018. Published by Elsevier B.V.
Photo-dependent protein biosynthesis using a caged aminoacyl-tRNA.
Akahoshi, Akiya; Doi, Yoshio; Sisido, Masahiko; Watanabe, Kazunori; Ohtsuki, Takashi
2014-12-01
Translation systems with four-base codons provide a powerful strategy for protein engineering and protein studies because they enable site-specific incorporation of non-natural amino acids into proteins. In this study, a caged aminoacyl-tRNA with a four-base anticodon was synthesized. The caged aminoacyl-tRNA contains a photocleavable nitroveratryloxycarbonyl (NVOC) group. This study showed that the caged aminoacyl-tRNA was not deacylated, did not bind to EF-Tu, and was activated by light. Photo-dependent translation of an mRNA containing the four-base codon was demonstrated using the caged aminoacyl-tRNA.
Gene Model Annotations for Drosophila melanogaster: The Rule-Benders
Crosby, Madeline A.; Gramates, L. Sian; dos Santos, Gilberto; Matthews, Beverley B.; St. Pierre, Susan E.; Zhou, Pinglei; Schroeder, Andrew J.; Falls, Kathleen; Emmert, David B.; Russo, Susan M.; Gelbart, William M.
2015-01-01
In the context of the FlyBase annotated gene models in Drosophila melanogaster, we describe the many exceptional cases we have curated from the literature or identified in the course of FlyBase analysis. These range from atypical but common examples such as dicistronic and polycistronic transcripts, noncanonical splices, trans-spliced transcripts, noncanonical translation starts, and stop-codon readthroughs, to single exceptional cases such as ribosomal frameshifting and HAC1-type intron processing. In FlyBase, exceptional genes and transcripts are flagged with Sequence Ontology terms and/or standardized comments. Because some of the rule-benders create problems for handlers of high-throughput data, we discuss plans for flagging these cases in bulk data downloads. PMID:26109356
The Quantum Workings of the Rotating 64-Grid Genetic Code
Castro-Chavez, Fernando
2011-01-01
In this article, the pattern learned from the classic or conventional rotating circular genetic code is transferred to a 64-grid model. In this non-static representation, the codons for the same amino acid within each quadrant could be exchanged, wobbling or rotating in a quantic way similar to the electrons within an atomic orbit. Represented in this 64-grid format are the three rules of variation encompassing 4, 2, or 1 quadrant, respectively: 1) same position in four quadrants for the essential hydrophobic amino acids that have U at the center, 2) same or contiguous position for the same or related amino acids in two quadrants, and 3) equivalent amino acids within one quadrant. Also represented is the mathematical balance of the odd and even codons, and the most used codons per amino acid in humans compared to one diametrically opposed organism: the plant Arabidopsis thaliana, a comparison that depicts the difference in third nucleotide preferences: a C/U exchange for 11 amino acids, a G/A and a G/U exchange for 2 amino acids, respectively, and a C/A exchange for one amino acid; by studying these codon usage preferences per amino acid we present our two hypotheses: 1) A slower translation in vertebrates and 2) a faster translation in invertebrates, possibly due to the aqueous environments where they live. These codon usage preferences may also be able to determine genomic compatibility by comparing individual mRNAs and their functional third dimensional structure, transport and translation within cells and organisms. These observations are aimed to the design of bioinformatics computational tools to compare human genomes and to determine the exchange between compatible codons and amino acids, to preserve and/or to bring back extinct biodiversity, and for the early detection of incompatible changes that lead to genetic diseases. PMID:22308074
Choi, Sang Ki; Olsen, DeAnne S.; Roll-Mecak, Antonina; Martung, Agnes; Remo, Keith L.; Burley, Stephen K.; Hinnebusch, Alan G.; Dever, Thomas E.
2000-01-01
To initiate protein synthesis, a ribosome with bound initiator methionyl-tRNA must be assembled at the start codon of an mRNA. This process requires the coordinated activities of three translation initiation factors (IF) in prokaryotes and at least 12 translation initiation factors in eukaryotes (eIF). The factors eIF1A and eIF5B from eukaryotes show extensive amino acid sequence similarity to the factors IF1 and IF2 from prokaryotes. By a combination of two-hybrid, coimmunoprecipitation, and in vitro binding assays eIF1A and eIF5B were found to interact directly, and the eIF1A binding site was mapped to the C-terminal region of eIF5B. This portion of eIF5B was found to be critical for growth in vivo and for translation in vitro. Overexpression of eIF1A exacerbated the slow-growth phenotype of yeast strains expressing C-terminally truncated eIF5B. These findings indicate that the physical interaction between the evolutionarily conserved factors eIF1A and eIF5B plays an important role in translation initiation, perhaps to direct or stabilize the binding of methionyl-tRNA to the ribosomal P site. PMID:10982835
Codon optimization underpins generalist parasitism in fungi
Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain
2017-01-01
The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.
Kumar, Chandra Shekhar; Kumar, Sachin
2017-06-01
Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup
2016-10-01
Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
1994-01-01
The 40-S subunit of eukaryotic ribosomes binds to the capped 5'-end of mRNA and scans for the first AUG in a favorable sequence context to initiate translation. Most eukaryotic mRNAs therefore have a short 5'- untranslated region (5'-UTR) and no AUGs upstream of the translational start site; features that seem to assure efficient translation. However, approximately 5-10% of all eukaryotic mRNAs, particularly those encoding for regulatory proteins, have complex leader sequences that seem to compromise translational initiation. The retinoic-acid- receptor-beta 2 (RAR beta 2) mRNA is such a transcript with a long (461 nucleotides) 5'-UTR that contains five, partially overlapping, upstream open reading frames (uORFs) that precede the major ORF. We have begun to investigate the function of this complex 5'-UTR in transgenic mice, by introducing mutations in the start/stop codons of the uORFs in RAR beta 2-lacZ reporter constructs. When we compared the expression patterns of mutant and wild-type constructs we found that these mutations affected expression of the downstream RAR beta 2-ORF, resulting in an altered regulation of RAR beta 2-lacZ expression in heart and brain. Other tissues were unaffected. RNA analysis of adult tissues demonstrated that the uORFs act at the level of translation; adult brains and hearts of transgenic mice carrying a construct with either the wild-type or a mutant UTR, had the same levels of mRNA, but only the mutant produced protein. Our study outlines an unexpected role for uORFs: control of tissue-specific and developmentally regulated gene expression. PMID:7962071
Popov, Georgy; Majhi, Bharat Bhusan; Sessa, Guido
2018-05-21
The type III effector XopAE from the Xanthomonas euvesicatoria strain 85-10 ( Xe 85-10) was previously shown to inhibit plant immunity and enhance pathogen-induced disease symptoms. Evolutionary analysis of 60 xopAE alleles ( AEal ) revealed that the xopAE locus is conserved in multiple Xanthomonas species. The majority of xopAE alleles (55 out of 60) encodes a single ORF ( xopAE ), while in 5 alleles, including AEal 37 of the Xe 85-10 strain, a frame-shift splits the locus into two ORFs ( hpaF and a truncated xopAE ). To test whether the second ORF of AEal 37 ( xopAE 85-10 ) is translated, we examined expression of YFP fused downstream to truncated or mutant forms of the locus in Xanthomonas bacteria. YFP fluorescence was detected at maximal levels when the reporter was in proximity of an internal ribosome-binding site upstream to a rare ATT start codon in the xopAE 85-10 ORF, but severely reduced when these elements were abolished. In agreement with the notion that xopAE 85- 10 is a functional gene, its protein product was translocated into plant cells by the type III secretion system and translocation was dependent on its upstream ORF hpaF. Homology modeling predicted that XopAE 85-10 contains an E3 ligase XL-box domain at the C-terminus, and in vitro assays demonstrated that this domain displays mono-ubiquitination activity. Remarkably, the XL-box was essential for XopAE 85-10 to inhibit PAMP-induced gene expression in Arabidopsis protoplasts. Together, these results indicate that the xopAE 85-10 gene resides in a functional operon, which utilizes the alternative start codon ATT, and encodes a novel XL-box E3 ligase. Importance Xanthomonas bacteria utilize a type III secretion system to cause disease in many crops. This study provides insights into evolution, translocation and biochemical function of the XopAE type III secreted effector contributing to the understanding of Xanthomonas-host interactions. We establish XopAE as core effector of seven Xanthomonas species and elucidate evolution of the Xanthomonas euvesicatoria xopAE locus, which contains an operon encoding a truncated effector. Our findings indicate that this operon evolved from the split of a multi-domains gene into two ORFs that conserved the original domain function. Analysis of xopAE 85-10 translation provides the first evidence for translation initiation from an ATT codon in Xanthomonas Our data demonstrate that XopAE 85-10 is an XL-box E3 ubiquitin ligase and provide insights into structure and function of this effector family. Copyright © 2018 American Society for Microbiology.
Inhibition of Non-ATG Translational Events in Cells via Covalent Small Molecules Targeting RNA.
Yang, Wang-Yong; Wilson, Henry D; Velagapudi, Sai Pradeep; Disney, Matthew D
2015-04-29
One major class of disease-causing RNAs is expanded repeating transcripts. These RNAs cause diseases via multiple mechanisms, including: (i) gain-of-function, in which repeating RNAs bind and sequester proteins involved in RNA biogenesis and (ii) repeat associated non-ATG (RAN) translation, in which repeating transcripts are translated into toxic proteins without use of a canonical, AUG, start codon. Herein, we develop and study chemical probes that bind and react with an expanded r(CGG) repeat (r(CGG)(exp)) present in a 5' untranslated region that causes fragile X-associated tremor/ataxia syndrome (FXTAS). Reactive compounds bind to r(CGG)(exp) in cellulo as shown with Chem-CLIP-Map, an approach to map small molecule binding sites within RNAs in cells. Compounds also potently improve FXTAS-associated pre-mRNA splicing and RAN translational defects, while not affecting translation of the downstream open reading frame. In contrast, oligonucleotides affect both RAN and canonical translation when they bind to r(CGG)(exp), which is mechanistically traced to a decrease in polysome loading. Thus, designer small molecules that react with RNA targets can be used to profile the RNAs to which they bind in cells, including identification of binding sites, and can modulate several aspects of RNA-mediated disease pathology in a manner that may be more beneficial than oligonucleotides.
The control of lambda DNA terminase synthesis.
Murialdo, H; Davidson, A; Chow, S; Gold, M
1987-01-01
Nu1 and A, the genes coding for bacteriophage lambda DNA terminase, rank among the most poorly translated genes expressed in E. coli. To understand the reason for this low level of translation the genes were cloned into plasmids and their expression measured. In addition, the wild type DNA sequences immediately preceding the genes were reduced and modified. It was found that the elements that control translation are contained in the 100 base pairs upstream from the initiation codon. Interchanging these upstream sequences with those of an efficiently translated gene dramatically increased the translation of terminase subunits. It seems unlikely that the rare codons present in the genes, and any feature of their mRNA secondary structure play a role in the control of their translation. The elimination of cos from plasmids containing Nu1 and A also resulted in an increase in terminase production. This result suggests a role for cos in the control of late gene expression. The terminase subunit overproducer strains are potentially very useful for the design of improved DNA packaging and cosmid mapping techniques. Images PMID:3029667
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana
Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea
2012-01-01
The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Köhrer, Caroline; Mandal, Debabrata; Gaston, Kirk W.; Grosjean, Henri; Limbach, Patrick A.; RajBhandary, Uttam L.
2014-01-01
Translation of the isoleucine codon AUA in most prokaryotes requires a modified C (lysidine or agmatidine) at the wobble position of tRNA2Ile to base pair specifically with the A of the AUA codon but not with the G of AUG. Recently, a Bacillus subtilis strain was isolated in which the essential gene encoding tRNAIle-lysidine synthetase was deleted for the first time. In such a strain, C34 at the wobble position of tRNA2Ile is expected to remain unmodified and cells depend on a mutant suppressor tRNA derived from tRNA1Ile, in which G34 has been changed to U34. An important question, therefore, is how U34 base pairs with A without also base pairing with G. Here, we show (i) that unlike U34 at the wobble position of all B. subtilis tRNAs of known sequence, U34 in the mutant tRNA is not modified, and (ii) that the mutant tRNA binds strongly to the AUA codon on B. subtilis ribosomes but only weakly to AUG. These in vitro data explain why the suppressor strain displays only a low level of misreading AUG codons in vivo and, as shown here, grows at a rate comparable to that of the wild-type strain. PMID:24194599
Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath
2017-08-23
Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
Lapierre, Pascal; Mir, Mushtaq; Chase, Michael R.; Pyle, Margaret M.; Gawande, Richa; Ahmad, Rushdy; Sarracino, David A.; Ioerger, Thomas R.; Fortune, Sarah M.; Derbyshire, Keith M.; Wade, Joseph T.; Gray, Todd A.
2015-01-01
RNA-seq technologies have provided significant insight into the transcription networks of mycobacteria. However, such studies provide no definitive information on the translational landscape. Here, we use a combination of high-throughput transcriptome and proteome-profiling approaches to more rigorously understand protein expression in two mycobacterial species. RNA-seq and ribosome profiling in Mycobacterium smegmatis, and transcription start site (TSS) mapping and N-terminal peptide mass spectrometry in Mycobacterium tuberculosis, provide complementary, empirical datasets to examine the congruence of transcription and translation in the Mycobacterium genus. We find that nearly one-quarter of mycobacterial transcripts are leaderless, lacking a 5’ untranslated region (UTR) and Shine-Dalgarno ribosome-binding site. Our data indicate that leaderless translation is a major feature of mycobacterial genomes and is comparably robust to leadered initiation. Using translational reporters to systematically probe the cis-sequence requirements of leaderless translation initiation in mycobacteria, we find that an ATG or GTG at the mRNA 5’ end is both necessary and sufficient. This criterion, together with our ribosome occupancy data, suggests that mycobacteria encode hundreds of small, unannotated proteins at the 5’ ends of transcripts. The conservation of small proteins in both mycobacterial species tested suggests that some play important roles in mycobacterial physiology. Our translational-reporter system further indicates that mycobacterial leadered translation initiation requires a Shine Dalgarno site in the 5’ UTR and that ATG, GTG, TTG, and ATT codons can robustly initiate translation. Our combined approaches provide the first comprehensive view of mycobacterial gene structures and their non-canonical mechanisms of protein expression. PMID:26536359
Shell, Scarlet S; Wang, Jing; Lapierre, Pascal; Mir, Mushtaq; Chase, Michael R; Pyle, Margaret M; Gawande, Richa; Ahmad, Rushdy; Sarracino, David A; Ioerger, Thomas R; Fortune, Sarah M; Derbyshire, Keith M; Wade, Joseph T; Gray, Todd A
2015-11-01
RNA-seq technologies have provided significant insight into the transcription networks of mycobacteria. However, such studies provide no definitive information on the translational landscape. Here, we use a combination of high-throughput transcriptome and proteome-profiling approaches to more rigorously understand protein expression in two mycobacterial species. RNA-seq and ribosome profiling in Mycobacterium smegmatis, and transcription start site (TSS) mapping and N-terminal peptide mass spectrometry in Mycobacterium tuberculosis, provide complementary, empirical datasets to examine the congruence of transcription and translation in the Mycobacterium genus. We find that nearly one-quarter of mycobacterial transcripts are leaderless, lacking a 5' untranslated region (UTR) and Shine-Dalgarno ribosome-binding site. Our data indicate that leaderless translation is a major feature of mycobacterial genomes and is comparably robust to leadered initiation. Using translational reporters to systematically probe the cis-sequence requirements of leaderless translation initiation in mycobacteria, we find that an ATG or GTG at the mRNA 5' end is both necessary and sufficient. This criterion, together with our ribosome occupancy data, suggests that mycobacteria encode hundreds of small, unannotated proteins at the 5' ends of transcripts. The conservation of small proteins in both mycobacterial species tested suggests that some play important roles in mycobacterial physiology. Our translational-reporter system further indicates that mycobacterial leadered translation initiation requires a Shine Dalgarno site in the 5' UTR and that ATG, GTG, TTG, and ATT codons can robustly initiate translation. Our combined approaches provide the first comprehensive view of mycobacterial gene structures and their non-canonical mechanisms of protein expression.
Subramanian, Abhishek; Sarkar, Ram Rup
2015-10-01
Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
Analysis of amino acid and codon usage in Paramecium bursaria.
Dohra, Hideo; Fujishima, Masahiro; Suzuki, Haruo
2015-10-07
The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Identification of the likely translational start of Mycobacterium tuberculosis GyrB.
Karkare, Shantanu; Brown, Amanda C; Parish, Tanya; Maxwell, Anthony
2013-07-15
Bacterial DNA gyrase is a validated target for antibacterial chemotherapy. It consists of two subunits, GyrA and GyrB, which form an A₂B₂ complex in the active enzyme. Sequence alignment of Mycobacterium tuberculosis GyrB with other bacterial GyrBs predicts the presence of 40 potential additional amino acids at the GyrB N-terminus. There are discrepancies between the M. tuberculosis GyrB sequences retrieved from different databases, including sequences annotated with or without the additional 40 amino acids. This has resulted in differences in the GyrB sequence numbering that has led to the reporting of previously known fluoroquinolone-resistance mutations as novel mutations. We have expressed M. tuberculosis GyrB with and without the extra 40 amino acids in Escherichia coli and shown that both can be produced as soluble, active proteins. Supercoiling and other assays of the two proteins show no differences, suggesting that the additional 40 amino acids have no effect on the enzyme in vitro. RT-PCR analysis of M. tuberculosis mRNA shows that transcripts that could yield both the longer and shorter protein are present. However, promoter analysis showed that only the promoter elements leading to the shorter GyrB (lacking the additional 40 amino acids) had significant activity. We conclude that the most probable translational start codon for M. tuberculosis GyrB is GTG (Val) which results in translation of a protein of 674 amino acids (74 kDa).
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-05-15
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-01-01
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Liu, Yanbin; Koh, Chong Mei John; Ngoh, Si Te; Ji, Lianghui
2015-10-26
Rhodosporidium and Rhodotorula are two genera of oleaginous red yeast with great potential for industrial biotechnology. To date, there is no effective method for inducible expression of proteins and RNAs in these hosts. We have developed a luciferase gene reporter assay based on a new codon-optimized LUC2 reporter gene (RtLUC2), which is flanked with CAR2 homology arms and can be integrated into the CAR2 locus in the nuclear genome at >90 % efficiency. We characterized the upstream DNA sequence of a D-amino acid oxidase gene (DAO1) from R. toruloides ATCC 10657 by nested deletions. By comparing the upstream DNA sequences of several putative DAO1 homologs of Basidiomycetous fungi, we identified a conserved DNA motif with a consensus sequence of AGGXXGXAGX11GAXGAXGG within a 0.2 kb region from the mRNA translation initiation site. Deletion of this motif led to strong mRNA transcription under non-inducing conditions. Interestingly, DAO1 promoter activity was enhanced about fivefold when the 108 bp intron 1 was included in the reporter construct. We identified a conserved CT-rich motif in the intron with a consensus sequence of TYTCCCYCTCCYCCCCACWYCCGA, deletion or point mutations of which drastically reduced promoter strength under both inducing and non-inducing conditions. Additionally, we created a selection marker-free DAO1-null mutant (∆dao1e) which displayed greatly improved inducible gene expression, particularly when both glucose and nitrogen were present in high levels. To avoid adding unwanted peptide to proteins to be expressed, we converted the original translation initiation codon to ATC and re-created a translation initiation codon at the start of exon 2. This promoter, named P DAO1-in1m1 , showed very similar luciferase activity to the wild-type promoter upon induction with D-alanine. The inducible system was tunable by adjusting the levels of inducers, carbon source and nitrogen source. The intron 1-containing DAO1 promoters coupled with a DAO1 null mutant makes an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium and Rhodotorula genera. The system will be a valuable tool for metabolic engineering and enzyme expression in these yeast hosts.
Physical Model for the Evolution of the Genetic Code
NASA Astrophysics Data System (ADS)
Yamashita, Tatsuro; Narikiyo, Osamu
2011-12-01
Using the shape space of codons and tRNAs we give a physical description of the genetic code evolution on the basis of the codon capture and ambiguous intermediate scenarios in a consistent manner. In the lowest dimensional version of our description, a physical quantity, codon level is introduced. In terms of the codon levels two scenarios are typically classified into two different routes of the evolutional process. In the case of the ambiguous intermediate scenario we perform an evolutional simulation implemented cost selection of amino acids and confirm a rapid transition of the code change. Such rapidness reduces uncomfortableness of the non-unique translation of the code at intermediate state that is the weakness of the scenario. In the case of the codon capture scenario the survival against mutations under the mutational pressure minimizing GC content in genomes is simulated and it is demonstrated that cells which experience only neutral mutations survive.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rubinsztein, D.C.; Leggo, J.; Crow, T.J.
A new class of disease (including Huntington disease, Kennedy disease, and spinocerebellar ataxias types 1 and 3) results from abnormal expansions of CAG trinucleotides in the coding regions of genes. In all of these diseases the CAG repeats are thought to be translated into polyglutamine tracts. There is accumulating evidence arguing for CAG trinucleotide expansions as one of the causative disease mutations in schizophrenia and bipolar affective disorder. We and others believe that the TATA-binding protein (TBP) is an important candidate to investigate in these diseases as it contains a highly polymorphic stretch of glutamine codons, which are close tomore » the threshold length where the polyglutamine tracts start to be associated with disease. Thus, we examined the lengths of this polyglutamine repeat in normal unrelated East Anglians, South African Blacks, sub-Saharan Africans mainly from Nigeria, and Asian Indians. We also examined 43 bipolar affective disorder patients and 65 schizophrenic patients. The range of polyglutamine tract-lengths that we found in humans was from 26-42 codons. No patients with bipolar affective disorder and schizophrenia had abnormal expansions at this locus. 22 refs., 1 tab.« less
[Organization and expression of poliovirus genome].
Vevcherenko, S G
1984-01-01
In the present paper on the basis of analysis of literary data it is postulated that along with the AUG codon at N743 there exists a second initiation codon in the poliovirus RNA (the AUG codon at N586). The translation initiated at N586 can be transferred to the phase of the major reading frame by removing the small hairpin N732-N744 formed near the first initiation site, or by removing the small region N739-N745. In the first case at the boundary between the hypothetical leader peptide encoded by the 5'-terminus of the long, open reading frame of the spliced poliovirus RNA and the capsid protein VP4 must be the Gln-Gly proteolytic cleavage signal, and in the second case--the Tyr-Gly signal. In both cases the leader peptide can be chipped off by the virus specific proteinase. It is supposed that the exon-intronic structure of the poliovirus genome is needed for coordination of translation and transcription during the poliovirus reproduction cycle.
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli
Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.
2016-01-01
The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F
1991-01-01
In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
The complete mitochondrial genome of the Longnose skate: Raja rhina (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2015-02-01
The complete sequence of mitochondrial DNA of a longnose skate, Raja rhina was determined for the first time. It is 16,910 bp in length containing 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of 30.1% A, 27.2% C, 28.5% T and 14.2% G, showing a slight A + T bias. The G is the least used base and markedly lower at the third codon position (5.4%). Twelve of the 13 protein coding genes use ATG as their start codon while the COX1 starts with GTG. As for stop codon, only ND4 shows incomplete stop codon TA. This mitogenome is the first report for a species of the genus Raja, and providing a valuable resource of genetic information for understanding the phylogenetic relationship and the evolution of the genus Raja as well as the family, Rajidae.
Garver, K.A.; Conway, C.M.; Kurath, G.
2006-01-01
A highly efficacious DNA vaccine against a fish rhabdovirus, infectious hematopoietic necrosis virus (IHNV), was mutated to introduce two stop codons to prevent glycoprotein translation while maintaining the plasmid DNA integrity and RNA transcription ability. The mutated plasmid vaccine, denoted pIHNw-G2stop, when injected intramuscularly into fish at high doses, lacked detectable glycoprotein expression in the injection site muscle, and did not provide protection against lethal virus challenge 7 days post-vaccination. These results suggest that the G-protein itself is required to stimulate the early protective antiviral response observed after vaccination with the nonmutated parental DNA vaccine. ?? Springer Science+Business Media, Inc. 2006.
Translational control of Nrf2 within the open reading frame
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perez-Leal, Oscar, E-mail: operez@temple.edu; Barrero, Carlos A.; Merali, Salim, E-mail: smerali@temple.edu
2013-07-19
Highlights: •Identification of a novel Nrf2 translational repression mechanism. •The repressor is within the 3′ portion of the Nrf2 ORF. •The translation of Nrf2 or eGFP is reduced by the regulatory element. •The translational repression can be reversed with synonymous codon substitutions. •The molecular mechanism requires the mRNA sequence, but not the encoded amino acids. -- Abstract: Nuclear Factor Erythroid 2-Related Factor 2 (Nrf2) is a transcription factor that is essential for the regulation of an effective antioxidant and detoxifying response. The regulation of its activity can occur at transcription, translation and post-translational levels. Evidence suggests that under environmental stressmore » conditions, new synthesis of Nrf2 is required – a process that is regulated by translational control and is not fully understood. Here we described the identification of a novel molecular process that under basal conditions strongly represses the translation of Nrf2 within the open reading frame (ORF). This mechanism is dependent on the mRNA sequence within the 3′ portion of the ORF of Nrf2 but not in the encoded amino acid sequence. The Nrf2 translational repression can be reversed with the use of synonymous codon substitutions. This discovery suggests an additional layer of control to explain the reason for the low Nrf2 concentration under quiescent state.« less
Peroxisomal lactate dehydrogenase is generated by translational readthrough in mammals
Schueren, Fabian; Lingner, Thomas; George, Rosemol; Hofhuis, Julia; Dickel, Corinna; Gärtner, Jutta; Thoms, Sven
2014-01-01
Translational readthrough gives rise to low abundance proteins with C-terminal extensions beyond the stop codon. To identify functional translational readthrough, we estimated the readthrough propensity (RTP) of all stop codon contexts of the human genome by a new regression model in silico, identified a nucleotide consensus motif for high RTP by using this model, and analyzed all readthrough extensions in silico with a new predictor for peroxisomal targeting signal type 1 (PTS1). Lactate dehydrogenase B (LDHB) showed the highest combined RTP and PTS1 probability. Experimentally we show that at least 1.6% of the total cellular LDHB is targeted to the peroxisome by a conserved hidden PTS1. The readthrough-extended lactate dehydrogenase subunit LDHBx can also co-import LDHA, the other LDH subunit, into peroxisomes. Peroxisomal LDH is conserved in mammals and likely contributes to redox equivalent regeneration in peroxisomes. DOI: http://dx.doi.org/10.7554/eLife.03640.001 PMID:25247702
Upstream open reading frames regulate the expression of the nuclear Wnt13 isoforms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tang Tao; Rector, Kyle; Barnett, Corey D.
2008-02-22
Wnt proteins control cell survival and cell fate during development. Although Wnt expression is tightly regulated in a spatio-temporal manner, the mechanisms involved both at the transcriptional and translational levels are poorly defined. We have identified a downstream translation initiation codon, AUG(+74), in Wnt13B and Wnt13C mRNAs responsible for the expression of Wnt13 nuclear forms. In this report, we demonstrate that the expression of the nuclear Wnt13C form is translationally regulated in response to stress and apoptosis. Though the 5'-leaders of both Wnt13C and Wnt13B mRNAs have an inhibitory effect on translation, they did not display an internal ribosome entrymore » site activity as demonstrated by dicistronic reporter assays. However, mutations or deletions of the upstream AUG(-99) and AUG(+1) initiation codons abrogate these translation inhibitory effects, demonstrating that Wnt13C expression is controlled by upstream open reading frames. Since long 5'-untranslated region with short upstream open reading frames characterize other Wnt transcripts, our present data on the translational control of Wnt13 expression open the way to further studies on the translation control of Wnt expression as a modulator of their subcellular localization and activity.« less
Massive programmed translational jumping in mitochondria
Lang, B. Franz; Jakubkova, Michaela; Hegedusova, Eva; Daoud, Rachid; Forget, Lise; Brejova, Brona; Vinar, Tomas; Kosa, Peter; Fricova, Dominika; Nebohacova, Martina; Griac, Peter; Tomaska, Lubomir; Burger, Gertraud; Nosek, Jozef
2014-01-01
Programmed translational bypassing is a process whereby ribosomes “ignore” a substantial interval of mRNA sequence. Although discovered 25 y ago, the only experimentally confirmed example of this puzzling phenomenon is expression of the bacteriophage T4 gene 60. Bypassing requires translational blockage at a “takeoff codon” immediately upstream of a stop codon followed by a hairpin, which causes peptidyl-tRNA dissociation and reassociation with a matching “landing triplet” 50 nt downstream, where translation resumes. Here, we report 81 translational bypassing elements (byps) in mitochondria of the yeast Magnusiomyces capitatus and demonstrate in three cases, by transcript analysis and proteomics, that byps are retained in mitochondrial mRNAs but not translated. Although mitochondrial byps resemble the bypass sequence in the T4 gene 60, they utilize unused codons instead of stops for translational blockage and have relaxed matching rules for takeoff/landing sites. We detected byp-like sequences also in mtDNAs of several Saccharomycetales, indicating that byps are mobile genetic elements. These byp-like sequences lack bypassing activity and are tolerated when inserted in-frame in variable protein regions. We hypothesize that byp-like elements have the potential to contribute to evolutionary diversification of proteins by adding new domains that allow exploration of new structures and functions. PMID:24711422
Translation fidelity coevolves with longevity.
Ke, Zhonghe; Mallik, Pramit; Johnson, Adam B; Luna, Facundo; Nevo, Eviatar; Zhang, Zhengdong D; Gladyshev, Vadim N; Seluanov, Andrei; Gorbunova, Vera
2017-10-01
Whether errors in protein synthesis play a role in aging has been a subject of intense debate. It has been suggested that rare mistakes in protein synthesis in young organisms may result in errors in the protein synthesis machinery, eventually leading to an increasing cascade of errors as organisms age. Studies that followed generally failed to identify a dramatic increase in translation errors with aging. However, whether translation fidelity plays a role in aging remained an open question. To address this issue, we examined the relationship between translation fidelity and maximum lifespan across 17 rodent species with diverse lifespans. To measure translation fidelity, we utilized sensitive luciferase-based reporter constructs with mutations in an amino acid residue critical to luciferase activity, wherein misincorporation of amino acids at this mutated codon re-activated the luciferase. The frequency of amino acid misincorporation at the first and second codon positions showed strong negative correlation with maximum lifespan. This correlation remained significant after phylogenetic correction, indicating that translation fidelity coevolves with longevity. These results give new life to the role of protein synthesis errors in aging: Although the error rate may not significantly change with age, the basal rate of translation errors is important in defining lifespan across mammals. © 2017 The Authors. Aging Cell published by the Anatomical Society and John Wiley & Sons Ltd.
Guo, Kun; Chen, Jun; Xu, Chang-Qing; Qiao, Hai-Li; Xu, Rong; Zhao, Xiang-Jian
2016-05-01
We sequenced the complete mitochondrial genome of the longhorn beetle, Xylotrechus grayii. The total length of the X. grayii mitogenome was 15,540 bp with an A + T content of 75.29%, consisting of 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes and an A + T-rich region. All the genes were arranged in the same order as that of the ancestral insect. All PCGs started with a typical ATN codon except for cox1 and nad1, which used TTG as start codon. Ten out of 13 PCGs terminated with incomplete codons (TA or T). The A + T-rich region was 893 bp in length with an A + T content of 85.89 %.
Wada, Takahito; Haddad, Marie Reine; Yi, Ling; Murakami, Tomomi; Sasaki, Akiko; Shimbo, Hiroko; Kodama, Hiroko; Osaka, Hitoshi; Kaler, Stephen G
2014-04-01
Determining the relationship between clinical phenotype and genotype in genetic diseases is important in clinical practice. In general, frameshift mutations are expected to produce premature termination codons, leading to production of mutant transcripts destined for degradation by nonsense-mediated decay. In X-linked recessive diseases, male patients with frameshift mutations typically have a severe or even lethal phenotype. We report a case of a 17-month-old boy with Menkes disease (NIM #309400), an X-linked recessive copper metabolism disorder caused by mutations in the ATP7A copper transporter gene. He exhibited an unexpectedly late onset and experienced milder symptoms. His genomic DNA showed a de novo two-nucleotide deletion in exon 4 of ATP7A, predicting a translational frameshift and premature stop codon, and a classic severe phenotype. Characterization of his ATP7A mRNA showed no abnormal splicing. We speculate that translation reinitiation could occur downstream to the premature termination codon and produce a partially functional ATP7A protein. Study of the child's fibroblasts found no evidence of translation reinitiation; however, the possibility remains that this phenomenon occurred in neural tissues and influenced the clinical phenotype. Copyright © 2014 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baserga, S.J.; Benz, E.J. Jr.
A number of premature translation termination mutations (nonsense mutations) have been described in the human {alpha}- and {beta}-globin genes. Studies on mRNA isolated from patients with {beta}{sup 0}-thalassemia have shown that for both the {beta}-17 and the {beta}-39 mutations less than normal levels of {beta}-globin mRNA accumulate in peripheral blood cells. (The codon at which the mutation occurs designates the name of the mutation; there are 146 codons in human {beta}-globin mRNA). In vitro studies using the cloned {beta}-39 gene have reproduced this effect in a heterologous transfection system and have suggested that the defect resides in intranuclear metabolism. Themore » authors have asked if this phenomenon of decreased mRNA accumulation is a general property of nonsense mutations and if the effect depends on the location or the type of mutation. Toward this end, they have studied the effect of five nonsense mutations and two missense mutations on the expression of human {beta}-globin mRNA in a heterologous transfection system. In all cases studied, the presence of a translation termination codon correlates with a decrease in the steady-state level of mRNA. The data suggest that the metabolism of a mammalian mRNA is affected by the presence of a mutation that affects translation.« less
Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.
2013-01-01
Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon
Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier
2008-01-01
Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Global translational impacts of the loss of the tRNA modification t6A in yeast.
Thiaville, Patrick C; Legendre, Rachel; Rojas-Benítez, Diego; Baudin-Baillieu, Agnès; Hatin, Isabelle; Chalancon, Guilhem; Glavic, Alvaro; Namy, Olivier; de Crécy-Lagard, Valérie
2016-01-01
The universal tRNA modification t 6 A is found at position 37 of nearly all tRNAs decoding ANN codons. The absence of t 6 A 37 leads to severe growth defects in baker's yeast, phenotypes similar to those caused by defects in mcm 5 s 2 U 34 synthesis. Mutants in mcm 5 s 2 U 34 can be suppressed by overexpression of tRNA Lys UUU , but we show t 6 A phenotypes could not be suppressed by expressing any individual ANN decoding tRNA, and t 6 A and mcm 5 s 2 U are not determinants for each other's formation. Our results suggest that t 6 A deficiency, like mcm 5 s 2 U deficiency, leads to protein folding defects, and show that the absence of t 6 A led to stress sensitivities (heat, ethanol, salt) and sensitivity to TOR pathway inhibitors. Additionally, L-homoserine suppressed the slow growth phenotype seen in t 6 A-deficient strains, and proteins aggregates and Advanced Glycation End-products (AGEs) were increased in the mutants. The global consequences on translation caused by t 6 A absence were examined by ribosome profiling. Interestingly, the absence of t 6 A did not lead to global translation defects, but did increase translation initiation at upstream non-AUG codons and increased frame-shifting in specific genes. Analysis of codon occupancy rates suggests that one of the major roles of t 6 A is to homogenize the process of elongation by slowing the elongation rate at codons decoded by high abundance tRNAs and I 34 :C 3 pairs while increasing the elongation rate of rare tRNAs and G 34 :U 3 pairs. This work reveals that the consequences of t 6 A absence are complex and multilayered and has set the stage to elucidate the molecular basis of the observed phenotypes.
Carbone, Alessandra; Madden, Richard
2005-10-01
Codon bias is related to metabolic functions in translationally biased organisms, and two facts are argued about. First, genes with high codon bias describe in meaningful ways the metabolic characteristics of the organism; important metabolic pathways corresponding to crucial characteristics of the lifestyle of an organism, such as photosynthesis, nitrification, anaerobic versus aerobic respiration, sulfate reduction, methanogenesis, and others, happen to involve especially biased genes. Second, gene transcriptional levels of sets of experiments representing a significant variation of biological conditions strikingly confirm, in the case of Saccharomyces cerevisiae, that metabolic preferences are detectable by purely statistical analysis: the high metabolic activity of yeast during fermentation is encoded in the high bias of enzymes involved in the associated pathways, suggesting that this genome was affected by a strong evolutionary pressure that favored a predominantly fermentative metabolism of yeast in the wild. The ensemble of metabolic pathways involving enzymes with high codon bias is rather well defined and remains consistent across many species, even those that have not been considered as translationally biased, such as Helicobacter pylori, for instance, reveal some weak form of translational bias for this genome. We provide numerical evidence, supported by experimental data, of these facts and conclude that the metabolic networks of translationally biased genomes, observable today as projections of eons of evolutionary pressure, can be analyzed numerically and predictions of the role of specific pathways during evolution can be derived. The new concepts of Comparative Pathway Index, used to compare organisms with respect to their metabolic networks, and Evolutionary Pathway Index, used to detect evolutionarily meaningful bias in the genetic code from transcriptional data, are introduced.
Hda Monomerization by ADP Binding Promotes Replicase Clamp-mediated DnaA-ATP Hydrolysis*S⃞
Su'etsugu, Masayuki; Nakamura, Kenta; Keyamura, Kenji; Kudo, Yuka; Katayama, Tsutomu
2008-01-01
ATP-DnaA is the initiator of chromosomal replication in Escherichia coli, and the activity of DnaA is regulated by the regulatory inactivation of the DnaA (RIDA) system. In this system, the Hda protein promotes DnaA-ATP hydrolysis to produce inactive ADP-DnaA in a mechanism that is mediated by the DNA-loaded form of the replicase sliding clamp. In this study, we first revealed that hda translation uses an unusual initiation codon, CUG, located downstream of the annotated initiation codon. The CUG initiation codon could be used for restricting the Hda level, as this initiation codon has a low translation efficiency, and the cellular Hda level is only ∼100 molecules per cell. Hda translated using the correct reading frame was purified and found to have a high RIDA activity in vitro. Moreover, we found that Hda has a high affinity for ADP but not for other nucleotides, including ATP. ADP-Hda was active in the RIDA system in vitro and stable in a monomeric state, whereas apo-Hda formed inactive homomultimers. Both ADP-Hda and apo-Hda could form complexes with the DNA-loaded clamp; however, only ADP-Hda-DNA-clamp complexes were highly functional in the following interaction with DnaA. Formation of ADP-Hda was also observed in vivo, and mutant analysis suggested that ADP binding is crucial for cellular Hda activity. Thus, we propose that ADP is a crucial Hda ligand that promotes the activated conformation of the protein. ADP-dependent monomerization might enable the arginine finger of the Hda AAA+ domain to be accessible to ATP bound to the DnaA AAA+ domain. PMID:18977760
Hda monomerization by ADP binding promotes replicase clamp-mediated DnaA-ATP hydrolysis.
Su'etsugu, Masayuki; Nakamura, Kenta; Keyamura, Kenji; Kudo, Yuka; Katayama, Tsutomu
2008-12-26
ATP-DnaA is the initiator of chromosomal replication in Escherichia coli, and the activity of DnaA is regulated by the regulatory inactivation of the DnaA (RIDA) system. In this system, the Hda protein promotes DnaA-ATP hydrolysis to produce inactive ADP-DnaA in a mechanism that is mediated by the DNA-loaded form of the replicase sliding clamp. In this study, we first revealed that hda translation uses an unusual initiation codon, CUG, located downstream of the annotated initiation codon. The CUG initiation codon could be used for restricting the Hda level, as this initiation codon has a low translation efficiency, and the cellular Hda level is only approximately 100 molecules per cell. Hda translated using the correct reading frame was purified and found to have a high RIDA activity in vitro. Moreover, we found that Hda has a high affinity for ADP but not for other nucleotides, including ATP. ADP-Hda was active in the RIDA system in vitro and stable in a monomeric state, whereas apo-Hda formed inactive homomultimers. Both ADP-Hda and apo-Hda could form complexes with the DNA-loaded clamp; however, only ADP-Hda-DNA-clamp complexes were highly functional in the following interaction with DnaA. Formation of ADP-Hda was also observed in vivo, and mutant analysis suggested that ADP binding is crucial for cellular Hda activity. Thus, we propose that ADP is a crucial Hda ligand that promotes the activated conformation of the protein. ADP-dependent monomerization might enable the arginine finger of the Hda AAA+ domain to be accessible to ATP bound to the DnaA AAA+ domain.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).
Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai
2014-12-01
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Voigt, Karsten; Sharma, Cynthia M; Mitschke, Jan; Joke Lambrecht, S; Voß, Björn; Hess, Wolfgang R; Steglich, Claudia
2014-01-01
Prochlorococcus is a genus of abundant and ecologically important marine cyanobacteria. Here, we present a comprehensive comparison of the structure and composition of the transcriptomes of two Prochlorococcus strains, which, despite their similarities, have adapted their gene pool to specific environmental constraints. We present genome-wide maps of transcriptional start sites (TSS) for both organisms, which are representatives of the two most diverse clades within the two major ecotypes adapted to high- and low-light conditions, respectively. Our data suggest antisense transcription for three-quarters of all genes, which is substantially more than that observed in other bacteria. We discovered hundreds of TSS within genes, most notably within 16 of the 29 prochlorosin genes, in strain MIT9313. A direct comparison revealed very little conservation in the location of TSS and the nature of non-coding transcripts between both strains. We detected extremely short 5′ untranslated regions with a median length of only 27 and 29 nt for MED4 and MIT9313, respectively, and for 8% of all protein-coding genes the median distance to the start codon is only 10 nt or even shorter. These findings and the absence of an obvious Shine–Dalgarno motif suggest that leaderless translation and ribosomal protein S1-dependent translation constitute alternative mechanisms for translation initiation in Prochlorococcus. We conclude that genome-wide antisense transcription is a major component of the transcriptional output from these relatively small genomes and that a hitherto unrecognized high degree of complexity and variability of gene expression exists in their transcriptional architecture. PMID:24739626
Time to Translate: Deciphering the Codon in the Classroom
ERIC Educational Resources Information Center
Firooznia, Fardad
2015-01-01
I describe and evaluate a fun and simple role-playing exercise that allows students to actively work through the process of translation. This exercise can easily be completed during a 50-minute class period, with time to review the steps and contemplate complications such as the effects of various types of mutations.
Translational autocontrol of the Escherichia coli ribosomal protein S15.
Portier, C; Dondon, L; Grunberg-Manago, M
1990-01-20
When rpsO, the gene encoding the ribosomal protein S15 in Escherichia coli, is carried by a multicopy plasmid, the mRNA synthesis rate of S15 increases with the gene dosage but the rate of synthesis of S15 does not rise. A translational fusion between S15 and beta-galactosidase was introduced on the chromosome in a delta lac strain and the expression of beta-galactosidase studied under different conditions. The presence of S15 in trans represses the beta-galactosidase level five- to sixfold, while the synthesis rate of the S15-beta-galactosidase mRNA decreases by only 30 to 50%. These data indicate that S15 is subject to autogenous translational control. Derepressed mutants were isolated and sequenced. All the point mutations map in the second codon of S15, suggesting a location for the operator site that is very near to the translation initiation codon. However, the creation of deletion mutations shows that the operator extends into the 5' non-coding part of the message, thus overlapping the ribosome loading site.
The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).
Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho
2014-12-01
The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine–pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism. PMID:26791911
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-21
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNA(Lys)(UUU) with hypermodified 5-methylaminomethyl-2-thiouridine (mnm(5)s(2)U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm(5)s(2)U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
NASA Astrophysics Data System (ADS)
Rozov, Alexey; Demeshkina, Natalia; Khusainov, Iskander; Westhof, Eric; Yusupov, Marat; Yusupova, Gulnara
2016-01-01
Posttranscriptional modifications at the wobble position of transfer RNAs play a substantial role in deciphering the degenerate genetic code on the ribosome. The number and variety of modifications suggest different mechanisms of action during messenger RNA decoding, of which only a few were described so far. Here, on the basis of several 70S ribosome complex X-ray structures, we demonstrate how Escherichia coli tRNALysUUU with hypermodified 5-methylaminomethyl-2-thiouridine (mnm5s2U) at the wobble position discriminates between cognate codons AAA and AAG, and near-cognate stop codon UAA or isoleucine codon AUA, with which it forms pyrimidine-pyrimidine mismatches. We show that mnm5s2U forms an unusual pair with guanosine at the wobble position that expands general knowledge on the degeneracy of the genetic code and specifies a powerful role of tRNA modifications in translation. Our models consolidate the translational fidelity mechanism proposed previously where the steric complementarity and shape acceptance dominate the decoding mechanism.
Yakhnin, Helen; Baker, Carol S.; Berezin, Igor; Evangelista, Michael A.; Rassin, Alisa; Romeo, Tony; Babitzke, Paul
2011-01-01
The RNA binding protein CsrA is the central component of a conserved global regulatory system that activates or represses gene expression posttranscriptionally. In every known example of CsrA-mediated translational control, CsrA binds to the 5′ untranslated region of target transcripts, thereby repressing translation initiation and/or altering the stability of the RNA. Furthermore, with few exceptions, repression by CsrA involves binding directly to the Shine-Dalgarno sequence and blocking ribosome binding. sdiA encodes the quorum-sensing receptor for N-acyl-l-homoserine lactone in Escherichia coli. Because sdiA indirectly stimulates transcription of csrB, which encodes a small RNA (sRNA) antagonist of CsrA, we further explored the relationship between sdiA and the Csr system. Primer extension analysis revealed four putative transcription start sites within 85 nucleotides of the sdiA initiation codon. Potential σ70-dependent promoters were identified for each of these primer extension products. In addition, two CsrA binding sites were predicted in the initially translated region of sdiA. Expression of chromosomally integrated sdiA′-′lacZ translational fusions containing the entire promoter and CsrA binding site regions indicates that CsrA represses sdiA expression. The results from gel shift and footprint studies demonstrate that tight binding of CsrA requires both of these sites. Furthermore, the results from toeprint and in vitro translation experiments indicate that CsrA represses translation of sdiA by directly competing with 30S ribosomal subunit binding. Thus, this represents the first example of CsrA preventing translation by interacting solely within the coding region of an mRNA target. PMID:21908661
Steel, L F; Telly, D L; Leonard, J; Rice, B A; Monks, B; Sawicki, J A
1996-10-01
Murine c-mos transcripts isolated from testes have 5'-untranslated regions (5'UTRs) of approximately 300 nucleotides with a series of four overlapping open reading frames (ORFs) upstream of the AUG codon that initiates the Mos ORF. Ovarian c-mos transcripts have shorter 5'UTRs (70-80 nucleotides) and contain only 1-2 of the upstream ORFs (uORFs). To test whether these 5'UTRs affect translational efficiency, we have constructed plasmids for the expression of chimeric transcripts with a mos-derived 5'UTR fused to the Escherichia coli beta-galactosidase coding region. Translational efficiency has been evaluated by measuring beta-galactosidase activity NIH3T3 cells transiently transfected with these plasmids and with plasmids where various mutations have been introduced into the 5'UTR. We show that the 5'UTR characteristic of testis-specific c-mos mRNA strongly represses translation relative to the translation of transcripts that contain a 5'UTR derived from beta-globin mRNA, and this is mainly due to the four uORFs. Each of the four upstream AUG triplets can be recognized as a start site for translation, and no single uAUG dominates the repressive effect. The uORFs repress translation by a mechanism that is not affected by the amino acid sequence in the COOH-terminal region of the uORF-encoded peptides. The very short uORF (AUGUGA) present in ovary-specific transcripts does not repress translation. Staining of testis sections from transgenic mice carrying chimeric beta-galactosidase transgene constructs, which contain a mos 5'UTR with or without the uATGs, suggests that the uORFs can dramatically change the pattern of expression in spermatogenic cells.
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Henke, Nadja A; Heider, Sabine A E; Peters-Wendisch, Petra; Wendisch, Volker F
2016-06-30
Astaxanthin, a red C40 carotenoid, is one of the most abundant marine carotenoids. It is currently used as a food and feed additive in a hundred-ton scale and is furthermore an attractive component for pharmaceutical and cosmetic applications with antioxidant activities. Corynebacterium glutamicum, which naturally synthesizes the yellow C50 carotenoid decaprenoxanthin, is an industrially relevant microorganism used in the million-ton amino acid production. In this work, engineering of a genome-reduced C. glutamicum with optimized precursor supply for astaxanthin production is described. This involved expression of heterologous genes encoding for lycopene cyclase CrtY, β-carotene ketolase CrtW, and hydroxylase CrtZ. For balanced expression of crtW and crtZ their translation initiation rates were varied in a systematic approach using different ribosome binding sites, spacing, and translational start codons. Furthermore, β-carotene ketolases and hydroxylases from different marine bacteria were tested with regard to efficient astaxanthin production in C. glutamicum. In shaking flasks, the C. glutamicum strains developed here overproduced astaxanthin with volumetric productivities up to 0.4 mg·L(-1)·h(-1) which are competitive with current algae-based production. Since C. glutamicum can grow to high cell densities of up to 100 g cell dry weight (CDW)·L(-1), the recombinant strains developed here are a starting point for astaxanthin production by C. glutamicum.
Henke, Nadja A.; Heider, Sabine A. E.; Peters-Wendisch, Petra; Wendisch, Volker F.
2016-01-01
Astaxanthin, a red C40 carotenoid, is one of the most abundant marine carotenoids. It is currently used as a food and feed additive in a hundred-ton scale and is furthermore an attractive component for pharmaceutical and cosmetic applications with antioxidant activities. Corynebacterium glutamicum, which naturally synthesizes the yellow C50 carotenoid decaprenoxanthin, is an industrially relevant microorganism used in the million-ton amino acid production. In this work, engineering of a genome-reduced C. glutamicum with optimized precursor supply for astaxanthin production is described. This involved expression of heterologous genes encoding for lycopene cyclase CrtY, β-carotene ketolase CrtW, and hydroxylase CrtZ. For balanced expression of crtW and crtZ their translation initiation rates were varied in a systematic approach using different ribosome binding sites, spacing, and translational start codons. Furthermore, β-carotene ketolases and hydroxylases from different marine bacteria were tested with regard to efficient astaxanthin production in C. glutamicum. In shaking flasks, the C. glutamicum strains developed here overproduced astaxanthin with volumetric productivities up to 0.4 mg·L−1·h−1 which are competitive with current algae-based production. Since C. glutamicum can grow to high cell densities of up to 100 g cell dry weight (CDW)·L−1, the recombinant strains developed here are a starting point for astaxanthin production by C. glutamicum. PMID:27376307
Liu, Yingqi; Zhu, Zixiang; Zhang, Miaotao; Zheng, Haixue
2015-10-28
Foot-and-mouth disease virus (FMDV) leader protein (L(pro)) is a papain-like proteinase, which plays an important role in FMDV pathogenesis. L(pro) exists as two forms, Lab and Lb, due to translation being initiated from two different start codons separated by 84 nucleotides. L(pro) self-cleaves from the nascent viral polyprotein precursor as the first mature viral protein. In addition to its role as a viral proteinase, L(pro) also has the ability to antagonize host antiviral effects. To promote FMDV replication, L(pro) can suppress host antiviral responses by three different mechanisms: (1) cleavage of eukaryotic translation initiation factor 4 γ (eIF4G) to shut off host protein synthesis; (2) inhibition of host innate immune responses through restriction of interferon-α/β production; and (3) L(pro) can also act as a deubiquitinase and catalyze deubiquitination of innate immune signaling molecules. In the light of recent functional and biochemical findings regarding L(pro), this review introduces the basic properties of L(pro) and the mechanisms by which it antagonizes host antiviral responses.
Rewiring protein synthesis: From natural to synthetic amino acids.
Fan, Yongqiang; Evans, Christopher R; Ling, Jiqiang
2017-11-01
The protein synthesis machinery uses 22 natural amino acids as building blocks that faithfully decode the genetic information. Such fidelity is controlled at multiple steps and can be compromised in nature and in the laboratory to rewire protein synthesis with natural and synthetic amino acids. This review summarizes the major quality control mechanisms during protein synthesis, including aminoacyl-tRNA synthetases, elongation factors, and the ribosome. We will discuss evolution and engineering of such components that allow incorporation of natural and synthetic amino acids at positions that deviate from the standard genetic code. The protein synthesis machinery is highly selective, yet not fixed, for the correct amino acids that match the mRNA codons. Ambiguous translation of a codon with multiple amino acids or complete reassignment of a codon with a synthetic amino acid diversifies the proteome. Expanding the genetic code with synthetic amino acids through rewiring protein synthesis has broad applications in synthetic biology and chemical biology. Biochemical, structural, and genetic studies of the translational quality control mechanisms are not only crucial to understand the physiological role of translational fidelity and evolution of the genetic code, but also enable us to better design biological parts to expand the proteomes of synthetic organisms. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
2007-01-01
Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Gubbens, Jacob; Kim, Soo Jung; Yang, Zhongying; Johnson, Arthur E.; Skach, William R.
2010-01-01
Amber suppressor tRNAs are widely used to incorporate nonnatural amino acids into proteins to serve as probes of structure, environment, and function. The utility of this approach would be greatly enhanced if multiple probes could be simultaneously incorporated at different locations in the same protein without other modifications. Toward this end, we have developed amber, opal, and ochre suppressor tRNAs derived from Escherichia coli, and yeast tRNACys that incorporate a chemically modified cysteine residue with high selectivity at the cognate UAG, UGA, and UAA stop codons in an in vitro translation system. These synthetic tRNAs were aminoacylated in vitro, and the labile aminoacyl bond was stabilized by covalently attaching a fluorescent dye to the cysteine sulfhydryl group. Readthrough efficiency (amber > opal > ochre) was substantially improved by eRF1/eRF3 inhibition with an RNA aptamer, thus overcoming an intrinsic hierarchy in stop codon selection that limits UGA and UAA termination suppression in higher eukaryotic translation systems. This approach now allows concurrent incorporation of two different modified amino acids at amber and opal codons with a combined apparent readthrough efficiency of up to 25% when compared with the parent protein lacking a stop codon. As such, it significantly expands the possibilities for incorporating nonnative amino acids for protein structure/function studies. PMID:20581130
Codon Usage Bias and Determining Forces in Taenia solium Genome.
Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng
2015-12-01
The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.
Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A
2016-04-16
Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model. PMID:22368390
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome.
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model.
Engqvist, Martin K M; Nielsen, Jens
2015-08-21
The Ambiguous Nucleotide Tool (ANT) is a desktop application that generates and evaluates degenerate codons. Degenerate codons are used to represent DNA positions that have multiple possible nucleotide alternatives. This is useful for protein engineering and directed evolution, where primers specified with degenerate codons are used as a basis for generating libraries of protein sequences. ANT is intuitive and can be used in a graphical user interface or by interacting with the code through a defined application programming interface. ANT comes with full support for nonstandard, user-defined, or expanded genetic codes (translation tables), which is important because synthetic biology is being applied to an ever widening range of natural and engineered organisms. The Python source code for ANT is freely distributed so that it may be used without restriction, modified, and incorporated in other software or custom data pipelines.
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae).
Pan, Hong-Chun; Qian, Xiao-Cheng; Li, Ping; Li, Xiao-Fei; Wang, An-Tai
2014-02-01
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae) is a linear molecule of 16,189 bp in length, containing 13 protein-coding genes, small and large subunit ribosomal RNAs, methionine and tryptophan transfer RNAs, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mitochondrial DNA. The A + T content of the overall base composition of H-strand is 77.2% (T: 41.7%; C: 10.9%; A: 35.5%; and G: 11.9%). COI and ND1 genes begin with GTG as start codon, while other 11 protein-coding genes start with a typical ATG initiation codon. COII, ATP8, ATP6, COIII, ND5, ND6, ND3, ND1, ND4 and COI genes are terminated with TAA as stop codon, ND4L ends with TAG, ND2 ends with TA and Cyt b ends with T.
Ribosome A and P sites revealed by length analysis of ribosome profiling data
Martens, Andrew T.; Taylor, James; Hilser, Vincent J.
2015-01-01
The high-throughput sequencing of nuclease-protected mRNA fragments bound to ribosomes, a technique known as ribosome profiling, quantifies the relative frequencies with which different regions of transcripts are translated. This technique has revealed novel translation initiation sites with unprecedented scope and has furthered investigations into the connections between codon biases and translation rates. Yet the location of the codon being decoded in ribosome footprints is still unknown, and has been complicated by the recent observation of footprints with non-canonical lengths. Here we show how taking into account the variations in ribosome footprint lengths can reveal the ribosome aminoacyl (A) and peptidyl (P) site locations. These location assignments are in agreement with the proposed mechanisms for various ribosome pauses and further enhance the resolution of the profiling data. We also show that GC-rich motifs at the 5′ ends of footprints are found in yeast, calling into question the anti-Shine-Dalgarno effect's role in ribosome pausing. PMID:25805170
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.
Jiang, Rays H Y; Govers, Francine
2006-10-01
Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
Genome-Wide Analysis of Translational Control in Tuberous Sclerosis Complex
2012-07-01
particular non-AUG codons in the 5’UTR. However, these data was “noisy” and required a machine-learning algorithm to identify TIS codons. We develop...To investigate how nutrient signaling affects the folding of nascent chains, we used firefly luciferase (Luc) as a reporter because of its high...folding as the structural basis for the rapid de novo folding of firefly luciferase. Nat Struct Biol 6(7):697-705. 12. Gupta R, Kasturi P, Bracher A
Stop Codon Reassignment in the Wild
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ivanova, Natalia; Schwientek, Patrick; Tripp, H. James
Since the discovery of the genetic code and protein translation mechanisms (1), a limited number of variations of the standard assignment between unique base triplets (codons) and their encoded amino acids and translational stop signals have been found in bacteria and phages (2-3). Given the apparent ubiquity of the canonical genetic code, the design of genomically recoded organisms with non-canonical codes has been suggested as a means to prevent horizontal gene transfer between laboratory and environmental organisms (4). It is also predicted that genomically recoded organisms are immune to infection by viruses, under the assumption that phages and their hostsmore » must share a common genetic code (5). This paradigm is supported by the observation of increased resistance of genomically recoded bacteria to phages with a canonical code (4). Despite these assumptions and accompanying lines of evidence, it remains unclear whether differential and non-canonical codon usage represents an absolute barrier to phage infection and genetic exchange between organisms. Our knowledge of the diversity of genetic codes and their use by viruses and their hosts is primarily derived from the analysis of cultivated organisms. Advances in single-cell sequencing and metagenome assembly technologies have enabled the reconstruction of genomes of uncultivated bacterial and archaeal lineages (6). These initial findings suggest that large scale systematic studies of uncultivated microorganisms and viruses may reveal the extent and modes of divergence from the canonical genetic code operating in nature. To explore alternative genetic codes, we carried out a systematic analysis of stop codon reassignments from the canonical TAG amber, TGA opal, and TAA ochre codons in assembled metagenomes from environmental and host-associated samples, single-cell genomes of uncultivated bacteria and archaea, and a collection of phage sequences« less
Optimizing doped libraries by using genetic algorithms
NASA Astrophysics Data System (ADS)
Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas
1997-01-01
The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
1996-01-01
An increasing amount of evidence has shown that epitopes restricted to MHC class I molecules and recognized by CTL need not be encoded in a primary open reading frame (ORF). Such epitopes have been demonstrated after stop codons, in alternative reading frames (RF) and within introns. We have used a series of frameshifts (FS) introduced into the Influenza A/PR/8 /34 nucleoprotein (NP) gene to confirm the previous in vitro observations of cryptic epitope expression, and show that they are sufficiently expressed to prime immune responses in vivo. This presentation is not due to sub-dominant epitopes, transcription from cryptic promoters beyond the point of the FS, or internal initiation of translation. By introducing additional mutations to the construct exhibiting the most potent presentation, we have identified initiation codon readthrough (termed scanthrough here, where the scanning ribosome bypasses the conventional initiation codon, initiating translation further downstream) as the likely mechanism of epitope production. Further mutational analysis demonstrated that, while it should operate during the expression of wild-type (WT) protein, scanthrough does not provide a major source of processing substrate in our system. These findings suggest (i) that the full array of self- and pathogen-derived epitopes available during thymic selection and infection has not been fully appreciated and (ii) that cryptic epitope expression should be considered when the specificity of a CTL response cannot be identified or in therapeutic situations when conventional CTL targets are limited, as may be the case with latent viral infections and transformed cells. Finally, initiation codon readthrough provides a plausible explanation for the presentation of exocytic proteins by MHC class I molecules. PMID:8879204
2016-02-11
the White- head Genome Technology Core for sequencing . This work was supported by the UCSF Program for Breakthrough Biomedical Research (funded in...landscape of the yeast genome defined by RNA sequencing . Science 320, 1344–1349. Nedialkova, D.D., and Leidel, S.A. (2015). Optimization of Codon Translation... the CC BY license (http://creativecommons.org/licenses/by/4.0/). SUMMARY Ribosome-footprint profiling provides genome -wide snapshots of translation
Hara, A; Ueda, M; Misawa, S; Matsui, T; Furuhashi, K; Tanaka, A
2000-03-01
Development of a transformation system in the n-alkane-assimilating diploid yeast Candida tropicalis requires an antibiotic resistance gene in order to establish a selectable marker. The resistance gene for hygromycin B has often been used as a selectable marker in yeast transformation. However, C. tropicalis harboring the hygromycin resistance gene (HYG) was as sensitive to hygromycin B as the wild-type strain. Nine CTG codons were found in the ORF of the HYG gene. This codon has been reported to be translated as serine rather than leucine in Candida species. Analysis of the tRNA gene in C. tropicalis with the anticodon CAG [tRNA(CAG) gene], which is complementary to the codon CTG, showed that the sequence was highly similar to that of the C. maltosa tRNA(CAG) gene. In C. maltosa, the codon CTG is read as serine and not leucine. These results suggested that the HYG gene was not functional due to the nonuniversal usage of the CTG codon. Each of the nine CTG codons in the ORF of the HYG gene was changed to a CTC codon, which is read as leucine, by site-directed mutagenesis. When a plasmid containing the mutated HYG gene (HYG#) was constructed and introduced into C. tropicalis, hygromycin-resistant transformants were successfully obtained. This mutated hygromycin resistance gene may be useful for direct selection of C. tropicalis transformants.
Hofhuis, Julia; Schueren, Fabian; Nötzel, Christopher; Lingner, Thomas; Gärtner, Jutta; Jahn, Olaf
2016-01-01
Translational readthrough gives rise to C-terminally extended proteins, thereby providing the cell with new protein isoforms. These may have different properties from the parental proteins if the extensions contain functional domains. While for most genes amino acid incorporation at the stop codon is far lower than 0.1%, about 4% of malate dehydrogenase (MDH1) is physiologically extended by translational readthrough and the actual ratio of MDH1x (extended protein) to ‘normal' MDH1 is dependent on the cell type. In human cells, arginine and tryptophan are co-encoded by the MDH1x UGA stop codon. Readthrough is controlled by the 7-nucleotide high-readthrough stop codon context without contribution of the subsequent 50 nucleotides encoding the extension. All vertebrate MDH1x is directed to peroxisomes via a hidden peroxisomal targeting signal (PTS) in the readthrough extension, which is more highly conserved than the extension of lactate dehydrogenase B. The hidden PTS of non-mammalian MDH1x evolved to be more efficient than the PTS of mammalian MDH1x. These results provide insight into the genetic and functional co-evolution of these dually localized dehydrogenases. PMID:27881739
Codon-Anticodon Recognition in the Bacillus subtilis glyQS T Box Riboswitch
Caserta, Enrico; Liu, Liang-Chun; Grundy, Frank J.; Henkin, Tina M.
2015-01-01
Many amino acid-related genes in Gram-positive bacteria are regulated by the T box riboswitch. The leader RNA of genes in the T box family controls the expression of downstream genes by monitoring the aminoacylation status of the cognate tRNA. Previous studies identified a three-nucleotide codon, termed the “Specifier Sequence,” in the riboswitch that corresponds to the amino acid identity of the downstream genes. Pairing of the Specifier Sequence with the anticodon of the cognate tRNA is the primary determinant of specific tRNA recognition. This interaction mimics codon-anticodon pairing in translation but occurs in the absence of the ribosome. The goal of the current study was to determine the effect of a full range of mismatches for comparison with codon recognition in translation. Mutations were individually introduced into the Specifier Sequence of the glyQS leader RNA and tRNAGly anticodon to test the effect of all possible pairing combinations on tRNA binding affinity and antitermination efficiency. The functional role of the conserved purine 3′ of the Specifier Sequence was also verifiedin this study. We found that substitutions at the Specifier Sequence resulted in reduced binding, the magnitude of which correlates well with the predicted stability of the RNA-RNA pairing. However, the tolerance for specific mismatches in antitermination was generally different from that during decoding, which reveals a unique tRNA recognition pattern in the T box antitermination system. PMID:26229106
Factors affecting expression of the recF gene of Escherichia coli K-12.
Sandler, S J; Clark, A J
1990-01-31
This report describes four factors which affect expression of the recF gene from strong upstream lambda promoters under temperature-sensitive cIAt2-encoded repressor control. The first factor was the long mRNA leader sequence consisting of the Escherichia coli dnaN gene and 95% of the dnaA gene and lambda bet, N (double amber) and 40% of the exo gene. When most of this DNA was deleted, RecF became detectable in maxicells. The second factor was the vector, pBEU28, a runaway replication plasmid. When we substituted pUC118 for pBEU28, RecF became detectable in whole cells by the Coomassie blue staining technique. The third factor was the efficiency of initiation of translation. We used site-directed mutagenesis to change the mRNA leader, ribosome-binding site and the 3 bp before and after the translational start codon. Monitoring the effect of these mutational changes by translational fusion to lacZ, we discovered that the efficiency of initiation of translation was increased 30-fold. Only an estimated two- or threefold increase in accumulated levels of RecF occurred, however. This led us to discover the fourth factor, namely sequences in the recF gene itself. These sequences reduce expression of the recF-lacZ fusion genes 100-fold. The sequences responsible for this decrease in expression occur in four regions in the N-terminal half of recF. Expression is reduced by some sequences at the transcriptional level and by others at the translational level.
Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
Modification of orthogonal tRNAs: unexpected consequences for sense codon reassignment.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2016-12-01
Breaking the degeneracy of the genetic code via sense codon reassignment has emerged as a way to incorporate multiple copies of multiple non-canonical amino acids into a protein of interest. Here, we report the modification of a normally orthogonal tRNA by a host enzyme and show that this adventitious modification has a direct impact on the activity of the orthogonal tRNA in translation. We observed nearly equal decoding of both histidine codons, CAU and CAC, by an engineered orthogonal M. jannaschii tRNA with an AUG anticodon: tRNA Opt We suspected a modification of the tRNA Opt AUG anticodon was responsible for the anomalous lack of codon discrimination and demonstrate that adenosine 34 of tRNA Opt AUG is converted to inosine. We identified tRNA Opt AUG anticodon loop variants that increase reassignment of the histidine CAU codon, decrease incorporation in response to the histidine CAC codon, and improve cell health and growth profiles. Recognizing tRNA modification as both a potential pitfall and avenue of directed alteration will be important as the field of genetic code engineering continues to infiltrate the genetic codes of diverse organisms. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Barendt, Pamela A.; Shah, Najaf A.; Barendt, Gregory A.; Kothari, Parth A.; Sarkar, Casim A.
2013-01-01
While the ribosome has evolved to function in complex intracellular environments, these contexts do not easily allow for the study of its inherent capabilities. We have used a synthetic, well-defined, Escherichia coli (E. coli)-based translation system in conjunction with ribosome display, a powerful in vitro selection method, to identify ribosome binding sites (RBSs) that can promote the efficient translation of messenger RNAs (mRNAs) with a leader length representative of natural E. coli mRNAs. In previous work, we used a longer leader sequence and unexpectedly recovered highly efficient cytosine-rich sequences with complementarity to the 16S ribosomal RNA (rRNA) and similarity to eukaryotic RBSs. In the current study, Shine-Dalgarno (SD) sequences were prevalent but non-SD sequences were also heavily enriched and were dominated by novel guanine- and uracil-rich motifs which showed statistically significant complementarity to the 16S rRNA. Additionally, only SD motifs exhibited position-dependent decreases in sequence entropy, indicating that non-SD motifs likely operate by increasing the local concentration of ribosomes in the vicinity of the start codon, rather than by a position-dependent mechanism. These results further support the putative generality of mRNA-rRNA complementarity in facilitating mRNA translation, but also suggest that context (e.g., leader length and composition) dictates the specific subset of possible RBSs that are used for efficient translation of a given transcript. PMID:23427812
Castro-Chavez, Fernando
2012-01-01
Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
Scherer, N M; Basso, D M
2008-09-16
DNATagger is a web-based tool for coloring and editing DNA, RNA and protein sequences and alignments. It is dedicated to the visualization of protein coding sequences and also protein sequence alignments to facilitate the comprehension of evolutionary processes in sequence analysis. The distinctive feature of DNATagger is the use of codons as informative units for coloring DNA and RNA sequences. The codons are colored according to their corresponding amino acids. It is the first program that colors codons in DNA sequences without being affected by "out-of-frame" gaps of alignments. It can handle single gaps and gaps inside the triplets. The program also provides the possibility to edit the alignments and change color patterns and translation tables. DNATagger is a JavaScript application, following the W3C guidelines, designed to work on standards-compliant web browsers. It therefore requires no installation and is platform independent. The web-based DNATagger is available as free and open source software at http://www.inf.ufrgs.br/~dmbasso/dnatagger/.
Yao, Peng; Potdar, Alka A.; Arif, Abul; Ray, Partho Sarothi; Mukhopadhyay, Rupak; Willard, Belinda; Xu, Yichi; Yan, Jun; Saidel, Gerald M.; Fox, Paul L.
2012-01-01
SUMMARY Post-transcriptional regulatory mechanisms superimpose “fine-tuning” control upon “on-off” switches characteristic of gene transcription. We have exploited computational modeling with experimental validation to resolve an anomalous relationship between mRNA expression and protein synthesis. Differential GAIT (Gamma-interferon Activated Inhibitor of Translation) complex activation repressed VEGF-A synthesis to a low, constant rate despite high, variable VEGFA mRNA expression. Dynamic model simulations indicated the presence of an unidentified, inhibitory GAIT element-interacting factor. We discovered a truncated form of glutamyl-prolyl tRNA synthetase (EPRS), the GAIT constituent that binds the 3’-UTR GAIT element in target transcripts. The truncated protein, EPRSN1, prevents binding of functional GAIT complex. EPRSN1 mRNA is generated by a remarkable polyadenylation-directed conversion of a Tyr codon in the EPRS coding sequence to a stop codon (PAY*). By low-level protection of GAIT element-bearing transcripts, EPRSN1 imposes a robust “translational trickle” of target protein expression. Genome-wide analysis shows PAY* generates multiple truncated transcripts thereby contributing to transcriptome expansion. PMID:22386318
Dass, J Febin Prabhu; Sudandiradoss, C
2012-07-15
5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Onofre, Cláudia; Tomé, Filipa; Barbosa, Cristina; Silva, Ana Luísa
2015-01-01
The gene encoding human hemojuvelin (HJV) is one of the genes that, when mutated, can cause juvenile hemochromatosis, an early-onset inherited disorder associated with iron overload. The 5′ untranslated region of the human HJV mRNA has two upstream open reading frames (uORFs), with 28 and 19 codons formed by two upstream AUGs (uAUGs) sharing the same in-frame stop codon. Here we show that these uORFs decrease the translational efficiency of the downstream main ORF in HeLa and HepG2 cells. Indeed, ribosomal access to the main AUG is conditioned by the strong uAUG context, which results in the first uORF being translated most frequently. The reach of the main ORF is then achieved by ribosomes that resume scanning after uORF translation. Furthermore, the amino acid sequences of the uORF-encoded peptides also reinforce the translational repression of the main ORF. Interestingly, when iron levels increase, translational repression is relieved specifically in hepatic cells. The upregulation of protein levels occurs along with phosphorylation of the eukaryotic initiation factor 2α. Nevertheless, our results support a model in which the increasing recognition of the main AUG is mediated by a tissue-specific factor that promotes uORF bypass. These results support a tight HJV translational regulation involved in iron homeostasis. PMID:25666510
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Grand, Roger Le; Fomsgaard, Anders
2013-07-19
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques.
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Le Grand, Roger; Fomsgaard, Anders
2013-01-01
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques. PMID:26344115
Cell illustrator 4.0: a computational platform for systems biology.
Nagasaki, Masao; Saito, Ayumu; Jeong, Euna; Li, Chen; Kojima, Kaname; Ikeda, Emi; Miyano, Satoru
2011-01-01
Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.
Cell Illustrator 4.0: a computational platform for systems biology.
Nagasaki, Masao; Saito, Ayumu; Jeong, Euna; Li, Chen; Kojima, Kaname; Ikeda, Emi; Miyano, Satoru
2010-01-01
Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.
Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim
2013-06-01
Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
The ribosome uses two active mechanisms to unwind messenger RNA during translation.
Qu, Xiaohui; Wen, Jin-Der; Lancaster, Laura; Noller, Harry F; Bustamante, Carlos; Tinoco, Ignacio
2011-07-06
The ribosome translates the genetic information encoded in messenger RNA into protein. Folded structures in the coding region of an mRNA represent a kinetic barrier that lowers the peptide elongation rate, as the ribosome must disrupt structures it encounters in the mRNA at its entry site to allow translocation to the next codon. Such structures are exploited by the cell to create diverse strategies for translation regulation, such as programmed frameshifting, the modulation of protein expression levels, ribosome localization and co-translational protein folding. Although strand separation activity is inherent to the ribosome, requiring no exogenous helicases, its mechanism is still unknown. Here, using a single-molecule optical tweezers assay on mRNA hairpins, we find that the translation rate of identical codons at the decoding centre is greatly influenced by the GC content of folded structures at the mRNA entry site. Furthermore, force applied to the ends of the hairpin to favour its unfolding significantly speeds translation. Quantitative analysis of the force dependence of its helicase activity reveals that the ribosome, unlike previously studied helicases, uses two distinct active mechanisms to unwind mRNA structure: it destabilizes the helical junction at the mRNA entry site by biasing its thermal fluctuations towards the open state, increasing the probability of the ribosome translocating unhindered; and it mechanically pulls apart the mRNA single strands of the closed junction during the conformational changes that accompany ribosome translocation. The second of these mechanisms ensures a minimal basal rate of translation in the cell; specialized, mechanically stable structures are required to stall the ribosome temporarily. Our results establish a quantitative mechanical basis for understanding the mechanism of regulation of the elongation rate of translation by structured mRNAs. ©2011 Macmillan Publishers Limited. All rights reserved
A comparative study of prebiotic and present day translational models
NASA Technical Reports Server (NTRS)
Rein, R.; Raghunathan, G.; Mcdonald, J.; Shibata, M.; Srinivasan, S.
1986-01-01
It is generally recognized that the understanding of the molecular basis of primitive translation is a fundamental step in developing a theory of the origin of life. However, even in modern molecular biology, the mechanism for the decoding of messenger RNA triplet codons into an amino acid sequence of a protein on the ribosome is understood incompletely. Most of the proposed models for prebiotic translation lack, not only experimental support, but also a careful theoretical scrutiny of their compatibility with well understood stereochemical and energetic principles of nucleic acid structure, molecular recognition principles, and the chemistry of peptide bond formation. Present studies are concerned with comparative structural modelling and mechanistic simulation of the decoding apparatus ranging from those proposed for prebiotic conditions to the ones involved in modern biology. Any primitive decoding machinery based on nucleic acids and proteins, and most likely the modern day system, has to satisfy certain geometrical constraints. The charged amino acyl and the peptidyl termini of successive adaptors have to be adjacent in space in order to satisfy the stereochemical requirements for amide bond formation. Simultaneously, the same adaptors have to recognize successive codons on the messenger. This translational complex has to be realized by components that obey nucleic acid conformational principles, stabilities, and specificities. This generalized condition greatly restricts the number of acceptable adaptor structures.
Targeting Nonsense Mutations in Diseases with Translational Read-Through-Inducing Drugs (TRIDs).
Nagel-Wolfrum, Kerstin; Möller, Fabian; Penner, Inessa; Baasov, Timor; Wolfrum, Uwe
2016-04-01
In recent years, remarkable advances in the ability to diagnose genetic disorders have been made. The identification of disease-causing genes allows the development of gene-specific therapies with the ultimate goal to develop personalized medicines for each patient according to their own specific genetic defect. In-depth genotyping of many different genes has revealed that ~12% of inherited genetic disorders are caused by in-frame nonsense mutations. Nonsense (non-coding) mutations are caused by point mutations, which generate premature termination codons (PTCs) that cause premature translational termination of the mRNA, and subsequently inhibit normal full-length protein expression. Recently, a gene-based therapeutic approach for genetic diseases caused by nonsense mutations has emerged, namely the so-called translational read-through (TR) therapy. Read-through therapy is based on the discovery that small molecules, known as TR-inducing drugs (TRIDs), allow the translation machinery to suppress a nonsense codon, elongate the nascent peptide chain, and consequently result in the synthesis of full-length protein. Several TRIDs are currently under investigation and research has been performed on several genetic disorders caused by nonsense mutations over the years. These findings have raised hope for the usage of TR therapy as a gene-based pharmacogenetic therapy for nonsense mutations in various genes responsible for a variety of genetic diseases.
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.
Eriani, G; Dirheimer, G; Gangloff, J
1989-01-01
The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
Genetic coding and gene expression - new Quadruplet genetic coding model
NASA Astrophysics Data System (ADS)
Shankar Singh, Rama
2012-07-01
Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.
Sun, Liying; Andika, Ida Bagus; Shen, Jiangfeng; Yang, Di; Ratti, Claudio; Chen, Jianping
2013-10-01
Some viruses use alternative translation initiation at non-AUG codons as a strategy to produce multiple proteins during gene expression. Here we show that, using this strategy, Chinese wheat mosaic virus (CWMV; Furovirus) expresses a larger form of coat protein (N-ext/CP) in infected plants. Site-directed mutagenesis and transient expression analysis confirmed that CWMV N-ext/CP is initiated at an upstream in-frame CUG codon at nucleotide position 207-209 of RNA 2, which adds a 39 amino acid (aa) N-terminal extension to the major CP. Interestingly, in planta and in vitro analyses indicated that CWMV N-ext/CP but not CP interacts with the CWMV cysteine-rich protein (CRP), an RNA silencing suppressor. We further determined that the N-terminal 39 aa extension, particularly the 10 aa region immediately upstream of the major CP coding region is responsible for the interaction of N-ext/CP with CRP. In an Agrobacterium co-infiltration assay, co-expression with N-ext/CP did not affect CRP silencing suppression activity. Thus the alternative translation initiation at a CUG codon provides the CWMV N-ext/CP with the ability to bind to the viral silencing suppressor. Copyright © 2013 Elsevier B.V. All rights reserved.
Foamy virus reverse transcriptase is expressed independently from the Gag protein.
Enssle, J; Jordan, I; Mauer, B; Rethwilm, A
1996-01-01
In the foamy virus (FV) subgroup of retroviruses the pol genes are located in the +1 reading frame relative to the gag genes and possess potential ATG initiation codons in their 5' regions. This genome organization suggests either a + 1 ribosomal frameshift to generate a Gag-Pol fusion protein, similar to all other retroviruses studied so far, or new initiation of Pol translation, as used by pararetroviruses, to express the Pol protein. By using a genetic approach we have ruled out the former possibility and provide evidence for the latter. Two down-mutations (M53 and M54) of the pol ATG codon were found to abolish replication and Pol protein expression of the human FV isolate. The introduction of a new ATG in mutation M55, 3' to the down-mutated ATG of mutation M53, restored replication competence, indicating that the pol ATG functions as a translational initiation codon. Two nonsense mutants (M56 and M57), which functionally separated gag and pol with respect to potential frame-shifting sites, were also replication-competent, providing further genetic evidence that FVs express the Pol protein independently from Gag. Our results show that during a particular step of the replication cycle, FVs differ fundamentally from all other retroviruses. Images Fig. 3 PMID:8633029
Effects of tRNA modification on translational accuracy depend on intrinsic codon-anticodon strength.
Manickam, Nandini; Joshi, Kartikeya; Bhatt, Monika J; Farabaugh, Philip J
2016-02-29
Cellular health and growth requires protein synthesis to be both efficient to ensure sufficient production, and accurate to avoid producing defective or unstable proteins. The background of misreading error frequency by individual tRNAs is as low as 2 × 10(-6) per codon but is codon-specific with some error frequencies above 10(-3) per codon. Here we test the effect on error frequency of blocking post-transcriptional modifications of the anticodon loops of four tRNAs in Escherichia coli. We find two types of responses to removing modification. Blocking modification of tRNA(UUC)(Glu) and tRNA(QUC)(Asp) increases errors, suggesting that the modifications act at least in part to maintain accuracy. Blocking even identical modifications of tRNA(UUU)(Lys) and tRNA(QUA)(Tyr) has the opposite effect of decreasing errors. One explanation could be that the modifications play opposite roles in modulating misreading by the two classes of tRNAs. Given available evidence that modifications help preorder the anticodon to allow it to recognize the codons, however, the simpler explanation is that unmodified 'weak' tRNAs decode too inefficiently to compete against cognate tRNAs that normally decode target codons, which would reduce the frequency of misreading. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C
2015-04-01
The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
Rules of UGA-N decoding by near-cognate tRNAs and analysis of readthrough on short uORFs in yeast.
Beznosková, Petra; Gunišová, Stanislava; Valášek, Leoš Shivaya
2016-03-01
The molecular mechanism of stop codon recognition by the release factor eRF1 in complex with eRF3 has been described in great detail; however, our understanding of what determines the difference in termination efficiencies among various stop codon tetranucleotides and how near-cognate (nc) tRNAs recode stop codons during programmed readthrough in Saccharomyces cerevisiae is still poor. Here, we show that UGA-C as the only tetranucleotide of all four possible combinations dramatically exacerbated the readthrough phenotype of the stop codon recognition-deficient mutants in eRF1. Since the same is true also for UAA-C and UAG-C, we propose that the exceptionally high readthrough levels that all three stop codons display when followed by cytosine are partially caused by the compromised sampling ability of eRF1, which specifically senses cytosine at the +4 position. The difference in termination efficiencies among the remaining three UGA-N tetranucleotides is then given by their varying preferences for nc-tRNAs. In particular, UGA-A allows increased incorporation of Trp-tRNA whereas UGA-G and UGA-C favor Cys-tRNA. Our findings thus expand the repertoire of general decoding rules by showing that the +4 base determines the preferred selection of nc-tRNAs and, in the case of cytosine, it also genetically interacts with eRF1. Finally, using an example of the GCN4 translational control governed by four short uORFs, we also show how the evolution of this mechanism dealt with undesirable readthrough on those uORFs that serve as the key translation reinitiation promoting features of the GCN4 regulation, as both of these otherwise counteracting activities, readthrough versus reinitiation, are mediated by eIF3. © 2016 Beznosková et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Towards a Model for Protein Production Rates
NASA Astrophysics Data System (ADS)
Dong, J. J.; Schmittmann, B.; Zia, R. K. P.
2007-07-01
In the process of translation, ribosomes read the genetic code on an mRNA and assemble the corresponding polypeptide chain. The ribosomes perform discrete directed motion which is well modeled by a totally asymmetric simple exclusion process (TASEP) with open boundaries. Using Monte Carlo simulations and a simple mean-field theory, we discuss the effect of one or two "bottlenecks" (i.e., slow codons) on the production rate of the final protein. Confirming and extending previous work by Chou and Lakatos, we find that the location and spacing of the slow codons can affect the production rate quite dramatically. In particular, we observe a novel "edge" effect, i.e., an interaction of a single slow codon with the system boundary. We focus in detail on ribosome density profiles and provide a simple explanation for the length scale which controls the range of these interactions.
The scanning model for translation: an update
1989-01-01
The small (40S) subunit of eukaryotic ribosomes is believed to bind initially at the capped 5'-end of messenger RNA and then migrate, stopping at the first AUG codon in a favorable context for initiating translation. The first-AUG rule is not absolute, but there are rules for breaking the rule. Some anomalous observations that seemed to contradict the scanning mechanism now appear to be artifacts. A few genuine anomalies remain unexplained. PMID:2645293
Bellantuono, I; Lashford, L S; Rafferty, J A; Fairbairn, L J
2000-05-01
As a single gene defect in mature bone marrow cells, chronic granulomatous disease (X-CGD) represents a disorder which may be amenable to gene therapy by the transfer of the missing subunit into hemopoietic stem cells. In the majority of cases lack of Gp91-phox causes the disease. So far, studies involving transfer of Gp91-phox cDNA, including a phase I clinical trial, have yielded disappointing results. Most often, low titers of virus have been reported. In the present study we investigated the possible reasons for low titer amphotropic viral production. To investigate the effect of Gp91 cDNA on the efficiency of retroviral production from the packaging cell line, GP+envAm12, we constructed vectors containing either the native cDNA, truncated versions of the cDNA or a mutated form (LATG) in which the natural translational start codon was changed to a stop codon. Following derivation of clonal packaging cell lines, these were assessed for viral titer by RNA slot blot and analyzed by non-parametrical statistical analysis (Whitney-Mann U-test). An improvement in viral titer of just over two-fold was found in packaging cells containing the start-codon mutant of Gp91 and no evidence of truncated viral RNA was seen in these cells. Further analysis revealed the presence of rearranged forms of the provirus in Gp91-expressing cells, and the production of truncated, unpackaged viral RNA. Protein analysis revealed that LATG-transduced cells did not express full-length Gp91-phox, whereas those containing the wild-type cDNA did. However, a truncated protein was seen in ATG-transduced cells which was also present in wild type cells. No evidence for the presence of a negative transcriptional regulatory element was found from studies with the deletion mutants. A statistically significant effect of protein production on the production of virus from Gp91-expressing cells was found. Our data point to a need to restrict expression of the Gp91-phox protein and its derivatives in order to enhance retroviral production and suggest that improvements in current vectors for CGD gene therapy may need to include controlled, directed expression only in mature neutrophils.
2012-01-01
Background To evaluate the value of KRAS codon 13 mutations in patients with advanced colorectal cancer (advanced CRC) treated with oxaliplatin and fluoropyrimidines. Methods Tumor specimens from 201 patients with advanced CRC from a randomized, phase III trial comparing oxaliplatin/5-FU vs. oxaliplatin/capecitabine were retrospectively analyzed for KRAS mutations. Mutation data were correlated to response data (Overall response rate, ORR), progression-free survival (PFS) and overall survival (OS). Results 201 patients were analysed for KRAS mutation (61.2% males; mean age 64.2 ± 8.6 years). KRAS mutations were identified in 36.3% of tumors (28.8% in codon 12, 7.4% in codon 13). The ORR in codon 13 patients compared to codon 12 and wild type patients was significantly lower (p = 0.008). There was a tendency for a better overall survival in KRAS wild type patients compared to mutants (p = 0.085). PFS in all patients was not different in the three KRAS genetic groups (p = 0.72). However, we found a marked difference in PFS between patients with codon 12 and 13 mutant tumors treated with infusional 5-FU versus capecitabine based regimens. Conclusions Our data suggest that the type of KRAS mutation may be of clinical relevance under oxaliplatin combination chemotherapies without the addition of monoclonal antibodies in particular when overall response rates are important. Trial registration number 2002-04-017 PMID:22876876
Bacillus anthracis genome organization in light of whole transcriptome sequencing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martin, Jeffrey; Zhu, Wenhan; Passalacqua, Karla D.
2010-03-22
Emerging knowledge of whole prokaryotic transcriptomes could validate a number of theoretical concepts introduced in the early days of genomics. What are the rules connecting gene expression levels with sequence determinants such as quantitative scores of promoters and terminators? Are translation efficiency measures, e.g. codon adaptation index and RBS score related to gene expression? We used the whole transcriptome shotgun sequencing of a bacterial pathogen Bacillus anthracis to assess correlation of gene expression level with promoter, terminator and RBS scores, codon adaptation index, as well as with a new measure of gene translational efficiency, average translation speed. We compared computationalmore » predictions of operon topologies with the transcript borders inferred from RNA-Seq reads. Transcriptome mapping may also improve existing gene annotation. Upon assessment of accuracy of current annotation of protein-coding genes in the B. anthracis genome we have shown that the transcriptome data indicate existence of more than a hundred genes missing in the annotation though predicted by an ab initio gene finder. Interestingly, we observed that many pseudogenes possess not only a sequence with detectable coding potential but also promoters that maintain transcriptional activity.« less
Komatsu, Ken; Hirata, Hisae; Fukagawa, Takako; Yamaji, Yasuyuki; Okano, Yukari; Ishikawa, Kazuya; Adachi, Tatsushi; Maejima, Kensaku; Hashimoto, Masayoshi; Namba, Shigetou
2012-07-01
The first open-reading frame (ORF) of apple stem grooving virus (ASGV), of the genus Capillovirus, encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP). However, our previous study revealed that ASGV mutants with distinct and discontinuous Rep- and CP-coding regions successfully infect plants, indicating that CP expressed via a subgenomic RNA (sgRNA) is sufficient for viability of the virus. Here we identified a transcription start site of the CP sgRNA and revealed that CP translated from the sgRNA is essential for ASGV infection. We mapped the transcription start sites of both the CP and the movement protein (MP) sgRNAs of ASGV and found a hexanucleotide motif, UUAGGU, conserved upstream from both sgRNA transcription start sites. Mutational analysis of the putative CP initiation codon and of the UUAGGU sequence upstream from the transcription start site of CP sgRNA demonstrated their importance for ASGV accumulation. Our results also demonstrated that potato virus T (PVT), an unassigned species closely related to ASGV, produces two sgRNAs putatively deployed for the CP and MP expression and that the same hexanucleotide motif as found in ASGV is located upstream from the transcription start sites of both sgRNAs. This motif, which constituted putative core elements of the sgRNA promoter, is broadly conserved among viruses in the families Alphaflexiviridae and Betaflexiviridae, suggesting that the gene expression strategy of the viruses in both families has been conserved throughout evolution. Copyright © 2012 Elsevier B.V. All rights reserved.
Novel mutation at the initiation codon in the Norrie disease gene in two Japanese families.
Isashiki, Y; Ohba, N; Yanagita, T; Hokita, N; Doi, N; Nakagawa, M; Ozawa, M; Kuroda, N
1995-01-01
We have identified a new mutation of Norrie disease (ND) gene in two Japanese males from unrelated families; they showed typical ocular features of ND but no mental retardation or hearing impairment. A mutation was found in both patients at the initiation codon of exon 2 of the ND gene (ATG to GTG), with otherwise normal nucleotide sequences. Their mothers had the normal and mutant types of the gene, which was expected for heterozygotes of the disease. The mutation of the initiation codon would cause the failure of ND gene expression or a defect in translation thereby truncating the amino terminus of ND protein. In view of the rarity and marked heterogeneity of mutations in the ND gene, the present apparently unrelated Japanese families who have lived in the same area for over two centuries presumably share the origin of the mutation.
Prevost, Luanna B.; Smith, Michelle K.; Knight, Jennifer K.
2016-01-01
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon mutation potentially impacts replication, transcription, and translation. We then used computer-assisted lexical analysis combined with human scoring to categorize student responses. The lexical analysis models showed high agreement with human scoring, demonstrating that this approach can be successfully used to analyze large numbers of student written responses. The results of this analysis show that students’ ideas about one process in the central dogma can affect their thinking about subsequent and previous processes, leading to mixed models of conceptual understanding. PMID:27909016
Romero, Héctor; Zavala, Alejandro; Musto, Héctor
2000-01-01
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C.trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted. PMID:10773076
Romero, H; Zavala, A; Musto, H
2000-05-15
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C. trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted.
Fu, Hsu-Yuan; Lu, Yen-Hsu; Yi, Hsiu-Ping; Yang, Chii-Shen
2013-04-05
Microbial sensory rhodopsins are known to mediate phototaxis, and all of the known sensory rhodopsins execute this function with a specific cognate transducer that has two-transmembrane (2-TM) regions. In the genome of Haloarcula marismortui, a total of six rhodopsin genes were annotated, and we previously showed three of them to be the ion type and suggested the other three as sensory type, even though the candidate transducer gene, htr, for HmSRI was missing the 2-TM region that is found in all of the other known transducers. Here we showed this htr gene featured a preceding 2-TM region when the alternative start codon GTG located 291 nucleotides upstream of the original annotated open reading frame (ORF) was introduced and it is named as htrI in this study. Overexpression of HmHtrI exhibited it existed as a membrane protein and several biophysical assays confirmed it functionally interacted with HmSRI. Together with our previous reverse-transcriptase-PCR results and phototaxis measurements, the new ORF of original predicted soluble htr gene product was a membrane protein with a 2-TM region, HmHtrI; and it serves as the cognate transducer for HmSRI. HmHtrI therefore is the first transducer for the sensory rhodopsin adopted start codon other than ATG. Copyright © 2013 Elsevier B.V. All rights reserved.
Bergeron, Danny; Lapointe, Catherine; Bissonnette, Cyntia; Tremblay, Guillaume; Motard, Julie; Roucou, Xavier
2013-01-01
Spinocerebellar ataxia type 1 is an autosomal dominant cerebellar ataxia associated with the expansion of a polyglutamine tract within the ataxin-1 (ATXN1) protein. Recent studies suggest that understanding the normal function of ATXN1 in cellular processes is essential to decipher the pathogenesis mechanisms in spinocerebellar ataxia type 1. We found an alternative translation initiation ATG codon in the +3 reading frame of human ATXN1 starting 30 nucleotides downstream of the initiation codon for ATXN1 and ending at nucleotide 587. This novel overlapping open reading frame (ORF) encodes a 21-kDa polypeptide termed Alt-ATXN1 (Alternative ATXN1) with a completely different amino acid sequence from ATXN1. We introduced a hemagglutinin tag in-frame with Alt-ATXN1 in ATXN1 cDNA and showed in cell culture the co-expression of both ATXN1 and Alt-ATXN1. Remarkably, Alt-ATXN1 colocalized and interacted with ATXN1 in nuclear inclusions. In contrast, in the absence of ATXN1 expression, Alt-ATXN1 displays a homogenous nucleoplasmic distribution. Alt-ATXN1 interacts with poly(A)+ RNA, and its nuclear localization is dependent on RNA transcription. Polyclonal antibodies raised against Alt-ATXN1 confirmed the expression of Alt-ATXN1 in human cerebellum expressing ATXN1. These results demonstrate that human ATXN1 gene is a dual coding sequence and that ATXN1 interacts with and controls the subcellular distribution of Alt-ATXN1. PMID:23760502
Control of total GFP expression by alterations to the 3′ region nucleotide sequence
2013-01-01
Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.
Gan, Qinglei; Fan, Chenguang
2017-11-01
Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P; Pino, Karla; Tischler, Nicole D; Ohlmann, Théophile; Darlix, Jean-Luc; López-Lastra, Marcelo
2012-02-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism.
Vera-Otarola, Jorge; Solis, Loretto; Soto-Rifo, Ricardo; Ricci, Emiliano P.; Pino, Karla; Tischler, Nicole D.; Ohlmann, Théophile; Darlix, Jean-Luc
2012-01-01
The small mRNA (SmRNA) of all Bunyaviridae encodes the nucleocapsid (N) protein. In 4 out of 5 genera in the Bunyaviridae, the smRNA encodes an additional nonstructural protein denominated NSs. In this study, we show that Andes hantavirus (ANDV) SmRNA encodes an NSs protein. Data show that the NSs protein is expressed in the context of an ANDV infection. Additionally, our results suggest that translation initiation from the NSs initiation codon is mediated by ribosomal subunits that have bypassed the upstream N protein initiation codon through a leaky scanning mechanism. PMID:22156529
Wang, Huiyan; Fang, Liangyan; Wen, Qing; Lin, Jianqun; Liu, Xiangmei
2017-04-01
Acidithiobacillus ferrooxidans is a model organism for investigating metal sulfide bioleaching. The regulatory mechanism of gene expression by metabolizing various substrates is critical for understanding its role in bioleaching processes. However, no reporter has been successfully employed to study gene expression in A. ferrooxidans to date. In this study, a sensitive and robust reporter system based on β-glucuronidase (GusA) was described for feasible application in A. ferrooxidans. A set of vectors, which contained the transcriptional and translational fusions of gusA, were constructed and employed to analyze promoter activity and efficiency of translation initiation in A. ferrooxidans. Ptac and P2811 were screened out from ten tested promoters and could be used as strong promoters for gene overexpression in A. ferrooxidans. Among the four translational fusions of gusA with different start codons, ATG was most active, followed by TTG and GTG, while CTG showed the least activity. The transcriptional inhibition effect of an IclR-like transcription factor was also observed on its own encoding gene AFE_1668 as well as its neighboring AFE_1667. In addition, the specific chromogenic reaction of GusA could be detected and visualized by colonies of A. ferrooxidans containing gusA expression plasmids. Generally, the established GusA reporter system would be applied not only for quantitative analysis of promoter strength and its transcriptional regulation but also for qualitative colony screening in A. ferrooxidans in the future.
Ochs, Kerstin; Rust, René C.; Niepmann, Michael
1999-01-01
Most eukaryotic initiation factors (eIFs) are required for internal translation initiation at the internal ribosome entry site (IRES) of picornaviruses. eIF4B is incorporated into ribosomal 48S initiation complexes with the IRES RNA of foot-and-mouth disease virus (FMDV). In contrast to the weak interaction of eIF4B with capped cellular mRNAs and its release upon entry of the ribosomal 60S subunit, eIF4B remains tightly associated with the FMDV IRES during formation of complete 80S ribosomes. Binding of eIF4B to the IRES is energy dependent, and binding of the small ribosomal subunit to the IRES requires the previous energy-dependent association of initiation factors with the IRES. The interaction of eIF4B with the IRES in 48S and 80S complexes is independent of the location of the initiator AUG and thus independent of the mechanism by which the small ribosomal subunit is placed at the actual start codon, either by direct internal ribosomal entry or by scanning. eIF4B does not greatly rearrange its binding to the IRES upon entry of the ribosomal subunits, and the interaction of eIF4B with the IRES is independent of the polypyrimidine tract-binding protein, which enhances FMDV translation. PMID:10438840
Sala, Claudia; Forti, Francesca; Magnoni, Francesca; Ghisotti, Daniela
2008-01-01
Background In Mycobacterium tuberculosis and in Mycobacterium smegmatis the furA-katG loci, encoding the FurA regulatory protein and the KatG catalase-peroxidase, are highly conserved. In M. tuberculosis furA-katG constitute a single operon, whereas in M. smegmatis a single mRNA covering both genes could not be found. In both species, specific 5' ends have been identified: the first one, located upstream of the furA gene, corresponds to transcription initiation from the furA promoter; the second one is the katG mRNA 5' end, located in the terminal part of furA. Results In this work we demonstrate by in vitro transcription and by RNA polymerase Chromatin immunoprecipitation that no promoter is present in the M. smegmatis region covering the latter 5' end, suggesting that it is produced by specific processing of longer transcripts. Several DNA fragments of M. tuberculosis and M. smegmatis were inserted in a plasmid between the sigA promoter and the lacZ reporter gene, and expression of the reporter gene was measured. A polypurine sequence, located four bp upstream of the katG translation start codon, increased beta-galactosidase activity and stabilized the lacZ transcript. Mutagenesis of this sequence led to destabilization of the mRNA. Analysis of constructs, in which the polypurine sequence of M. smegmatis was followed by an increasing number of katG codons, demonstrated that mRNA stability requires translation of at least 20 amino acids. In order to define the requirements for the 5' processing of the katG transcript, we created several mutations in this region and analyzed the 5' ends of the transcripts: the distance from the polypurine sequence does not seem to influence the processing, neither the sequence around the cutting point. Only mutations which create a double stranded region around the processing site prevented RNA processing. Conclusion This is the first reported case in mycobacteria, in which both a polypurine sequence and translation initiation are shown to contribute to mRNA stability. The furA-katG mRNA is transcribed from the furA promoter and immediately processed; this processing is prevented by a double stranded RNA at the cutting site, suggesting that the endoribonuclease responsible for the cleavage cuts single stranded RNA. PMID:18394163
A novel start codon mutation of the MERTK gene in a patient with retinitis pigmentosa
Jinda, Worapoj; Poungvarin, Naravat; Taylor, Todd D.; Suzuki, Yutaka; Thongnoppakhun, Wanna; Limwongse, Chanin; Lertrit, Patcharee; Suriyaphol, Prapat
2016-01-01
Purpose Retinitis pigmentosa (RP) is a clinically and genetically heterogeneous group of inherited retinal degenerations characterized by progressive loss of photoreceptor cells and RPE functions. More than 70 causative genes are known to be responsible for RP. This study aimed to identify the causative gene in a patient from a consanguineous family with childhood-onset severe retinal dystrophy. Methods To identify the defective gene, whole exome sequencing was performed. Candidate causative variants were selected and validated using Sanger sequencing. Segregation analysis of the causative gene was performed in additional family members. To verify that the mutation has an effect on protein synthesis, an expression vector containing the first ten amino acids of the mutant protein fused with the DsRed2 fluorescent protein was constructed and transfected into HEK293T cells. Expression of the fusion protein in the transfected cells was measured using fluorescence microscopy. Results By filtering against public variant databases, a novel homozygous missense mutation (c.3G>A) localized in the start codon of the MERTK gene was detected as a potentially pathogenic mutation for autosomal recessive RP. The c.3G>A mutation cosegregated with the disease phenotype in the family. No expression of the first ten amino acids of the MerTK mutant fused with the DsRed2 fluorescent protein was detected in HEK293T cells, indicating that the mutation affects the translation initiation site of the gene that may lead to loss of function of the MerTK signaling pathway. Conclusions We report a novel missense mutation (c.3G>A, p.0?) in the MERTK gene that causes severe vision impairment in a patient. Taken together with previous reports, our results expand the spectrum of MERTK mutations and extend our understanding of the role of the MerTK protein in the pathogenesis of retinitis pigmentosa. PMID:27122965
Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang
2016-01-01
The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Accuracy of genetic code translation and its orthogonal corruption by aminoglycosides and Mg2+ ions.
Zhang, Jingji; Pavlov, Michael Y; Ehrenberg, Måns
2018-02-16
We studied the effects of aminoglycosides and changing Mg2+ ion concentration on the accuracy of initial codon selection by aminoacyl-tRNA in ternary complex with elongation factor Tu and GTP (T3) on mRNA programmed ribosomes. Aminoglycosides decrease the accuracy by changing the equilibrium constants of 'monitoring bases' A1492, A1493 and G530 in 16S rRNA in favor of their 'activated' state by large, aminoglycoside-specific factors, which are the same for cognate and near-cognate codons. Increasing Mg2+ concentration decreases the accuracy by slowing dissociation of T3 from its initial codon- and aminoglycoside-independent binding state on the ribosome. The distinct accuracy-corrupting mechanisms for aminoglycosides and Mg2+ ions prompted us to re-interpret previous biochemical experiments and functional implications of existing high resolution ribosome structures. We estimate the upper thermodynamic limit to the accuracy, the 'intrinsic selectivity' of the ribosome. We conclude that aminoglycosides do not alter the intrinsic selectivity but reduce the fraction of it that is expressed as the accuracy of initial selection. We suggest that induced fit increases the accuracy and speed of codon reading at unaltered intrinsic selectivity of the ribosome.
Simulations Using Random-Generated DNA and RNA Sequences
ERIC Educational Resources Information Center
Bryce, C. F. A.
1977-01-01
Using a very simple computer program written in BASIC, a very large number of random-generated DNA or RNA sequences are obtained. Students use these sequences to predict complementary sequences and translational products, evaluate base compositions, determine frequencies of particular triplet codons, and suggest possible secondary structures.…
eIF2β is critical for eIF5-mediated GDP-dissociation inhibitor activity and translational control
Jennings, Martin D.; Kershaw, Christopher J.; White, Christopher; Hoyle, Danielle; Richardson, Jonathan P.; Costello, Joseph L.; Donaldson, Ian J.; Zhou, Yu; Pavitt, Graham D.
2016-01-01
In protein synthesis translation factor eIF2 binds initiator tRNA to ribosomes and facilitates start codon selection. eIF2 GDP/GTP status is regulated by eIF5 (GAP and GDI functions) and eIF2B (GEF and GDF activities), while eIF2α phosphorylation in response to diverse signals is a major point of translational control. Here we characterize a growth suppressor mutation in eIF2β that prevents eIF5 GDI and alters cellular responses to reduced eIF2B activity, including control of GCN4 translation. By monitoring the binding of fluorescent nucleotides and initiator tRNA to purified eIF2 we show that the eIF2β mutation does not affect intrinsic eIF2 affinities for these ligands, neither does it interfere with eIF2 binding to 43S pre-initiation complex components. Instead we show that the eIF2β mutation prevents eIF5 GDI stabilizing nucleotide binding to eIF2, thereby altering the off-rate of GDP from eIF2•GDP/eIF5 complexes. This enables cells to grow with reduced eIF2B GEF activity but impairs activation of GCN4 targets in response to amino acid starvation. These findings provide support for the importance of eIF5 GDI activity in vivo and demonstrate that eIF2β acts in concert with eIF5 to prevent premature release of GDP from eIF2γ and thereby ensure tight control of protein synthesis initiation. PMID:27458202
eIF2β is critical for eIF5-mediated GDP-dissociation inhibitor activity and translational control.
Jennings, Martin D; Kershaw, Christopher J; White, Christopher; Hoyle, Danielle; Richardson, Jonathan P; Costello, Joseph L; Donaldson, Ian J; Zhou, Yu; Pavitt, Graham D
2016-11-16
In protein synthesis translation factor eIF2 binds initiator tRNA to ribosomes and facilitates start codon selection. eIF2 GDP/GTP status is regulated by eIF5 (GAP and GDI functions) and eIF2B (GEF and GDF activities), while eIF2α phosphorylation in response to diverse signals is a major point of translational control. Here we characterize a growth suppressor mutation in eIF2β that prevents eIF5 GDI and alters cellular responses to reduced eIF2B activity, including control of GCN4 translation. By monitoring the binding of fluorescent nucleotides and initiator tRNA to purified eIF2 we show that the eIF2β mutation does not affect intrinsic eIF2 affinities for these ligands, neither does it interfere with eIF2 binding to 43S pre-initiation complex components. Instead we show that the eIF2β mutation prevents eIF5 GDI stabilizing nucleotide binding to eIF2, thereby altering the off-rate of GDP from eIF2•GDP/eIF5 complexes. This enables cells to grow with reduced eIF2B GEF activity but impairs activation of GCN4 targets in response to amino acid starvation. These findings provide support for the importance of eIF5 GDI activity in vivo and demonstrate that eIF2β acts in concert with eIF5 to prevent premature release of GDP from eIF2γ and thereby ensure tight control of protein synthesis initiation. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Cimdins, Annika; Klinkert, Birgit; Aschke-Sonnenborn, Ursula; Kaiser, Friederike M; Kortmann, Jens; Narberhaus, Franz
2014-01-01
Cyanobacteria constitute a heterogeneous phylum of oxygen-producing, photosynthetic prokaryotes. They are susceptible to various stress conditions like heat, salt, or light stress, all inducing the cyanobacterial heat shock response (HSR). Cyanobacterial small heat shock proteins (sHsps) are known to preserve thylakoid membrane integrity under stress conditions, thereby protecting the photosynthesis machinery. In Synechocystis sp PCC 6803, synthesis of the sHsp Hsp17 is regulated by an RNA thermometer (RNAT) in the 5′-untranslated region (5′-UTR) of the hsp17 mRNA. RNATs are direct temperature sensors that control expression of many bacterial heat shock and virulence genes. They hinder translation at low temperatures by base pairing, thus blocking ribosome access to the mRNA. To explore the temperature range in which RNATs act, we studied various RNAT candidates upstream of sHsp genes from mesophilic and thermophilic cyanobacteria. The mesophilic cyanobacteria Anabaena variabilis and Nostoc sp chromosomally encode two sHsps each. Reporter gene studies suggested RNAT-mediated post-transcriptional regulation of shsp expression in both organisms. Detailed structural analysis of the two A. variabilis candidates revealed two novel RNAT types. The first, avashort, regulates translation primarily by masking of the AUG translational start codon. The second, featuring an extended initial hairpin, thus named avalong, presumably makes use of complex tertiary interaction. The 5′-UTR of the small heat shock gene hspA in the thermophile Thermosynechococcus elongatus is predicted to adopt an extended secondary structure. Structure probing revealed that the ribosome binding site was blocked at temperatures below 55 °C. The results of this study demonstrate that cyanobacteria commonly use RNATs to control expression of their small heat shock genes. PMID:24755616
A model for competition for ribosomes in the cell
Raveh, Alon; Margaliot, Michael; Sontag, Eduardo D.; Tuller, Tamir
2016-01-01
A single mammalian cell includes an order of 104–105 mRNA molecules and as many as 105–106 ribosomes. Large-scale simultaneous mRNA translation induces correlations between the mRNA molecules, as they all compete for the finite pool of available ribosomes. This has important implications for the cell's functioning and evolution. Developing a better understanding of the intricate correlations between these simultaneous processes, rather than focusing on the translation of a single isolated transcript, should help in gaining a better understanding of mRNA translation regulation and the way elongation rates affect organismal fitness. A model of simultaneous translation is specifically important when dealing with highly expressed genes, as these consume more resources. In addition, such a model can lead to more accurate predictions that are needed in the interconnection of translational modules in synthetic biology. We develop and analyse a general dynamical model for large-scale simultaneous mRNA translation and competition for ribosomes. This is based on combining several ribosome flow models (RFMs) interconnected via a pool of free ribosomes. We use this model to explore the interactions between the various mRNA molecules and ribosomes at steady state. We show that the compound system always converges to a steady state and that it always entrains or phase locks to periodically time-varying transition rates in any of the mRNA molecules. We then study the effect of changing the transition rates in one mRNA molecule on the steady-state translation rates of the other mRNAs that results from the competition for ribosomes. We show that increasing any of the codon translation rates in a specific mRNA molecule yields a local effect, an increase in the translation rate of this mRNA, and also a global effect, the translation rates in the other mRNA molecules all increase or all decrease. These results suggest that the effect of codon decoding rates of endogenous and heterologous mRNAs on protein production is more complicated than previously thought. In addition, we show that increasing the length of an mRNA molecule decreases the production rate of all the mRNAs. PMID:26962028
Sequences 5' to translation start regulate expression of petunia rbcS genes.
Dean, C; Favreau, M; Bedbrook, J; Dunsmuir, P
1989-01-01
The promoter sequences that contribute to quantitative differences in expression of the petunia genes (rbcS) encoding the small subunit of ribulose bisphosphate carboxylase have been characterized. The promoter regions of the two most abundantly expressed petunia rbcS genes, SSU301 and SSU611, show sequence similarity not present in other rbcS genes. We investigated the significance of these and other sequences by adding specific regions from the SSU301 promoter (the most strongly expressed gene) to equivalent regions in the SSU911 promoter (the least strongly expressed gene) and assaying the expression of the fusions in transgenic tobacco plants. In this way, we characterized an SSU301 promoter region (either from -285 to -178 or -291 to -204) which, when added to SSU911, in either orientation, increased SSU911 expression 25-fold. This increase was equivalent to that caused by addition of the entire SSU301 5'-flanking region. Replacement of SSU911 promoter sequences between -198 and the start codon with sequences from the equivalent region of SSU301 did not increase SSU911 expression significantly. The -291 to -204 SSU301 promoter fragment contributes significantly to quantitative differences in expression between the petunia rbcS genes. PMID:2535543
Exon 2-mediated c-myc mRNA decay in vivo is independent of its translation.
Pistoi, S; Roland, J; Babinet, C; Morello, D
1996-01-01
We have previously shown that the steady-state level of c-myc mRNA in vivo is primarily controlled by posttranscriptional regulatory mechanisms. To identify the sequences involved in this process, we constructed a series of H-2/myc transgenic lines in which various regions of the human c-MYC gene were placed under the control of the quasi-ubiquitous H-2K class I regulatory sequences. We demonstrated that the presence of one of the two coding exons, exon 2 or exon 3, is sufficient to confer a level of expression of transgene mRNA similar to that of endogenous c-myc in various adult tissues as well as after partial hepatectomy or after protein synthesis inhibition. We now focus on the molecular mechanisms involved in modulation of expression of mRNAs containing c-myc exon 2 sequences, with special emphasis on the coupling between translation and c-myc mRNA turnover. We have undertaken an analysis of expression, both at the mRNA level and at the protein level, of new transgenic constructs in which the translation is impaired either by disruption of the initiation codon or by addition of stop codons upstream of exon 2. Our results show that the translation of c-myc exon 2 is not required for regulated expression of the transgene in the different situations analyzed, and therefore they indicate that the mRNA destabilizing function of exon 2 is independent of translation by ribosomes. Our investigations also reveal that, in the thymus, some H-2/myc transgenes express high levels of mRNA but low levels of protein. Besides the fact that these results suggest the existence of tissue-specific mechanisms that control c-myc translatability in vivo, they also bring another indication of the uncoupling of c-myc mRNA translation and degradation. PMID:8756668
USDA-ARS?s Scientific Manuscript database
Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a non-incorporated protein in concert with numerous insect and plant proteins to regulate virus movem...
Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan
2016-01-01
The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.
Felsenstein, K M; Goff, S P
1992-01-01
The gag-pol polyprotein of the murine and feline leukemia viruses is expressed by translational readthrough of a UAG terminator codon at the 3' end of the gag gene. To explore the cis-acting sequence requirements for the readthrough event in vivo, we generated a library of mutants of the Moloney murine leukemia virus with point mutations near the terminator codon and tested the mutant viral DNAs for the ability to direct synthesis of the gag-pol fusion protein and formation of infectious virus. The analysis showed that sequences 3' to the terminator are necessary and sufficient for the process. The results do not support a role for one proposed stem-loop structure that includes the terminator but are consistent with the involvement of another stem-loop 3' to the terminator. One mutant, containing two compensatory changes in this stem structure, was temperature sensitive for replication and for formation of the gag-pol protein. The results suggest that RNA sequence and structure are critical determinants of translational readthrough in vivo. Images PMID:1404606
Ensemble cryo-EM elucidates the mechanism of translation fidelity
Loveland, Anna B.; Demo, Gabriel; Grigorieff, Nikolaus; Korostelev, Andrei A.
2017-01-01
SUMMARY Faithful gene translation depends on accurate decoding, whose structural mechanism remains a matter of debate. Ribosomes decode mRNA codons by selecting cognate aminoacyl-tRNAs delivered by EF-Tu. We present high-resolution structural ensembles of ribosomes with cognate or near-cognate aminoacyl-tRNAs delivered by EF-Tu. Both cognate and near-cognate tRNA anticodons explore the A site of an open 30S subunit, while inactive EF-Tu is separated from the 50S subunit. A transient conformation of decoding-center nucleotide G530 stabilizes the cognate codon-anticodon helix, initiating step-wise “latching” of the decoding center. The resulting 30S domain closure docks EF-Tu at the sarcin-ricin loop of the 50S subunit, activating EF-Tu for GTP hydrolysis and ensuing aminoacyl-tRNA accommodation. By contrast, near-cognate complexes fail to induce the G530 latch, thus favoring open 30S pre-accommodation intermediates with inactive EF-Tu. This work unveils long-sought structural differences between the pre-accommodation of cognate and near-cognate tRNA that elucidate the mechanism of accurate decoding. PMID:28538735
Structural insights into translational fidelity.
Ogle, James M; Ramakrishnan, V
2005-01-01
The underlying basis for the accuracy of protein synthesis has been the subject of over four decades of investigation. Recent biochemical and structural data make it possible to understand at least in outline the structural basis for tRNA selection, in which codon recognition by cognate tRNA results in the hydrolysis of GTP by EF-Tu over 75 A away. The ribosome recognizes the geometry of codon-anticodon base pairing at the first two positions but monitors the third, or wobble position, less stringently. Part of the additional binding energy of cognate tRNA is used to induce conformational changes in the ribosome that stabilize a transition state for GTP hydrolysis by EF-Tu and subsequently result in accelerated accommodation of tRNA into the peptidyl transferase center. The transition state for GTP hydrolysis is characterized, among other things, by a distorted tRNA. This picture explains a large body of data on the effect of antibiotics and mutations on translational fidelity. However, many fundamental questions remain, such as the mechanism of activation of GTP hydrolysis by EF-Tu, and the relationship between decoding and frameshifting.
Molecular Mechanism of Scanning and Start Codon Selection in Eukaryotes
Hinnebusch, Alan G.
2011-01-01
Summary: The correct translation of mRNA depends critically on the ability to initiate at the right AUG codon. For most mRNAs in eukaryotic cells, this is accomplished by the scanning mechanism, wherein the small (40S) ribosomal subunit attaches to the 5′ end of the mRNA and then inspects the leader base by base for an AUG in a suitable context, using complementarity with the anticodon of methionyl initiator tRNA (Met-tRNAiMet) as the key means of identifying AUG. Over the past decade, a combination of yeast genetics, biochemical analysis in reconstituted systems, and structural biology has enabled great progress in deciphering the mechanism of ribosomal scanning. A robust molecular model now exists, describing the roles of initiation factors, notably eukaryotic initiation factor 1 (eIF1) and eIF1A, in stabilizing an “open” conformation of the 40S subunit with Met-tRNAiMet bound in a low-affinity state conducive to scanning and in triggering rearrangement into a “closed” conformation incompatible with scanning, which features Met-tRNAiMet more tightly bound to the “P” site and base paired with AUG. It has also emerged that multiple DEAD-box RNA helicases participate in producing a single-stranded “landing pad” for the 40S subunit and in removing the secondary structure to enable the mRNA to traverse the 40S mRNA-binding channel in the single-stranded form for base-by-base inspection in the P site. PMID:21885680
Dean, Kimberly M; Grayhack, Elizabeth J
2012-12-01
We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression.
Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene
Firth, Andrew E
2008-01-01
Background The genus Orbivirus includes several species that infect livestock – including Bluetongue virus (BTV) and African horse sickness virus (AHSV). These viruses have linear dsRNA genomes divided into ten segments, all of which have previously been assumed to be monocistronic. Results Bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the Orbivirus genome segment 9, overlapping the VP6 cistron in the +1 reading frame. In BTV, a 77–79 codon AUG-initiated open reading frame (hereafter ORFX) is present in all 48 segment 9 sequences analysed. The pattern of base variations across the 48-sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP6 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP6. The ORFX AUG codon has a strong Kozak context in all 48 sequences. Each has only one or two upstream AUG codons, always in the VP6 reading frame, and (with a single exception) always with weak or medium Kozak context. Thus, in BTV, ORFX may be translated via leaky scanning. A long (83–169 codon) ORF is present in a corresponding location and reading frame in all other Orbivirus species analysed except Saint Croix River virus (SCRV; the most divergent). Again, the pattern of base variations across sequence alignments indicates multiple coding in the VP6 and ORFX reading frames. Conclusion At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless, a review of past literature reveals a number of possible detections. We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX, and hence lead to a greater understanding of the molecular biology of these important pathogens. PMID:18489030
L-MPZ, a Novel Isoform of Myelin P0, Is Produced by Stop Codon Readthrough*
Yamaguchi, Yoshihide; Hayashi, Akiko; Campagnoni, Celia W.; Kimura, Akio; Inuzuka, Takashi; Baba, Hiroko
2012-01-01
Myelin protein zero (P0 or MPZ) is a major myelin protein (∼30 kDa) expressed in the peripheral nervous system (PNS) in terrestrial vertebrates. Several groups have detected a P0-related 36-kDa (or 35-kDa) protein that is expressed in the PNS as an antigen for the serum IgG of patients with neuropathy. The molecular structure and function of this 36-kDa protein are, however, still unknown. We hypothesized that the 36-kDa protein may be derived from P0 mRNA by stop codon readthrough. We found a highly conserved region after the regular stop codon in predicted sequences from the 3′-UTR of P0 in higher animals. MS of the 36-kDa protein revealed that both P0 peptides and peptides deduced from the P0 3′-UTR sequence were found among the tryptic fragments. In transfected cells and in an in vitro transcription/translation system, the 36-kDa molecule was also produced from the identical mRNA that produced P0. We designated this 36-kDa molecule as large myelin protein zero (L-MPZ), a novel isoform of P0 that contains an additional domain at the C terminus. In the PNS, L-MPZ was localized in compact myelin. In transfected cells, just like P0, L-MPZ was localized at cell-cell adhesion sites in the plasma membrane. These results suggest that L-MPZ produced by the stop codon readthrough mechanism is potentially involved in myelination. Since this is the first finding of stop codon readthrough in a common mammalian protein, detailed analysis of L-MPZ expression will help to understand the mechanism of stop codon readthrough in mammals. PMID:22457349
Analysis of codon usage in beta-tubulin sequences of helminths.
von Samson-Himmelstjerna, G; Harder, A; Failing, K; Pape, M; Schnieder, T
2003-07-01
Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing
2012-12-01
The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Lintner, Nathanael G.; McClure, Kim F.; Petersen, Donna; Londregan, Allyn T.; Piotrowski, David W.; Wei, Liuqing; Xiao, Jun; Bolt, Michael; Loria, Paula M.; Maguire, Bruce; Geoghegan, Kieran F.; Huang, Austin; Rolph, Tim; Liras, Spiros; Doudna, Jennifer A.; Dullea, Robert G.
2017-01-01
Proprotein convertase subtilisin/kexin type 9 (PCSK9) plays a key role in regulating the levels of plasma low-density lipoprotein cholesterol (LDL-C). Here, we demonstrate that the compound PF-06446846 inhibits translation of PCSK9 by inducing the ribosome to stall around codon 34, mediated by the sequence of the nascent chain within the exit tunnel. We further show that PF-06446846 reduces plasma PCSK9 and total cholesterol levels in rats following oral dosing. Using ribosome profiling, we demonstrate that PF-06446846 is highly selective for the inhibition of PCSK9 translation. The mechanism of action employed by PF-06446846 reveals a previously unexpected tunability of the human ribosome that allows small molecules to specifically block translation of individual transcripts. PMID:28323820
Papadopoulos, Evangelos; Jenni, Simon; Kabha, Eihab; Takrouri, Khuloud J; Yi, Tingfang; Salvi, Nicola; Luna, Rafael E; Gavathiotis, Evripidis; Mahalingam, Poornachandran; Arthanari, Haribabu; Rodriguez-Mias, Ricard; Yefidoff-Freedman, Revital; Aktas, Bertal H; Chorev, Michael; Halperin, Jose A; Wagner, Gerhard
2014-08-05
The interaction of the eukaryotic translation initiation factor eIF4E with the initiation factor eIF4G recruits the 40S ribosomal particle to the 5' end of mRNAs, facilitates scanning to the AUG start codon, and is crucial for eukaryotic translation of nearly all genes. Efficient recruitment of the 40S particle is particularly important for translation of mRNAs encoding oncoproteins and growth-promoting factors, which often harbor complex 5' UTRs and require efficient initiation. Thus, inhibiting the eIF4E/eIF4G interaction has emerged as a previously unpursued route for developing anticancer agents. Indeed, we discovered small-molecule inhibitors of this eIF4E/eIF4G interaction (4EGIs) that inhibit translation initiation both in vitro and in vivo and were used successfully in numerous cancer-biology and neurobiology studies. However, their detailed molecular mechanism of action has remained elusive. Here, we show that the eIF4E/eIF4G inhibitor 4EGI-1 acts allosterically by binding to a site on eIF4E distant from the eIF4G binding epitope. Data from NMR mapping and high-resolution crystal structures are congruent with this mechanism, where 4EGI-1 attaches to a hydrophobic pocket of eIF4E between β-sheet2 (L60-T68) and α-helix1 (E69-N77), causing localized conformational changes mainly in the H78-L85 region. It acts by unfolding a short 310-helix (S82-L85) while extending α-helix1 by one turn (H78-S82). This unusual helix rearrangement has not been seen in any previous eIF4E structure and reveals elements of an allosteric inhibition mechanism leading to the dislocation of eIF4G from eIF4E.
Diene, Seydina M; Merhej, Vicky; Henry, Mireille; El Filali, Adil; Roux, Véronique; Robert, Catherine; Azza, Saïd; Gavory, Frederick; Barbe, Valérie; La Scola, Bernard; Raoult, Didier; Rolain, Jean-Marc
2013-02-01
Here, we sequenced the 5,419,609 bp circular genome of an Enterobacter aerogenes clinical isolate that killed a patient and was resistant to almost all current antibiotics (except gentamicin) commonly used to treat Enterobacterial infections, including colistin. Genomic and phylogenetic analyses explain the discrepancies of this bacterium and show that its core genome originates from another genus, Klebsiella. Atypical characteristics of this bacterium (i.e., motility, presence of ornithine decarboxylase, and lack of urease activity) are attributed to genomic mosaicism, by acquisition of additional genes, such as the complete 60,582 bp flagellar assembly operon acquired "en bloc" from the genus Serratia. The genealogic tree of the 162,202 bp multidrug-resistant conjugative plasmid shows that it is a chimera of transposons and integrative conjugative elements from various bacterial origins, resembling a rhizome. Moreover, we demonstrate biologically that a G53S mutation in the pmrA gene results in colistin resistance. E. aerogenes has a large RNA population comprising 8 rRNA operons and 87 cognate tRNAs that have the ability to translate transferred genes that use different codons, as exemplified by the significantly different codon usage between genes from the core genome and the "mobilome." On the basis of our findings, the evolution of this bacterium to become a "killer bug" with new genomic repertoires was from three criteria that are "opportunity, power, and usage" to indicate a sympatric lifestyle: "opportunity" to meet other bacteria and exchange foreign sequences since this bacteria was similar to sympatric bacteria; "power" to integrate these foreign sequences such as the acquisition of several mobile genetic elements (plasmids, integrative conjugative element, prophages, transposons, flagellar assembly system, etc.) found in his genome; and "usage" to have the ability to translate these sequences including those from rare codons to serve as a translator of foreign languages.
Akhmaloka; Susilowati, Prima Endang; Subandi; Madayanti, Fida
2008-01-01
Termination translation in Saccharomyces cerevisiae is controlled by two interacting polypeptide chain release factors, eRF1 and eRF3. Two regions in human eRF1, position at 281-305 and position at 411-415, were proposed to be involved on the interaction to eRF3. In this study we have constructed and characterized yeast eRF1 mutant at position 410 (correspond to 415 human eRF1) from tyrosine to serine residue resulting eRF1(Y410S). The mutations did not affect the viability and temperature sensitivity of the cell. The stop codons suppression of the mutant was analyzed in vivo using PGK-stop codon-LACZ gene fusion and showed that the suppression of the mutant was significantly increased in all of codon terminations. The suppression on UAG codon was the highest increased among the stop codons by comparing the suppression of the wild type respectively. In vitro interaction between eRF1 (mutant and wild type) to eRF3 were carried out using eRF1-(His)6 and eRF1(Y410S)-(His)6 expressed in Escherichia coli and indigenous Saccharomyces cerevisiae eRF3. The results showed that the binding affinity of eRF1(Y410S) to eRF3 was decreased up to 20% of the wild type binding affinity. Computer modeling analysis using Swiss-Prot and Amber version 9.0 programs revealed that the overall structure of eRF1(Y410S) has no significant different with the wild type. However, substitution of tyrosine to serine triggered the structural change on the other motif of C-terminal domain of eRF1. The data suggested that increasing stop codon suppression and decreasing of the binding affinity of eRF1(Y410S) were probably due to the slight modification on the structure of the C-terminal domain. PMID:18463713
Floquet, Célia; Hatin, Isabelle; Rousset, Jean-Pierre; Bidou, Laure
2012-01-01
The efficiency of translation termination depends on the nature of the stop codon and the surrounding nucleotides. Some molecules, such as aminoglycoside antibiotics (gentamicin), decrease termination efficiency and are currently being evaluated for diseases caused by premature termination codons. However, the readthrough response to treatment is highly variable and little is known about the rules governing readthrough level and response to aminoglycosides. In this study, we carried out in-depth statistical analysis on a very large set of nonsense mutations to decipher the elements of nucleotide context responsible for modulating readthrough levels and gentamicin response. We quantified readthrough for 66 sequences containing a stop codon, in the presence and absence of gentamicin, in cultured mammalian cells. We demonstrated that the efficiency of readthrough after treatment is determined by the complex interplay between the stop codon and a larger sequence context. There was a strong positive correlation between basal and induced readthrough levels, and a weak negative correlation between basal readthrough level and gentamicin response (i.e. the factor of increase from basal to induced readthrough levels). The identity of the stop codon did not affect the response to gentamicin treatment. In agreement with a previous report, we confirm that the presence of a cytosine in +4 position promotes higher basal and gentamicin-induced readthrough than other nucleotides. We highlight for the first time that the presence of a uracil residue immediately upstream from the stop codon is a major determinant of the response to gentamicin. Moreover, this effect was mediated by the nucleotide itself, rather than by the amino-acid or tRNA corresponding to the −1 codon. Finally, we point out that a uracil at this position associated with a cytosine at +4 results in an optimal gentamicin-induced readthrough, which is the therapeutically relevant variable. PMID:22479203
Accuracy of genetic code translation and its orthogonal corruption by aminoglycosides and Mg2+ ions
Zhang, Jingji
2018-01-01
Abstract We studied the effects of aminoglycosides and changing Mg2+ ion concentration on the accuracy of initial codon selection by aminoacyl-tRNA in ternary complex with elongation factor Tu and GTP (T3) on mRNA programmed ribosomes. Aminoglycosides decrease the accuracy by changing the equilibrium constants of ‘monitoring bases’ A1492, A1493 and G530 in 16S rRNA in favor of their ‘activated’ state by large, aminoglycoside-specific factors, which are the same for cognate and near-cognate codons. Increasing Mg2+ concentration decreases the accuracy by slowing dissociation of T3 from its initial codon- and aminoglycoside-independent binding state on the ribosome. The distinct accuracy-corrupting mechanisms for aminoglycosides and Mg2+ ions prompted us to re-interpret previous biochemical experiments and functional implications of existing high resolution ribosome structures. We estimate the upper thermodynamic limit to the accuracy, the ‘intrinsic selectivity’ of the ribosome. We conclude that aminoglycosides do not alter the intrinsic selectivity but reduce the fraction of it that is expressed as the accuracy of initial selection. We suggest that induced fit increases the accuracy and speed of codon reading at unaltered intrinsic selectivity of the ribosome. PMID:29267976
Chen, Siyu; Li, Ke; Cao, Wenqing; Wang, Jia; Zhao, Tong; Huan, Qing; Yang, Yu-Fei; Wu, Shaohuan; Qian, Wenfeng
2017-01-01
Abstract Codon usage bias (CUB) refers to the observation that synonymous codons are not used equally frequently in a genome. CUB is stronger in more highly expressed genes, a phenomenon commonly explained by stronger natural selection on translational accuracy and/or efficiency among these genes. Nevertheless, this phenomenon could also occur if CUB regulates gene expression at the mRNA level, a hypothesis that has not been tested until recently. Here, we attempt to quantify the impact of synonymous mutations on mRNA level in yeast using 3,556 synonymous variants of a heterologous gene encoding green fluorescent protein (GFP) and 523 synonymous variants of an endogenous gene TDH3. We found that mRNA level was positively correlated with CUB among these synonymous variants, demonstrating a direct role of CUB in regulating transcript concentration, likely via regulating mRNA degradation rate, as our additional experiments suggested. More importantly, we quantified the effects of individual synonymous mutations on mRNA level and found them dependent on 1) CUB and 2) mRNA secondary structure, both in proximal sequence contexts. Our study reveals the pleiotropic effects of synonymous codon usage and provides an additional explanation for the well-known correlation between CUB and gene expression level. PMID:28961875
Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius
2013-05-01
Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Sjöholm, Johannes; Oliveira, Paulo; Lindblad, Peter
2007-01-01
The filamentous, heterocystous cyanobacterium Nostoc sp. strain PCC 7120 (Anabaena sp. strain PCC 7120) possesses an uptake hydrogenase and a bidirectional enzyme, the latter being capable of catalyzing both H2 production and evolution. The completely sequenced genome of Nostoc sp. strain PCC 7120 reveals that the five structural genes encoding the bidirectional hydrogenase (hoxEFUYH) are separated in two clusters at a distance of approximately 8.8 kb. The transcription of the hox genes was examined under nitrogen-fixing conditions, and the results demonstrate that the cluster containing hoxE and hoxF can be transcribed as one polycistronic unit together with the open reading frame alr0750. The second cluster, containing hoxU, hoxY, and hoxH, is transcribed together with alr0763 and alr0765, located between the hox genes. Moreover, alr0760 and alr0761 form an additional larger operon. Nevertheless, Northern blot hybridizations revealed a rather complex transcription pattern in which the different hox genes are expressed differently. Transcriptional start points (TSPs) were identified 66 and 57 bp upstream from the start codon of alr0750 and hoxU, respectively. The transcriptions of the two clusters containing the hox genes are both induced under anaerobic conditions concomitantly with the induction of a higher level of hydrogenase activity. An additional TSP, within the annotated alr0760, 244 bp downstream from the suggested translation start codon, was identified. Electrophoretic mobility shift assays with purified LexA from Nostoc sp. strain PCC 7120 demonstrated specific interactions between the transcriptional regulator and both hox promoter regions. However, when LexA from Synechocystis sp. strain PCC 6803 was used, the purified protein interacted only with the promoter region of the alr0750-hoxE-hoxF operon. A search of the whole Nostoc sp. strain PCC 7120 genome demonstrated the presence of 216 putative LexA binding sites in total, including recA and recF. This indicates that, in addition to the bidirectional hydrogenase gene, a number of other genes, including open reading frames connected to DNA replication, recombination, and repair, may be part of the LexA regulatory network in Nostoc sp. strain PCC 7120. PMID:17630298
Genomic adaptation of the ISA virus to Salmo salar codon usage
2013-01-01
Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.
Tello, Mario; Vergara, Francisco; Spencer, Eugenio
2013-07-05
The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
Detection of signals in mRNAs that influence translation.
Brown, Chris M; Jacobs, Grant; Stockwell, Peter; Schreiber, Mark
2003-01-01
Genome sequencing efforts mean that we now have extensive data from a wide range of organisms to study. Understanding the differing natures of the biology of these organisms is an important aim of genome analysis. We are interested in signals that affect translation of mRNAs. Some signals in the mRNA influence how efficiently it is translated into protein. Previous studies have indicated that many important signals are located around the initiation and termination codons. We have developed tools described here to extract the relevant sequence regions from GenBank. To create databases organised by species, or higher taxonomic groupings (eg planta), a program was developed to dynamically view and edit the taxonomy database. Data from relevant species were then extracted using our Genbank feature table parser. We analysed all available sequences, particularly those from complete genomes. Patterns were then identified using information theory. The software is available from http://transterm.otago.ac.nz. Patterns around the initiation codons for most of the organisms fall into two groups, containing the previously known Shine-Dalgarno and Kozaks efficiency signals. However, we have identified several organisms that appear to utilise novel systems. Our analysis indicates that some organisms with extremely high GC% genomes do not have a strong dependence on base pairing ribosome binding sites, as the complementary sequence is absent from many genes.
Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.
Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir
2017-08-01
Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Yang, Wang-Yong; He, Fang; Strack, Rita L; Oh, Seok Yoon; Frazer, Michelle; Jaffrey, Samie R; Todd, Peter K; Disney, Matthew D
2016-09-16
RNA transcripts containing expanded nucleotide repeats cause many incurable diseases via various mechanisms. One such disorder, fragile X-associated tremor ataxia syndrome (FXTAS), is caused by a noncoding r(CGG) repeat expansion (r(CGG)(exp)) that (i) sequesters proteins involved in RNA metabolism in nuclear foci, causing dysregulation of alternative pre-mRNA splicing, and (ii) undergoes repeat associated non-ATG translation (RANT), which produces toxic homopolymeric proteins without using a start codon. Here, we describe the design of two small molecules that inhibit both modes of toxicity and the implementation of various tools to study perturbation of these cellular events. Competitive Chemical Cross Linking and Isolation by Pull Down (C-Chem-CLIP) established that compounds bind r(CGG)(exp) and defined small molecule occupancy of r(CGG)(exp) in cells, the first approach to do so. Using an RNA GFP mimic, r(CGG)(exp)-Spinach2, we observe that our optimal designed compound binds r(CGG)(exp) and affects RNA localization by disrupting preformed RNA foci. These events correlate with an improvement of pre-mRNA splicing defects caused by RNA gain of function. In addition, the compounds reduced levels of toxic homopolymeric proteins formed via RANT. Polysome profiling studies showed that small molecules decreased loading of polysomes onto r(CGG)(exp), explaining decreased translation.
Zhang, Fan; Saini, Adesh K.; Shin, Byung-Sik; Nanda, Jagpreet; Hinnebusch, Alan G.
2015-01-01
The translation preinitiation complex (PIC) is thought to assume an open conformation when scanning the mRNA leader, with AUG recognition evoking a closed conformation and more stable P site interaction of Met-tRNAi; however, physical evidence is lacking that AUG recognition constrains interaction of mRNA with the 40S binding cleft. We compared patterns of hydroxyl radical cleavage of rRNA by Fe(II)-BABE tethered to unique sites in eIF1A in yeast PICs reconstituted with mRNA harboring an AUG or near-cognate (AUC) start codon. rRNA residues in the P site display reduced cleavage in AUG versus AUC PICs; and enhanced cleavage in the AUC complexes was diminished by mutations of scanning enhancer elements of eIF1A that increase near-cognate recognition in vivo. This suggests that accessibility of these rRNA residues is reduced by accommodation of Met-tRNAi in the P site (PIN state) and by their interactions with the anticodon stem of Met-tRNAi. Our cleavage data also provide evidence that AUG recognition evokes dissociation of eIF1 from its 40S binding site, ejection of the eIF1A-CTT from the P-site and rearrangement to a closed conformation of the entry channel with reduced mobility of mRNA. PMID:25670678
Olivares, Eduardo; Landry, Dori M.; Cáceres, C. Joaquín; Pino, Karla; Rossi, Federico; Navarrete, Camilo; Huidobro-Toro, Juan Pablo; Thompson, Sunnie R.
2014-01-01
ABSTRACT The human T-cell leukemia virus type 1 (HTLV-1) is a complex human retrovirus that causes adult T cell leukemia and of HTLV-associated myelopathy/tropical spastic paraparesis. The mRNA of some complex retroviruses, including the human and simian immunodeficiency viruses (HIV and SIV), can initiate translation using a canonical cap-dependent mechanism or through an internal ribosome entry site (IRES). In this study, we present strong evidence showing that like HIV-1 and SIV, the 5′-untranslated region (5′UTR) of the HTLV-1 full-length mRNA harbors an IRES. Cap-independent translational activity was evaluated and demonstrated using dual luciferase bicistronic mRNAs in rabbit reticulocyte lysate, in mammalian cell culture, and in Xenopus laevis oocytes. Characterization of the HTLV-1 IRES shows that its activity is dependent on the ribosomal protein S25 (RPS25) and that its function is highly sensitive to the drug edeine. Together, these findings suggest that the 5′UTR of the HTLV-1 full-length mRNA enables internal recruitment of the eukaryotic translation initiation complex. However, the recognition of the initiation codon requires ribosome scanning. These results suggest that, after internal recruitment by the HTLV-1 IRES, a scanning step takes place for the 40S ribosomal subunit to be positioned at the translation initiation codon. IMPORTANCE The mechanism by which retroviral mRNAs recruit the 40S ribosomal subunit internally is not understood. This study provides new insights into the mechanism of translation initiation used by the human T-cell lymphotropic virus type 1 (HTLV-1). The results show that the HTLV-1 mRNA can initiate translation via a noncanonical mechanism mediated by an internal ribosome entry site (IRES). This study also provides evidence showing the involvement of cellular proteins in HTLV-1 IRES-mediated translation initiation. Together, the data presented in this report significantly contribute to the understanding of HTLV-1 gene expression. PMID:24623421
Does the Genetic Code Have A Eukaryotic Origin?
Zhang, Zhang; Yu, Jun
2013-01-01
In the RNA world, RNA is assumed to be the dominant macromolecule performing most, if not all, core “house-keeping” functions. The ribo-cell hypothesis suggests that the genetic code and the translation machinery may both be born of the RNA world, and the introduction of DNA to ribo-cells may take over the informational role of RNA gradually, such as a mature set of genetic code and mechanism enabling stable inheritance of sequence and its variation. In this context, we modeled the genetic code in two content variables—GC and purine contents—of protein-coding sequences and measured the purine content sensitivities for each codon when the sensitivity (% usage) is plotted as a function of GC content variation. The analysis leads to a new pattern—the symmetric pattern—where the sensitivity of purine content variation shows diagonally symmetry in the codon table more significantly in the two GC content invariable quarters in addition to the two existing patterns where the table is divided into either four GC content sensitivity quarters or two amino acid diversity halves. The most insensitive codon sets are GUN (valine) and CAN (CAR for asparagine and CAY for aspartic acid) and the most biased amino acid is valine (always over-estimated) followed by alanine (always under-estimated). The unique position of valine and its codons suggests its key roles in the final recruitment of the complete codon set of the canonical table. The distinct choice may only be attributable to sequence signatures or signals of splice sites for spliceosomal introns shared by all extant eukaryotes. PMID:23402863
Ko, Jae-hyeong; Llopis, Paula Montero; Heinritz, Jennifer; Jacobs-Wagner, Christine; Söll, Dieter
2013-01-01
While translational read-through of stop codons by suppressor tRNAs is common in many bacteria, archaea and eukaryotes, this phenomenon has not yet been observed in the α-proteobacterium Caulobacter crescentus. Based on a previous report that C. crescentus and Escherichia coli tRNAHis have distinctive identity elements, we constructed E. coli tRNAHis CUA, a UAG suppressor tRNA for C. crescentus. By examining the expression of three UAG codon- containing reporter genes (encoding a β-lactamase, the fluorescent mCherry protein, or the C. crescentus xylonate dehydratase), we demonstrated that the E. coli histidyl-tRNA synthetase/tRNAHis CUA pair enables in vivo UAG suppression in C. crescentus. E. coli histidyl-tRNA synthetase (HisRS) or tRNAHis CUA alone did not achieve suppression; this indicates that the E. coli HisRS/tRNAHis CUA pair is orthogonal in C. crescentus. These results illustrate that UAG suppression can be achieved in C. crescentus with an orthogonal aminoacyl-tRNA synthetase/suppressor tRNA pair. PMID:24386240
Self-organizing approach for meta-genomes.
Zhu, Jianfeng; Zheng, Wei-Mou
2014-12-01
We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.
Expression of different functional isoforms in haematopoiesis.
Grech, Godfrey; Pollacco, Joel; Portelli, Mark; Sacco, Keith; Baldacchino, Shawn; Grixti, Justine; Saliba, Christian
2014-01-01
Haematopoiesis is a complex process regulated at various levels facilitating rapid responses to external factors including stress, modulation of lineage commitment and terminal differentiation of progenitors. Although the transcription program determines the RNA pool of a cell, various mRNA strands can be obtained from the same template, giving rise to multiple protein isoforms. The majority of variants and isoforms co-occur in normal haematopoietic cells or are differentially expressed at various maturity stages of progenitor maturation and cellular differentiation within the same lineage or across lineages. Genetic aberrations or specific cellular states result in the predominant expression of abnormal isoforms leading to deregulation and disease. The presence of upstream open reading frames (uORF) in 5' untranslated regions (UTRs) of a transcript, couples the utilization of start codons with the cellular status and availability of translation initiation factors (eIFs). In addition, tissue-specific and cell lineage-specific alternative promoter use, regulates several transcription factors producing transcript variants with variable 5' exons. In this review, we propose to give a detailed account of the differential isoform formation, causing haematological malignancies.
Emerging genetic therapies to treat Duchenne muscular dystrophy
Nelson, Stanley F.; Crosbie, Rachelle H.; Miceli, M. Carrie; Spencer, Melissa J.
2010-01-01
Purpose of review Duchenne muscular dystrophy is a progressive muscle degenerative disease caused by dystrophin mutations. The purpose of this review is to highlight two emerging therapies designed to repair the primary genetic defect, called `exon skipping' and `nonsense codon suppression'. Recent findings A drug, PTC124, was identified that suppresses nonsense codon translation termination. PTC124 can lead to restoration of some dystrophin expression in human Duchenne muscular dystrophy muscles with mutations resulting in premature stops. Two drugs developed for exon skipping, PRO051 and AVI-4658, result in the exclusion of exon 51 from mature mRNA. They can restore the translational reading frame to dystrophin transcripts from patients with a particular subset of dystrophin gene deletions and lead to some restoration of dystrophin expression in affected boys' muscle in vivo. Both approaches have concluded phase I trials with no serious adverse events. Summary These novel therapies that act to correct the primary genetic defect of dystrophin deficiency are among the first generation of therapies tailored to correct specific mutations in humans. Thus, they represent paradigm forming approaches to personalized medicine with the potential to lead to life changing treatment for those affected by Duchenne muscular dystrophy. PMID:19745732
Van Haute, Lindsey; Powell, Christopher A; Minczuk, Michal
2017-03-02
Human mitochondria contain their own genome, which uses an unconventional genetic code. In addition to the standard AUG methionine codon, the single mitochondrial tRNA Methionine (mt-tRNAMet) also recognises AUA during translation initiation and elongation. Post-transcriptional modifications of tRNAs are important for structure, stability, correct folding and aminoacylation as well as decoding. The unique 5-formylcytosine (f5C) modification of position 34 in mt-tRNAMet has been long postulated to be crucial for decoding of unconventional methionine codons and efficient mitochondrial translation. However, the enzymes responsible for the formation of mitochondrial f5C have been identified only recently. The first step of the f5C pathway consists of methylation of cytosine by NSUN3. This is followed by further oxidation by ABH1. Here, we review the role of f5C, the latest breakthroughs in our understanding of the biogenesis of this unique mitochondrial tRNA modification and its involvement in human disease.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, S.P.; Robert, M.F.; Mitchell, G.A.
1996-04-01
3-hydroxy-3-methylglutaryl CoA lyase (HL, EC 4.1.3.4) catalyzes the cleavage of 3-hydroxy-3-methylglutaryl CoA to acetoacetic acid and acetyl CoA, the final reaction of both ketogenesis and leucine catabolism. Autosomal-recessive HL deficiency in humans results in episodes of hypoketotic hypoglycemia and coma. Using a mouse HL cDNA as a probe, we isolated a clone containing the full-length mouse HL gene that spans about 15 kb of mouse chromosome 4 and contains nine exons. The promoter region of the mouse HL gene contains elements characteristic of a housekeeping gene: a CpG island containing multiple Sp1 binding sites surrounds exon 1, and neither amore » TATA nor a CAAT box are present. We identified multiple transcription start sites in the mouse HL gene, 35 to 9 bases upstream of the translation start codon. We also isolated two human HL genomic clones that include HL exons 2 to 9 within 18 kb. The mouse and human HL genes (HGMW-approved symbol HMGCL) are highly homologous, with identical locations of intron-exon junctions. By genomic Southern blot analysis and exonic PCR, was found 2 of 33 HL-deficient probands to be homozygous for large deletions in the HL gene. 26 refs., 4 figs., 2 tabs.« less
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Ou, Jing; Liu, Jin-Bo; Yao, Fu-Jiao; Wang, Xin-Guo; Wei, Zhao-Ming
2016-01-01
Flour beetles of the genus Tribolium are all pests of stored products and cause severe economic losses every year. The American black flour beetle Tribolium audax is one of the important pest species of flour beetle, and it is also an important quarantine insect. Here we sequenced and characterized the complete mitochondrial genome of T. audax, which was intercepted by Huangpu Custom in maize from America. The complete circular mitochondrial genome (mitogenome) of T. audax was 15,924 bp in length, containing 37 typical coding genes and one non-coding AT-rich region. The mitogenome of T. audax exhibits a gene arrangement and content identical to the most common type in insects. All protein coding genes (PCGs) are start with a typical ATN initiation codon, except for the cox1, which use AAC as its start codon instead of ATN. Eleven genes use standard complete termination codon (nine TAA, two TAG), whereas the nad4 and nad5 genes end with single T. Except for trnS1 (AGN), all tRNA genes display typical secondary cloverleaf structures as those of other insects. The sizes of the large and small ribosomal RNA genes are 1288 and 780 bp, respectively. The AT content of the AT-rich region is 81.36%. The 5 bp conserved motif TACTA was found in the intergenic region between trnS2 (UCN) and nad1.
Novel mutations of the AGXT gene causing primary hyperoxaluria type 1.
Yuen, Yuet-Ping; Lai, Chi-Kong; Tong, Gensy Mei-Wah; Wong, Ping-Nam; Wong, Francis Kim-Ming; Mak, Siu-Ka; Lo, Kin-Yee; Wong, Andrew Kui-Man; Tong, Sui-Fan; Chan, Yan-Wo; Lam, Ching-Wan
2004-01-01
Primary hyperoxaluria type 1 (PH1), an inherited cause of nephrolithiasis, is due to a functional defect of the liver-specific peroxisomal enzyme alanine:glyoxylate aminotransferase (AGT). A definitive PH1 diagnosis can be established by analyzing AGT activity in liver tissue or mutation analysis of the AGXT gene. The molecular basis of PH1 in three Chinese patients, two with adult-onset and one with childhood-onset recurrent nephrolithiasis, was established by analyzing the entire AGXT gene. Three novel mutations (c2T>C, c817insAG and c844C>T) and two previously reported mutations (c33insC and 679-IVS6+2delAAgt) were identified. c2T>C converts the initiation codon from ATG to ACG, which predicts significant reduction, if not complete abolition, of protein translation. c817insAG leads to a frameshift and changes the amino acid sequence after codon 274. c844C>T changes glutamine at codon 282 to a termination codon, resulting in protein truncation. This is the first report describing AGXT gene mutations in Chinese patients with PH1. AGXT genotypes cannot fully explain the clinical heterogeneity of PH1, and other factors involved in disease pathogenesis remain to be identified. Our experience emphasizes the importance of excluding PH1 in patients with recurrent nephrolithiasis to avoid delay or inappropriate management.
Genes for cytochrome c oxidase subunit I, URF2, and three tRNAs in Drosophila mitochondrial DNA.
Clary, D O; Wolstenholme, D R
1983-01-01
Genes for URF2, tRNAtrp, tRNAcys, tRNAtyr and cytochrome c oxidase subunit I (COI) have been identified within a sequenced segment of the Drosophila yakuba mtDNA molecule. The five genes are arranged in the order given. Transcription of the tRNAcys and tRNAtyr genes is in the same direction as replication, while transcription of the URF2, tRNAtrp and COI genes is in the opposite direction. A similar arrangement of these genes is found in mammalian mtDNA except that in the latter, the tRNAala and tRNAasn genes are located between the tRNAtrp and tRNAcys genes. Also, a sequence found between the tRNAasn and tRNAcys genes in mammalian mtDNA, which is associated with the initiation of second strand DNA synthesis, is not found in this region of the D. yakuba mtDNA molecule. As the D. yakuba COI gene lacks a standard translation initiation codon, we consider the possibility that the quadruplet ATAA may serve this function. As in other D. yakuba mitochondrial polypeptide genes, AGA codons in the URF2 and COI genes do not correspond in position to arginine-specifying codons in the equivalent genes of mouse and yeast mtDNAs, but do most frequently correspond to serine-specifying codons. PMID:6314262
Suetomi, Yuta; Matsuda, Fuko; Uenoyama, Yoshihisa; Maeda, Kei-ichiro; Tsukamura, Hiroko; Ohkura, Satoshi
2013-10-01
Neurokinin B (NKB), encoded by TAC3, is thought to be an important accelerator of pulsatile gonadotropin-releasing hormone release. This study aimed to clarify the transcriptional regulatory mechanism of goat TAC3. First, we determined the full-length mRNA sequence of goat TAC3 from the hypothalamus to be 820 b, including a 381 b coding region, with the putative transcription start site located 143-b upstream of the start codon. The deduced amino acid sequence of NKB, which is produced from preproNKB, was completely conserved among goat, cattle, and human. Next, we cloned 5'-upstream region of goat TAC3 up to 3400 b from the translation initiation site, and this region was highly homologous with cattle TAC3 (89%). We used this goat TAC3 5'-upstream region to perform luciferase assays. We created a luciferase reporter vector containing DNA constructs from -2706, -1837, -834, -335, or -197 to +166 bp (the putative transcription start site was designated as +1) of goat TAC3 and these were transiently transfected into mouse hypothalamus-derived N7 cells and human neuroblastoma-derived SK-N-AS cells. The luciferase activity gradually increased with the deletion of the 5'-upstream region, suggesting that the transcriptional suppressive region is located between -2706 and -336 bp and that the core promoter exists downstream of -197 bp. Estradiol treatment did not lead to significant suppression of luciferase activity of any constructs, suggesting the existence of other factor(s) that regulate goat TAC3 transcription.
Branny, P; de la Torre, F; Garel, J R
1998-04-01
The structural genes gap, pgk and tpi encoding three glycolytic enzymes, glyceraldehyde-3-phosphate dehydrogenase (GAPDH), 3-phosphoglycerate kinase (PGK) and triosephosphate isomerase (TPI), respectively, have been cloned and sequenced from Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus). The genes were isolated after screening genomic sublibraries with specific gap and pgk probes obtained by PCR amplification of chromosomal DNA with degenerate primers corresponding to amino acid sequences highly conserved in GAPDHs and PGKs. Nucleotide sequencing revealed that the three genes were organized in the order gap-pgk-tpi. The translation start codons of the three genes were identified by alignment of the N-terminal sequences. These genes predicted polypeptide chains of 338, 403 and 252 amino acids for GAPDH, PGK and TPI, respectively, and they were separated by 96 bp between gap and pgk, and by only 18 bp between pgk and tpi. The codon usage in gap, pgk, tpi and three other glycolytic genes from L. bulgaricus differed, noticeably from that in other chromosomal genes. The site of transcriptional initiation was located by primer extension, and a probable promoter was identified for the gap-pgk-tpi operon. Northern hybridization of total RNA with specific probes showed two transcripts, an mRNA of 1.4 kb corresponding to the gap gene, and a less abundant mRNA of 3.4 kb corresponding to the gap-pgk-tpi cluster. The absence of a visible terminator in the 3'-end of the shorter transcript and the location of this 3'-end inside the pgk gene indicated that this shorter transcript was produced by degradation of the longer one, rather than by an early termination of transcription after the gap gene.
Durantel, D; Croizier, L; Ayres, M D; Croizier, G; Possee, R D; López-Ferber, M
1998-03-01
Autographa californica nucleopolyhedrovirus (AcMNPV) ORF 86, located within the HindIII C fragment, potentially encodes a protein which shares sequence similarity with two T4 bacteriophage gene products, RNA ligase and polynucleotide kinase. This AcMNPV gene has been designated pnk/pnl but has yet to be assigned a function in virus replication. It has been classified as an immediate early virus gene, since the promoter was active in uninfected insect cells and mRNA transcripts were detectable from 4 to 48 h post-infection and in the presence of cycloheximide or aphidicolin in virus-infected cells. The extremities of the transcript have been mapped by primer extension and 3' RACE-PCR to positions -18 from the translational start codon and +15 downstream of the stop codon. The function of pnk/pnl was investigated by producing a recombinant virus (Acdel86lacZ) with the coding region replaced with that of lacZ. This virus replicated normally in Spodoptera frugiperda (Sf 21) cells, indicating that pnk/pnl is not essential for propagation in these cells. Virus protein production in Acdel86lacZ-infected Sf 21 cells also appeared to be unaffected, with normal synthesis of the IE-1, GP64, VP39 and polyhedrin proteins. Shut-down of host protein synthesis was not abolished in recombinant infection. When other baculovirus genomes were examined for the presence of pnk/pnl by restriction enzyme digestion and PCR, a deletion was found in AcMNPV 1.2, Galleria mellonella NPV (GmMNPV) and Bombyx mori NPV (BmNPV), suggesting that in many isolates this gene has either never been acquired or has been lost during genome evolution. This is one of the first baculovirus immediate early genes that appears to be nonessential for virus survival.
A mutated dph3 gene causes sensitivity of Schizosaccharomyces pombe cells to cytotoxic agents.
Villahermosa, Desirée; Knapp, Karen; Fleck, Oliver
2017-12-01
Dph3 is involved in diphthamide modification of the eukaryotic translation elongation factor eEF2 and in Elongator-mediated modifications of tRNAs, where a 5-methoxycarbonyl-methyl moiety is added to wobble uridines. Lack of such modifications affects protein synthesis due to inaccurate translation of mRNAs at ribosomes. We have discovered that integration of markers at the msh3 locus of Schizosaccharomyces pombe impaired the function of the nearby located dph3 gene. Such integrations rendered cells sensitive to the cytotoxic drugs hydroxyurea and methyl methanesulfonate. We constructed dph3 and msh3 strains with mutated ATG start codons (ATGmut), which allowed investigating drug sensitivity without potential interference by marker insertions. The dph3-ATGmut and a dph3::loxP-ura4-loxM gene disruption strain, but not msh3-ATGmut, turned out to be sensitive to hydroxyurea and methyl methanesulfonate, likewise the strains with cassettes integrated at the msh3 locus. The fungicide sordarin, which inhibits diphthamide modified eEF2 of Saccharomyces cerevisiae, barely affected survival of wild type and msh3Δ S. pombe cells, while the dph3Δ mutant was sensitive. The msh3-ATG mutation, but not dph3Δ or the dph3-ATG mutation caused a defect in mating-type switching, indicating that the ura4 marker at the dph3 locus did not interfere with Msh3 function. We conclude that Dph3 is required for cellular resistance to the fungicide sordarin and to the cytotoxic drugs hydroxyurea and methyl methanesulfonate. This is likely mediated by efficient translation of proteins in response to DNA damage and replication stress.
López-Lastra, M; Gabus, C; Darlix, J L
1997-11-01
The murine leukemia virus (MLV)-related type C viruses constitute a major class of retroviruses that includes numerous endogenous and exogenous mammalian viruses and the related avian spleen necrosis virus (SNV). The MLV-related viruses possess a long and multifunctional 5' untranslated leader involved in key steps of the viral life cycle--splicing, translation, RNA dimerization, encapsidation, and reverse transcription. Recent studies have shown that the 5' leader of Friend murine leukemia virus and Moloney murine leukemia virus can direct cap independent translation of gag precursor proteins (Berlioz et al., 1995; Vagner et al., 1995b). These data, together with structural homology studies (Koning et al., 1992), prompted us to undertake a search for new internal ribosome entry segment (IRES) of retroviral origin. Here we describe an IRES element within the 5' leader of avian reticuloendotheliosis virus type A (REV-A) genomic RNA. Data show that the REV-A 5' IRES element maps downstream of the packaging/dimerization (E/DLS) sequence (Watanabe and Temin, 1982; Darlix et al., 1992) and the minimal IRES sequence appears to be within a 129 nt fragment (nucleotides 452-580) of the 5' leader, immediately upstream of the gag AUG codon. The REV-A IRES has been successfully utilized in the construction of novel high titer MLV-based retroviral vectors, containing one or more IRES elements of retroviral origin. These retroviral constructs, which represent a starting point for the design of novel vectors suitable for gene therapy, are also of interest as a model system of internal translation initiation and its possible regulation during development, cancer, or virus infection.
Complete mitochondrial genome of yellow meal worm(Tenebrio molitor)
LIU, Li-Na; WANG, Cheng-Ye
2014-01-01
The yellow meal worm(Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons(CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNASer(AGN), which lacked a dihydrouridine(DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm. PMID:25465087
Complete mitochondrial genome of yellow meal worm (Tenebrio molitor).
Liu, Li-Na; Wang, Cheng-Ye
2014-11-18
The yellow meal worm (Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons (CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNA(Ser) (AGN), which lacked a dihydrouridine (DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm.
Fletcher, Simon P; Ali, Iraj K; Kaminski, Ann; Digard, Paul; Jackson, Richard J
2002-01-01
Classical swine fever virus (CSFV) is a member of the pestivirus family, which shares many features in common with hepatitis C virus (HCV). It is shown here that CSFV has an exceptionally efficient cis-acting internal ribosome entry segment (IRES), which, like that of HCV, is strongly influenced by the sequences immediately downstream of the initiation codon, and is optimal with viral coding sequences in this position. Constructs that retained 17 or more codons of viral coding sequence exhibited full IRES activity, but with only 12 codons, activity was approximately 66% of maximum in vitro (though close to maximum in transfected BHK cells), whereas with just 3 codons or fewer, the activity was only approximately 15% of maximum. The minimal coding region elements required for high activity were exchanged between HCV and CSFV. Although maximum activity was observed in each case with the homologous combination of coding region and 5' UTR, the heterologous combinations were sufficiently active to rule out a highly specific functional interplay between the 5' UTR and coding sequences. On the other hand, inversion of the coding sequences resulted in low IRES activity, particularly with the HCV coding sequences. RNA structure probing showed that the efficiency of internal initiation of these chimeric constructs correlated most closely with the degree of single-strandedness of the region around and immediately downstream of the initiation codon. The low activity IRESs could not be rescued by addition of supplementary eIF4A (the initiation factor with ATP-dependent RNA helicase activity). The extreme sensitivity to secondary structure around the initiation codon is likely to be due to the fact that the eIF4F complex (which has eIF4A as one of its subunits) is not required for and does not participate in initiation on these IRESs. PMID:12515388
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-01-01
Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-04-21
To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.
Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas
2014-01-01
For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
Dimitrieva, Slavica; Anisimova, Maria
2014-01-01
In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Mora, Liliana; Heurgué-Hamard, Valérie; de Zamaroczy, Miklos; Kervestin, Stephanie; Buckingham, Richard H
2007-12-07
Bacterial release factors RF1 and RF2 are methylated on the Gln residue of a universally conserved tripeptide motif GGQ, which interacts with the peptidyl transferase center of the large ribosomal subunit, triggering hydrolysis of the ester bond in peptidyl-tRNA and releasing the newly synthesized polypeptide from the ribosome. In vitro experiments have shown that the activity of RF2 is stimulated by Gln methylation. The viability of Escherichia coli K12 strains depends on the integrity of the release factor methyltransferase PrmC, because K12 strains are partially deficient in RF2 activity due to the presence of a Thr residue at position 246 instead of Ala. Here, we study in vivo RF1 and RF2 activity at termination codons in competition with programmed frameshifting and the effect of the Ala-246 --> Thr mutation. PrmC inactivation reduces the specific termination activity of RF1 and RF2(Ala-246) by approximately 3- to 4-fold. The mutation Ala-246 --> Thr in RF2 reduces the termination activity in cells approximately 5-fold. After correction for the decrease in level of RF2 due to the autocontrol of RF2 synthesis, the mutation Ala-246 --> Thr reduced RF2 termination activity by approximately 10-fold at UGA codons and UAA codons. PrmC inactivation had no effect on cell growth in rich media but reduced growth considerably on poor carbon sources. This suggests that the expression of some genes needed for optimal growth under such conditions can become growth limiting as a result of inefficient translation termination.
Modification of tRNALys UUU by Elongator Is Essential for Efficient Translation of Stress mRNAs
Sansó, Miriam; Buhne, Karin; Carmona, Mercè; Paulo, Esther; Hermand, Damien; Rodríguez-Gabriel, Miguel; Ayté, José; Leidel, Sebastian; Hidalgo, Elena
2013-01-01
The Elongator complex, including the histone acetyl transferase Sin3/Elp3, was isolated as an RNA polymerase II-interacting complex, and cells deficient in Elongator subunits display transcriptional defects. However, it has also been shown that Elongator mediates the modification of some tRNAs, modulating translation efficiency. We show here that the fission yeast Sin3/Elp3 is important for oxidative stress survival. The stress transcriptional program, governed by the Sty1-Atf1-Pcr1 pathway, is affected in mutant cells, but not severely. On the contrary, cells lacking Sin3/Elp3 cannot modify the uridine wobble nucleoside of certain tRNAs, and other tRNA modifying activities such as Ctu1-Ctu2 are also essential for normal tolerance to H2O2. In particular, a plasmid over-expressing the tRNALys UUU complements the stress-related phenotypes of Sin3/Elp3 mutant cells. We have determined that the main H2O2-dependent genes, including those coding for the transcription factors Atf1 and Pcr1, are highly expressed mRNAs containing a biased number of lysine-coding codons AAA versus AAG. Thus, their mRNAs are poorly translated after stress in cells lacking Sin3/Elp3 or Ctu2, whereas a mutated atf1 transcript with AAA-to-AAG lysine codons is efficiently translated in all strain backgrounds. Our study demonstrates that the lack of a functional Elongator complex results in stress phenotypes due to its contribution to tRNA modification and subsequent translation inefficiency of certain stress-induced, highly expressed mRNAs. These results suggest that the transcriptional defects of these strain backgrounds may be a secondary consequence of the deficient expression of a transcription factor, Atf1-Pcr1, and other components of the transcriptional machinery. PMID:23874237
Modification of tRNA(Lys) UUU by elongator is essential for efficient translation of stress mRNAs.
Fernández-Vázquez, Jorge; Vargas-Pérez, Itzel; Sansó, Miriam; Buhne, Karin; Carmona, Mercè; Paulo, Esther; Hermand, Damien; Rodríguez-Gabriel, Miguel; Ayté, José; Leidel, Sebastian; Hidalgo, Elena
2013-01-01
The Elongator complex, including the histone acetyl transferase Sin3/Elp3, was isolated as an RNA polymerase II-interacting complex, and cells deficient in Elongator subunits display transcriptional defects. However, it has also been shown that Elongator mediates the modification of some tRNAs, modulating translation efficiency. We show here that the fission yeast Sin3/Elp3 is important for oxidative stress survival. The stress transcriptional program, governed by the Sty1-Atf1-Pcr1 pathway, is affected in mutant cells, but not severely. On the contrary, cells lacking Sin3/Elp3 cannot modify the uridine wobble nucleoside of certain tRNAs, and other tRNA modifying activities such as Ctu1-Ctu2 are also essential for normal tolerance to H2O2. In particular, a plasmid over-expressing the tRNA(Lys) UUU complements the stress-related phenotypes of Sin3/Elp3 mutant cells. We have determined that the main H2O2-dependent genes, including those coding for the transcription factors Atf1 and Pcr1, are highly expressed mRNAs containing a biased number of lysine-coding codons AAA versus AAG. Thus, their mRNAs are poorly translated after stress in cells lacking Sin3/Elp3 or Ctu2, whereas a mutated atf1 transcript with AAA-to-AAG lysine codons is efficiently translated in all strain backgrounds. Our study demonstrates that the lack of a functional Elongator complex results in stress phenotypes due to its contribution to tRNA modification and subsequent translation inefficiency of certain stress-induced, highly expressed mRNAs. These results suggest that the transcriptional defects of these strain backgrounds may be a secondary consequence of the deficient expression of a transcription factor, Atf1-Pcr1, and other components of the transcriptional machinery.
Kobayashi, Kan; Ishitani, Ryuichiro; Nureki, Osamu
2013-01-01
In the translation process, translating ribosomes usually move on an mRNA until they reach the stop codon. However, when ribosomes translate an aberrant mRNA, they stall. Then, ribosomes are rescued from the aberrant mRNA, and the aberrant mRNA is subsequently degraded. In eukaryotes, Pelota (Dom34 in yeast) and Hbs1 are responsible for solving general problems of ribosomal stall in translation. In archaea, aPelota and aEF1α, homologous to Pelota and Hbs1, respectively, are considered to be involved in that process. In recent years, great progress has been made in determining structures of Dom34/aPelota and Hbs1/aEF1α. In this review, we focus on the functional roles of Dom34/aPelota and Hbs1/aEF1α in ribosome rescue, based on recent structural studies of them. We will also present questions to be answered by future work. PMID:27493551
Insulin Signaling Augments eIF4E-Dependent Nonsense-Mediated mRNA Decay in Mammalian Cells.
Park, Jungyun; Ahn, Seyoung; Jayabalan, Aravinth K; Ohn, Takbum; Koh, Hyun Chul; Hwang, Jungwook
2016-07-01
Nonsense-mediated mRNA decay (NMD) modulates the level of mRNA harboring a premature termination codon (PTC) in a translation-dependent manner. Inhibition of translation is known to impair NMD; however, few studies have investigated the correlation between enhanced translation and increased NMD. Here, we demonstrate that insulin signaling events increase translation, leading to an increase in NMD of eIF4E-bound transcripts. We provide evidence that (i) insulin-mediated enhancement of translation augments NMD and rapamycin abrogates this enhancement; (ii) an increase in AKT phosphorylation due to inhibition of PTEN facilitates NMD; (iii) insulin stimulation increases the binding of up-frameshift factor 1 (UPF1), most likely to eIF4E-bound PTC-containing transcripts; and (iv) insulin stimulation induces the colocalization of UPF1 and eIF4E in processing bodies. These results illustrate how extracellular signaling promotes the removal of eIF4E-bound NMD targets. Copyright © 2015 Elsevier B.V. All rights reserved.
Gu, Wei; Cui, Yizhi; Zhong, Jiayong; Jin, Jingjie; He, Qing-Yu; Wang, Tong; Zhang, Gong
2016-01-01
In the process of translation, ribosomes first assemble on mRNAs (translation initiation) and then translate along the mRNA (elongation) to synthesize proteins. Elongation pausing is deemed highly relevant to co-translational folding of nascent peptides and the functionality of protein products, which positioned the evaluation of elongation speed as one of the central questions in the field of translational control. By integrating three types of RNA-seq methods, we experimentally and computationally resolved elongation speed, with our proposed elongation velocity index (EVI), a relative measure at individual gene level and under physiological condition in human cells. We successfully distinguished slow-translating genes from the background translatome. We demonstrated that low-EVI genes encoded more stable proteins. We further identified cell-specific slow-translating codons, which might serve as a causal factor of elongation deceleration. As an example for the biological relevance, we showed that the relatively slow-translating genes tended to be associated with the maintenance of malignant phenotypes per pathway analyses. In conclusion, EVI opens a new view to understand why human cells tend to avoid simultaneously speeding up translation initiation and decelerating elongation, and the possible cancer relevance of translating low-EVI genes to gain better protein quality. PMID:26926465
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shimron-Abarbanell, D.; Harms, H.; Erdmann, J.
1996-04-09
Using single strand conformational analysis we screened the complete coding sequence of the serotonin 1F (5-HT{sub 1F}) receptor gene for the presence of DNA sequence variation in a sample of 137 unrelated individuals including 45 schizophrenic patients, 46 bipolar patients, as well as 46 healthy controls. We detected only three rare sequence variants which are characterized by single base pair substitutions, namely a silent T{r_arrow}A transversion in the third position of codon 261 (encoding isoleucine), a silent C{r_arrow}T transition in the third position of codon 176 (encoding histidine), and a C{r_arrow}T transition in position -78 upstream from the start codon.more » The lack of significant mutations in patients suffering from schizophrenia and bipolar affective disorder indicates that the 5-HT{sub 1F} receptor is not commonly involved in the etiology of these diseases. 12 refs., 1 fig., 2 tabs.« less
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.
Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing
2016-12-01
Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Chu, Dominique; Barnes, David J.; von der Haar, Tobias
2011-01-01
Protein synthesis translates information from messenger RNAs into functional proteomes. Because of the finite nature of the resources required by the translational machinery, both the overall protein synthesis activity of a cell and activity on individual mRNAs are controlled by the allocation of limiting resources. Upon introduction of heterologous sequences into an organism—for example for the purposes of bioprocessing or synthetic biology—limiting resources may also become overstretched, thus negatively affecting both endogenous and heterologous gene expression. In this study, we present a mean-field model of translation in Saccharomyces cerevisiae for the investigation of two particular translational resources, namely ribosomes and aminoacylated tRNAs. We firstly use comparisons of experiments with heterologous sequences and simulations of the same conditions to calibrate our model, and then analyse the behaviour of the translational system in yeast upon introduction of different types of heterologous sequences. Our main findings are that: competition for ribosomes, rather than tRNAs, limits global translation in this organism; that tRNA aminoacylation levels exert, at most, weak control over translational activity; and that decoding speeds and codon adaptation exert strong control over local (mRNA specific) translation rates. PMID:21558172
Swenson, Dana L; Warfield, Kelly L; Warren, Travis K; Lovejoy, Candace; Hassinger, Jed N; Ruthel, Gordon; Blouch, Robert E; Moulton, Hong M; Weller, Dwight D; Iversen, Patrick L; Bavari, Sina
2009-05-01
Phosphorodiamidate morpholino oligomers (PMOs) are uncharged nucleic acid-like molecules designed to inactivate the expression of specific genes via the antisense-based steric hindrance of mRNA translation. PMOs have been successful at knocking out viral gene expression and replication in the case of acute viral infections in animal models and have been well tolerated in human clinical trials. We propose that antisense PMOs represent a promising class of therapeutic agents that may be useful for combating filoviral infections. We have previously shown that mice treated with a PMO whose sequence is complementary to a region spanning the start codon of VP24 mRNA were protected against lethal Ebola virus challenge. In the present study, we report on the abilities of two additional VP24-specific PMOs to reduce the cell-free translation of a VP24 reporter, to inhibit the in vitro replication of Ebola virus, and to protect mice against lethal challenge when the PMOs are delivered prior to infection. Additionally, structure-activity relationship evaluations were conducted to assess the enhancement of antiviral efficacy associated with PMO chemical modifications that included conjugation with peptides of various lengths and compositions, positioning of conjugated peptides to either the 5' or the 3' terminus, and the conferring of charge modifications by the addition of piperazine moieties. Conjugation with arginine-rich peptides greatly enhanced the antiviral efficacy of VP24-specific PMOs in infected cells and mice during lethal Ebola virus challenge.
Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J
1996-04-30
A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms.
BANERJI, JULIAN
2015-01-01
The present treatment of childhood T-cell leukemias involves the systemic administration of prokary-otic L-asparaginase (ASNase), which depletes plasma Asparagine (Asn) and inhibits protein synthesis. The mechanism of therapeutic action of ASNase is poorly understood, as are the etiologies of the side-effects incurred by treatment. Protein expression from genes bearing Asn homopolymeric coding regions (N-hCR) may be particularly susceptible to Asn level fluctuation. In mammals, N-hCR are rare, short and conserved. In humans, misfunctions of genes encoding N-hCR are associated with a cluster of disorders that mimic ASNase therapy side-effects which include impaired glycemic control, dislipidemia, pancreatitis, compromised vascular integrity, and neurological dysfunction. This paper proposes that dysregulation of Asn homeostasis, potentially even by ASNase produced by the microbiome, may contribute to several clinically important syndromes by altering expression of N-hCR bearing genes. By altering amino acid abundance and modulating ribosome translocation rates at codon repeats, the microbiomic environment may contribute to genome decoding and to shaping the proteome. We suggest that impaired translation at poly Asn codons elevates diabetes risk and severity. PMID:26178806
Banerji, Julian
2015-09-01
The present treatment of childhood T-cell leukemias involves the systemic administration of prokaryotic L-asparaginase (ASNase), which depletes plasma Asparagine (Asn) and inhibits protein synthesis. The mechanism of therapeutic action of ASNase is poorly understood, as are the etiologies of the side-effects incurred by treatment. Protein expression from genes bearing Asn homopolymeric coding regions (N-hCR) may be particularly susceptible to Asn level fluctuation. In mammals, N-hCR are rare, short and conserved. In humans, misfunctions of genes encoding N-hCR are associated with a cluster of disorders that mimic ASNase therapy side-effects which include impaired glycemic control, dislipidemia, pancreatitis, compromised vascular integrity, and neurological dysfunction. This paper proposes that dysregulation of Asn homeostasis, potentially even by ASNase produced by the microbiome, may contribute to several clinically important syndromes by altering expression of N-hCR bearing genes. By altering amino acid abundance and modulating ribosome translocation rates at codon repeats, the microbiomic environment may contribute to genome decoding and to shaping the proteome. We suggest that impaired translation at poly Asn codons elevates diabetes risk and severity.
Sequence Polishing Library (SPL) v10.0
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberortner, Ernst
The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Structural insights into eRF3 and stop codon recognition by eRF1
Cheng, Zhihong; Saito, Kazuki; Pisarev, Andrey V.; Wada, Miki; Pisareva, Vera P.; Pestova, Tatyana V.; Gajda, Michal; Round, Adam; Kong, Chunguang; Lim, Mengkiat; Nakamura, Yoshikazu; Svergun, Dmitri I.; Ito, Koichi; Song, Haiwei
2009-01-01
Eukaryotic translation termination is mediated by two interacting release factors, eRF1 and eRF3, which act cooperatively to ensure efficient stop codon recognition and fast polypeptide release. The crystal structures of human and Schizosaccharomyces pombe full-length eRF1 in complex with eRF3 lacking the GTPase domain revealed details of the interaction between these two factors and marked conformational changes in eRF1 that occur upon binding to eRF3, leading eRF1 to resemble a tRNA molecule. Small-angle X-ray scattering analysis of the eRF1/eRF3/GTP complex suggested that eRF1's M domain contacts eRF3's GTPase domain. Consistently, mutation of Arg192, which is predicted to come in close contact with the switch regions of eRF3, revealed its important role for eRF1's stimulatory effect on eRF3's GTPase activity. An ATP molecule used as a crystallization additive was bound in eRF1's putative decoding area. Mutational analysis of the ATP-binding site shed light on the mechanism of stop codon recognition by eRF1. PMID:19417105
On fuzzy semantic similarity measure for DNA coding.
Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin
2016-02-01
A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Bekliz, Meriem; Azza, Said; Seligmann, Hervé; Decloquement, Philippe; Raoult, Didier; La Scola, Bernard
2018-05-15
The Acanthamoeba polyphaga mimivirus is the first giant virus ever described, with a 1.2-Mb genome which encodes 979 proteins, including central components of the translation apparatus. One of these proteins, R458, was predicted to initiate translation, although its specific role remains unknown. We silenced the R458 gene using small interfering RNA (siRNA) and compared levels of viral fitness and protein expression in silenced versus wild-type mimivirus. Silencing decreased the growth rate, but viral particle production at the end of the viral cycle was unaffected. A comparative proteomic approach using two-dimensional difference-in-gel electrophoresis (2D-DIGE) revealed deregulation of the expression of 32 proteins in silenced mimivirus, which were defined as up- or downregulated. Besides revealing proteins with unknown functions, silencing R458 also revealed deregulation in proteins associated with viral particle structures, transcriptional machinery, oxidative pathways, modification of proteins/lipids, and DNA topology/repair. Most of these proteins belong to genes transcribed at the end of the viral cycle. Overall, our data suggest that the R458 protein regulates the expression of mimivirus proteins and, thus, that mimivirus translational proteins may not be strictly redundant in relation to those from the amoeba host. As is the case for eukaryotic initiation factor 4a (eIF4a), the R458 protein is the prototypical member of the ATP-dependent DEAD box RNA helicase mechanism. We suggest that the R458 protein is required to unwind the secondary structures at the 5' ends of mRNAs and to bind the mRNA to the ribosome, making it possible to scan for the start codon. These data are the first experimental evidence of mimivirus translation-related genes, predicted to initiate protein biosynthesis. IMPORTANCE The presence in the genome of a mimivirus of genes coding for many translational processes, with the exception of ribosome constituents, has been the subject of debate since its discovery in 2003. In this work, we focused on the R458 mimivirus gene, predicted to initiate protein biosynthesis. After silencing was performed, we observed that it has no major effect on mimivirus multiplication but that it affects protein expression and fitness. This suggests that it is effectively used by mimivirus during its developmental cycle. Until large-scale genetic manipulation of giant viruses becomes possible, the silencing strategy used here on mimivirus translation-related factors will open the way to understanding the functions of these translational genes. Copyright © 2018 American Society for Microbiology.
Théberge, M; Lacaze, P; Shareck, F; Morosoli, R; Kluepfel, D
1992-01-01
The endoglucanase isolated from culture filtrates of Streptomyces lividans IAF74 was shown to have an Mr of 46,000 and a pI of 3.3. The specific enzyme activity of 539 IU/mg, determined by the reducing assay method on carboxymethyl cellulose, is among the highest reported in the literature. The cellulase showed typical endo-type activity when reacting on oligocellodextrins. Optimal enzyme activity was obtained at 50 degrees C and pH 5.5. The kinetic constants for this endoglucanase, determined with carboxymethyl cellulose as the substrate, were a Vmax of 24.9 IU/mg of enzyme and a Km of 4.2 mg/ml. Activity was found against neither methylumbelliferyl- nor p-nitrophenyl-cellobiopyranoside nor with xylan. The DNA sequence contains one possible reading frame validated by the N terminus of the mature purified protein. However, neither ATG nor GTG starting codons were identified near the ribosome-binding site. A putative TTG codon was found as a good candidate for the start codon. Comparison of the primary amino acid sequence of the endoglucanase of S. lividans revealed that the N terminus contains a bacterial cellulose-binding domain. The catalytic domain at the C terminus showed similarity to endoglucanases from a Bacillus sp. Thus, the endoglucanase CelA belongs to family A of cellulases as described before (N. R. Gilkes, B. Henrissat, D. G. Kilburn, R. C. Miller, Jr., and R. A. J. Warren, Microbiol. Rev. 55:303-315, 1991. Images PMID:1575483
Cao, Jin-Jun; Li, Wei-Hai
2018-01-01
Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis, which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs (COII and ND5) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNASer(AGN), which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla, and Chloroperlidae and Perlidae were herein supported to be a sister group. PMID:29495588
Wang, Ying; Cao, Jin-Jun; Li, Wei-Hai
2018-02-28
Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis , which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs ( COII and ND5 ) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNA Ser(AGN) , which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla , and Chloroperlidae and Perlidae were herein supported to be a sister group.
Kim, Seong Ryeol; Kim, Man Il; Hong, Mee Yeon; Kim, Kee Young; Kang, Pil Don; Hwang, Jae Sam; Han, Yeon Soo; Jin, Byung Rae; Kim, Iksoo
2009-09-01
The 15,338-bp long complete mitochondrial genome (mitogenome) of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae) was determined. This genome has a gene arrangement identical to those of all other sequenced lepidopteran insects, but differs from the most common type, as the result of the movement of tRNA(Met) to a position 5'-upstream of tRNA(Ile). No typical start codon of the A. yamamai COI gene is available. Instead, a tetranucleotide, TTAG, which is found at the beginning context of all sequenced lepidopteran insects was tentatively designated as the start codon for A. yamamai COI gene. Three of the 13 protein-coding genes (PCGs) harbor the incomplete termination codon, T or TA. All tRNAs formed stable stem-and-loop structures, with the exception of tRNA(Ser)(AGN), the DHU arm of which formed a simple loop as has been observed in many other metazoan mt tRNA(Ser)(AGN). The 334-bp long A + T-rich region is noteworthy in that it harbors tRNA-like structures, as has also been seen in the A + T-rich regions of other insect mitogenomes. Phylogenetic analyses of the available species of Bombycoidea, Pyraloidea, and Tortricidea bolstered the current morphology-based hypothesis that Bombycoidea and Pyraloidea are monophyletic (Obtectomera). As has been previously suggested, Bombycidae (Bombyx mori and B. mandarina) and Saturniidae (A. yamamai and Caligula boisduvalii) formed a reciprocal monophyletic group.
Chen, Wanping; Xie, Ting; Shao, Yanchun; Chen, Fusheng
2012-04-10
Filamentous fungi are widely exploited in food industry due to their abilities to secrete large amounts of enzymes and metabolites. The recent availability of fungal genome sequences has provided an opportunity to explore the genomic characteristics of these food-related filamentous fungi. In this paper, we selected 12 representative filamentous fungi in the areas of food processing and safety, which were Aspergillus clavatus, A. flavus, A. fumigatus, A. nidulans, A. niger, A. oryzae, A. terreus, Monascus ruber, Neurospora crassa, Penicillium chrysogenum, Rhizopus oryzae and Trichoderma reesei, and did the comparative studies of their genomic characteristics of tRNA gene distribution, codon usage pattern and amino acid composition. The results showed that the copy numbers greatly differed among isoaccepting tRNA genes and the distribution seemed to be related with translation process. The results also revealed that genome compositional variation probably constrained the base choice at the third codon, and affected the overall amino acid composition but seemed to have little effect on the integrated physicochemical characteristics of overall amino acids. The further analysis suggested that the wobble pairing and base modification were the important mechanisms in codon-anticodon interaction. In the scope of authors' knowledge, it is the first report about the genomic characteristics analysis of food-related filamentous fungi, which would be informative for the analysis of filamentous fungal genome evolution and their practical application in food industry. Copyright © 2012 Elsevier B.V. All rights reserved.
Therapy for Duchenne muscular dystrophy: renewed optimism from genetic approaches.
Fairclough, Rebecca J; Wood, Matthew J; Davies, Kay E
2013-06-01
Duchenne muscular dystrophy (DMD) is a devastating progressive disease for which there is currently no effective treatment except palliative therapy. There are several promising genetic approaches, including viral delivery of the missing dystrophin gene, read-through of translation stop codons, exon skipping to restore the reading frame and increased expression of the compensatory utrophin gene. The lessons learned from these approaches will be applicable to many other disorders.
MS-READ: Quantitative measurement of amino acid incorporation.
Mohler, Kyle; Aerni, Hans-Rudolf; Gassaway, Brandon; Ling, Jiqiang; Ibba, Michael; Rinehart, Jesse
2017-11-01
Ribosomal protein synthesis results in the genetically programmed incorporation of amino acids into a growing polypeptide chain. Faithful amino acid incorporation that accurately reflects the genetic code is critical to the structure and function of proteins as well as overall proteome integrity. Errors in protein synthesis are generally detrimental to cellular processes yet emerging evidence suggest that proteome diversity generated through mistranslation may be beneficial under certain conditions. Cumulative translational error rates have been determined at the organismal level, however codon specific error rates and the spectrum of misincorporation errors from system to system remain largely unexplored. In particular, until recently technical challenges have limited the ability to detect and quantify comparatively rare amino acid misincorporation events, which occur orders of magnitude less frequently than canonical amino acid incorporation events. We now describe a technique for the quantitative analysis of amino acid incorporation that provides the sensitivity necessary to detect mistranslation events during translation of a single codon at frequencies as low as 1 in 10,000 for all 20 proteinogenic amino acids, as well as non-proteinogenic and modified amino acids. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Transimulation - protein biosynthesis web service.
Siwiak, Marlena; Zielenkiewicz, Piotr
2013-01-01
Although translation is the key step during gene expression, it remains poorly characterized at the level of individual genes. For this reason, we developed Transimulation - a web service measuring translational activity of genes in three model organisms: Escherichia coli, Saccharomyces cerevisiae and Homo sapiens. The calculations are based on our previous computational model of translation and experimental data sets. Transimulation quantifies mean translation initiation and elongation time (expressed in SI units), and the number of proteins produced per transcript. It also approximates the number of ribosomes that typically occupy a transcript during translation, and simulates their propagation. The simulation of ribosomes' movement is interactive and allows modifying the coding sequence on the fly. It also enables uploading any coding sequence and simulating its translation in one of three model organisms. In such a case, ribosomes propagate according to mean codon elongation times of the host organism, which may prove useful for heterologous expression. Transimulation was used to examine evolutionary conservation of translational parameters of orthologous genes. Transimulation may be accessed at http://nexus.ibb.waw.pl/Transimulation (requires Java version 1.7 or higher). Its manual and source code, distributed under the GPL-2.0 license, is freely available at the website.
Raimondeau, Etienne; Bufton, Joshua C; Schaffitzel, Christiane
2018-06-19
Faulty mRNAs with a premature stop codon (PTC) are recognized and degraded by nonsense-mediated mRNA decay (NMD). Recognition of a nonsense mRNA depends on translation and on the presence of NMD-enhancing or the absence of NMD-inhibiting factors in the 3'-untranslated region. Our review summarizes our current understanding of the molecular function of the conserved NMD factors UPF3B and UPF1, and of the anti-NMD factor Poly(A)-binding protein, and their interactions with ribosomes translating PTC-containing mRNAs. Our recent discovery that UPF3B interferes with human translation termination and enhances ribosome dissociation in vitro , whereas UPF1 is inactive in these assays, suggests a re-interpretation of previous experiments and modification of prevalent NMD models. Moreover, we discuss recent work suggesting new functions of the key NMD factor UPF1 in ribosome recycling, inhibition of translation re-initiation and nascent chain ubiquitylation. These new findings suggest that the interplay of UPF proteins with the translation machinery is more intricate than previously appreciated, and that this interplay quality-controls the efficiency of termination, ribosome recycling and translation re-initiation. © 2018 The Author(s).
Ren, Qian; Au, Hilda H.T.; Wang, Qing S.; Lee, Seonghoon; Jan, Eric
2014-01-01
The dicistrovirus intergenic internal ribosome entry site (IGR IRES) directly recruits the ribosome and initiates translation using a non-AUG codon. A subset of IGR IRESs initiates translation in either of two overlapping open reading frames (ORFs), resulting in expression of the 0 frame viral structural polyprotein and an overlapping +1 frame ORFx. A U–G base pair adjacent to the anticodon-like pseudoknot of the IRES directs +1 frame translation. Here, we show that the U-G base pair is not absolutely required for +1 frame translation. Extensive mutagenesis demonstrates that 0 and +1 frame translation can be uncoupled. Ribonucleic acid (RNA) structural probing analyses reveal that the mutant IRESs adopt distinct conformations. Toeprinting analysis suggests that the reading frame is selected at a step downstream of ribosome assembly. We propose a model whereby the IRES adopts conformations to occlude the 0 frame aminoacyl-tRNA thereby allowing delivery of the +1 frame aminoacyl-tRNA to the A site to initiate translation of ORFx. This study provides a new paradigm for programmed recoding mechanisms that increase the coding capacity of a viral genome. PMID:25038250
Translational control of ribosomal protein S15.
Portier, C; Philippe, C; Dondon, L; Grunberg-Manago, M; Ebel, J P; Ehresmann, B; Ehresmann, C
1990-08-27
The expression of ribosomal protein S15 is shown to be translationally and negatively autocontrolled using a fusion within a reporter gene. Isolation and characterization of several deregulated mutants indicate that the regulatory site (the translational operator site) overlaps the ribosome loading site of the S15 messenger. In this region, three domains, each exhibiting a stem-loop structure, were determined using chemical and enzymatic probes. The most downstream hairpin carries the Shine-Dalgarno sequence and the initiation codon. Genetic and structural data derived from mutants constructed by site-directed mutagenesis show that the operator is a dynamic structure, two domains of which can form a pseudoknot. Binding of S15 to these two domains suggests that the pseudoknot could be stabilized by S15. A model is presented in which two alternative structures would explain the molecular basis of the S15 autocontrol.
Mistranslation: from adaptations to applications.
Hoffman, Kyle S; O'Donoghue, Patrick; Brandl, Christopher J
2017-11-01
The conservation of the genetic code indicates that there was a single origin, but like all genetic material, the cell's interpretation of the code is subject to evolutionary pressure. Single nucleotide variations in tRNA sequences can modulate codon assignments by altering codon-anticodon pairing or tRNA charging. Either can increase translation errors and even change the code. The frozen accident hypothesis argued that changes to the code would destabilize the proteome and reduce fitness. In studies of model organisms, mistranslation often acts as an adaptive response. These studies reveal evolutionary conserved mechanisms to maintain proteostasis even during high rates of mistranslation. This review discusses the evolutionary basis of altered genetic codes, how mistranslation is identified, and how deviations to the genetic code are exploited. We revisit early discoveries of genetic code deviations and provide examples of adaptive mistranslation events in nature. Lastly, we highlight innovations in synthetic biology to expand the genetic code. The genetic code is still evolving. Mistranslation increases proteomic diversity that enables cells to survive stress conditions or suppress a deleterious allele. Genetic code variants have been identified by genome and metagenome sequence analyses, suppressor genetics, and biochemical characterization. Understanding the mechanisms of translation and genetic code deviations enables the design of new codes to produce novel proteins. Engineering the translation machinery and expanding the genetic code to incorporate non-canonical amino acids are valuable tools in synthetic biology that are impacting biomedical research. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses
KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu
2017-01-01
Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guss, Adam M.; Rother, Michael; Zhang, Jun Kai
A highly efficient method for chromosomal integration of cloned DNA into Methanosarcina spp. was developed utilizing the site-specific recombination system from the Streptomyces phage φC31. Host strains expressing the φC31 integrase gene and carrying an appropriate recombination site can be transformed with non-replicating plasmids carrying the complementary recombination site at efficiencies similar to those obtained with self-replicating vectors. We have also constructed a series of hybrid promoters that combine the highly expressed M. barkeri P mcrB promoter with binding sites for the tetracycline-responsive, bacterial TetR protein. These promoters are tightly regulated by the presence or absence of tetracycline in strainsmore » that express the tetR gene. The hybrid promoters can be used in genetic experiments to test gene essentiality by placing a gene of interest under their control. Thus, growth of strains with tetR -regulated essential genes becomes tetracycline-dependent. A series of plasmid vectors that utilize the site-specific recombination system for construction of reporter gene fusions and for tetracycline regulated expression of cloned genes are reported. These vectors were used to test the efficiency of translation at a variety of start codons. Fusions using an ATG start site were the most active, whereas those using GTG and TTG were approximately one half or one fourth as active, respectively. The CTG fusion was 95% less active than the ATG fusion.« less
Guss, Adam M.; Rother, Michael; Zhang, Jun Kai; ...
2008-01-01
A highly efficient method for chromosomal integration of cloned DNA into Methanosarcina spp. was developed utilizing the site-specific recombination system from the Streptomyces phage φC31. Host strains expressing the φC31 integrase gene and carrying an appropriate recombination site can be transformed with non-replicating plasmids carrying the complementary recombination site at efficiencies similar to those obtained with self-replicating vectors. We have also constructed a series of hybrid promoters that combine the highly expressed M. barkeri P mcrB promoter with binding sites for the tetracycline-responsive, bacterial TetR protein. These promoters are tightly regulated by the presence or absence of tetracycline in strainsmore » that express the tetR gene. The hybrid promoters can be used in genetic experiments to test gene essentiality by placing a gene of interest under their control. Thus, growth of strains with tetR -regulated essential genes becomes tetracycline-dependent. A series of plasmid vectors that utilize the site-specific recombination system for construction of reporter gene fusions and for tetracycline regulated expression of cloned genes are reported. These vectors were used to test the efficiency of translation at a variety of start codons. Fusions using an ATG start site were the most active, whereas those using GTG and TTG were approximately one half or one fourth as active, respectively. The CTG fusion was 95% less active than the ATG fusion.« less
Garver, Kyle A.; Conway, Carla M.; Kurath, Gael
2006-01-01
A highly efficacious DNA vaccine against a fish rhabdovirus, infectious hematopoietic necrosis virus (IHNV), was mutated to introduce two stop codons to prevent glycoprotein translation while maintaining the plasmid DNA integrity and RNA transcription ability. The mutated plasmid vaccine, denoted pIHNw-G2stop, when injected intramuscularly into fish at high doses, lacked detectable glycoprotein expression in the injection site muscle, and did not provide protection against lethal virus challenge 7 days post-vaccination. These results suggest that the G-protein itself is required to stimulate the early protective antiviral response observed after vaccination with the nonmutated parental DNA vaccine.
Emerging functions of alternative splicing coupled with nonsense-mediated decay.
Hamid, Fursham M; Makeyev, Eugene V
2014-08-01
Higher eukaryotes rely on AS (alternative splicing) of pre-mRNAs (mRNA precursors) to generate more than one protein product from a single gene and to regulate mRNA stability and translational activity. An important example of the latter function involves an interplay between AS and NMD (nonsense-mediated decay), a cytoplasmic quality control mechanism eliminating mRNAs containing PTCs (premature translation termination codons). Although originally identified as an error surveillance process, AS-NMD additionally provides an efficient strategy for deterministic regulation of gene expression outputs. In this review, we discuss recently published examples of AS-NMD and delineate functional contexts where recurrent use of this mechanism orchestrates expression of important genes.
Okumiya, T; Takenaka, T; Ishii, S; Kase, R; Kamei, S; Sakuraba, H
1996-09-01
Four alpha-galactosidase gene mutations were identified in Japanese male patients with Fabry disease who had no detectable alpha-galactosidase activity. Two of them were novel mutations, an 11-bp deletion in exon 2 and a g-1 to t substitution at the 3' end of the splice acceptor site in intron 1. The former caused a frameshift and led to the creation of a new stop codon at codon 118. The latter was predicted to provoke aberrant mRNA splicing followed by accelerated degradation of the mRNA. A nonsense mutation, R301X, and a 2-bp deletion starting at nucleotide position 718, which were reported previously, were also identified in unrelated patients.
Rubinstein, M; Mogil, J S; Japón, M; Chan, E C; Allen, R G; Low, M J
1996-01-01
A physiological role for beta-endorphin in endogenous pain inhibition was investigated by targeted mutagenesis of the proopiomelanocortin gene in mouse embryonic stem cells. The tyrosine codon at position 179 of the proopiomelanocortin gene was converted to a premature translational stop codon. The resulting transgenic mice display no overt developmental or behavioral alterations and have a normally functioning hypothalamic-pituitary-adrenal axis. Homozygous transgenic mice with a selective deficiency of beta-endorphin exhibit normal analgesia in response to morphine, indicating the presence of functional mu-opiate receptors. However, these mice lack the opioid (naloxone reversible) analgesia induced by mild swim stress. Mutant mice also display significantly greater nonopioid analgesia in response to cold water swim stress compared with controls and display paradoxical naloxone-induced analgesia. These changes may reflect compensatory upregulation of alternative pain inhibitory mechanisms. Images Fig. 1 Fig. 2 PMID:8633004
Na, Dokyun; Lee, Doheon
2010-10-15
RBSDesigner predicts the translation efficiency of existing mRNA sequences and designs synthetic ribosome binding sites (RBSs) for a given coding sequence (CDS) to yield a desired level of protein expression. The program implements the mathematical model for translation initiation described in Na et al. (Mathematical modeling of translation initiation for the estimation of its efficiency to computationally design mRNA sequences with a desired expression level in prokaryotes. BMC Syst. Biol., 4, 71). The program additionally incorporates the effect on translation efficiency of the spacer length between a Shine-Dalgarno (SD) sequence and an AUG codon, which is crucial for the incorporation of fMet-tRNA into the ribosome. RBSDesigner provides a graphical user interface (GUI) for the convenient design of synthetic RBSs. RBSDesigner is written in Python and Microsoft Visual Basic 6.0 and is publicly available as precompiled stand-alone software on the web (http://rbs.kaist.ac.kr). dhlee@kaist.ac.kr
Web application for automatic prediction of gene translation elongation efficiency.
Sokolov, Vladimir; Zuraev, Bulat; Lashin, Sergei; Matushkin, Yury
2015-09-03
Expression efficiency is one of the major characteristics describing genes in various modern investigations. Expression efficiency of genes is regulated at various stages: transcription, translation, posttranslational protein modification and others. In this study, a special EloE (Elongation Efficiency) web application is described. The EloE sorts the organism's genes in a descend order on their theoretical rate of the elongation stage of translation based on the analysis of their nucleotide sequences. Obtained theoretical data have a significant correlation with available experimental data of gene expression in various organisms. In addition, the program identifies preferential codons in organism's genes and defines distribution of potential secondary structures energy in 5´ and 3´ regions of mRNA. The EloE can be useful in preliminary estimation of translation elongation efficiency for genes for which experimental data are not available yet. Some results can be used, for instance, in other programs modeling artificial genetic structures in genetically engineered experiments.
SunRiSE - measuring translation elongation at single-cell resolution by means of flow cytometry.
Argüello, Rafael J; Reverendo, Marisa; Mendes, Andreia; Camosseto, Voahirana; Torres, Adrian G; Ribas de Pouplana, Lluis; van de Pavert, Serge A; Gatti, Evelina; Pierre, Philippe
2018-05-31
The rate at which ribosomes translate mRNAs regulates protein expression by controlling co-translational protein folding and mRNA stability. Many factors regulate translation elongation, including tRNA levels, codon usage and phosphorylation of eukaryotic elongation factor 2 (eEF2). Current methods to measure translation elongation lack single-cell resolution, require expression of multiple transgenes and have never been successfully applied ex vivo Here, we show, by using a combination of puromycilation detection and flow cytometry (a method we call 'SunRiSE'), that translation elongation can be measured accurately in primary cells in pure or heterogenous populations isolated from blood or tissues. This method allows for the simultaneous monitoring of multiple parameters, such as mTOR or S6K1/2 signaling activity, the cell cycle stage and phosphorylation of translation factors in single cells, without elaborated, costly and lengthy purification procedures. We took advantage of SunRiSE to demonstrate that, in mouse embryonic fibroblasts, eEF2 phosphorylation by eEF2 kinase (eEF2K) mostly affects translation engagement, but has a surprisingly small effect on elongation, except after proteotoxic stress induction.This article has an associated First Person interview with the first author of the paper. © 2018. Published by The Company of Biologists Ltd.
Liang, Hui; He, Shiming; Yang, Jingyi; Jia, Xinying; Wang, Pan; Chen, Xi; Zhang, Zhong; Zou, Xiajuan; McNutt, Michael A; Shen, Wen Hong; Yin, Yuxin
2014-05-06
PTEN is one of the most frequently mutated genes in human cancer. It is known that PTEN has a wide range of biological functions beyond tumor suppression. Here, we report that PTENα, an N-terminally extended form of PTEN, functions in mitochondrial metabolism. Translation of PTENα is initiated from a CUG codon upstream of and in-frame with the coding region of canonical PTEN. Eukaryotic translation initiation factor 2A (eIF2A) controls PTENα translation, which requires a CUG-centered palindromic motif. We show that PTENα induces cytochrome c oxidase activity and ATP production in mitochondria. TALEN-mediated somatic deletion of PTENα impairs mitochondrial respiratory chain function. PTENα interacts with canonical PTEN to increase PINK1 protein levels and promote energy production. Our studies demonstrate the importance of eIF2A-mediated alternative translation for generation of protein diversity in eukaryotic systems and provide insights into the mechanism by which the PTEN family is involved in multiple cellular processes. Copyright © 2014 Elsevier Inc. All rights reserved.
RNA Editing and Its Molecular Mechanism in Plant Organelles
Ichinose, Mizuho; Sugita, Mamoru
2016-01-01
RNA editing by cytidine (C) to uridine (U) conversions is widespread in plant mitochondria and chloroplasts. In some plant taxa, “reverse” U-to-C editing also occurs. However, to date, no instance of RNA editing has yet been reported in green algae and the complex thalloid liverworts. RNA editing may have evolved in early land plants 450 million years ago. However, in some plant species, including the liverwort, Marchantia polymorpha, editing may have been lost during evolution. Most RNA editing events can restore the evolutionarily conserved amino acid residues in mRNAs or create translation start and stop codons. Therefore, RNA editing is an essential process to maintain genetic information at the RNA level. Individual RNA editing sites are recognized by plant-specific pentatricopeptide repeat (PPR) proteins that are encoded in the nuclear genome. These PPR proteins are characterized by repeat elements that bind specifically to RNA sequences upstream of target editing sites. In flowering plants, non-PPR proteins also participate in multiple RNA editing events as auxiliary factors. C-to-U editing can be explained by cytidine deamination. The proteins discovered to date are important factors for RNA editing but a bona fide RNA editing enzyme has yet to be identified. PMID:28025543
Katzif, Samuel; Danavall, Damien; Bowers, Samera; Balthazar, Jacqueline T.; Shafer, William M.
2003-01-01
A Tn551 insertional library of Staphylococcus aureus strain ISP479 was challenged with an antimicrobial peptide (CG 117-136) derived from human neutrophil cathepsin G (CG). After repeated selection and screening of surviving colonies, a mutant was identified that expressed increased resistance to CG 117-136. Southern hybridization analysis revealed that the Tn551 insert in this mutant (SK1) was carried on a 10.6-kb EcoRI chromosomal DNA fragment. Subsequent physical mapping of this Tn551 insert revealed that it was positioned between a putative promoter sequence and the translational start codon of the cspA gene, which encodes a protein (CspA) highly similar to the major cold shock proteins CspA and CspB of Escherichia coli and Bacillus subtilis, respectively. This Tn551 insertion as well as a separate deletion-insertion mutation in cspA decreased the capacity of S. aureus to respond to the stress of cold shock and increased resistance to CG 117-136. The results indicate for the first time that a physiologic link exists between bacterial susceptibility to an antimicrobial peptide and a stress response system. PMID:12874306
Katzif, Samuel; Danavall, Damien; Bowers, Samera; Balthazar, Jacqueline T; Shafer, William M
2003-08-01
A Tn551 insertional library of Staphylococcus aureus strain ISP479 was challenged with an antimicrobial peptide (CG 117-136) derived from human neutrophil cathepsin G (CG). After repeated selection and screening of surviving colonies, a mutant was identified that expressed increased resistance to CG 117-136. Southern hybridization analysis revealed that the Tn551 insert in this mutant (SK1) was carried on a 10.6-kb EcoRI chromosomal DNA fragment. Subsequent physical mapping of this Tn551 insert revealed that it was positioned between a putative promoter sequence and the translational start codon of the cspA gene, which encodes a protein (CspA) highly similar to the major cold shock proteins CspA and CspB of Escherichia coli and Bacillus subtilis, respectively. This Tn551 insertion as well as a separate deletion-insertion mutation in cspA decreased the capacity of S. aureus to respond to the stress of cold shock and increased resistance to CG 117-136. The results indicate for the first time that a physiologic link exists between bacterial susceptibility to an antimicrobial peptide and a stress response system.
Yeast eIF4A enhances recruitment of mRNAs regardless of their structural complexity
Yourik, Paul; Aitken, Colin Echeverría; Zhou, Fujun; Gupta, Neha
2017-01-01
eIF4A is a DEAD-box RNA-dependent ATPase thought to unwind RNA secondary structure in the 5'-untranslated regions (UTRs) of mRNAs to promote their recruitment to the eukaryotic translation pre-initiation complex (PIC). We show that eIF4A's ATPase activity is markedly stimulated in the presence of the PIC, independently of eIF4E•eIF4G, but dependent on subunits i and g of the heteromeric eIF3 complex. Surprisingly, eIF4A accelerated the rate of recruitment of all mRNAs tested, regardless of their degree of structural complexity. Structures in the 5'-UTR and 3' of the start codon synergistically inhibit mRNA recruitment in a manner relieved by eIF4A, indicating that the factor does not act solely to melt hairpins in 5'-UTRs. Our findings that eIF4A functionally interacts with the PIC and plays important roles beyond unwinding 5'-UTR structure is consistent with a recent proposal that eIF4A modulates the conformation of the 40S ribosomal subunit to promote mRNA recruitment. PMID:29192585
USDA-ARS?s Scientific Manuscript database
Translation of influenza A virus PB1-F2 occurs in a second open reading frame (ORF) of the PB1 gene segment. PB1-F2 has been implicated in regulation of polymerase activity, immunopathology, susceptibility to secondary bacterial infection, and induction of apoptosis. Experimental evidence of PB1-F2 ...
Al-Hadid, Qais; White, Jonelle; Clarke, Steven
2016-02-12
A significant percentage of the methyltransferasome in Saccharomyces cerevisiae and higher eukaryotes is devoted to methylation of the translational machinery. Methylation of the RNA components of the translational machinery has been studied extensively and is important for structure stability, ribosome biogenesis, and translational fidelity. However, the functional effects of ribosomal protein methylation by their cognate methyltransferases are still largely unknown. Previous work has shown that the ribosomal protein Rpl3 methyltransferase, histidine protein methyltransferase 1 (Hpm1), is important for ribosome biogenesis and translation elongation fidelity. In this study, yeast strains deficient in each of the ten ribosomal protein methyltransferases in S. cerevisiae were examined for potential defects in ribosome biogenesis and translation. Like Hpm1-deficient cells, loss of four of the nine other ribosomal protein methyltransferases resulted in defects in ribosomal subunit synthesis. All of the mutant strains exhibited resistance to the ribosome inhibitors anisomycin and/or cycloheximide in plate assays, but not in liquid culture. Translational fidelity assays measuring stop codon readthrough, amino acid misincorporation, and programmed -1 ribosomal frameshifting, revealed that eight of the ten enzymes are important for translation elongation fidelity and the remaining two are necessary for translation termination efficiency. Altogether, these results demonstrate that ribosomal protein methyltransferases in S. cerevisiae play important roles in ribosome biogenesis and translation. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhao, Xing; Liang, Ai-Ping
2016-09-01
The first complete DNA sequence of the mitochondrial genome (mitogenome) of Leptobelus gazelle (Membracoidea: Hemiptera) is determined in this study. The circular molecule is 16,007 bp in its full length, which encodes a set of 37 genes, including 13 proteins, 2 ribosomal RNAs, 22 transfer RNAs, and contains an A + T-rich region (CR). The gene numbers, content, and organization of L. gazelle are similar to other typical metazoan mitogenomes. Twelve of the 13 PCGs are initiated with ATR methionine or ATT isoleucine codons, except the atp8 gene that uses the ATC isoleucine as start signal. Ten of the 13 PCGs have complete termination codons, either TAA (nine genes) or TAG (cytb). The remaining 3 PCGs (cox1, cox2 and nad5) have incomplete termination codons T (AA). All of the 22 tRNAs can be folded in the form of a typical clover-leaf structure. The complete mitogenome sequence data of L. gazelle is useful for the phylogenetic and biogeographic studies of the Membracoidea and Hemiptera.
Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun
2016-07-01
Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.
Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong
2012-01-01
The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Pelletier, J; Kaplan, G; Racaniello, V R; Sonenberg, N
1988-01-01
Poliovirus mRNA contains a long 5' noncoding region of about 750 nucleotides (the exact number varies among the three virus serotypes), which contains several AUG codons upstream of the major initiator AUG. Unlike most eucaryotic mRNAs, poliovirus does not contain a m7GpppX (where X is any nucleotide) cap structure at its 5' end and is translated by a cap-independent mechanism. To study the manner by which poliovirus mRNA is expressed, we examined the translational efficiencies of a series of deletion mutants within the 5' noncoding region of the mRNA. In this paper we report striking translation system-specific differences in the ability of the altered mRNAs to be translated. The results suggest the existence of an inhibitory cis-acting element(s) within the 5' noncoding region of poliovirus (between nucleotides 70 and 381) which restricts mRNA translation in reticulocyte lysate, wheat germ extract, and Xenopus oocytes, but not in HeLa cell extracts. In addition, we show that HeLa cell extracts contain a trans-acting factor(s) that overcomes this restriction. Images PMID:2836606
Selenium. Role of the Essential Metalloid in Health
Kurokawa, Suguru; Berry, Marla J.
2015-01-01
Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102
Seligmann, Hervé
2013-05-07
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Vasquez, Kevin A; Hatridge, Taylor A; Curtis, Nicholas C; Contreras, Lydia M
2016-02-19
Recent studies have demonstrated that effective protein production requires coordination of multiple cotranslational cellular processes, which are heavily affected by translation timing. Until recently, protein engineering has focused on codon optimization to maximize protein production rates, mostly considering the effect of tRNA abundance. However, as it relates to complex multidomain proteins, it has been hypothesized that strategic translational pauses between domains and between distinct individual structural motifs can prevent interactions between nascent chain fragments that generate kinetically trapped misfolded peptides and thereby enhance protein yields. In this study, we introduce synthetic transient pauses between structural domains in a heterologous model protein based on designed patterns of affinity between the mRNA and the anti-Shine-Dalgarno (aSD) sequence on the ribosome. We demonstrate that optimizing translation attenuation at domain boundaries can predictably affect solubility patterns in bacteria. Exploration of the affinity space showed that modifying less than 1% of the nucleotides (on a small 12 amino acid linker) can vary soluble protein yields up to ∼7-fold without altering the primary sequence of the protein. In the context of longer linkers, where a larger number of distinct structural motifs can fold outside the ribosome, optimal synonymous codon variations resulted in an additional 2.1-fold increase in solubility, relative to that of nonoptimized linkers of the same length. While rational construction of 54 linkers of various affinities showed a significant correlation between protein solubility and predicted affinity, only weaker correlations were observed between tRNA abundance and protein solubility. We also demonstrate that naturally occurring high-affinity clusters are present between structural domains of β-galactosidase, one of Escherichia coli's largest native proteins. Interdomain ribosomal affinity is an important factor that has not previously been explored in the context of protein engineering.
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.
Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F
1984-01-01
The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
Smirnova, Ekaterina; Firth, Andrew E; Miller, W Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M; Chung, Betty Y-W; Ziegler-Graff, Véronique
2015-05-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5' end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement.
Smirnova, Ekaterina; Firth, Andrew E.; Miller, W. Allen; Scheidecker, Danièle; Brault, Véronique; Reinbold, Catherine; Rakotondrafara, Aurélie M.; Chung, Betty Y.-W.; Ziegler-Graff, Véronique
2015-01-01
Viruses in the family Luteoviridae have positive-sense RNA genomes of around 5.2 to 6.3 kb, and they are limited to the phloem in infected plants. The Luteovirus and Polerovirus genera include all but one virus in the Luteoviridae. They share a common gene block, which encodes the coat protein (ORF3), a movement protein (ORF4), and a carboxy-terminal extension to the coat protein (ORF5). These three proteins all have been reported to participate in the phloem-specific movement of the virus in plants. All three are translated from one subgenomic RNA, sgRNA1. Here, we report the discovery of a novel short ORF, termed ORF3a, encoded near the 5’ end of sgRNA1. Initially, this ORF was predicted by statistical analysis of sequence variation in large sets of aligned viral sequences. ORF3a is positioned upstream of ORF3 and its translation initiates at a non-AUG codon. Functional analysis of the ORF3a protein, P3a, was conducted with Turnip yellows virus (TuYV), a polerovirus, for which translation of ORF3a begins at an ACG codon. ORF3a was translated from a transcript corresponding to sgRNA1 in vitro, and immunodetection assays confirmed expression of P3a in infected protoplasts and in agroinoculated plants. Mutations that prevent expression of P3a, or which overexpress P3a, did not affect TuYV replication in protoplasts or inoculated Arabidopsis thaliana leaves, but prevented virus systemic infection (long-distance movement) in plants. Expression of P3a from a separate viral or plasmid vector complemented movement of a TuYV mutant lacking ORF3a. Subcellular localization studies with fluorescent protein fusions revealed that P3a is targeted to the Golgi apparatus and plasmodesmata, supporting an essential role for P3a in viral movement. PMID:25946037
Ribosome rearrangements at the onset of translational bypassing
Agirrezabala, Xabier; Samatova, Ekaterina; Klimova, Mariia; Zamora, Miguel; Gil-Carton, David; Rodnina, Marina V.; Valle, Mikel
2017-01-01
Bypassing is a recoding event that leads to the translation of two distal open reading frames into a single polypeptide chain. We present the structure of a translating ribosome stalled at the bypassing take-off site of gene 60 of bacteriophage T4. The nascent peptide in the exit tunnel anchors the P-site peptidyl-tRNAGly to the ribosome and locks an inactive conformation of the peptidyl transferase center (PTC). The mRNA forms a short dynamic hairpin in the decoding site. The ribosomal subunits adopt a rolling conformation in which the rotation of the small subunit around its long axis causes the opening of the A-site region. Together, PTC conformation and mRNA structure safeguard against premature termination and read-through of the stop codon and reconfigure the ribosome to a state poised for take-off and sliding along the noncoding mRNA gap. PMID:28630923
Prevost, Luanna B; Smith, Michelle K; Knight, Jennifer K
2016-01-01
Previous work has shown that students have persistent difficulties in understanding how central dogma processes can be affected by a stop codon mutation. To explore these difficulties, we modified two multiple-choice questions from the Genetics Concept Assessment into three open-ended questions that asked students to write about how a stop codon mutation potentially impacts replication, transcription, and translation. We then used computer-assisted lexical analysis combined with human scoring to categorize student responses. The lexical analysis models showed high agreement with human scoring, demonstrating that this approach can be successfully used to analyze large numbers of student written responses. The results of this analysis show that students' ideas about one process in the central dogma can affect their thinking about subsequent and previous processes, leading to mixed models of conceptual understanding. © 2016 L. B. Prevost et al. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Bender, Aline; Hajieva, Parvana; Moosmann, Bernd
2008-10-28
Humans and most other animals use 2 different genetic codes to translate their hereditary information: the standard code for nuclear-encoded proteins and a modern variant of this code in mitochondria. Despite the pivotal role of the genetic code for cell biology, the functional significance of the deviant mitochondrial code has remained enigmatic since its first description in 1979. Here, we show that profound and functionally beneficial alterations on the encoded protein level were causative for the AUA codon reassignment from isoleucine to methionine observed in most mitochondrial lineages. We demonstrate that this codon reassignment leads to a massive accumulation of the easily oxidized amino acid methionine in the highly oxidative inner mitochondrial membrane. This apparently paradoxical outcome can yet be smoothly settled if the antioxidant surface chemistry of methionine is taken into account, and we present direct experimental evidence that intramembrane accumulation of methionine exhibits antioxidant and cytoprotective properties in living cells. Our results unveil that methionine is an evolutionarily selected antioxidant building block of respiratory chain complexes. Collective protein alterations can thus constitute the selective advantage behind codon reassignments, which authenticates the "ambiguous decoding" hypothesis of genetic code evolution. Oxidative stress has shaped the mitochondrial genetic code.
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.
Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S
2005-01-01
Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Rationalizing context-dependent performance of dynamic RNA regulatory devices.
Kent, Ross; Halliwell, Samantha; Young, Kate; Swainston, Neil; Dixon, Neil
2018-06-21
The ability of RNA to sense, regulate and store information is an attractive attribute for a variety of functional applications including the development of regulatory control devices for synthetic biology. RNA folding and function is known to be highly context sensitive, which limits the modularity and reuse of RNA regulatory devices to control different heterologous sequences and genes. We explored the cause and effect of sequence context sensitivity for translational ON riboswitches located in the 5' UTR, by constructing and screening a library of N-terminal synonymous codon variants. By altering the N-terminal codon usage we were able to obtain RNA devices with a broad range of functional performance properties (ON, OFF, fold-change). Linear regression and calculated metrics were used to rationalize the major determining features leading to optimal riboswitch performance, and to identify multiple interactions between the explanatory metrics. Finally, partial least squared (PLS) analysis was employed in order to understand the metrics and their respective effect on performance. This PLS model was shown to provide good explanation of our library. This study provides a novel multi-variant analysis framework by which to rationalize the codon context performance of allosteric RNA-devices. The framework will also serve as a platform for future riboswitch context engineering endeavors.
Webel, Rike; Hakki, Morgan; Prichard, Mark N.; Rawlinson, William D.; Marschall, Manfred
2014-01-01
ABSTRACT The human cytomegalovirus (HCMV)-encoded kinase pUL97 is required for efficient viral replication. Previous studies described two isoforms of pUL97, the full-length isoform (M1) and a smaller isoform likely resulting from translation initiation at codon 74 (M74). Here, we report the detection of a third pUL97 isoform during viral infection resulting from translation initiation at codon 157 (isoform M157). The consistent expression of isoform M157 as a minor component of pUL97 during infection with clinical and laboratory-adapted HCMV strains was suppressed when codon 157 was mutagenized. Viral mutants expressing specific isoforms were generated to compare their growth and drug susceptibility phenotypes, as well as pUL97 intracellular localization patterns and kinase activities. The exclusive expression of isoform M157 resulted in substantially reduced viral growth and resistance to the pUL97 inhibitor maribavir while retaining susceptibility to ganciclovir. Confocal imaging demonstrated reduced nuclear import of amino-terminal deletion isoforms compared to isoform M1. Isoform M157 showed reduced efficiency of various substrate protein interactions and autophosphorylation, whereas Rb phosphorylation was preserved. These results reveal differential properties of pUL97 isoforms that affect viral replication, with implications for the antiviral efficacy of maribavir. IMPORTANCE The HCMV UL97 kinase performs important functions in viral replication that are targeted by the antiviral drug maribavir. Here, we describe a naturally occurring short isoform of the kinase that when expressed by itself in a recombinant virus results in altered intracellular localization, impaired growth, and high-level resistance to maribavir compared to those of the predominant full-length counterpart. This is another factor to consider in explaining why maribavir appears to have variable antiviral activity in cell culture and in vivo. PMID:24522923
Deducing the Kinetics of Protein Synthesis In Vivo from the Transition Rates Measured In Vitro
Rudorf, Sophia; Thommen, Michael; Rodnina, Marina V.; Lipowsky, Reinhard
2014-01-01
The molecular machinery of life relies on complex multistep processes that involve numerous individual transitions, such as molecular association and dissociation steps, chemical reactions, and mechanical movements. The corresponding transition rates can be typically measured in vitro but not in vivo. Here, we develop a general method to deduce the in-vivo rates from their in-vitro values. The method has two basic components. First, we introduce the kinetic distance, a new concept by which we can quantitatively compare the kinetics of a multistep process in different environments. The kinetic distance depends logarithmically on the transition rates and can be interpreted in terms of the underlying free energy barriers. Second, we minimize the kinetic distance between the in-vitro and the in-vivo process, imposing the constraint that the deduced rates reproduce a known global property such as the overall in-vivo speed. In order to demonstrate the predictive power of our method, we apply it to protein synthesis by ribosomes, a key process of gene expression. We describe the latter process by a codon-specific Markov model with three reaction pathways, corresponding to the initial binding of cognate, near-cognate, and non-cognate tRNA, for which we determine all individual transition rates in vitro. We then predict the in-vivo rates by the constrained minimization procedure and validate these rates by three independent sets of in-vivo data, obtained for codon-dependent translation speeds, codon-specific translation dynamics, and missense error frequencies. In all cases, we find good agreement between theory and experiment without adjusting any fit parameter. The deduced in-vivo rates lead to smaller error frequencies than the known in-vitro rates, primarily by an improved initial selection of tRNA. The method introduced here is relatively simple from a computational point of view and can be applied to any biomolecular process, for which we have detailed information about the in-vitro kinetics. PMID:25358034
Hulsebos, Theo J M; Kenter, Susan; Verhagen, Wim I M; Baas, Frank; Flucke, Uta; Wesseling, Pieter
2014-09-01
In schwannomatosis, germline SMARCB1 mutations predispose to the development of multiple schwannomas, but not vestibular schwannomas. Many of these are missense or splice-site mutations or in-frame deletions, which are presumed to result in the synthesis of altered SMARCB1 proteins. However, also nonsense and frameshift mutations, which are characteristic for rhabdoid tumors and are predicted to result in the absence of SMARCB1 protein via nonsense-mediated mRNA decay, have been reported in schwannomatosis patients. We investigated the consequences of four of the latter mutations, i.e. c.30delC, c.34C>T, c.38delA, and c.46A>T, all in SMARCB1-exon 1. We could demonstrate for the c.30delC and c.34C>T mutations that the respective mRNAs were still present in the schwannomas of the patients. We hypothesized that these were prevented from degradation by translation reinitiation at the AUG codon encoding methionine at position 27 of the SMARCB1 protein. To test this, we expressed the mutations in MON cells, rhabdoid cells without endogenous SMARCB1 protein, and found that all four resulted in synthesis of the N-terminally truncated protein. Mutation of the reinitiation methionine codon into a valine codon prevented synthesis of the truncated protein, thereby confirming its identity. Immunohistochemistry with a SMARCB1 antibody revealed a mosaic staining pattern in schwannomas of the patients with the c.30delC and c.34C>T mutations. Our findings support the concept that, in contrast to the complete absence of SMARCB1 expression in rhabdoid tumors, altered SMARCB1 proteins with modified activity and reduced (mosaic) expression are formed in the schwannomas of schwannomatosis patients with a germline SMARCB1 mutation.
Hu, Xiao-Pan; Yang, Yi; Ma, Bin-Guang
2015-06-09
Protein translation is a central step in gene expression and affected by many factors such as codon usage bias, mRNA folding energy and tRNA abundance. Despite intensive previous studies, how metabolic amino acid supply correlates with protein translation efficiency remains unknown. In this work, we estimated the amino acid flux from metabolic network for each protein in Escherichia coli and Saccharomyces cerevisiae by using Flux Balance Analysis. Integrated with the mRNA expression level, protein abundance and ribosome profiling data, we provided a detailed description of the role of amino acid supply in protein translation. Our results showed that amino acid supply positively correlates with translation efficiency and ribosome density. Moreover, with the rank-based regression model, we found that metabolic amino acid supply facilitates ribosome utilization. Based on the fact that the ribosome density change of well-amino-acid-supplied genes is smaller than poorly-amino-acid-supply genes under amino acid starvation, we reached the conclusion that amino acid supply may buffer ribosome density change against amino acid starvation and benefit maintaining a relatively stable translation environment. Our work provided new insights into the connection between metabolic amino acid supply and protein translation process by revealing a new regulation strategy that is dependent on resource availability.
Sankar, Sathish; Upadhyay, Mohita; Ramamurthy, Mageshbabu; Vadivel, Kumaran; Sagadevan, Kalaiselvan; Nandagopal, Balaji; Vivekanandan, Perumal; Sridharan, Gopalan
2015-01-01
Hantaviruses are important emerging zoonotic pathogens. The current understanding of hantavirus evolution is complicated by the lack of consensus on co-divergence of hantaviruses with their animal hosts. In addition, hantaviruses have long-term associations with their reservoir hosts. Analyzing the relative abundance of dinucleotides may shed new light on hantavirus evolution. We studied the relative abundance of dinucleotides and the evolutionary pressures shaping different hantavirus segments. A total of 118 sequences were analyzed; this includes 51 sequences of the S segment, 43 sequences of the M segment and 23 sequences of the L segment. The relative abundance of dinucleotides, effective codon number (ENC), codon usage biases were analyzed. Standard methods were used to investigate the relative roles of mutational pressure and translational selection on the three hantavirus segments. All three segments of hantaviruses are CpG depleted. Mutational pressure is the predominant evolutionary force leading to CpG depletion among hantaviruses. Interestingly, the S segment of hantaviruses is GpU depleted and in contrast to CpG depletion, the depletion of GpU dinucleotides from the S segment is driven by translational selection. Our findings also suggest that mutational pressure is the primary evolutionary pressure acting on the S and the M segments of hantaviruses. While translational selection plays a key role in shaping the evolution of the L segment. Our findings highlight how different evolutionary pressures may contribute disproportionally to the evolution of the three hantavirus segments. These findings provide new insights on the current understanding of hantavirus evolution. There is a dichotomy among evolutionary pressures shaping a) the relative abundance of different dinucleotides in hantavirus genomes b) the evolution of the three hantavirus segments.
Expression of Recombinant Phosphoproteins for Signal Transduction Studies.
Barber, Karl W; Rinehart, Jesse
2017-01-01
Complex signaling cascades are difficult to study in vitro without phosphorylated proteins. Here, we describe a technique for the routine production of recombinant phosphoproteins by directly incorporating phosphoserine as a nonstandard amino acid. This protocol utilizes an optimized phosphoserine orthogonal translation system and an engineered strain of E. coli containing no genomic amber codons. This approach has been used to generate a variety of phosphorylated proteins to understand the role of protein phosphorylation in cell signaling.
Bazzini, Ariel A; Johnstone, Timothy G; Christiano, Romain; Mackowiak, Sebastian D; Obermayer, Benedikt; Fleming, Elizabeth S; Vejnar, Charles E; Lee, Miler T; Rajewsky, Nikolaus; Walther, Tobias C; Giraldez, Antonio J
2014-01-01
Identification of the coding elements in the genome is a fundamental step to understanding the building blocks of living systems. Short peptides (< 100 aa) have emerged as important regulators of development and physiology, but their identification has been limited by their size. We have leveraged the periodicity of ribosome movement on the mRNA to define actively translated ORFs by ribosome footprinting. This approach identifies several hundred translated small ORFs in zebrafish and human. Computational prediction of small ORFs from codon conservation patterns corroborates and extends these findings and identifies conserved sequences in zebrafish and human, suggesting functional peptide products (micropeptides). These results identify micropeptide-encoding genes in vertebrates, providing an entry point to define their function in vivo. PMID:24705786
Mearls, Elizabeth B.; Jackter, Jacquelin; Colquhoun, Jennifer M.; Matthews, Allison J.; Fenton, Colleen
2018-01-01
A cascade of alternative sigma factors directs developmental gene expression during spore formation by the bacterium Bacillus subtilis. As the spore develops, a tightly regulated switch occurs in which the early-acting sigma factor σF is replaced by the late-acting sigma factor σG. The gene encoding σG (sigG) is transcribed by σF and by σG itself in an autoregulatory loop; yet σG activity is not detected until σF-dependent gene expression is complete. This separation in σF and σG activities has been suggested to be due at least in part to a poorly understood intercellular checkpoint pathway that delays sigG expression by σF. Here we report the results of a careful examination of sigG expression during sporulation. Unexpectedly, our findings argue against the existence of a regulatory mechanism to delay sigG transcription by σF and instead support a model in which sigG is transcribed by σF with normal timing, but at levels that are very low. This low-level expression of sigG is the consequence of several intrinsic features of the sigG regulatory and coding sequence—promoter spacing, secondary structure potential of the mRNA, and start codon identity—that dampen its transcription and translation. Especially notable is the presence of a conserved hairpin in the 5’ leader sequence of the sigG mRNA that occludes the ribosome-binding site, reducing translation by up to 4-fold. Finally, we demonstrate that misexpression of sigG from regulatory and coding sequences lacking these features triggers premature σG activity in the forespore during sporulation, as well as inappropriate σG activity during vegetative growth. Altogether, these data indicate that transcription and translation of the sigG gene is tuned to prevent vegetative expression of σG and to ensure the precise timing of the switch from σF to σG in the developing spore. PMID:29702640
Huang, Xiaoshuai; Ye, Haihui; Chung, J Sook
2017-08-01
Insulin-like androgenic gland factor (IAG) that is produced by the male androgenic gland (AG), plays a role in sexual differentiation and maintenance of male secondary sex characteristics in decapod crustaceans. With an earlier finding of IAG expression in a female Callinectes sapidus ovary, we aimed to examine a putative role of IAG during the ovarian development of this species. To this end, the full-length cDNA sequence of the ovarian CasIAG (termed CasIAG-ova) has been isolated. The predicted mature peptide sequence of CasIAG-ova is identical to that of the IAG from the AG, except in their signal peptide regions. The CasIAG-ova contains an alternative initiation codon (UUG) as the start codon, which suggests that the translational regulation of CasIAG-ova may differ from that of the IAG from AG. To define the function of CasIAG-ova, the expressions of CasIAG-ova as well as its putative binding protein, insulin-like peptide binding protein (ILPBP), are measured in the ovaries at various developmental stages obtained from different seasons. Season affects both CasIAG and ILPBP expression in the ovary. Overall, summer females at earlier ovarian stages contain high levels of CasIAG and ILPBP than spring or fall females. These findings indicate that CasIAG-ova and CasILPBP may be involved in the ovarian development. When comparing the levels of CasIAG and CasILPBP in the ovary, the latter are much higher (∼10-10000 fold) than the former. Expression patterns of CasILPBP differ from those of CasIAG-ova during ovarian development and by season, suggesting that ILPBP may have an additional role in ovarian development rather than a function of a putative binding protein of IAG. Copyright © 2017 Elsevier Inc. All rights reserved.
Sonobe, Yoshifumi; Ghadge, Ghanashyam; Masaki, Katsuhisa; Sendoel, Ataman; Fuchs, Elaine; Roos, Raymond P
2018-08-01
Expansion of a hexanucleotide repeat (HRE), GGGGCC, in the C9ORF72 gene is recognized as the most common cause of familial amyotrophic lateral sclerosis (FALS), frontotemporal dementia (FTD) and ALS-FTD, as well as 5-10% of sporadic ALS. Despite the location of the HRE in the non-coding region (with respect to the main C9ORF72 gene product), dipeptide repeat proteins (DPRs) that are thought to be toxic are translated from the HRE in all three reading frames from both the sense and antisense transcript. Here, we identified a CUG that has a good Kozak consensus sequence as the translation initiation codon. Mutation of this CTG significantly suppressed polyglycine-alanine (GA) translation. GA was translated when the G 4 C 2 construct was placed as the second cistron in a bicistronic construct. CRISPR/Cas9-induced knockout of a non-canonical translation initiation factor, eIF2A, impaired GA translation. Transfection of G 4 C 2 constructs induced an integrated stress response (ISR), while triggering the ISR led to a continuation of translation of GA with a decline in conventional cap-dependent translation. These in vitro observations were confirmed in chick embryo neural cells. The findings suggest that DPRs translated from an HRE in C9ORF72 aggregate and lead to an ISR that then leads to continuing DPR production and aggregation, thereby creating a continuing pathogenic cycle. Copyright © 2018 Elsevier Inc. All rights reserved.
Au, Hilda H.; Cornilescu, Gabriel; Mouzakis, Kathryn D.; Ren, Qian; Burke, Jordan E.; Lee, Seonghoon; Butcher, Samuel E.; Jan, Eric
2015-01-01
The dicistrovirus intergenic region internal ribosome entry site (IRES) adopts a triple-pseudoknotted RNA structure and occupies the core ribosomal E, P, and A sites to directly recruit the ribosome and initiate translation at a non-AUG codon. A subset of dicistrovirus IRESs directs translation in the 0 and +1 frames to produce the viral structural proteins and a +1 overlapping open reading frame called ORFx, respectively. Here we show that specific mutations of two unpaired adenosines located at the core of the three-helical junction of the honey bee dicistrovirus Israeli acute paralysis virus (IAPV) IRES PKI domain can uncouple 0 and +1 frame translation, suggesting that the structure adopts distinct conformations that contribute to 0 or +1 frame translation. Using a reconstituted translation system, we show that ribosomes assembled on mutant IRESs that direct exclusive 0 or +1 frame translation lack reading frame fidelity. Finally, a nuclear magnetic resonance/small-angle X-ray scattering hybrid approach reveals that the PKI domain of the IAPV IRES adopts an RNA structure that resembles a complete tRNA. The tRNA shape-mimicry enables the viral IRES to gain access to the ribosome tRNA-binding sites and form intermolecular contacts with the ribosome that are necessary for initiating IRES translation in a specific reading frame. PMID:26554019
Genomic structure and chromosomal mapping of the human CD22 gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilson, G.L.; Kozlow, E.; Kehrl, J.H.
1993-06-01
The human CD22 gene is expressed specifically in B lymphocytes and likely has an important function in cell-cell interactions. A nearly full length human CD22 cDNA clone was used to isolate genomic clones that span the CD22 gene. The CD22 gene is spread over 22 kb of DNA and is composed of 15 exons. The first exon contains the major transcriptional start sites. The translation initiation codon is located in exon 3, which also encodes a portion of the signal peptide. Exons 4 to 10 encode the seven Ig domains of CD22, exon 11 encodes the transmembrane domain, exons 12more » to 15 encode the intracytoplasmic domain of CD22, and exon 15 also contains the 3' untranslated region. A minor form of CD22 mRNA likely results from splicing of exon 5 to exon 8, skipping exons 6 and 7. A 4.6-kb Xbal fragment of the CD22 gene was used to map the chromosomal location of CD22 by fluorescence in situ hybridization. The hybridization locus was identified by combining fluorescent images of the probe with the chromosomal banding pattern generated by an Alu probe. The results demonstrate the CD22 is located within the band region q13.1 of chromosome 19. Two closely clustered major transcription start sites and several minor start sites were mapped by primer extension. Similarly to many other lymphoid-specific genes, the CD22 promoter lacks an obvious TATA box. Approximately 4 kb of DNA 5' of the transcription start sites were sequenced and found to contain multiple Alu elements. Potential binding sites for the transcriptional factors NF-kB, AP-1, and Oct-2 are located within 300 bp 5' of the major transcription start sites. A 400-bp fragment (bp -339 through +71) of the CD22 promoter region was subcloned into a pGEM-chloramphenicol acetyltransferase vector and after transfection into B and T cells was found to be active in both B and T cells. 45 refs., 7 figs., 2 tabs.« less
Feng, Shangguo; He, Refeng; Yang, Sai; Chen, Zhe; Jiang, Mengying; Lu, Jiangjie; Wang, Huizhong
2015-08-10
Two molecular marker systems, start codon targeted (SCoT) and target region amplification polymorphism (TRAP), were used for genetic relationship analysis of 36 Dendrobium species collected from China. Twenty-two selected SCoT primers produced 337 loci, of which 324 (96%) were polymorphic, whereas 13 TRAP primer combinations produced a total of 510 loci, with 500 (97.8%) of them being polymorphic. An average polymorphism information content of 0.953 and 0.983 was detected using the SCoT and TRAP primers, respectively, showing that a high degree of genetic diversity exists among Chinese Dendrobium species. The partition of clusters in the unweighted pair group method with arithmetic mean dendrogram and principal coordinate analysis plot based on the SCoT and TRAP markers was similar and clustered the 36 Dendrobium species into four main groups. Our results will provide useful information for resource protection and will also be useful to improve the current Dendrobium breeding programs. Our results also demonstrate that SCoT and TRAP markers are informative and can be used to evaluate genetic relationships between Dendrobium species. Copyright © 2015 Elsevier B.V. All rights reserved.
Vendeix, Franck A P; Murphy, Frank V; Cantara, William A; Leszczyńska, Grażyna; Gustilo, Estella M; Sproat, Brian; Malkiewicz, Andrzej; Agris, Paul F
2012-03-02
Human tRNA(Lys3)(UUU) (htRNA(Lys3)(UUU)) decodes the lysine codons AAA and AAG during translation and also plays a crucial role as the primer for HIV-1 (human immunodeficiency virus type 1) reverse transcription. The posttranscriptional modifications 5-methoxycarbonylmethyl-2-thiouridine (mcm(5)s(2)U(34)), 2-methylthio-N(6)-threonylcarbamoyladenosine (ms(2)t(6)A(37)), and pseudouridine (Ψ(39)) in the tRNA's anticodon domain are critical for ribosomal binding and HIV-1 reverse transcription. To understand the importance of modified nucleoside contributions, we determined the structure and function of this tRNA's anticodon stem and loop (ASL) domain with these modifications at positions 34, 37, and 39, respectively (hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39)). Ribosome binding assays in vitro revealed that the hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39) bound AAA and AAG codons, whereas binding of the unmodified ASL(Lys3)(UUU) was barely detectable. The UV hyperchromicity, the circular dichroism, and the structural analyses indicated that Ψ(39) enhanced the thermodynamic stability of the ASL through base stacking while ms(2)t(6)A(37) restrained the anticodon to adopt an open loop conformation that is required for ribosomal binding. The NMR-restrained molecular-dynamics-derived solution structure revealed that the modifications provided an open, ordered loop for codon binding. The crystal structures of the hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39) bound to the 30S ribosomal subunit with each codon in the A site showed that the modified nucleotides mcm(5)s(2)U(34) and ms(2)t(6)A(37) participate in the stability of the anticodon-codon interaction. Importantly, the mcm(5)s(2)U(34)·G(3) wobble base pair is in the Watson-Crick geometry, requiring unusual hydrogen bonding to G in which mcm(5)s(2)U(34) must shift from the keto to the enol form. The results unambiguously demonstrate that modifications pre-structure the anticodon as a key prerequisite for efficient and accurate recognition of cognate and wobble codons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Inferring Selection on Amino Acid Preference in Protein Domains
Durbin, Richard
2009-01-01
Models that explicitly account for the effect of selection on new mutations have been proposed to account for “codon bias” or the excess of “preferred” codons that results from selection for translational efficiency and/or accuracy. In principle, such models can be applied to any mutation that results in a preferred allele, but in most cases, the fitness effect of a specific mutation cannot be predicted. Here we show that it is possible to assign preferred and unpreferred states to amino acid changing mutations that occur in protein domains. We propose that mutations that lead to more common amino acids (at a given position in a domain) can be considered “preferred alleles” just as are synonymous mutations leading to codons for more abundant tRNAs. We use genome-scale polymorphism data to show that alleles for preferred amino acids in protein domains occur at higher frequencies in the population, as has been shown for preferred codons. We show that this effect is quantitative, such that there is a correlation between the shift in frequency of preferred alleles and the predicted fitness effect. As expected, we also observe a reduction in the numbers of polymorphisms and substitutions at more important positions in domains, consistent with stronger selection at those positions. We examine the derived allele frequency distribution and polymorphism to divergence ratios of preferred and unpreferred differences and find evidence for both negative and positive selections acting to maintain protein domains in the human population. Finally, we analyze a model for selection on amino acid preferences in protein domains and find that it is consistent with the quantitative effects that we observe. PMID:19095755
Yokoyama, S; Watanabe, T; Murao, K; Ishikura, H; Yamaizumi, Z; Nishimura, S; Miyazawa, T
1985-01-01
Proton NMR analyses have been made to elucidate the conformational characteristics of modified nucleotides as found in the first position of the anticodon of tRNA [derivatives of 5-methyl-2-thiouridine 5'-monophosphate (pxm5s2U) and derivatives of 5-hydroxyuridine 5'-monophosphate (pxo5U)]. In pxm5s2U, the C3'-endo form is extraordinarily more stable than the C2'-endo form for the ribose ring, because of the combined effects of the 2-thiocarbonyl group and the 5-substituent. By contrast, in pxo5U, the C2'-endo form is much more stable than the C3'-endo form, because of the interaction between the 5-substituent and the 5'-phosphate group. The enthalpy differences between the C2'-endo form and the C3'-endo form have been obtained as 1.1, -0.7, and 0.1 kcal/mol (1 cal = 4.184 J) for pxm5s2U, pxo5U, and unmodified uridine 5'-monophosphate, respectively. These findings lead to the conclusion that xm5s2U in the first position of the anticodon exclusively takes the C3'-endo form to recognize adenosine (but not uridine) as the third letter of the codon, whereas xo5U takes the C2'-endo form as well as the C3'-endo form to recognize adenosine, guanosine, and uridine as the third letter of the codon on ribosome. Accordingly, the biological significance of such modifications of uridine to xm5s2U/xo5U is in the regulation of the conformational rigidity/flexibility in the first position of the anticodon so as to guarantee the correct and efficient translation of codons in protein biosynthesis. PMID:3860833
Codon bias and gene ontology in holometabolous and hemimetabolous insects.
Carlini, David B; Makowski, Matthew
2015-12-01
The relationship between preferred codon use (PCU), developmental mode, and gene ontology (GO) was investigated in a sample of nine insect species with sequenced genomes. These species were selected to represent two distinct modes of insect development, holometabolism and hemimetabolism, with an aim toward determining whether the differences in developmental timing concomitant with developmental mode would be mirrored by differences in PCU in their developmental genes. We hypothesized that the developmental genes of holometabolous insects should be under greater selective pressure for efficient translation, manifest as increased PCU, than those of hemimetabolous insects because holometabolism requires abundant protein expression over shorter time intervals than hemimetabolism, where proteins are required more uniformly in time. Preferred codon sets were defined for each species, from which the frequency of PCU for each gene was obtained. Although there were substantial differences in the genomic base composition of holometabolous and hemimetabolous insects, both groups exhibited a general preference for GC-ending codons, with the former group having higher PCU averaged across all genes. For each species, the biological process GO term for each gene was assigned that of its Drosophila homolog(s), and PCU was calculated for each GO term category. The top two GO term categories for PCU enrichment in the holometabolous insects were anatomical structure development and cell differentiation. The increased PCU in the developmental genes of holometabolous insects may reflect a general strategy to maximize the protein production of genes expressed in bursts over short time periods, e.g., heat shock proteins. J. Exp. Zool. (Mol. Dev. Evol.) 324B: 686-698, 2015. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Karásková, Martina; Gunišová, Stanislava; Herrmannová, Anna; Wagner, Susan; Munzarová, Vanda; Valášek, Leoš Shivaya
2012-01-01
In eukaryotes, for a protein to be synthesized, the 40 S subunit has to first scan the 5′-UTR of the mRNA until it has encountered the AUG start codon. Several initiation factors that ensure high fidelity of AUG recognition were identified previously, including eIF1A, eIF1, eIF2, and eIF5. In addition, eIF3 was proposed to coordinate their functions in this process as well as to promote their initial binding to 40 S subunits. Here we subjected several previously identified segments of the N-terminal domain (NTD) of the eIF3c/Nip1 subunit, which mediates eIF3 binding to eIF1 and eIF5, to semirandom mutagenesis to investigate the molecular mechanism of eIF3 involvement in these reactions. Three major classes of mutant substitutions or internal deletions were isolated that affect either the assembly of preinitiation complexes (PICs), scanning for AUG, or both. We show that eIF5 binds to the extreme c/Nip1-NTD (residues 1–45) and that impairing this interaction predominantly affects the PIC formation. eIF1 interacts with the region (60–137) that immediately follows, and altering this contact deregulates AUG recognition. Together, our data indicate that binding of eIF1 to the c/Nip1-NTD is equally important for its initial recruitment to PICs and for its proper functioning in selecting the translational start site. PMID:22718758
López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel
2017-02-01
We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.
Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin
2016-07-01
The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
A web application for automatic prediction of gene translation elongation efficiency.
Sokolov, Vladimir S; Zuraev, Bulat S; Lashin, Sergei A; Matushkin, Yury G
2015-03-01
Expression efficiency is one of the major characteristics describing genes in various modern investigations. Expression efficiency of genes is regulated at various stages: transcription, translation, posttranslational protein modification and others. In this study, a special EloE (Elongation Efficiency) web application is described. The EloE sorts the organism's genes in a descend order on their theoretical rate of the elongation stage of translation based on the analysis of their nucleotide sequences. Obtained theoretical data have a significant correlation with available experimental data of gene expression in various organisms. In addition, the program identifies preferential codons in organism's genes and defines distribution of potential secondary structures energy in 5´ and 3´ regions of mRNA. The EloE can be useful in preliminary estimation of translation elongation efficiency for genes for which experimental data are not available yet. Some results can be used, for instance, in other programs modeling artificial genetic structures in genetically engineered experiments. The EloE web application is available at http://www-bionet.sscc.ru:7780/EloE.
The control of translational accuracy is a determinant of healthy ageing in yeast
Leadsham, Jane E.; Sauvadet, Aimie; Tarrant, Daniel; Adam, Ilectra S.; Saromi, Kofo; Laun, Peter; Rinnerthaler, Mark; Breitenbach-Koller, Hannelore; Breitenbach, Michael; Tuite, Mick F.; Gourlay, Campbell W.
2017-01-01
Life requires the maintenance of molecular function in the face of stochastic processes that tend to adversely affect macromolecular integrity. This is particularly relevant during ageing, as many cellular functions decline with age, including growth, mitochondrial function and energy metabolism. Protein synthesis must deliver functional proteins at all times, implying that the effects of protein synthesis errors like amino acid misincorporation and stop-codon read-through must be minimized during ageing. Here we show that loss of translational accuracy accelerates the loss of viability in stationary phase yeast. Since reduced translational accuracy also reduces the folding competence of at least some proteins, we hypothesize that negative interactions between translational errors and age-related protein damage together overwhelm the cellular chaperone network. We further show that multiple cellular signalling networks control basal error rates in yeast cells, including a ROS signal controlled by mitochondrial activity, and the Ras pathway. Together, our findings indicate that signalling pathways regulating growth, protein homeostasis and energy metabolism may jointly safeguard accurate protein synthesis during healthy ageing. PMID:28100667
The control of translational accuracy is a determinant of healthy ageing in yeast.
von der Haar, Tobias; Leadsham, Jane E; Sauvadet, Aimie; Tarrant, Daniel; Adam, Ilectra S; Saromi, Kofo; Laun, Peter; Rinnerthaler, Mark; Breitenbach-Koller, Hannelore; Breitenbach, Michael; Tuite, Mick F; Gourlay, Campbell W
2017-01-01
Life requires the maintenance of molecular function in the face of stochastic processes that tend to adversely affect macromolecular integrity. This is particularly relevant during ageing, as many cellular functions decline with age, including growth, mitochondrial function and energy metabolism. Protein synthesis must deliver functional proteins at all times, implying that the effects of protein synthesis errors like amino acid misincorporation and stop-codon read-through must be minimized during ageing. Here we show that loss of translational accuracy accelerates the loss of viability in stationary phase yeast. Since reduced translational accuracy also reduces the folding competence of at least some proteins, we hypothesize that negative interactions between translational errors and age-related protein damage together overwhelm the cellular chaperone network. We further show that multiple cellular signalling networks control basal error rates in yeast cells, including a ROS signal controlled by mitochondrial activity, and the Ras pathway. Together, our findings indicate that signalling pathways regulating growth, protein homeostasis and energy metabolism may jointly safeguard accurate protein synthesis during healthy ageing. © 2017 The Authors.
Zucchelli, Silvia; Patrucco, Laura; Persichetti, Francesca; Gustincich, Stefano; Cotella, Diego
2016-01-01
Mammalian cells are an indispensable tool for the production of recombinant proteins in contexts where function depends on post-translational modifications. Among them, Chinese Hamster Ovary (CHO) cells are the primary factories for the production of therapeutic proteins, including monoclonal antibodies (MAbs). To improve expression and stability, several methodologies have been adopted, including methods based on media formulation, selective pressure and cell- or vector engineering. This review presents current approaches aimed at improving mammalian cell factories that are based on the enhancement of translation. Among well-established techniques (codon optimization and improvement of mRNA secondary structure), we describe SINEUPs, a family of antisense long non-coding RNAs that are able to increase translation of partially overlapping protein-coding mRNAs. By exploiting their modular structure, SINEUP molecules can be designed to target virtually any mRNA of interest, and thus to increase the production of secreted proteins. Thus, synthetic SINEUPs represent a new versatile tool to improve the production of secreted proteins in biomanufacturing processes.
Xie, Ping
2015-10-09
Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by "hungry" codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel.
Xie, Ping
2015-01-01
Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by “hungry” codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel. PMID:26473825
Translational repression of mouse mu opioid receptor expression via leaky scanning
Song, Kyu Young; Hwang, Cheol Kyu; Kim, Chun Sung; Choi, Hack Sun; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H.
2007-01-01
Mu opioid receptor (MOR) expression is under temporal and spatial controls, but expression levels of the MOR gene are relatively low in vivo. In addition to transcriptional regulations, upstream AUGs (uAUGs) and open reading frames (uORFs) profoundly affect the translation of the primary ORF and thus the protein levels in several genes. The 5′-untranslated region (UTR) of mouse MOR mRNA contains three uORFs preceding the MOR main initiation codon. In MOR-fused EGFP or MOR promoter/luciferase reporter constructs, mutating each uAUG individually or in combinations increased MOR transient heterologous expression in neuroblastoma NMB and HEK293 cells significantly. Translation of such constructs increased up to 3-fold without altering the mRNA levels if either the third uAUG or both the second and third AUGs were mutated. Additionally, these uAUG-mediated translational inhibitions were independent of their peptide as confirmed by internal mutation analyses in each uORF. Translational studies indicated that protein syntheses were initiated at these uAUG initiation sites, with the third uAUG initiating the highest translation level. These results support the hypothesis that uORFs in mouse MOR mRNA act as negative regulators through a ribosome leaky scanning mechanism. Such leaky scanning resulted in the suppression of mouse MOR under normal conditions. PMID:17284463
Increasing protein production rates can decrease the rate at which functional protein is produced
NASA Astrophysics Data System (ADS)
Sharma, Ajeet; O'Brien, Edward
The rate at which soluble, functional protein is produced by the ribosome has recently been found to vary in complex and unexplained ways as various translation-associated rates are altered through synonymous codon substitutions. We combine a well-established ribosome-traffic model with a master-equation model of co-translational domain folding to explore the scenarios that are possible for the protein production rate, J, and the functional-nascent protein production rate, F, as the rates associated with translation are altered. We find that while J monotonically increases as the rates of translation-initiation, -elongation and -termination increase, F can either increase or decrease. F exhibits non-monotonic behavior because increasing these rates can cause a protein to be synthesized more rapidly but provide less time for nascent-protein domains to co-translationally fold thereby producing less functional nascent protein immediately after synthesis. We further demonstrate that these non-monotonic changes in Faffect the post-translational, steady-state levels of functional protein in a similar manner. Our results provide a possible explanation for recent experimental observations that the specific activity of enzymatic proteins can decrease with increased synthesis rates and can in principle be used to rationally-design transcripts to maximize the production of functional nascent protein.
Tissue- and Time-Specific Expression of Otherwise Identical tRNA Genes
Adir, Idan; Dahan, Orna; Broday, Limor; Pilpel, Yitzhak; Rechavi, Oded
2016-01-01
Codon usage bias affects protein translation because tRNAs that recognize synonymous codons differ in their abundance. Although the current dogma states that tRNA expression is exclusively regulated by intrinsic control elements (A- and B-box sequences), we revealed, using a reporter that monitors the levels of individual tRNA genes in Caenorhabditis elegans, that eight tryptophan tRNA genes, 100% identical in sequence, are expressed in different tissues and change their expression dynamically. Furthermore, the expression levels of the sup-7 tRNA gene at day 6 were found to predict the animal’s lifespan. We discovered that the expression of tRNAs that reside within introns of protein-coding genes is affected by the host gene’s promoter. Pairing between specific Pol II genes and the tRNAs that are contained in their introns is most likely adaptive, since a genome-wide analysis revealed that the presence of specific intronic tRNAs within specific orthologous genes is conserved across Caenorhabditis species. PMID:27560950
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vendeix, Franck A.P.; Murphy, IV, Frank V.; Cantara, William A.
Human tRNA Lys3 UUU (htRNA Lys3 UUU) decodes the lysine codons AAA and AAG during translation and also plays a crucial role as the primer for HIV-1 (human immunodeficiency virus type 1) reverse transcription. The posttranscriptional modifications 5-methoxycarbonylmethyl-2-thiouridine (mcm 5s 2U 34), 2-methylthio-N 6-threonylcarbamoyladenosine (ms 2t 6A 37), and pseudouridine (Ψ 39) in the tRNA's anticodon domain are critical for ribosomal binding and HIV-1 reverse transcription. To understand the importance of modified nucleoside contributions, we determined the structure and function of this tRNA's anticodon stem and loop (ASL) domain with these modifications at positions 34, 37, and 39, respectively (hASLmore » Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39). Ribosome binding assays in vitro revealed that the hASL Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39 bound AAA and AAG codons, whereas binding of the unmodified ASL Lys3 UUU was barely detectable. The UV hyperchromicity, the circular dichroism, and the structural analyses indicated that Ψ 39 enhanced the thermodynamic stability of the ASL through base stacking while ms 2t 6A 37 restrained the anticodon to adopt an open loop conformation that is required for ribosomal binding. The NMR-restrained molecular-dynamics-derived solution structure revealed that the modifications provided an open, ordered loop for codon binding. The crystal structures of the hASL Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39 bound to the 30S ribosomal subunit with each codon in the A site showed that the modified nucleotides mcm 5s 2U 34 and ms 2t 6A 37 participate in the stability of the anticodon–codon interaction. Importantly, the mcm 5s 2U 34·G 3 wobble base pair is in the Watson–Crick geometry, requiring unusual hydrogen bonding to G in which mcm 5s 2U 34 must shift from the keto to the enol form. The results unambiguously demonstrate that modifications pre-structure the anticodon as a key prerequisite for efficient and accurate recognition of cognate and wobble codons.« less
Thiamine-responsive megaloblastic anemia: early diagnosis may be effective in preventing deafness.
Onal, Hasan; Bariş, Safa; Ozdil, Mine; Yeşil, Gözde; Altun, Gürkan; Ozyilmaz, Isa; Aydin, Ahmet; Celkan, Tiraje
2009-01-01
Thiamine-responsive megaloblastic anemia syndrome is an autosomal recessive disorder characterized by diabetes mellitus, megaloblastic anemia and sensorineural hearing loss. Mutations in the SLC19A2 gene, encoding a high-affinity thiamine transporter protein, THTR-1, are responsible for the clinical features associated with thiamine-responsive megaloblastic anemia syndrome in which treatment with pharmacological doses of thiamine correct the megaloblastic anemia and diabetes mellitus. The anemia can recur when thiamine is withdrawn. Thiamine may be effective in preventing deafness if started before two months. Our patient was found homozygous for a mutation, 242insA, in the nucleic acid sequence of exon B, with insertion of an adenine introducing a stop codon at codon 52 in the high-affinity thiamine transporter gene, SLC19A2, on chromosome 1q23.3.
Gurvich, Olga L.; Näsvall, S. Joakim; Baranov, Pavel V.; Björk, Glenn R.; Atkins, John F.
2011-01-01
The bacterial pheL gene encodes the leader peptide for the phenylalanine biosynthetic operon. Translation of pheL mRNA controls transcription attenuation and, consequently, expression of the downstream pheA gene. Fifty-three unique pheL genes have been identified in sequenced genomes of the gamma subdivision. There are two groups of pheL genes, both of which are short and contain a run(s) of phenylalanine codons at an internal position. One group is somewhat diverse and features different termination and 5′-flanking codons. The other group, mostly restricted to Enterobacteria and including Escherichia coli pheL, has a conserved nucleotide sequence that ends with UUC_CCC_UGA. When these three codons in E. coli pheL mRNA are in the ribosomal E-, P- and A-sites, there is an unusually high level, 15%, of +1 ribosomal frameshifting due to features of the nascent peptide sequence that include the penultimate phenylalanine. This level increases to 60% with a natural, heterologous, nascent peptide stimulator. Nevertheless, studies with different tRNAPro mutants in Salmonella enterica suggest that frameshifting at the end of pheL does not influence expression of the downstream pheA. This finding of incidental, rather than utilized, frameshifting is cautionary for other studies of programmed frameshifting. PMID:21177642
Crowder, Justin J.; Geigges, Marco; Gibson, Ryan T.; Fults, Eric S.; Buchanan, Bryce W.; Sachs, Nadine; Schink, Andrea; Kreft, Stefan G.; Rubenstein, Eric M.
2015-01-01
Aberrant nonstop proteins arise from translation of mRNA molecules beyond the coding sequence into the 3′-untranslated region. If a stop codon is not encountered, translation continues into the poly(A) tail, resulting in C-terminal appendage of a polylysine tract and a terminally stalled ribosome. In Saccharomyces cerevisiae, the ubiquitin ligase Rkr1/Ltn1 has been implicated in the proteasomal degradation of soluble cytosolic nonstop and translationally stalled proteins. Rkr1 is essential for cellular fitness under conditions associated with increased prevalence of nonstop proteins. Mutation of the mammalian homolog causes significant neurological pathology, suggesting broad physiological significance of ribosome-associated quality control. It is not known whether and how soluble or transmembrane nonstop and translationally stalled proteins targeted to the endoplasmic reticulum (ER) are detected and degraded. We generated and characterized model soluble and transmembrane ER-targeted nonstop and translationally stalled proteins. We found that these proteins are indeed subject to proteasomal degradation. We tested three candidate ubiquitin ligases (Rkr1 and ER-associated Doa10 and Hrd1) for roles in regulating abundance of these proteins. Our results indicate that Rkr1 plays the primary role in targeting the tested model ER-targeted nonstop and translationally stalled proteins for degradation. These data expand the catalog of Rkr1 substrates and highlight a previously unappreciated role for this ubiquitin ligase at the ER membrane. PMID:26055716
Structural basis of suppression of host translation termination by Moloney Murine Leukemia Virus
NASA Astrophysics Data System (ADS)
Tang, Xuhua; Zhu, Yiping; Baker, Stacey L.; Bowler, Matthew W.; Chen, Benjamin Jieming; Chen, Chen; Hogg, J. Robert; Goff, Stephen P.; Song, Haiwei
2016-06-01
Retroviral reverse transcriptase (RT) of Moloney murine leukemia virus (MoMLV) is expressed in the form of a large Gag-Pol precursor protein by suppression of translational termination in which the maximal efficiency of stop codon read-through depends on the interaction between MoMLV RT and peptidyl release factor 1 (eRF1). Here, we report the crystal structure of MoMLV RT in complex with eRF1. The MoMLV RT interacts with the C-terminal domain of eRF1 via its RNase H domain to sterically occlude the binding of peptidyl release factor 3 (eRF3) to eRF1. Promotion of read-through by MoMLV RNase H prevents nonsense-mediated mRNA decay (NMD) of mRNAs. Comparison of our structure with that of HIV RT explains why HIV RT cannot interact with eRF1. Our results provide a mechanistic view of how MoMLV manipulates the host translation termination machinery for the synthesis of its own proteins.
Ribosomal Binding Site Switching: An Effective Strategy for High-Throughput Cloning Constructions
Li, Yunlong; Zhang, Yong; Lu, Pei; Rayner, Simon; Chen, Shiyun
2012-01-01
Direct cloning of PCR fragments by TA cloning or blunt end ligation are two simple methods which would greatly benefit high-throughput (HTP) cloning constructions if the efficiency can be improved. In this study, we have developed a ribosomal binding site (RBS) switching strategy for direct cloning of PCR fragments. RBS is an A/G rich region upstream of the translational start codon and is essential for gene expression. Change from A/G to T/C in the RBS blocks its activity and thereby abolishes gene expression. Based on this property, we introduced an inactive RBS upstream of a selectable marker gene, and designed a fragment insertion site within this inactive RBS. Forward and reverse insertions of specifically tailed fragments will respectively form an active and inactive RBS, thus all background from vector self-ligation and fragment reverse insertions will be eliminated due to the non-expression of the marker gene. The effectiveness of our strategy for TA cloning and blunt end ligation are confirmed. Application of this strategy to gene over-expression, a bacterial two-hybrid system, a bacterial one-hybrid system, and promoter bank construction are also verified. The advantages of this simple procedure, together with its low cost and high efficiency, makes our strategy extremely useful in HTP cloning constructions. PMID:23185557
Klein, Peter; Seidel, Thorsten; Stöcker, Benedikt; Dietz, Karl-Josef
2012-01-01
The stromal ascorbate peroxidase (sAPX) functions as central element of the chloroplast antioxidant defense system. Its expression is under retrograde control of chloroplast signals including redox- and reactive oxygen species-linked cues. The sAPX promoter of Arabidopsis thaliana was dissected in transient reporter assays using mesophyll protoplasts. The study revealed regulatory elements up to –1868 upstream of the start codon. By yeast-one-hybrid screening, the transcription factor ANAC089 was identified to bind to the promoter fragment 2 (–1262 to –1646 bp upstream of translational initiation). Upon mutation of the cis-acting element CACG, binding of ANAC089 was abolished. Expression of a fused fluorescent protein version and comparison with known endomembrane markers localized ANAC089 to the trans-Golgi network and the ER. The transcription factor was released upon treatment with reducing agents and targeted to the nucleus. Transactivation assays using wild type and mutated versions of the promoter showed a partial suppression of reporter expression. The data indicate that ANAC089 functions in a negative retrograde loop, lowering sAPX expression if the cell encounters a highly reducing condition. This conclusion was supported by reciprocal transcript accumulation of ANAC089 and sAPX during acclimation to low, normal, and high light conditions. PMID:23162559
Zurawski, Gerard; Bohnert, Hans J.; Whitfeld, Paul R.; Bottomley, Warwick
1982-01-01
The gene for the so-called Mr 32,000 rapidly labeled photosystem II thylakoid membrane protein (here designated psbA) of spinach (Spinacia oleracea) chloroplasts is located on the chloroplast DNA in the large single-copy region immediately adjacent to one of the inverted repeat sequences. In this paper we show that the size of the mRNA for this protein is ≈ 1.25 kilobases and that the direction of transcription is towards the inverted repeat unit. The nucleotide sequence of the gene and its flanking regions is presented. The only large open reading frame in the sequence codes for a protein of Mr 38,950. The nucleotide sequence of psbA from Nicotiana debneyi also has been determined, and comparison of the sequences from the two species shows them to be highly conserved (>95% homology) throughout the entire reading frame. Conservation of the amino acid sequence is absolute, there being no changes in a total of 353 residues. This leads us to conclude that the primary translation product of psbA must be a protein of Mr 38,950. The protein is characterized by the complete absence of lysine residues and is relatively rich in hydrophobic amino acids, which tend to be clustered. Transcription of spinach psbA starts about 86 base pairs before the first ATG codon. Immediately upstream from this point there is a sequence typical of that found in E. coli promoters. An almost identical sequence occurs in the equivalent region of N. debneyi DNA. Images PMID:16593262
In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.
Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M
2014-01-30
RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.
NS1 codon usage adaptation to humans in pandemic Zika virus.
Freire, Caio César de Melo; Palmisano, Giuseppe; Braconi, Carla T; Cugola, Fernanda R; Russo, Fabiele B; Beltrão-Braga, Patricia Cb; Iamarino, Atila; Lima Neto, Daniel Ferreira de; Sall, Amadou Alpha; Rosa-Fernandes, Livia; Larsen, Martin R; Zanotto, Paolo Marinho de Andrade
2018-05-10
Zika virus (ZIKV) was recognised as a zoonotic pathogen in Africa and southeastern Asia. Human infections were infrequently reported until 2007, when the first known epidemic occurred in Micronesia. After 2013, the Asian lineage of ZIKV spread along the Pacific Islands and Americas, causing severe outbreaks with millions of human infections. The recent human infections of ZIKV were also associated with severe complications, such as an increase in cases of Guillain-Barre syndrome and the emergence of congenital Zika syndrome. To better understand the recent and rapid expansion of ZIKV, as well as the presentation of novel complications, we compared the genetic differences between the African sylvatic lineage and the Asian epidemic lineage that caused the recent massive outbreaks. The epidemic lineages have significant codon adaptation in NS1 gene to translate these proteins in human and Aedes aegypti mosquito cells compared to the African zoonotic lineage. Accordingly, a Brazilian epidemic isolate (ZBR) produced more NS1 protein than the MR766 African lineage (ZAF) did, as indicated by proteomic data from infections of neuron progenitor cells-derived neurospheres. Although ZBR replicated more efficiently in these cells, the differences observed in the stoichiometry of ZIKV proteins were not exclusively explained by the differences in viral replication between the lineages. Our findings suggest that natural, silent translational selection in the second half of 20th century could have improved the fitness of Asian ZIKV lineage in human and mosquito cells.
Diago-Navarro, Elizabeth; Mora, Liliana; Buckingham, Richard H; Díaz-Orejas, Ramón; Lemonnier, Marc
2009-01-01
Novel mutations in prfA, the gene for the polypeptide release factor RF1 of Escherichia coli, were isolated using a positive genetic screen based on the parD (kis, kid) toxin-antitoxin system. This original approach allowed the direct selection of mutants with altered translational termination efficiency at UAG codons. The isolated prfA mutants displayed a approximately 10-fold decrease in UAG termination efficiency with no significant changes in RF1 stability in vivo. All three mutations, G121S, G301S and R303H, were situated close to the nonsense codon recognition site in RF1:ribosome complexes. The prfA mutants displayed increased sensitivity to the RelE toxin encoded by the relBE system of E. coli, thus providing in vivo support for the functional interaction between RF1 and RelE. The prfA mutants also showed increased sensitivity to the Kid toxin. Since this toxin can cleave RNA in a ribosome-independent manner, this result was not anticipated and provided first evidence for the involvement of RF1 in the pathway of Kid toxicity. The sensitivity of the prfA mutants to RelE and Kid was restored to normal levels upon overproduction of the wild-type RF1 protein. We discuss these results and their utility for the design of novel antibacterial strategies in the light of the recently reported structure of ribosome-bound RF1.
Deficit of tRNA(Lys) modification by Cdkal1 causes the development of type 2 diabetes in mice.
Wei, Fan-Yan; Suzuki, Takeo; Watanabe, Sayaka; Kimura, Satoshi; Kaitsuka, Taku; Fujimura, Atsushi; Matsui, Hideki; Atta, Mohamed; Michiue, Hiroyuki; Fontecave, Marc; Yamagata, Kazuya; Suzuki, Tsutomu; Tomizawa, Kazuhito
2011-09-01
The worldwide prevalence of type 2 diabetes (T2D), which is caused by a combination of environmental and genetic factors, is increasing. With regard to genetic factors, variations in the gene encoding Cdk5 regulatory associated protein 1-like 1 (Cdkal1) have been associated with an impaired insulin response and increased risk of T2D across different ethnic populations, but the molecular function of this protein has not been characterized. Here, we show that Cdkal1 is a mammalian methylthiotransferase that biosynthesizes 2-methylthio-N6-threonylcarbamoyladenosine (ms2t6A) in tRNA(Lys)(UUU) and that it is required for the accurate translation of AAA and AAG codons. Mice with pancreatic β cell-specific KO of Cdkal1 (referred to herein as β cell KO mice) showed pancreatic islet hypertrophy, a decrease in insulin secretion, and impaired blood glucose control. In Cdkal1-deficient β cells, misreading of Lys codon in proinsulin occurred, resulting in a reduction of glucose-stimulated proinsulin synthesis. Moreover, expression of ER stress-related genes was upregulated in these cells, and abnormally structured ER was observed. Further, the β cell KO mice were hypersensitive to high fat diet-induced ER stress. These findings suggest that glucose-stimulated translation of proinsulin may require fully modified tRNA(Lys)(UUU), which could potentially explain the molecular pathogenesis of T2D in patients carrying cdkal1 risk alleles.
Thompson, R C; Dix, D B; Gerson, R B; Karim, A M
1981-01-10
The characteristics of a GTPase reaction between poly(U)-programmed ribosomes, EFTu . GTP, and the near-cognate aminoacyl (aa)-tRNA, Leu-tRNA Leu 2, have been studied to assess the role of this reaction in proofreading of the codon-anticodon interaction. The reaction resembles the GTPase reaction with cognate aa-tRNAs and EFTu . GTP in its substrate requirements, in its involving EFTu . GTP . aa-tRNA ternary complexes, and in its requiring a free ribosomal A-site. The noncognate reaction differs from the cognate one in that aa-tRNA becomes stably bound to the ribosomes only 5% of the time; it therefore seems best characterized as an abortive enzymatic binding reaction. The rate of reaction is a significant fraction (4%) of that of the cognate aa-tRNA, indicating that recognition of ternary complexes by ribosomes involves a level of error greater than that of translation as a whole. The rejection of the noncognate aa-tRNA following GTP hydrolysis is therefore a vital step in the translation process and fulfills the criteria set for a proofreading reaction. Leu-tRNA Leu 2 which escapes rejection through proofreading, forms a stable complex with the ribosomal A-site, so it appears that the Leu-tRNA2 which was rejected never reached the A-site and that proofreading precedes full A-site binding.
Deficit of tRNALys modification by Cdkal1 causes the development of type 2 diabetes in mice
Wei, Fan-Yan; Suzuki, Takeo; Watanabe, Sayaka; Kimura, Satoshi; Kaitsuka, Taku; Fujimura, Atsushi; Matsui, Hideki; Atta, Mohamed; Michiue, Hiroyuki; Fontecave, Marc; Yamagata, Kazuya; Suzuki, Tsutomu; Tomizawa, Kazuhito
2011-01-01
The worldwide prevalence of type 2 diabetes (T2D), which is caused by a combination of environmental and genetic factors, is increasing. With regard to genetic factors, variations in the gene encoding Cdk5 regulatory associated protein 1–like 1 (Cdkal1) have been associated with an impaired insulin response and increased risk of T2D across different ethnic populations, but the molecular function of this protein has not been characterized. Here, we show that Cdkal1 is a mammalian methylthiotransferase that biosynthesizes 2-methylthio-N6-threonylcarbamoyladenosine (ms2t6A) in tRNALys(UUU) and that it is required for the accurate translation of AAA and AAG codons. Mice with pancreatic β cell–specific KO of Cdkal1 (referred to herein as β cell KO mice) showed pancreatic islet hypertrophy, a decrease in insulin secretion, and impaired blood glucose control. In Cdkal1-deficient β cells, misreading of Lys codon in proinsulin occurred, resulting in a reduction of glucose-stimulated proinsulin synthesis. Moreover, expression of ER stress–related genes was upregulated in these cells, and abnormally structured ER was observed. Further, the β cell KO mice were hypersensitive to high fat diet–induced ER stress. These findings suggest that glucose-stimulated translation of proinsulin may require fully modified tRNALys(UUU), which could potentially explain the molecular pathogenesis of T2D in patients carrying cdkal1 risk alleles. PMID:21841312
Core histone genes of Giardia intestinalis: genomic organization, promoter structure, and expression
Yee, Janet; Tang, Anita; Lau, Wei-Ling; Ritter, Heather; Delport, Dewald; Page, Melissa; Adam, Rodney D; Müller, Miklós; Wu, Gang
2007-01-01
Background Giardia intestinalis is a protist found in freshwaters worldwide, and is the most common cause of parasitic diarrhea in humans. The phylogenetic position of this parasite is still much debated. Histones are small, highly conserved proteins that associate tightly with DNA to form chromatin within the nucleus. There are two classes of core histone genes in higher eukaryotes: DNA replication-independent histones and DNA replication-dependent ones. Results We identified two copies each of the core histone H2a, H2b and H3 genes, and three copies of the H4 gene, at separate locations on chromosomes 3, 4 and 5 within the genome of Giardia intestinalis, but no gene encoding a H1 linker histone could be recognized. The copies of each gene share extensive DNA sequence identities throughout their coding and 5' noncoding regions, which suggests these copies have arisen from relatively recent gene duplications or gene conversions. The transcription start sites are at triplet A sequences 1–27 nucleotides upstream of the translation start codon for each gene. We determined that a 50 bp region upstream from the start of the histone H4 coding region is the minimal promoter, and a highly conserved 15 bp sequence called the histone motif (him) is essential for its activity. The Giardia core histone genes are constitutively expressed at approximately equivalent levels and their mRNAs are polyadenylated. Competition gel-shift experiments suggest that a factor within the protein complex that binds him may also be a part of the protein complexes that bind other promoter elements described previously in Giardia. Conclusion In contrast to other eukaryotes, the Giardia genome has only a single class of core histone genes that encode replication-independent histones. Our inability to locate a gene encoding the linker histone H1 leads us to speculate that the H1 protein may not be required for the compaction of Giardia's small and gene-rich genome. PMID:17425802
Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong
2007-08-01
The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis.
Feutry, Pierre; Kyne, Peter M; Peng, Zaiqing; Pan, Lianghao; Chen, Xiao
2016-05-01
The complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis is presented in this study. It is 17,693 bp in length and contains 37 genes in typical gene order and transcriptional orientation observed in vertebrates. There were a total of 86 bp short intergenic spacers and 22 bp overlaps in the genome. The overall base composition was 31.4% A, 25.5% C, 13.2% G and 29.9% T. Two start codons (GTG and ATG) and two stop codons (TAG and TAA/T) were found in 13 protein-coding genes. The length of 22 tRNA genes ranged from 68 (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1). The origin of L-strand replication (OL) was found between the tRNA-Asn and tRNA-Cys genes. The base composition of the control region (1940 bp) was similar to the whole mitogenome.
ClubSub-P: Cluster-Based Subcellular Localization Prediction for Gram-Negative Bacteria and Archaea
Paramasivam, Nagarajan; Linke, Dirk
2011-01-01
The subcellular localization (SCL) of proteins provides important clues to their function in a cell. In our efforts to predict useful vaccine targets against Gram-negative bacteria, we noticed that misannotated start codons frequently lead to wrongly assigned SCLs. This and other problems in SCL prediction, such as the relatively high false-positive and false-negative rates of some tools, can be avoided by applying multiple prediction tools to groups of homologous proteins. Here we present ClubSub-P, an online database that combines existing SCL prediction tools into a consensus pipeline from more than 600 proteomes of fully sequenced microorganisms. On top of the consensus prediction at the level of single sequences, the tool uses clusters of homologous proteins from Gram-negative bacteria and from Archaea to eliminate false-positive and false-negative predictions. ClubSub-P can assign the SCL of proteins from Gram-negative bacteria and Archaea with high precision. The database is searchable, and can easily be expanded using either new bacterial genomes or new prediction tools as they become available. This will further improve the performance of the SCL prediction, as well as the detection of misannotated start codons and other annotation errors. ClubSub-P is available online at http://toolkit.tuebingen.mpg.de/clubsubp/ PMID:22073040
Determining if an mRNA is a Substrate of Nonsense-Mediated mRNA Decay in Saccharomyces cerevisiae.
Johansson, Marcus J O
2017-01-01
Nonsense-mediated mRNA decay (NMD) is a conserved eukaryotic quality control mechanism which triggers decay of mRNAs harboring premature translation termination codons. In this chapter, I describe methods for monitoring the influence of NMD on mRNA abundance and decay rates in Saccharomyces cerevisiae. The descriptions include detailed methods for growing yeast cells, total RNA isolation, and Northern blotting. Although the chapter focuses on NMD, the methods can be easily adapted to assess the effect of other mRNA decay pathways.
Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura
Boore, Jeffrey L
2004-01-01
Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
Lukes, Julius; Paris, Zdenek; Regmi, Sandesh; Breitling, Reinhard; Mureev, Sergey; Kushnir, Susanna; Pyatkov, Konstantin; Jirků, Milan; Alexandrov, Kirill A
2006-08-01
To investigate the influence of sequence context of translation initiation codon on translation efficiency in Kinetoplastida, we constructed a library of expression plasmids randomized in the three nucleotides prefacing ATG of a reporter gene encoding enhanced green fluorescent protein (EGFP). All 64 possible combinations of pre-ATG triplets were individually stably integrated into the rDNA locus of Leishmania tarentolae and the resulting cell lines were assessed for EGFP expression. The expression levels were quantified directly by measuring the fluorescence of EGFP protein in living cells and confirmed by Western blotting. We observed a strong influence of the pre-ATG triplet on the level of protein expression over a 20-fold range. To understand the degree of evolutionary conservation of the observed effect, we transformed Phytomonas serpens, a trypanosomatid parasite of plants, with a subset of the constructs. The pattern of translational efficiency mediated by individual pre-ATG triplets in this species was similar to that observed in L. tarentolae. However, the pattern of translational efficiency of two other proteins (red fluorescent protein and tetracycline repressor) containing selected pre-ATG triplets did not correlate with either EGFP or each other. Thus, we conclude that a conserved mechanism of translation initiation site selection exists in kinetoplastids that is strongly influenced not only by the pre-ATG sequences but also by the coding region of the gene.
Lin, Junyan; Guo, Jiangbo; Finer, John; Dorrance, Anne E.; Redinbaugh, Margaret G.
2014-01-01
ABSTRACT Bean pod mottle virus (BPMV) is a bipartite, positive-sense (+) RNA plant virus in the Secoviridae family. Its RNA1 encodes proteins required for genome replication, whereas RNA2 primarily encodes proteins needed for virion assembly and cell-to-cell movement. However, the function of a 58-kDa protein (P58) encoded by RNA2 has not been resolved. P58 and the movement protein (MP) of BPMV are two largely identical proteins differing only at their N termini, with P58 extending MP upstream by 102 amino acid residues. In this report, we unveil a unique role for P58. We show that BPMV RNA2 accumulation in infected cells was abolished when the start codon of P58 was eliminated. The role of P58 does not require the region shared by MP, as RNA2 accumulation in individual cells remained robust even when most of the MP coding sequence was removed. Importantly, the function of P58 required the P58 protein, rather than its coding RNA, as compensatory mutants could be isolated that restored RNA2 accumulation by acquiring new start codons upstream of the original one. Most strikingly, loss of P58 function could not be complemented by P58 provided in trans, suggesting that P58 functions in cis to selectively promote the accumulation of RNA2 copies that encode a functional P58 protein. Finally, we found that all RNA1-encoded proteins are cis-acting relative to RNA1. Together, our results suggest that P58 probably functions by recruiting the RNA1-encoded polyprotein to RNA2 to enable RNA2 reproduction. IMPORTANCE Bean pod mottle virus (BPMV) is one of the most important pathogens of the crop plant soybean, yet its replication mechanism is not well understood, hindering the development of knowledge-based control measures. The current study examined the replication strategy of BPMV RNA2, one of the two genomic RNA segments of this virus, and established an essential role for P58, one of the RNA2-encoded proteins, in the process of RNA2 replication. Our study demonstrates for the first time that P58 functions preferentially with the very RNA from which it is translated, thus greatly advancing our understanding of the replication mechanisms of this and related viruses. Furthermore, this study is important because it provides a potential target for BPMV-specific control, and hence could help to mitigate soybean production losses caused by this virus. PMID:24390330
Lamping, Erwin; Niimi, Masakazu; Cannon, Richard D
2013-07-29
A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5' UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5' UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = -15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (∆G = -4.4 kcal/mol) inhibited Cdr1p expression by ~50%. We have developed a simple cloning strategy to fine-tune protein expression levels in yeast that has many potential applications in metabolic engineering and the optimization of protein expression in yeast. This study also highlights the importance of considering the use of multiple cloning-sites carefully to preclude unwanted effects on gene expression.
2013-01-01
Background A large range of genetic tools has been developed for the optimal design and regulation of complex metabolic pathways in bacteria. However, fewer tools exist in yeast that can precisely tune the expression of individual enzymes in novel metabolic pathways suitable for industrial-scale production of non-natural compounds. Tuning expression levels is critical for reducing the metabolic burden of over-expressed proteins, the accumulation of toxic intermediates, and for redirecting metabolic flux from native pathways involving essential enzymes without negatively affecting the viability of the host. We have developed a yeast membrane protein hyper-expression system with critical advantages over conventional, plasmid-based, expression systems. However, expression levels are sometimes so high that they adversely affect protein targeting/folding or the growth and/or phenotype of the host. Here we describe the use of small synthetic mRNA control modules that allowed us to predictably tune protein expression levels to any desired level. Down-regulation of expression was achieved by engineering small GC-rich mRNA stem-loops into the 5′ UTR that inhibited translation initiation of the yeast ribosomal 43S preinitiation complex (PIC). Results Exploiting the fact that the yeast 43S PIC has great difficulty scanning through GC-rich mRNA stem-loops, we created yeast strains containing 17 different RNA stem-loop modules in the 5′ UTR that expressed varying amounts of the fungal multidrug efflux pump reporter Cdr1p from Candida albicans. Increasing the length of mRNA stem-loops (that contained only GC-pairs) near the AUG start-codon led to a surprisingly large decrease in Cdr1p expression; ~2.7-fold for every additional GC-pair added to the stem, while the mRNA levels remained largely unaffected. An mRNA stem-loop of seven GC-pairs (∆G = −15.8 kcal/mol) reduced Cdr1p expression levels by >99%, and even the smallest possible stem-loop of only three GC-pairs (∆G = −4.4 kcal/mol) inhibited Cdr1p expression by ~50%. Conclusion We have developed a simple cloning strategy to fine-tune protein expression levels in yeast that has many potential applications in metabolic engineering and the optimization of protein expression in yeast. This study also highlights the importance of considering the use of multiple cloning-sites carefully to preclude unwanted effects on gene expression. PMID:23895661
Jaag, Hannah Miriam; Kawchuk, Lawrence; Rohde, Wolfgang; Fischer, Rainer; Emans, Neil; Prüfer, Dirk
2003-01-01
Potato leafroll polerovirus (PLRV) genomic RNA acts as a polycistronic mRNA for the production of proteins P0, P1, and P2 translated from the 5′-proximal half of the genome. Within the P1 coding region we identified a 5-kDa replication-associated protein 1 (Rap1) essential for viral multiplication. An internal ribosome entry site (IRES) with unusual structure and location was identified that regulates Rap1 translation. Core structural elements for internal ribosome entry include a conserved AUG codon and a downstream GGAGAGAGAGG motif with inverted symmetry. Reporter gene expression in potato protoplasts confirmed the internal ribosome entry function. Unlike known IRES motifs, the PLRV IRES is located completely within the coding region of Rap1 at the center of the PLRV genome. PMID:12835413
Inhibition of eukaryotic translation elongation by the antitumor natural product Mycalamide B.
Dang, Yongjun; Schneider-Poetsch, Tilman; Eyler, Daniel E; Jewett, John C; Bhat, Shridhar; Rawal, Viresh H; Green, Rachel; Liu, Jun O
2011-08-01
Mycalamide B (MycB) is a marine sponge-derived natural product with potent antitumor activity. Although it has been shown to inhibit protein synthesis, the molecular mechanism of action by MycB remains incompletely understood. We verified the inhibition of translation elongation by in vitro HCV IRES dual luciferase assays, ribosome assembly, and in vivo [(35)S]methinione labeling experiments. Similar to cycloheximide (CHX), MycB inhibits translation elongation through blockade of eEF2-mediated translocation without affecting the eEF1A-mediated loading of tRNA onto the ribosome, AUG recognition, or dipeptide synthesis. Using chemical footprinting, we identified the MycB binding site proximal to the C3993 28S rRNA residue on the large ribosomal subunit. However, there are also subtle, but significant differences in the detailed mechanisms of action of MycB and CHX. First, MycB arrests the ribosome on the mRNA one codon ahead of CHX. Second, MycB specifically blocked tRNA binding to the E-site of the large ribosomal subunit. Moreover, they display different polysome profiles in vivo. Together, these observations shed new light on the mechanism of inhibition of translation elongation by MycB.
uAUG-mediated translational initiations are responsible for human mu opioid receptor gene expression
Song, Kyu Young; Kim, Chun Sung; Hwang, Cheol Kyu; Choi, Hack Sun; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H
2010-01-01
Abstract Mu opioid receptor (MOR) is the main site of interaction for major clinical analgesics, particularly morphine. MOR expression is regulated at the transcriptional and post-transcriptional levels. However, the protein expression of the MOR gene is relatively low and the translational control of MOR gene has not been well studied. The 5′-untranslated region (UTR) of the human MOR (OPRM1) mRNA contains four upstream AUG codons (uAUG) preceding the main translation initiation site. We mutated the four uAUGs individually and in combination. Mutations of the third uAUG, containing the same open reading frame, had the strongest inhibitory effect. The inhibitory effect caused by the third in-frame uAUG was confirmed by in vitro translation and receptor-binding assays. Toeprinting results showed that OPRM1 ribosomes initiated efficiently at the first uAUG, and subsequently re-initiated at the in-frame #3 uAUG and the physiological AUG site. This re-initiation resulted in negative expression of OPRM1 under normal conditions. These results indicate that re-initiation in MOR gene expression could play an important role in OPRM1 regulation. PMID:19438807
Protein degradation and dynamic tRNA thiolation fine-tune translation at elevated temperatures.
Tyagi, Kshitiz; Pedrioli, Patrick G A
2015-05-19
Maintenance of protein quality control has implications in various processes such as neurodegeneration and ageing. To investigate how environmental insults affect this process, we analysed the proteome of yeast continuously exposed to mild heat stress. In agreement with previous transcriptomics studies, amongst the most marked changes, we found up-regulation of cytoprotective factors; a shift from oxidative phosphorylation to fermentation; and down-regulation of translation. Importantly, we also identified a novel, post-translationally controlled, component of the heat shock response. The abundance of Ncs2p and Ncs6p, two members of the URM1 pathway responsible for the thiolation of wobble uridines in cytoplasmic tRNAs tK(UUU), tQ(UUG) and tE(UUC), is down-regulated in a proteasomal dependent fashion. Using random forests we show that this results in differential translation of transcripts with a biased content for the corresponding codons. We propose that the role of this pathway in promoting catabolic and inhibiting anabolic processes, affords cells with additional time and resources needed to attain proper protein folding under periods of stress. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Raymond, Amy; Lovell, Scott; Lorimer, Don
2009-12-01
With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38{alpha}), viral polymerase (HCV NS5B), and bacterial structural protein (FtsZ) were expressed in both E. colimore » and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.« less
Fellner, Lea; Simon, Svenja; Scherling, Christian; Witting, Michael; Schober, Steffen; Polte, Christine; Schmitt-Kopplin, Philippe; Keim, Daniel A; Scherer, Siegfried; Neuhaus, Klaus
2015-12-18
Gene duplication is believed to be the classical way to form novel genes, but overprinting may be an important alternative. Overprinting allows entirely novel proteins to evolve de novo, i.e., formerly non-coding open reading frames within functional genes become expressed. Only three cases have been described for Escherichia coli. Here, a fourth example is presented. RNA sequencing revealed an open reading frame weakly transcribed in cow dung, coding for 101 residues and embedded completely in the -2 reading frame of citC in enterohemorrhagic E. coli. This gene is designated novel overlapping gene, nog1. The promoter region fused to gfp exhibits specific activities and 5' rapid amplification of cDNA ends indicated the transcriptional start 40-bp upstream of the start codon. nog1 was strand-specifically arrested in translation by a nonsense mutation silent in citC. This Nog1-mutant showed a phenotype in competitive growth against wild type in the presence of MgCl2. Small differences in metabolite concentrations were also found. Bioinformatic analyses propose Nog1 to be inner membrane-bound and to possess at least one membrane-spanning domain. A phylogenetic analysis suggests that the orphan gene nog1 arose by overprinting after Escherichia/Shigella separated from the other γ-proteobacteria. Since nog1 is of recent origin, non-essential, short, weakly expressed and only marginally involved in E. coli's central metabolism, we propose that this gene is in an initial stage of evolution. While we present specific experimental evidence for the existence of a fourth overlapping gene in enterohemorrhagic E. coli, we believe that this may be an initial finding only and overlapping genes in bacteria may be more common than is currently assumed by microbiologists.
Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H
2017-12-01
Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae , Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote. © 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.
Parra-Unda, Ricardo; Vaca-Paniagua, Felipe; Jiménez, Lucia; Landa, Abraham
2012-01-01
Cytosolic Cu,Zn superoxide dismutase (Cu,Zn-SOD) catalyzes the dismutation of superoxide (O(2)(-)) to oxygen and hydrogen peroxide (H(2)O(2)) and plays an important role in the establishment and survival of helminthes in their hosts. In this work, we describe the Taenia solium Cu,Zn-SOD gene (TsCu,Zn-SOD) and a Taenia crassiceps (TcCu,Zn-SOD) cDNA. TsCu,Zn-SOD gene that spans 2.841 kb, and has three exons and two introns; the splicing junctions follow the GT-AG rule. Analysis in silico of the gene revealed that the 5'-flanking region has three putative TATA and CCAAT boxes, and transcription factor binding sites for NF1 and AP1. The transcription start site was a C, located at 22 nucleotides upstream of the translation start codon (ATG). Southern blot analysis showed that TcCu,Zn-SOD and TsCu,Zn-SOD genes are encoded by a single copy. The deduced amino acid sequences of TsCu,Zn-SOD gene and TcCu,Zn-SOD cDNA reveal 98.47% of identity, and the characteristic motives, including the catalytic site and β-barrel structure of the Cu,Zn-SOD. Proteomic and immunohistochemical analysis indicated that Cu,Zn-SOD does not have isoforms, is distributed throughout the bladder wall and is concentrated in the tegument of T. solium and T. crassiceps cysticerci. Expression analysis revealed that TcCu,Zn-SOD mRNA and protein expression levels do not change in cysticerci, even upon exposure to O(2)(-) (0-3.8 nmol/min) and H(2)O(2) (0-2mM), suggesting that this gene is constitutively expressed in these parasites. Published by Elsevier Inc.
Fitzgerald, Jamie; Holden, Paul; Wright, Hollis; Wilmot, Beth; Hata, Abigail; Steiner, Robert D.; Basel, Don
2016-01-01
Background Osteogenesis imperfecta (OI) type V is a dominantly inherited skeletal dysplasia characterized by fractures and progressive deformity of long bones. In addition, patients often present with radial head dislocation, hyperplastic callus, and calcification of the forearm interosseous membrane. Recently, a specific mutation in the IFITM5 gene was found to be responsible for OI type V. This mutation, a C to T transition 14 nucleotides upstream from the endogenous start codon, creates a new start methionine that appears to be preferentially used by the translational machinery. However, the mechanism by which the lengthened protein results in a dominant type of OI is unknown. Methods and Results We report 7 ethnically diverse (African-American, Caucasian, Hispanic, and African) individuals with OI type V from 2 families and 2 sporadic cases. Exome sequencing failed to identify a causative mutation. Using Sanger sequencing, we found that all affected individuals in our cohort possess the c.−14 IFITM5 variant, further supporting the notion that OI type V is caused by a single, discrete mutation. Our patient cohort demonstrated inter-and intrafamilial phenotypic variability, including a father with classic OI type V whose daughter had a phenotype similar to OI type I. This clinical variability suggests that modifier genes influence the OI type V phenotype. We also confirm that the mutation creates an aberrant IFITM5 protein containing an additional 5 amino acids at the N-terminus. Conclusions The variable clinical signs in these cases illustrate the significant variability of the OI type V phenotype caused by the c.−14 IFITM5 mutation. The affected individuals are more ethnically diverse than previously reported. PMID:28824928
Jorge, João M P; Nguyen, Anh Q D; Pérez-García, Fernando; Kind, Stefanie; Wendisch, Volker F
2017-04-01
Gamma-aminobutyric acid (GABA) is a non-protein amino acid widespread in Nature. Among the various uses of GABA, its lactam form 2-pyrrolidone can be chemically converted to the biodegradable plastic polyamide-4. In metabolism, GABA can be synthesized either by decarboxylation of l-glutamate or by a pathway that starts with the transamination of putrescine. Fermentative production of GABA from glucose by recombinant Corynebacterium glutamicum has been described via both routes. Putrescine-based GABA production was characterized by accumulation of by-products such as N-acetyl-putrescine. Their formation was abolished by deletion of the spermi(di)ne N-acetyl-transferase gene snaA. To improve provision of l-glutamate as precursor 2-oxoglutarate dehydrogenase activity was reduced by changing the translational start codon of the chromosomal gene for 2-oxoglutarate dehydrogenase subunit E1o to the less preferred TTG and by maintaining the inhibitory protein OdhI in its inhibitory form by changing amino acid residue 15 from threonine to alanine. Putrescine-based GABA production by the strains described here led to GABA titers up to 63.2 g L -1 in fed-batch cultivation at maximum volumetric productivities up to 1.34 g L -1 h -1 , the highest volumetric productivity for fermentative GABA production reported to date. Moreover, GABA production from the carbon sources xylose, glucosamine, and N-acetyl-glucosamine that do not have competing uses in the food or feed industries was established. Biotechnol. Bioeng. 2017;114: 862-873. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Dou, Y.; Rutanhira, H.; Chen, X.; Mishra, A.; Wang, C.; Fletcher, H.M.
2018-01-01
Summary In Porphyromonas gingivalis, the protein PG1660, composed of 174 amino acids, is annotated as an extracytoplasmic function (ECF) sigma factor (RpoE homologue-σ24). Because PG1660 can modulate several virulence factors and responds to environmental signals in P. gingivalis, its genetic properties were evaluated. PG1660 is co-transcribed with its downstream gene PG1659, and the transcription start site was identified as adenine residue 54-nucleotides upstream of the ATG translation start codon. In addition to binding its own promoter, using the purified rPG1660 and RNAP core enzyme from Escherichia coli with the PG1660 promoter DNA as template, the function of PG1660 as a sigma factor was demonstrated in an in vitro transcription assay. Transcriptome analyses of a P. gingivalis PG1660-defective isogenic mutant revealed that under oxidative stress conditions 176 genes including genes involved in secondary metabolism were downregulated more than two-fold compared with the parental strain. The rPG1660 protein also showed the ability to bind to the promoters of the highly downregulated genes in the PG1660-deficient mutant. As the ECF sigma factor PG0162 has a 29% identity with PG1660 and can modulate its expression, the cross-talk between their regulatory networks was explored. The expression profile of the PG0162PG1660-deficient mutant (P. gingivalis FLL356) revealed that the type IX secretion system genes and several virulence genes were downregulated under hydrogen peroxide stress conditions. Taken together, we have confirmed that PG1660 can function as a sigma factor, and plays an important regulatory role in the oxidative stress and virulence regulatory network of P. gingivalis. PMID:29059500
Gan, Rui; Perez, Jessica G; Carlson, Erik D; Ntai, Ioanna; Isaacs, Farren J; Kelleher, Neil L; Jewett, Michael C
2017-05-01
The ability to site-specifically incorporate non-canonical amino acids (ncAAs) into proteins has made possible the study of protein structure and function in fundamentally new ways, as well as the bio synthesis of unnatural polymers. However, the task of site-specifically incorporating multiple ncAAs into proteins with high purity and yield continues to present a challenge. At the heart of this challenge lies the lower efficiency of engineered orthogonal translation system components compared to their natural counterparts (e.g., translation elements that specifically use a ncAA and do not interact with the cell's natural translation apparatus). Here, we show that evolving and tuning expression levels of multiple components of an engineered translation system together as a whole enhances ncAA incorporation efficiency. Specifically, we increase protein yield when incorporating multiple p-azido-phenylalanine(pAzF) residues into proteins by (i) evolving the Methanocaldococcus jannaschii p-azido-phenylalanyl-tRNA synthetase anti-codon binding domain, (ii) evolving the elongation factor Tu amino acid-binding pocket, and (iii) tuning the expression of evolved translation machinery components in a single vector. Use of the evolved translation machinery in a genomically recoded organism lacking release factor one enabled enhanced multi-site ncAA incorporation into proteins. We anticipate that our approach to orthogonal translation system development will accelerate and expand our ability to site-specifically incorporate multiple ncAAs into proteins and biopolymers, advancing new horizons for synthetic and chemical biotechnology. Biotechnol. Bioeng. 2017;114: 1074-1086. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).
Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su
2014-08-01
We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.
Mutations in the ADAR1 gene in Chinese families with dyschromatosis symmetrica hereditaria.
Zhang, G L; Shi, H J; Shao, M H; Li, M; Mu, H J; Gu, Y; Du, X F; Xie, P
2013-01-04
We investigated 2 Chinese families with dyschromatosis symmetrica hereditaria (DSH) and search for mutations in the adenosine deaminase acting on RNA1 (ADAR1) gene in these 2 pedigrees. We performed a mutation analysis of the ADAR1 gene in 2 Chinese families with DSH and reviewed all articles published regarding ADAR1 mutations reported since 2003 by using PubMed. By direct sequencing, a 2-nucleotide AG deletion, 2099-2100delAG, was found in family 1, and a C→T mutation was identified at nucleotide 1420 that changed codon 474 from arginine to a translational termination codon in family 2. Two different pathogenic mutations were identified, c.2099-2100delAG and c.1420C>T, the former being a novel mutation, and the latter previously reported in 3 other families with DSH. To date, a total of 110 mutations in the ADAR1 gene have been reported, and 10 of them were recurrent; the mutations R474X, R1083C, R1096X, and R1155W might be the DSH-related hotspots.
Ahmad, N N; McDonald-McGinn, D M; Zackai, E H; Knowlton, R G; LaRossa, D; DiMascio, J; Prockop, D J
1993-01-01
Genetic linkage analyses suggest that mutations in type II collagen may be responsible for Stickler syndrome, or arthro-ophthalmopathy (AO), in many families. In the present study oligonucleotide primers were developed to amplify and directly sequence eight of the first nine exons of the gene for type II procollagen (COL2A1). Analysis of the eight exons in 10 unrelated probands with AO revealed that one had a single-base mutation in one allele that changed the codon of -CGA- for arginine at amino acid position alpha 1-9 in exon 7 to a premature termination signal for translation. The second mutation found to cause AO was, therefore, similar to the first in that both created premature termination signals in the COL2A1 gene. Since mutations producing premature termination signals have not previously been detected in genes for fibrillar collagens, the results raise the possibility that such mutations in the COL2A1 gene are a common cause of AO. Images Figure 2 Figure 3 PMID:8434604