in-frame start codon: Topics by Science.gov

Sample records for in-frame start codon

CCC CGA is a weak translational recoding site in Escherichia coli.

PubMed

Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel

2004-12-08

Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
Translation, modification and cellular distribution of two AC4 variants of African cassava mosaic virus in yeast and their pathogenic potential in plants

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hipp, Katharina, E-mail: katharina.hipp@bio.uni-st

Plant infecting geminiviruses encode a small (A)C4 protein within the open reading frame of the replication-initiator protein. In African cassava mosaic virus, two in-frame start codons may be used for the translation of a longer and a shorter AC4 variant. Both were fused to green fluorescent protein or glutathione-S-transferase genes and expressed in fission yeast. The longer variant accumulated in discrete spots in the cytoplasm, whereas the shorter variant localized to the plasma membrane. A similar expression pattern was found in plants. A myristoylation motif may promote a targeting of the shorter variant to the plasma membrane. Mass spectrometry analysismore » of the yeast-expressed shorter variant detected the corresponding myristoylation. The biological relevance of the second start codon was confirmed using mutated infectious clones. Whereas mutating the first start codon had no effect on the infectivity in Nicotiana benthamiana plants, the second start codon proved to be essential. -- Highlights: •The ACMV AC4 may be translated from one or the other in-frame start codon. •Both AC4 variants are translated in fission yeast. •The long AC4 protein localizes to the cytoplasm, the short to the plasma membrane. •The short variant is myristoylated in yeast and may promote membrane localization. •Only the shorter AC4 variant has an impact on viral infections in plants.« less
PreTIS: A Tool to Predict Non-canonical 5’ UTR Translational Initiation Sites in Human and Mouse

PubMed Central

Reuter, Kerstin; Helms, Volkhard

2016-01-01

Translation of mRNA sequences into proteins typically starts at an AUG triplet. In rare cases, translation may also start at alternative non–AUG codons located in the annotated 5’ UTR which leads to an increased regulatory complexity. Since ribosome profiling detects translational start sites at the nucleotide level, the properties of these start sites can then be used for the statistical evaluation of functional open reading frames. We developed a linear regression approach to predict in–frame and out–of–frame translational start sites within the 5’ UTR from mRNA sequence information together with their translation initiation confidence. Predicted start codons comprise AUG as well as near–cognate codons. The underlying datasets are based on published translational start sites for human HEK293 and mouse embryonic stem cells that were derived by the original authors from ribosome profiling data. The average prediction accuracy of true vs. false start sites for HEK293 cells was 80%. When applied to mouse mRNA sequences, the same model predicted translation initiation sites observed in mouse ES cells with an accuracy of 76%. Moreover, we illustrate the effect of in silico mutations in the flanking sequence context of a start site on the predicted initiation confidence. Our new webservice PreTIS visualizes alternative start sites and their respective ORFs and predicts their ability to initiate translation. Solely, the mRNA sequence is required as input. PreTIS is accessible at http://service.bioinformatik.uni-saarland.de/pretis. PMID:27768687
A Non-Canonical Initiation Site Is Required for Efficient Translation of the Dendritically Localized Shank1 mRNA

PubMed Central

Studtmann, Katrin; Ölschläger-Schütt, Janin; Buck, Friedrich; Richter, Dietmar; Sala, Carlo; Bockmann, Jürgen; Kindler, Stefan; Kreienkamp, Hans-Jürgen

2014-01-01

Local protein synthesis in dendrites enables neurons to selectively change the protein complement of individual postsynaptic sites. Though it is generally assumed that this mechanism requires tight translational control of dendritically transported mRNAs, it is unclear how translation of dendritic mRNAs is regulated. We have analyzed here translational control elements of the dendritically localized mRNA coding for the postsynaptic scaffold protein Shank1. In its 5′ region, the human Shank1 mRNA exhibits two alternative translation initiation sites (AUG+1 and AUG+214), three canonical upstream open reading frames (uORFs1-3) and a high GC content. In reporter assays, fragments of the 5′UTR with high GC content inhibit translation, suggesting a contribution of secondary structures. uORF3 is most relevant to translation control as it overlaps with the first in frame start codon (AUG+1), directing translation initiation to the second in frame start codon (AUG+214). Surprisingly, our analysis points to an additional uORF initiated at a non-canonical ACG start codon. Mutation of this start site leads to an almost complete loss of translation initiation at AUG+1, demonstrating that this unconventional uORF is required for Shank1 synthesis. Our data identify a novel mechanism whereby initiation at a non-canonical site allows for translation of the main Shank1 ORF despite a highly structured 5′UTR. PMID:24533096
Translation of vph mRNA in Streptomyces lividans and Escherichia coli after removal of the 5' untranslated leader.

PubMed

Wu, C J; Janssen, G R

1996-10-01

The Streptomyces vinaceus viomycin phosphotransferase (vph) mRNA contains an untranslated leader with a conventional Shine-Dalgarno homology. The vph leader was removed by ligation of the vph coding sequence to the transcriptional start site of a Streptomyces or an Escherichia coli promoter, such that transcription would initiate at the first position of the vph start codon. Analysis of mRNA demonstrated that transcription initiated primarily at the A of the vph AUG translational start codon in both Streptomyces lividans and E. coli; cells expressing the unleadered vph mRNA were resistant to viomycin indicating that the Shine-Dalgarno sequence, or other features contained within the leader, was not necessary for vph translation. Addition of four nucleotides (5'-AUGC-3') onto the 5' end of the unleadered vph mRNA resulted in translation initiation from the vph start codon and the AUG triplet contained within the added sequence. Translational fusions of vph sequence to a Tn5 neo reporter gene indicated that the first 16 codons of vph coding sequence were sufficient to specify the translational start site and reading frame for expression of neomycin resistance in both E. coli and S. lividans.
Reduce Manual Curation by Combining Gene Predictions from Multiple Annotation Engines, a Case Study of Start Codon Prediction

PubMed Central

Ederveen, Thomas H. A.; Overmars, Lex; van Hijum, Sacha A. F. T.

2013-01-01

Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35–52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity. PMID:23675487
Consequences of germline variation disrupting the constitutional translational initiation codon start sites of MLH1 and BRCA2: use of potential alternative start sites and implications for predicting variant pathogenicity

PubMed Central

Parsons, Michael T.; Whiley, Phillip J.; Beesley, Jonathan; Drost, Mark; de Wind, Niels; Thompson, Bryony A.; Marquart, Louise; Hopper, John L.; Jenkins, Mark A.; Brown, Melissa A.; Tucker, Kathy; Warwick, Linda; Buchanan, Daniel D.; Spurdle, Amanda B.

2014-01-01

Variants that disrupt the translation initiation sequences in cancer predisposition genes are generally assumed to be deleterious. However few studies have validated these assumptions with functional and clinical data. Two cancer syndrome gene variants likely to affect native translation initiation were identified by clinical genetic testing: MLH1:c.1A>G p.(Met1?) and BRCA2:c.67+3A>G. In vitro GFP-reporter assays were conducted to assess the consequences of translation initiation disruption on alternative downstream initiation codon usage. Analysis of MLH1:c.1A>G p.(Met1?) showed that translation was mostly initiated at an in-frame position 103 nucleotides downstream, but also at two ATG sequences downstream. The protein product encoded by the in-frame transcript initiating from position c.103 showed loss of in vitro mismatch repair activity comparable to known pathogenic mutations. BRCA2:c.67+3A>G was shown by mRNA analysis to result in an aberrantly spliced transcript deleting exon 2 and the consensus ATG site. In the absence of exon 2, translation initiated mostly at an out-of-frame ATG 323 nucleotides downstream, and to a lesser extent at an in-frame ATG 370 nucleotides downstream. Initiation from any of the downstream alternative sites tested in both genes would lead to loss of protein function, but further clinical data is required to confirm if these variants are associated with a high cancer risk. Importantly, our results highlight the need for caution in interpreting the functional and clinical consequences of variation that leads to disruption of the initiation codon, since translation may not necessarily occur from the first downstream alternative start site, or from a single alternative start site. PMID:24302565
New Universal Rules of Eukaryotic Translation Initiation Fidelity

PubMed Central

Zur, Hadas; Tuller, Tamir

2013-01-01

The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
The Effect of an Alternate Start Codon on Heterologous Expression of a PhoA Fusion Protein in Mycoplasma gallisepticum

PubMed Central

Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.

2015-01-01

While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
The highly conserved codon following the slippery sequence supports -1 frameshift efficiency at the HIV-1 frameshift site.

PubMed

Mathew, Suneeth F; Crowe-McAuliffe, Caillan; Graves, Ryan; Cardno, Tony S; McKinney, Cushla; Poole, Elizabeth S; Tate, Warren P

2015-01-01

HIV-1 utilises -1 programmed ribosomal frameshifting to translate structural and enzymatic domains in a defined proportion required for replication. A slippery sequence, U UUU UUA, and a stem-loop are well-defined RNA features modulating -1 frameshifting in HIV-1. The GGG glycine codon immediately following the slippery sequence (the 'intercodon') contributes structurally to the start of the stem-loop but has no defined role in current models of the frameshift mechanism, as slippage is inferred to occur before the intercodon has reached the ribosomal decoding site. This GGG codon is highly conserved in natural isolates of HIV. When the natural intercodon was replaced with a stop codon two different decoding molecules-eRF1 protein or a cognate suppressor tRNA-were able to access and decode the intercodon prior to -1 frameshifting. This implies significant slippage occurs when the intercodon is in the (perhaps distorted) ribosomal A site. We accommodate the influence of the intercodon in a model of frame maintenance versus frameshifting in HIV-1.
A transducer for microbial sensory rhodopsin that adopts GTG as a start codon is identified in Haloarcula marismortui.

PubMed

Fu, Hsu-Yuan; Lu, Yen-Hsu; Yi, Hsiu-Ping; Yang, Chii-Shen

2013-04-05

Microbial sensory rhodopsins are known to mediate phototaxis, and all of the known sensory rhodopsins execute this function with a specific cognate transducer that has two-transmembrane (2-TM) regions. In the genome of Haloarcula marismortui, a total of six rhodopsin genes were annotated, and we previously showed three of them to be the ion type and suggested the other three as sensory type, even though the candidate transducer gene, htr, for HmSRI was missing the 2-TM region that is found in all of the other known transducers. Here we showed this htr gene featured a preceding 2-TM region when the alternative start codon GTG located 291 nucleotides upstream of the original annotated open reading frame (ORF) was introduced and it is named as htrI in this study. Overexpression of HmHtrI exhibited it existed as a membrane protein and several biophysical assays confirmed it functionally interacted with HmSRI. Together with our previous reverse-transcriptase-PCR results and phototaxis measurements, the new ORF of original predicted soluble htr gene product was a membrane protein with a 2-TM region, HmHtrI; and it serves as the cognate transducer for HmSRI. HmHtrI therefore is the first transducer for the sensory rhodopsin adopted start codon other than ATG. Copyright © 2013 Elsevier B.V. All rights reserved.
High-level tetracycline resistance mediated by efflux pumps Tet(A) and Tet(A)-1 with two start codons.

PubMed

Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui

2014-11-01

Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
Frame-Insensitive Expression Cloning of Fluorescent Protein from Scolionema suvaense.

PubMed

Horiuchi, Yuki; Laskaratou, Danai; Sliwa, Michel; Ruckebusch, Cyril; Hatori, Kuniyuki; Mizuno, Hideaki; Hotta, Jun-Ichi

2018-01-26

Expression cloning from cDNA is an important technique for acquiring genes encoding novel fluorescent proteins. However, the probability of in-frame cDNA insertion following the first start codon of the vector is normally only 1/3, which is a cause of low cloning efficiency. To overcome this issue, we developed a new expression plasmid vector, pRSET-TriEX, in which transcriptional slippage was induced by introducing a DNA sequence of (dT) 14 next to the first start codon of pRSET. The effectiveness of frame-insensitive cloning was validated by inserting the gene encoding eGFP with all three possible frames to the vector. After transformation with one of these plasmids, E. coli cells expressed eGFP with no significant difference in the expression level. The pRSET-TriEX vector was then used for expression cloning of a novel fluorescent protein from Scolionema suvaense . We screened 3658 E. coli colonies transformed with pRSET-TriEX containing Scolionema suvaense cDNA, and found one colony expressing a novel green fluorescent protein, ScSuFP. The highest score in protein sequence similarity was 42% with the chain c of multi-domain green fluorescent protein like protein "ember" from Anthoathecata sp. Variations in the N- and/or C-terminal sequence of ScSuFP compared to other fluorescent proteins indicate that the expression cloning, rather than the sequence similarity-based methods, was crucial for acquiring the gene encoding ScSuFP. The absorption maximum was at 498 nm, with an extinction efficiency of 1.17 × 10⁵ M -1 ·cm -1 . The emission maximum was at 511 nm and the fluorescence quantum yield was determined to be 0.6. Pseudo-native gel electrophoresis showed that the protein forms obligatory homodimers.
Efficient initiation of mammalian mRNA translation at a CUG codon.

PubMed Central

Dasso, M C; Jackson, R J

1989-01-01

Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
RNA editing makes mistakes in plant mitochondria: editing loses sense in transcripts of a rps19 pseudogene and in creating stop codons in coxI and rps3 mRNAs of Oenothera.

PubMed Central

Schuster, W; Brennicke, A

1991-01-01

An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
An Out-of-frame Overlapping Reading Frame in the Ataxin-1 Coding Sequence Encodes a Novel Ataxin-1 Interacting Protein*

PubMed Central

Bergeron, Danny; Lapointe, Catherine; Bissonnette, Cyntia; Tremblay, Guillaume; Motard, Julie; Roucou, Xavier

2013-01-01

Spinocerebellar ataxia type 1 is an autosomal dominant cerebellar ataxia associated with the expansion of a polyglutamine tract within the ataxin-1 (ATXN1) protein. Recent studies suggest that understanding the normal function of ATXN1 in cellular processes is essential to decipher the pathogenesis mechanisms in spinocerebellar ataxia type 1. We found an alternative translation initiation ATG codon in the +3 reading frame of human ATXN1 starting 30 nucleotides downstream of the initiation codon for ATXN1 and ending at nucleotide 587. This novel overlapping open reading frame (ORF) encodes a 21-kDa polypeptide termed Alt-ATXN1 (Alternative ATXN1) with a completely different amino acid sequence from ATXN1. We introduced a hemagglutinin tag in-frame with Alt-ATXN1 in ATXN1 cDNA and showed in cell culture the co-expression of both ATXN1 and Alt-ATXN1. Remarkably, Alt-ATXN1 colocalized and interacted with ATXN1 in nuclear inclusions. In contrast, in the absence of ATXN1 expression, Alt-ATXN1 displays a homogenous nucleoplasmic distribution. Alt-ATXN1 interacts with poly(A)+ RNA, and its nuclear localization is dependent on RNA transcription. Polyclonal antibodies raised against Alt-ATXN1 confirmed the expression of Alt-ATXN1 in human cerebellum expressing ATXN1. These results demonstrate that human ATXN1 gene is a dual coding sequence and that ATXN1 interacts with and controls the subcellular distribution of Alt-ATXN1. PMID:23760502
The helicase Ded1p controls use of near-cognate translation initiation codons in 5' UTRs.

PubMed

Guenther, Ulf-Peter; Weinberg, David E; Zubradt, Meghan M; Tedeschi, Frank A; Stawicki, Brittany N; Zagore, Leah L; Brar, Gloria A; Licatalosi, Donny D; Bartel, David P; Weissman, Jonathan S; Jankowsky, Eckhard

2018-06-27

The conserved and essential DEAD-box RNA helicase Ded1p from yeast and its mammalian orthologue DDX3 are critical for the initiation of translation 1 . Mutations in DDX3 are linked to tumorigenesis 2-4 and intellectual disability 5 , and the enzyme is targeted by a range of viruses 6 . How Ded1p and its orthologues engage RNAs during the initiation of translation is unknown. Here we show, by integrating transcriptome-wide analyses of translation, RNA structure and Ded1p-RNA binding, that the effects of Ded1p on the initiation of translation are connected to near-cognate initiation codons in 5' untranslated regions. Ded1p associates with the translation pre-initiation complex at the mRNA entry channel and repressing the activity of Ded1p leads to the accumulation of RNA structure in 5' untranslated regions, the initiation of translation from near-cognate start codons immediately upstream of these structures and decreased protein synthesis from the corresponding main open reading frames. The data reveal a program for the regulation of translation that links Ded1p, the activation of near-cognate start codons and mRNA structure. This program has a role in meiosis, in which a marked decrease in the levels of Ded1p is accompanied by the activation of the alternative translation initiation sites that are seen when the activity of Ded1p is repressed. Our observations indicate that Ded1p affects translation initiation by controlling the use of near-cognate initiation codons that are proximal to mRNA structure in 5' untranslated regions.
Circ-ZNF609 Is a Circular RNA that Can Be Translated and Functions in Myogenesis.

PubMed

Legnini, Ivano; Di Timoteo, Gaia; Rossi, Francesca; Morlando, Mariangela; Briganti, Francesca; Sthandier, Olga; Fatica, Alessandro; Santini, Tiziana; Andronache, Adrian; Wade, Mark; Laneve, Pietro; Rajewsky, Nikolaus; Bozzoni, Irene

2017-04-06

Circular RNAs (circRNAs) constitute a family of transcripts with unique structures and still largely unknown functions. Their biogenesis, which proceeds via a back-splicing reaction, is fairly well characterized, whereas their role in the modulation of physiologically relevant processes is still unclear. Here we performed expression profiling of circRNAs during in vitro differentiation of murine and human myoblasts, and we identified conserved species regulated in myogenesis and altered in Duchenne muscular dystrophy. A high-content functional genomic screen allowed the study of their functional role in muscle differentiation. One of them, circ-ZNF609, resulted in specifically controlling myoblast proliferation. Circ-ZNF609 contains an open reading frame spanning from the start codon, in common with the linear transcript, and terminating at an in-frame STOP codon, created upon circularization. Circ-ZNF609 is associated with heavy polysomes, and it is translated into a protein in a splicing-dependent and cap-independent manner, providing an example of a protein-coding circRNA in eukaryotes. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Identification of the initiation site of poliovirus polyprotein synthesis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dorner, A.J.; Dorner, L.F.; Larsen, G.R.

1982-06-01

The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less
The positive regulatory function of the 5'-proximal open reading frames in GCN4 mRNA can be mimicked by heterologous, short coding sequences.

PubMed Central

Williams, N P; Mueller, P P; Hinnebusch, A G

1988-01-01

Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626

The Enterococcus faecalis EbpA Pilus Protein: Attenuation of Expression, Biofilm Formation, and Adherence to Fibrinogen Start with the Rare Initiation Codon ATT

PubMed Central

Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.

2015-01-01

ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
Protein expression of preferred human codon-optimized Gaussia luciferase genes with an artificial open-reading frame in mammalian and bacterial cells.

PubMed

Inouye, Satoshi; Suzuki, Takahiro

2016-12-01

The protein expressions of three preferred human codon-optimized Gaussia luciferase genes (pGLuc, EpGLuc, and KpGLuc) were characterized in mammalian and bacterial cells by comparing them with those of wild-type Gaussia luciferase gene (wGLuc) and human codon-optimized Gaussia luciferase gene (hGLuc). Two synthetic genes of EpGLuc and KpGLuc containing the complete preferred human codons have an artificial open-reading frame; however, they had the similar protein expression levels to those of pGLuc and hGLuc in mammalian cells. In bacterial cells, the protein expressions of pGLuc, EpGLuc, and KpGLuc with approximately 65% GC content were the same and showed approximately 60% activities of wGLuc and hGLuc. The artificial open-reading frame in EpGLuc and KpGLuc did not affect the protein expression in mammalian and bacterial cells. Copyright © 2016 Elsevier Inc. All rights reserved.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.

PubMed

Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S

2005-01-01

Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Extensive frameshift at all AGG and CCC codons in the mitochondrial cytochrome c oxidase subunit 1 gene of Perkinsus marinus (Alveolata; Dinoflagellata).

PubMed

Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi

2010-10-01

Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
Nucleotide sequence and transcriptional start site of the Methylobacterium organophilum XX methanol dehydrogenase structural gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Machlin, S.M.; Hanson, R.S.

The nucleotide sequence of a cloned 2.5-kilobase-pair SmaI fragment containing the methanol dehydrogenase (MDH) structural gene from Methylobacterium organophilum XX was determined. A single open reading frame with a coding capacity of 626 amino acids (molecular weight, 66,000) was identified on one stand, and N-terminal sequencing of purified MDH revealed that 27 of these residues constituted a putative signal peptide. Primer extension mapping of in vivo transcripts indicated that the start of mRNA synthesis was 160 to 170 base pairs upstream of the ATG codon. Northern (RNA) blot analysis further demonstrated that the transcript was 2.1 kilobase pairs in lengthmore » and therefore appeared to encode only MDH.« less
Translation of the first upstream ORF in the hepatitis B virus pregenomic RNA modulates translation at the core and polymerase initiation codons

PubMed Central

Chen, Augustine; Kao, Y. F.; Brown, Chris M.

2005-01-01

The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
Purification and characterization of an endoglucanase from Streptomyces lividans 66 and DNA sequence of the gene.

PubMed Central

Théberge, M; Lacaze, P; Shareck, F; Morosoli, R; Kluepfel, D

1992-01-01

The endoglucanase isolated from culture filtrates of Streptomyces lividans IAF74 was shown to have an Mr of 46,000 and a pI of 3.3. The specific enzyme activity of 539 IU/mg, determined by the reducing assay method on carboxymethyl cellulose, is among the highest reported in the literature. The cellulase showed typical endo-type activity when reacting on oligocellodextrins. Optimal enzyme activity was obtained at 50 degrees C and pH 5.5. The kinetic constants for this endoglucanase, determined with carboxymethyl cellulose as the substrate, were a Vmax of 24.9 IU/mg of enzyme and a Km of 4.2 mg/ml. Activity was found against neither methylumbelliferyl- nor p-nitrophenyl-cellobiopyranoside nor with xylan. The DNA sequence contains one possible reading frame validated by the N terminus of the mature purified protein. However, neither ATG nor GTG starting codons were identified near the ribosome-binding site. A putative TTG codon was found as a good candidate for the start codon. Comparison of the primary amino acid sequence of the endoglucanase of S. lividans revealed that the N terminus contains a bacterial cellulose-binding domain. The catalytic domain at the C terminus showed similarity to endoglucanases from a Bacillus sp. Thus, the endoglucanase CelA belongs to family A of cellulases as described before (N. R. Gilkes, B. Henrissat, D. G. Kilburn, R. C. Miller, Jr., and R. A. J. Warren, Microbiol. Rev. 55:303-315, 1991. Images PMID:1575483
Genetic hotels for the standard genetic code: evolutionary analysis based upon novel three-dimensional algebraic models.

PubMed

José, Marco V; Morgado, Eberto R; Govezensky, Tzipe

2011-07-01

Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Negative and Translation Termination-Dependent Positive Control of FLI-1 Protein Synthesis by Conserved Overlapping 5′ Upstream Open Reading Frames in Fli-1 mRNA

PubMed Central

Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François

2000-01-01

The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Bacterial genomes lacking long-range correlations may not be modeled by low-order Markov chains: the role of mixing statistics and frame shift of neighboring genes.

PubMed

Cocho, Germinal; Miramontes, Pedro; Mansilla, Ricardo; Li, Wentian

2014-12-01

We examine the relationship between exponential correlation functions and Markov models in a bacterial genome in detail. Despite the well known fact that Markov models generate sequences with correlation function that decays exponentially, simply constructed Markov models based on nearest-neighbor dimer (first-order), trimer (second-order), up to hexamer (fifth-order), and treating the DNA sequence as being homogeneous all fail to predict the value of exponential decay rate. Even reading-frame-specific Markov models (both first- and fifth-order) could not explain the fact that the exponential decay is very slow. Starting with the in-phase coding-DNA-sequence (CDS), we investigated correlation within a fixed-codon-position subsequence, and in artificially constructed sequences by packing CDSs with out-of-phase spacers, as well as altering CDS length distribution by imposing an upper limit. From these targeted analyses, we conclude that the correlation in the bacterial genomic sequence is mainly due to a mixing of heterogeneous statistics at different codon positions, and the decay of correlation is due to the possible out-of-phase between neighboring CDSs. There are also small contributions to the correlation from bases at the same codon position, as well as by non-coding sequences. These show that the seemingly simple exponential correlation functions in bacterial genome hide a complexity in correlation structure which is not suitable for a modeling by Markov chain in a homogeneous sequence. Other results include: use of the (absolute value) second largest eigenvalue to represent the 16 correlation functions and the prediction of a 10-11 base periodicity from the hexamer frequencies. Copyright © 2014 Elsevier Ltd. All rights reserved.
uORFs with unusual translational start codons autoregulate expression of eukaryotic ornithine decarboxylase homologs

PubMed Central

Ivanov, Ivaylo P.; Loughran, Gary; Atkins, John F.

2008-01-01

In a minority of eukaryotic mRNAs, a small functional upstream ORF (uORF), often performing a regulatory role, precedes the translation start site for the main product(s). Here, conserved uORFs in numerous ornithine decarboxylase homologs are identified from yeast to mammals. Most have noncanonical evolutionarily conserved start codons, the main one being AUU, which has not been known as an initiator for eukaryotic chromosomal genes. The AUG-less uORF present in mouse antizyme inhibitor, one of the ornithine decarboxylase homologs in mammals, mediates polyamine-induced repression of the downstream main ORF. This repression is part of an autoregulatory circuit, and one of its sensors is the AUU codon, which suggests that translation initiation codon identity is likely used for regulation in eukaryotes. PMID:18626014
Codon Optimization of the Human Papillomavirus E7 Oncogene Induces a CD8+ T Cell Response to a Cryptic Epitope Not Harbored by Wild-Type E7

PubMed Central

Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang

2015-01-01

Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Complete mitochondrial genome of Palawan peacock-pheasant Polyplectron napoleonis (Galliformes, Phasianidae).

PubMed

Quach, Tommy; Brooks, Daniel M; Miranda, Hector C

2016-01-01

The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Alterations of the three short open reading frames in the Rous sarcoma virus leader RNA modulate viral replication and gene expression.

PubMed Central

Moustakas, A; Sonstegard, T S; Hackett, P B

1993-01-01

The Rous sarcoma virus (RSV) leader RNA has three short open reading frames (ORF1 to ORF3) which are conserved in all avian sarcoma-leukosis retroviruses. Effects on virus propagation were determined following three types of alterations in the ORFs: (i) replacement of AUG initiation codons in order to prohibit ORF translation, (ii) alterations of the codon context around the AUG initiation codon to enhance translation of the normally silent ORF3, and (iii) elongation of the ORF coding sequences. Mutagenesis of the AUG codons for ORF1 and ORF2 (AUG1 and AUG2) singly or together delayed the onset of viral replication and cell transformation. In contrast, mutagenesis of AUG3 almost completely suppressed these viral activities. Mutagenesis of ORF3 to enhance its translation inhibited viral propagation. When the mutant ORF3 included an additional frameshift mutation which extended the ORF beyond the initiation site for the gag, gag-pol, and env proteins, host cells were initially transformed but died soon thereafter. Elongation of ORF1 from 7 to 62 codons led to the accumulation of transformation-defective virus with a delayed onset of replication. In contrast, viruses with elongation of ORF1 from 7 to 30 codons, ORF2 from 16 to 48 codons, or ORF3 from 9 to 64 codons, without any alterations in the AUG context, exhibited wild-type phenotypes. These results are consistent with a model that translation of the ORFs is necessary to facilitate virus production. Images PMID:7685415
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression

ERIC Educational Resources Information Center

Szeberenyi, Jozsef

2009-01-01

Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Bioinformatic analysis suggests that the Orbivirus VP6 cistron encodes an overlapping gene

PubMed Central

Firth, Andrew E

2008-01-01

Background The genus Orbivirus includes several species that infect livestock – including Bluetongue virus (BTV) and African horse sickness virus (AHSV). These viruses have linear dsRNA genomes divided into ten segments, all of which have previously been assumed to be monocistronic. Results Bioinformatic evidence is presented for a short overlapping coding sequence (CDS) in the Orbivirus genome segment 9, overlapping the VP6 cistron in the +1 reading frame. In BTV, a 77–79 codon AUG-initiated open reading frame (hereafter ORFX) is present in all 48 segment 9 sequences analysed. The pattern of base variations across the 48-sequence alignment indicates that ORFX is subject to functional constraints at the amino acid level (even when the constraints due to coding in the overlapping VP6 reading frame are taken into account; MLOGD software). In fact the translated ORFX shows greater amino acid conservation than the overlapping region of VP6. The ORFX AUG codon has a strong Kozak context in all 48 sequences. Each has only one or two upstream AUG codons, always in the VP6 reading frame, and (with a single exception) always with weak or medium Kozak context. Thus, in BTV, ORFX may be translated via leaky scanning. A long (83–169 codon) ORF is present in a corresponding location and reading frame in all other Orbivirus species analysed except Saint Croix River virus (SCRV; the most divergent). Again, the pattern of base variations across sequence alignments indicates multiple coding in the VP6 and ORFX reading frames. Conclusion At ~9.5 kDa, the putative ORFX product in BTV is too small to appear on most published protein gels. Nonetheless, a review of past literature reveals a number of possible detections. We hope that presentation of this bioinformatic analysis will stimulate an attempt to experimentally verify the expression and functional role of ORFX, and hence lead to a greater understanding of the molecular biology of these important pathogens. PMID:18489030
Influence of certain forces on evolution of synonymous codon usage bias in certain species of three basal orders of aquatic insects.

PubMed

Selva Kumar, C; Nair, Rahul R; Sivaramakrishnan, K G; Ganesh, D; Janarthanan, S; Arunachalam, M; Sivaruban, T

2012-12-01

Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
The TGA codons are present in the open reading frame of selenoprotein P cDNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hill, K.E.; Lloyd, R.S.; Read, R.

1991-03-11

The TGA codon in DNA has been shown to direct incorporation of selenocysteine into protein. Several proteins from bacteria and animals contain selenocysteine in their primary structures. Each of the cDNA clones of these selenoproteins contains one TGA codon in the open reading frame which corresponds to the selenocysteine in the protein. A cDNA clone for selenoprotein P (SeP), obtained from a {gamma}ZAP rat liver library, was sequenced by the dideoxy termination method. The correct reading frame was determined by comparison of the deduced amino acid sequence with the amino acid sequence of several peptides from SeP. Using SeP labelledmore » with {sup 75}Se in vivo, the selenocysteine content of the peptides was verified by the collection of carboxymethylated {sup 77}Se-selenocysteine as it eluted from the amino acid analyzer and determination of the radioactivity contained in the collected samples. Ten TGA codons are present in the open reading frame of the cDNA. Peptide fragmentation studies and the deduced sequence indicate that selenium-rich regions are located close to the carboxy terminus. Nine of the 10 selenocysteines are located in the terminal 26% of the sequence with four in the terminal 15 amino acids. The deduced sequence codes for a protein of 385 amino acids. Cleavage of the signal peptide gives the mature protein with 366 amino acids and a calculated mol wt of 41,052 Da. Searches of PIR and SWISSPROT protein databases revealed no similarity with glutathione peroxidase or other selenoproteins.« less
Non-AUG translation: a new start for protein synthesis in eukaryotes

PubMed Central

Kearse, Michael G.; Wilusz, Jeremy E.

2017-01-01

Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states. PMID:28982758

Mutagenesis of the three bases preceding the start codon of the beta-galactosidase mRNA and its effect on translation in Escherichia coli.

PubMed Central

Hui, A; Hayflick, J; Dinkelspiel, K; de Boer, H A

1984-01-01

The effect on the translation efficiency of various mutations in the three bases (the -1 triplet) that precede the AUG start codon of the beta-galactosidase mRNA in Escherichia coli was studied. Of the 39 mutants examined, the level of expression varies over a 20-fold range. The most favorable combinations of bases in the -1 triplet are UAU and CUU. The expression levels in the mutants with UUC, UCA or AGG as the -1 triplet are 20-fold lower than those with UAU or CUU. In general, a U residue immediately preceding the start codon is more favorable for expression than any other base; furthermore, an A residue at the -2 position enhances the translation efficiency in most instances. In both cases, however, the degree of enhancement depends on its context, i.e. the neighboring bases. Although the rules derived from this study are complex, the results show that mutations in any of the three bases preceding the start codon can strongly affect the translational efficiency of the beta-galactosidase mRNA. PMID:6425057
Effect of the nucleotides surrounding the start codon on the translation of foot-and-mouth disease virus RNA.

PubMed

Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R

2016-06-01

As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Ribosomal scanning past the primary initiation codon as a mechanism for expression of CTL epitopes encoded in alternative reading frames

PubMed Central

1996-01-01

An increasing amount of evidence has shown that epitopes restricted to MHC class I molecules and recognized by CTL need not be encoded in a primary open reading frame (ORF). Such epitopes have been demonstrated after stop codons, in alternative reading frames (RF) and within introns. We have used a series of frameshifts (FS) introduced into the Influenza A/PR/8 /34 nucleoprotein (NP) gene to confirm the previous in vitro observations of cryptic epitope expression, and show that they are sufficiently expressed to prime immune responses in vivo. This presentation is not due to sub-dominant epitopes, transcription from cryptic promoters beyond the point of the FS, or internal initiation of translation. By introducing additional mutations to the construct exhibiting the most potent presentation, we have identified initiation codon readthrough (termed scanthrough here, where the scanning ribosome bypasses the conventional initiation codon, initiating translation further downstream) as the likely mechanism of epitope production. Further mutational analysis demonstrated that, while it should operate during the expression of wild-type (WT) protein, scanthrough does not provide a major source of processing substrate in our system. These findings suggest (i) that the full array of self- and pathogen-derived epitopes available during thymic selection and infection has not been fully appreciated and (ii) that cryptic epitope expression should be considered when the specificity of a CTL response cannot be identified or in therapeutic situations when conventional CTL targets are limited, as may be the case with latent viral infections and transformed cells. Finally, initiation codon readthrough provides a plausible explanation for the presentation of exocytic proteins by MHC class I molecules. PMID:8879204
The complete genome sequence of freesia mosaic virus and its relationship to other potyviruses.

PubMed

Choi, H I; Lim, H R; Song, Y S; Kim, M J; Choi, S H; Song, Y S; Bae, S C; Ryu, K H

2010-07-01

We have completed the genomic sequence of a potyvirus, freesia mosaic virus (FreMV), and compared it to those of other known potyviruses. The full-length genome sequence of FreMV consists of 9,489 nucleotides. The large protein contains 3,077 amino acids, with an AUG start codon and UAA stop codon, containing one open reading frame typical of a potyvirus polyprotein. The polyprotein of FreMV-Kr gives rise to eleven proteins (P1, HC-pro, P3, PIPO, 6K1, CI, 6K2, VPg, NIa, NIb and CP), and putative cleavage sites of each protein were identified by sequence comparison to those of other known potyviruses. Phylogenetic analysis of the polyprotein revealed that FreMV-Kr was most closely related to PeMoV and was related to BtMV, BaRMV and PeLMV, which belong to the BCMV subgroup. This is the first information on the complete genome structure of FreMV, and the sequence information clearly supports the status of FreMV as a member of a distinct species in the genus Potyvirus.
Ribosomes slide on lysine-encoding homopolymeric A stretches

PubMed Central

Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel

2015-01-01

Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Transcriptional mapping of the varicella-zoster virus regulatory genes encoding open reading frames 4 and 63.

PubMed Central

Kinchington, P R; Vergnes, J P; Defechereux, P; Piette, J; Turse, S E

1994-01-01

Four of the 68 varicella-zoster virus (VZV) unique open reading frames (ORFs), i.e., ORFs 4, 61, 62, and 63, encode proteins that influence viral transcription and are considered to be positional homologs of herpes simplex virus type 1 (HSV-1) immediate-early (IE) proteins. In order to identify the elements that regulate transcription of VZV ORFs 4 and 63, the encoded mRNAs were mapped in detail. For ORF 4, a major 1.8-kb and a minor 3.0-kb polyadenylated [poly(A)+] RNA were identified, whereas ORF 63-specific probes recognized 1.3- and 1.9-kb poly(A)+ RNAs. Probes specific for sequences adjacent to the ORFs and mapping of the RNA 3' ends indicated that the ORF 4 RNAs were 3' coterminal, whereas the RNAs for ORF 63 represented two different termination sites. S1 nuclease mapping and primer extension analyses indicated a single transcription initiation site for ORF 4 at 38 bp upstream of the ORF start codon. For ORF 63, multiple transcriptional start sites at 87 to 95, 151 to 153, and (tentatively) 238 to 243 bp upstream of the ORF start codon were identified. TATA box motifs at good positional locations were found upstream of all mapped transcription initiation sites. However, no sequences resembling the TAATGARAT motif, which confers IE regulation upon HSV-1 IE genes, were found. The finding of the absence of this motif was supported through analyses of the regulatory sequences of ORFs 4 and 63 in transient transfection assays alongside those of ORFs 61 and 62. Sequences representing the promoters for ORFs 4, 61, and 63 were all stimulated by VZV infection but failed to be stimulated by coexpression with the HSV-1 transactivator Vmw65. In contrast, the promoter for ORF 62, which contains TAATGARAT motifs, was activated by VZV infection and coexpression with Vmw65. These results extend the transcriptional knowledge for VZV and suggest that ORFs 4 and 63 contain regulatory signals different from those of the ORF 62 and HSV-1 IE genes. Images PMID:8189496
Selenocysteine incorporation: A trump card in the game of mRNA decay

PubMed Central

Shetty, Sumangala P.; Copeland, Paul R.

2015-01-01

The incorporation of the 21st amino acid, selenocysteine (Sec), occurs on mRNAs that harbor in-frame stop codons because the Sec-tRNASec recognizes a UGA codon. This sets up an intriguing interplay between translation elongation, translation termination and the complex machinery that marks mRNAs that contain premature termination codons for degradation, leading to nonsense mediated mRNA decay (NMD). In this review we discuss the intricate and complex relationship between this key quality control mechanism and the process of Sec incorporation in mammals. PMID:25622574
eIF1 Loop 2 interactions with Met-tRNAi control the accuracy of start codon selection by the scanning preinitiation complex.

PubMed

Thakur, Anil; Hinnebusch, Alan G

2018-05-01

The eukaryotic 43S preinitiation complex (PIC), bearing initiator methionyl transfer RNA (Met-tRNA i ) in a ternary complex (TC) with eukaryotic initiation factor 2 (eIF2)-GTP, scans the mRNA leader for an AUG codon in favorable context. AUG recognition evokes rearrangement from an open PIC conformation with TC in a "P OUT " state to a closed conformation with TC more tightly bound in a "P IN " state. eIF1 binds to the 40S subunit and exerts a dual role of enhancing TC binding to the open PIC conformation while antagonizing the P IN state, necessitating eIF1 dissociation for start codon selection. Structures of reconstituted PICs reveal juxtaposition of eIF1 Loop 2 with the Met-tRNA i D loop in the P IN state and predict a distortion of Loop 2 from its conformation in the open complex to avoid a clash with Met-tRNA i We show that Ala substitutions in Loop 2 increase initiation at both near-cognate UUG codons and AUG codons in poor context. Consistently, the D71A-M74A double substitution stabilizes TC binding to 48S PICs reconstituted with mRNA harboring a UUG start codon, without affecting eIF1 affinity for 40S subunits. Relatively stronger effects were conferred by arginine substitutions; and no Loop 2 substitutions perturbed the rate of TC loading on scanning 40S subunits in vivo. Thus, Loop 2-D loop interactions specifically impede Met-tRNA i accommodation in the P IN state without influencing the P OUT mode of TC binding; and Arg substitutions convert the Loop 2-tRNA i clash to an electrostatic attraction that stabilizes P IN and enhances selection of poor start codons in vivo.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome.

PubMed

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-02-24

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Multiple Evolutionary Selections Involved in Synonymous Codon Usages in the Streptococcus agalactiae Genome

PubMed Central

Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu

2016-01-01

Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Origin of noncoding DNA sequences: molecular fossils of genome evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Naora, H.; Miyahara, K.; Curnow, R.N.

The total amount of noncoding sequences on chromosomes of contemporary organisms varies significantly from species to species. The authors propose a hypothesis for the origin of these noncoding sequences that assumes that (i) an approx. 0.55-kilobase (kb)-long reading frame composed the primordial gene and (ii) a 20-kb-long single-stranded polynucleotide is the longest molecule (as a genome) that was polymerized at random and without a specific template in the primordial soup/cell. The statistical distribution of stop codons allows examination of the probability of generating reading frames of approx. 0.55 kb in this primordial polynucleotide. This analysis reveals that with three stopmore » codons, a run of at least 0.55-kb equivalent length of nonstop codons would occur in 4.6% of 20-kb-long polynucleotide molecules. They attempt to estimate the total amount of noncoding sequences that would be present on the chromosomes of contemporary species assuming that present-day chromosomes retain the prototype primordial genome structure. Theoretical estimates thus obtained for most eukaryotes do not differ significantly from those reported for these specific organisms, with only a few exceptions. Furthermore, analysis of possible stop-codon distributions suggests that life on earth would not exist, at least in its present form, had two or four stop codons been selected early in evolution.« less
Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

PubMed

Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

2007-07-01

A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Lee, Youn-Ho

2016-01-01

The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
[Organization and expression of poliovirus genome].

PubMed

Vevcherenko, S G

1984-01-01

In the present paper on the basis of analysis of literary data it is postulated that along with the AUG codon at N743 there exists a second initiation codon in the poliovirus RNA (the AUG codon at N586). The translation initiated at N586 can be transferred to the phase of the major reading frame by removing the small hairpin N732-N744 formed near the first initiation site, or by removing the small region N739-N745. In the first case at the boundary between the hypothetical leader peptide encoded by the 5'-terminus of the long, open reading frame of the spliced poliovirus RNA and the capsid protein VP4 must be the Gln-Gly proteolytic cleavage signal, and in the second case--the Tyr-Gly signal. In both cases the leader peptide can be chipped off by the virus specific proteinase. It is supposed that the exon-intronic structure of the poliovirus genome is needed for coordination of translation and transcription during the poliovirus reproduction cycle.
Mapping the subgenomic RNA promoter of the Citrus leaf blotch virus coat protein gene by Agrobacterium-mediated inoculation.

PubMed

Renovell, Agueda; Gago, Selma; Ruiz-Ruiz, Susana; Velázquez, Karelia; Navarro, Luis; Moreno, Pedro; Vives, Mari Carmen; Guerri, José

2010-10-25

Citrus leaf blotch virus has a single-stranded positive-sense genomic RNA (gRNA) of 8747 nt organized in three open reading frames (ORFs). The ORF1, encoding a polyprotein involved in replication, is translated directly from the gRNA, whereas ORFs encoding the movement (MP) and coat (CP) proteins are expressed via 3' coterminal subgenomic RNAs (sgRNAs). We characterized the minimal promoter region critical for the CP-sgRNA expression in infected cells by deletion analyses using Agrobacterium-mediated infection of Nicotiana benthamiana plants. The minimal CP-sgRNA promoter was mapped between nucleotides -67 and +50 nt around the transcription start site. Surprisingly, larger deletions in the region between the CP-sgRNA transcription start site and the CP translation initiation codon resulted in increased CP-sgRNA accumulation, suggesting that this sequence could modulate the CP-sgRNA transcription. Site-specific mutational analysis of the transcription start site revealed that the +1 guanylate and the +2 adenylate are important for CP-sgRNA synthesis. Copyright © 2010 Elsevier Inc. All rights reserved.
Analysis of the DNA sequence of a 15,500 bp fragment near the left telomere of chromosome XV from Saccharomyces cerevisiae reveals a putative sugar transporter, a carboxypeptidase homologue and two new open reading frames.

PubMed

Gamo, F J; Lafuente, M J; Casamayor, A; Ariño, J; Aldea, M; Casas, C; Herrero, E; Gancedo, C

1996-06-15

We report the sequence of a 15.5 kb DNA segment located near the left telomere of chromosome XV of Saccharomyces cerevisiae. The sequence contains nine open reading frames (ORFs) longer than 300 bp. Three of them are internal to other ones. One corresponds to the gene LGT3 that encodes a putative sugar transporter. Three adjacent ORFs were separated by two stop codons in frame. These ORFs presented homology with the gene CPS1 that encodes carboxypeptidase S. The stop codons were not found in the same sequence derived from another yeast strain. Two other ORFs without significant homology in databases were also found. One of them, O0420, is very rich in serine and threonine and presents a series of repeated or similar amino acid stretches along the sequence.
Tobacco chloroplast tRNALys(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron

PubMed Central

Sugita, Mamoru; Shinozaki, Kazuo; Sugiura, Masahiro

1985-01-01

The nucleotide sequence of a tRNALys(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNAGly(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long. Images PMID:16593561
Tobacco chloroplast tRNA(UUU) gene contains a 2.5-kilobase-pair intron: An open reading frame and a conserved boundary sequence in the intron.

PubMed

Sugita, M; Shinozaki, K; Sugiura, M

1985-06-01

The nucleotide sequence of a tRNA(Lys)(UUU) gene on tobacco (Nicotiana tabacum) chloroplast DNA has been determined. This gene is located 215 base pairs upstream from the gene for the 32,000-dalton thylakoid membrane protein on the same DNA strand and has a 2526-base-pair intron in the anticodon loop. The intron boundary sequence does not follow the G-U/A-G rule but is similar to those of tobacco chloroplast split genes for tRNA(Gly)(UCC) and ribosomal proteins L2 and S12. The intron contains one major open reading frame of 509 codons. The codon usage in the open reading frame resembles those observed in the genes for tobacco chloroplast proteins so far analyzed. The primary transcript of this tRNA gene is 2.7 kilobases long.
Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.

PubMed

Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier

2004-08-01

Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius.

PubMed

Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D

2014-01-01

It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.

Two alternative ways of start site selection in human norovirus reinitiation of translation.

PubMed

Luttermann, Christine; Meyers, Gregor

2014-04-25

The calicivirus minor capsid protein VP2 is expressed via termination/reinitiation. This process depends on an upstream sequence element denoted termination upstream ribosomal binding site (TURBS). We have shown for feline calicivirus and rabbit hemorrhagic disease virus that the TURBS contains three sequence motifs essential for reinitiation. Motif 1 is conserved among caliciviruses and is complementary to a sequence in the 18 S rRNA leading to the model that hybridization between motif 1 and 18 S rRNA tethers the post-termination ribosome to the mRNA. Motif 2 and motif 2* are proposed to establish a secondary structure positioning the ribosome relative to the start site of the terminal ORF. Here, we analyzed human norovirus (huNV) sequences for the presence and importance of these motifs. The three motifs were identified by sequence analyses in the region upstream of the VP2 start site, and we showed that these motifs are essential for reinitiation of huNV VP2 translation. More detailed analyses revealed that the site of reinitiation is not fixed to a single codon and does not need to be an AUG, even though this codon is clearly preferred. Interestingly, we were able to show that reinitiation can occur at AUG codons downstream of the canonical start/stop site in huNV and feline calicivirus but not in rabbit hemorrhagic disease virus. Although reinitiation at the original start site is independent of the Kozak context, downstream initiation exhibits requirements for start site sequence context known for linear scanning. These analyses on start codon recognition give a more detailed insight into this fascinating mechanism of gene expression.
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes

PubMed Central

Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.

2016-01-01

In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779
Translational Control of the SigR-Directed Oxidative Stress Response in Streptomyces via IF3-Mediated Repression of a Noncanonical GTC Start Codon

PubMed Central

Feeney, Morgan A.; Chandra, Govind; Findlay, Kim C.; Paget, Mark S. B.

2017-01-01

ABSTRACT The major oxidative stress response in Streptomyces is controlled by the sigma factor SigR and its cognate antisigma factor RsrA, and SigR activity is tightly controlled through multiple mechanisms at both the transcriptional and posttranslational levels. Here we show that sigR has a highly unusual GTC start codon and that this leads to another level of SigR regulation, in which SigR translation is repressed by translation initiation factor 3 (IF3). Changing the GTC to a canonical start codon causes SigR to be overproduced relative to RsrA, resulting in unregulated and constitutive expression of the SigR regulon. Similarly, introducing IF3* mutations that impair its ability to repress SigR translation has the same effect. Thus, the noncanonical GTC sigR start codon and its repression by IF3 are critical for the correct and proper functioning of the oxidative stress regulatory system. sigR and rsrA are cotranscribed and translationally coupled, and it had therefore been assumed that SigR and RsrA are produced in stoichiometric amounts. Here we show that RsrA can be transcribed and translated independently of SigR, present evidence that RsrA is normally produced in excess of SigR, and describe the factors that determine SigR-RsrA stoichiometry. PMID:28611250
Cloning and sequencing of the pheP gene, which encodes the phenylalanine-specific transport system of Escherichia coli.

PubMed Central

Pi, J; Wookey, P J; Pittard, A J

1991-01-01

The phenylalanine-specific permease gene (pheP) of Escherichia coli has been cloned and sequenced. The gene was isolated on a 6-kb Sau3AI fragment from a chromosomal library, and its presence was verified by complementation of a mutant lacking the functional phenylalanine-specific permease. Subcloning from this fragment localized the pheP gene on a 2.7-kb HindIII-HindII fragment. The nucleotide sequence of this 2.7-kb region was determined. An open reading frame was identified which extends from a putative start point of translation (GTG at position 636) to a termination signal (TAA at position 2010). The assignment of the GTG as the initiation codon was verified by site-directed mutagenesis of the initiation codon and by introducing a chain termination mutation into the pheP-lacZ fusion construct. A single initiation site of transcription 30 bp upstream of the start point of translation was identified by the primer extension analysis. The pheP structural gene consists of 1,374 nucleotides specifying a protein of 458 amino acid residues. The PheP protein is very hydrophobic (71% nonpolar residues). A topological model predicted from the sequence analysis defines 12 transmembrane segments. This protein is highly homologous with the AroP (general aromatic transport) system of E. coli (59.6% identity) and to a lesser extent with the yeast permeases CAN1 (arginine), PUT4 (proline), and HIP1 (histidine) of Saccharomyces cerevisiae. Images PMID:1711024
Transcription and Regulation of the Bidirectional Hydrogenase in the Cyanobacterium Nostoc sp. Strain PCC 7120▿

PubMed Central

Sjöholm, Johannes; Oliveira, Paulo; Lindblad, Peter

2007-01-01

The filamentous, heterocystous cyanobacterium Nostoc sp. strain PCC 7120 (Anabaena sp. strain PCC 7120) possesses an uptake hydrogenase and a bidirectional enzyme, the latter being capable of catalyzing both H2 production and evolution. The completely sequenced genome of Nostoc sp. strain PCC 7120 reveals that the five structural genes encoding the bidirectional hydrogenase (hoxEFUYH) are separated in two clusters at a distance of approximately 8.8 kb. The transcription of the hox genes was examined under nitrogen-fixing conditions, and the results demonstrate that the cluster containing hoxE and hoxF can be transcribed as one polycistronic unit together with the open reading frame alr0750. The second cluster, containing hoxU, hoxY, and hoxH, is transcribed together with alr0763 and alr0765, located between the hox genes. Moreover, alr0760 and alr0761 form an additional larger operon. Nevertheless, Northern blot hybridizations revealed a rather complex transcription pattern in which the different hox genes are expressed differently. Transcriptional start points (TSPs) were identified 66 and 57 bp upstream from the start codon of alr0750 and hoxU, respectively. The transcriptions of the two clusters containing the hox genes are both induced under anaerobic conditions concomitantly with the induction of a higher level of hydrogenase activity. An additional TSP, within the annotated alr0760, 244 bp downstream from the suggested translation start codon, was identified. Electrophoretic mobility shift assays with purified LexA from Nostoc sp. strain PCC 7120 demonstrated specific interactions between the transcriptional regulator and both hox promoter regions. However, when LexA from Synechocystis sp. strain PCC 6803 was used, the purified protein interacted only with the promoter region of the alr0750-hoxE-hoxF operon. A search of the whole Nostoc sp. strain PCC 7120 genome demonstrated the presence of 216 putative LexA binding sites in total, including recA and recF. This indicates that, in addition to the bidirectional hydrogenase gene, a number of other genes, including open reading frames connected to DNA replication, recombination, and repair, may be part of the LexA regulatory network in Nostoc sp. strain PCC 7120. PMID:17630298
Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron

PubMed Central

Hiesel, Rudolf; Brennicke, Axel

1983-01-01

The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484
Cloning and Expression Analysis of Genes Encoding Lytic Endopeptidases L1 and L5 from Lysobacter sp. Strain XL1

PubMed Central

Lapteva, Y. S.; Zolova, O. E.; Shlyapnikov, M. G.; Tsfasman, I. M.; Muranova, T. A.; Stepnaya, O. A.; Kulaev, I. S.

2012-01-01

Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5. PMID:22865082
Cloning and expression analysis of genes encoding lytic endopeptidases L1 and L5 from Lysobacter sp. strain XL1.

PubMed

Lapteva, Y S; Zolova, O E; Shlyapnikov, M G; Tsfasman, I M; Muranova, T A; Stepnaya, O A; Kulaev, I S; Granovsky, I E

2012-10-01

Lytic enzymes are the group of hydrolases that break down structural polymers of the cell walls of various microorganisms. In this work, we determined the nucleotide sequences of the Lysobacter sp. strain XL1 alpA and alpB genes, which code for, respectively, secreted lytic endopeptidases L1 (AlpA) and L5 (AlpB). In silico analysis of their amino acid sequences showed these endopeptidases to be homologous proteins synthesized as precursors similar in structural organization: the mature enzyme sequence is preceded by an N-terminal signal peptide and a pro region. On the basis of phylogenetic analysis, endopeptidases AlpA and AlpB were assigned to the S1E family [clan PA(S)] of serine peptidases. Expression of the alpA and alpB open reading frames (ORFs) in Escherichia coli confirmed that they code for functionally active lytic enzymes. Each ORF was predicted to have the Shine-Dalgarno sequence located at a canonical distance from the start codon and a potential Rho-independent transcription terminator immediately after the stop codon. The alpA and alpB mRNAs were experimentally found to be monocistronic; transcription start points were determined for both mRNAs. The synthesis of the alpA and alpB mRNAs was shown to occur predominantly in the late logarithmic growth phase. The amount of alpA mRNA in cells of Lysobacter sp. strain XL1 was much higher, which correlates with greater production of endopeptidase L1 than of L5.
The complete mitochondrial genome of the Longnose skate: Raja rhina (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Lee, Youn-Ho

2015-02-01

The complete sequence of mitochondrial DNA of a longnose skate, Raja rhina was determined for the first time. It is 16,910 bp in length containing 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of 30.1% A, 27.2% C, 28.5% T and 14.2% G, showing a slight A + T bias. The G is the least used base and markedly lower at the third codon position (5.4%). Twelve of the 13 protein coding genes use ATG as their start codon while the COX1 starts with GTG. As for stop codon, only ND4 shows incomplete stop codon TA. This mitogenome is the first report for a species of the genus Raja, and providing a valuable resource of genetic information for understanding the phylogenetic relationship and the evolution of the genus Raja as well as the family, Rajidae.
The complete mitochondrial genome of the longhorn beetle Xylotrechus grayii (Coleoptera: Cerambycidae).

PubMed

Guo, Kun; Chen, Jun; Xu, Chang-Qing; Qiao, Hai-Li; Xu, Rong; Zhao, Xiang-Jian

2016-05-01

We sequenced the complete mitochondrial genome of the longhorn beetle, Xylotrechus grayii. The total length of the X. grayii mitogenome was 15,540 bp with an A + T content of 75.29%, consisting of 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes and an A + T-rich region. All the genes were arranged in the same order as that of the ancestral insect. All PCGs started with a typical ATN codon except for cox1 and nad1, which used TTG as start codon. Ten out of 13 PCGs terminated with incomplete codons (TA or T). The A + T-rich region was 893 bp in length with an A + T content of 85.89 %.
Lost in Translation: Bioinformatic Analysis of Variations Affecting the Translation Initiation Codon in the Human Genome.

PubMed

Abad, Francisco; de la Morena-Barrio, María Eugenia; Fernández-Breis, Jesualdo Tomás; Corral, Javier

2018-06-01

Translation is a key biological process controlled in eukaryotes by the initiation AUG codon. Variations affecting this codon may have pathological consequences by disturbing the correct initiation of translation. Unfortunately, there is no systematic study describing these variations in the human genome. Moreover, we aimed to develop new tools for in silico prediction of the pathogenicity of gene variations affecting AUG codons, because to date, these gene defects have been wrongly classified as missense. Whole-exome analysis revealed the mean of 12 gene variations per person affecting initiation codons, mostly with high (> 0:01) minor allele frequency (MAF). Moreover, analysis of Ensembl data (December 2017) revealed 11,261 genetic variations affecting the initiation AUG codon of 7,205 genes. Most of these variations (99.5%) have low or unknown MAF, probably reflecting deleterious consequences. Only 62 variations had high MAF. Genetic variations with high MAF had closer alternative AUG downstream codons than did those with low MAF. Besides, the high-MAF group better maintained both the signal peptide and reading frame. These differentiating elements could help to determine the pathogenicity of this kind of variation. Data and scripts in Perl and R are freely available at https://github.com/fanavarro/hemodonacion. jfernand@um.es. Supplementary data are available at Bioinformatics online.
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lobanov, Alexei V.; Heaphy, Stephen M.; Turanov, Anton A.

2016-11-21

The ribosome can change its reading frame during translation in a process known as programmed ribosomal frameshifting. These rare events are supported by complex mRNA signals. However, we found that the ciliates Euplotes crassus and Euplotes focardii exhibit widespread frameshifting at stop codons. 47 different codons preceding stop signals resulted in either +1 or +2 frameshifts, and +1 frameshifting at AAA was the most frequent. The frameshifts showed unusual plasticity and rapid evolution, and had little influence on translation rates. The proximity of a stop codon to the 3' mRNA end, rather than its occurrence or sequence context, appeared tomore » designate termination. Thus, a ‘stop codon’ is not a sufficient signal for translation termination, and the default function of stop codons in Euplotes is frameshifting, whereas termination is specific to certain mRNA positions and probably requires additional factors.« less
Identification of Bombyx mori bidensovirus VD1-ORF4 reveals a novel protein associated with viral structural component.

PubMed

Li, Guohui; Hu, Zhaoyang; Guo, Xuli; Li, Guangtian; Tang, Qi; Wang, Peng; Chen, Keping; Yao, Qin

2013-06-01

Bombyx mori bidensovirus (BmBDV) VD1-ORF4 (open reading frame 4, ORF4) consists of 3,318 nucleotides, which codes for a predicted 1,105-amino acid protein containing a conserved DNA polymerase motif. However, its functions in viral propagation remain unknown. In the current study, the transcription of VD1-ORF4 was examined from 6 to 96 h postinfection (p.i.) by RT-PCR, 5'-RACE revealed the transcription initiation site of BmBDV ORF4 to be -16 nucleotides upstream from the start codon, and 3'-RACE revealed the transcription termination site of VD1-ORF4 to be +7 nucleotides downstream from termination codon. Three different proteins were examined in the extracts of BmBDV-infected silkworms midguts by Western blot using raised antibodies against VD1-ORF4 deduced amino acid, and a specific protein band about 53 kDa was further detected in purified virions using the same antibodies. Taken together, BmBDV VD1-ORF4 codes for three or more proteins during the viral life cycle, one of which is a 53 kDa protein and confirmed to be a component of BmBDV virion.
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon

PubMed Central

Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier

2008-01-01

Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
A Stem-Loop Structure in Potato Leafroll Virus Open Reading Frame 5 (ORF5) Is Essential for Readthrough Translation of the Coat Protein ORF Stop Codon 700 Bases Upstream.

PubMed

Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M

2018-06-01

Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Evidence for the recent origin of a bacterial protein-coding, overlapping orphan gene by evolutionary overprinting.

PubMed

Fellner, Lea; Simon, Svenja; Scherling, Christian; Witting, Michael; Schober, Steffen; Polte, Christine; Schmitt-Kopplin, Philippe; Keim, Daniel A; Scherer, Siegfried; Neuhaus, Klaus

2015-12-18

Gene duplication is believed to be the classical way to form novel genes, but overprinting may be an important alternative. Overprinting allows entirely novel proteins to evolve de novo, i.e., formerly non-coding open reading frames within functional genes become expressed. Only three cases have been described for Escherichia coli. Here, a fourth example is presented. RNA sequencing revealed an open reading frame weakly transcribed in cow dung, coding for 101 residues and embedded completely in the -2 reading frame of citC in enterohemorrhagic E. coli. This gene is designated novel overlapping gene, nog1. The promoter region fused to gfp exhibits specific activities and 5' rapid amplification of cDNA ends indicated the transcriptional start 40-bp upstream of the start codon. nog1 was strand-specifically arrested in translation by a nonsense mutation silent in citC. This Nog1-mutant showed a phenotype in competitive growth against wild type in the presence of MgCl2. Small differences in metabolite concentrations were also found. Bioinformatic analyses propose Nog1 to be inner membrane-bound and to possess at least one membrane-spanning domain. A phylogenetic analysis suggests that the orphan gene nog1 arose by overprinting after Escherichia/Shigella separated from the other γ-proteobacteria. Since nog1 is of recent origin, non-essential, short, weakly expressed and only marginally involved in E. coli's central metabolism, we propose that this gene is in an initial stage of evolution. While we present specific experimental evidence for the existence of a fourth overlapping gene in enterohemorrhagic E. coli, we believe that this may be an initial finding only and overlapping genes in bacteria may be more common than is currently assumed by microbiologists.
An MSC2 Promoter-lacZ Fusion Gene Reveals Zinc-Responsive Changes in Sites of Transcription Initiation That Occur across the Yeast Genome

PubMed Central

Wu, Yi-Hsuan; Taggart, Janet; Song, Pamela Xiyao; MacDiarmid, Colin; Eide, David J.

2016-01-01

The Msc2 and Zrg17 proteins of Saccharomyces cerevisiae form a complex to transport zinc into the endoplasmic reticulum. ZRG17 is transcriptionally induced in zinc-limited cells by the Zap1 transcription factor. In this report, we show that MSC2 mRNA also increases (~1.5 fold) in zinc-limited cells. The MSC2 gene has two in-frame ATG codons at its 5’ end, ATG1 and ATG2; ATG2 is the predicted initiation codon. When the MSC2 promoter was fused at ATG2 to the lacZ gene, we found that unlike the chromosomal gene this reporter showed a 4-fold decrease in lacZ mRNA in zinc-limited cells. Surprisingly, β-galactosidase activity generated by this fusion gene increased ~7 fold during zinc deficiency suggesting the influence of post-transcriptional factors. Transcription of MSC2ATG2-lacZ was found to start upstream of ATG1 in zinc-replete cells. In zinc-limited cells, transcription initiation shifted to sites just upstream of ATG2. From the results of mutational and polysome profile analyses, we propose the following explanation for these effects. In zinc-replete cells, MSC2ATG2-lacZ mRNA with long 5’ UTRs fold into secondary structures that inhibit translation. In zinc-limited cells, transcripts with shorter unstructured 5’ UTRs are generated that are more efficiently translated. Surprisingly, chromosomal MSC2 did not show start site shifts in response to zinc status and only shorter 5’ UTRs were observed. However, the shifts that occur in the MSC2ATG2-lacZ construct led us to identify significant transcription start site changes affecting the expression of ~3% of all genes. Therefore, zinc status can profoundly alter transcription initiation across the yeast genome. PMID:27657924
The complete mitochondrial genome of the Korean skate: Hongeo koreana (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho

2014-12-01

The complete mitochondrial genome of the Korean skate, Hongeo koreana, the sole member of its genus, is investigated for the first time. The genome consists of 16,906 bp in length including 2 rRNA, 22 tRNA and 13 protein coding genes with the same gene order and structure of the genome as those of other Rajidae species. The overall nucleotide composition of the L-strand is A = 29.8%, C = 27.9%, T = 27.9% and G = 14.3%, showing a high A + T bias. The anti-G bias (6.0%) is more significant in the third codon position. Twelve of the 13 protein-coding genes use ATG as their start codon while the COX1 gene starts with GTG. For stop codon, ND3 and ND4 genes show incomplete stop codon T. The mitogenome sequence of H. koreana will provide important information on the evolution and the phylogenetic relation of the genus Hongeo in relation to the other genera of the family Rajidae.
Analysis of the complete genome of peach chlorotic mottle virus: identification of non-AUG start codons, in vitro coat protein expression, and elucidation of serological cross-reactions.

PubMed

James, D; Varga, A; Croft, H

2007-01-01

The entire genome of peach chlorotic mottle virus (PCMV), originally identified as Prunus persica cv. Agua virus (4N6), was sequenced and analysed. PCMV cross-reacts with antisera to diverse viruses, such as plum pox virus (PPV), genus Potyvirus, family Potyviridae; and apple stem pitting virus (ASPV), genus Foveavirus, family Flexiviridae. The PCMV genome consists of 9005 nucleotides (nts), excluding a poly(A) tail at the 3' end of the genome. Five open reading frames (ORFs) were identified with four untranslated regions (UTR) including a 5', a 3', and two intergenic UTRs. The genome organisation of PCMV is similar to that of ASPV and the two genomes share a nucleotide (nt) sequence identity of 58%. PCMV ORF1 encodes the replication-associated protein complex (Mr 241,503), ORF2-ORF4 code for the triple gene block proteins (TGBp; Mr 24,802, 12,370, and 7320, respectively), and ORF5 encodes the coat protein (CP) (Mr 42,505). Two non-AUG start codons participate in the initiation of translation: 35AUC and 7676AUA initiate translation of ORF1 and ORF5. In vitro expression with subsequent Western blot analysis confirmed ORF5 as the CP-encoding gene and confirmed that the codon AUA is able to initiate translation of the CP. Expression of a truncated CP fragment (Mr 39, 689) was demonstrated, and both proteins are expressed in vivo, since both were observed in Western blot analysis of PCMV-infected peach and Nicotiana occidentalis. The expressed proteins cross-reacted with an antiserum against ASPV. The amino acid sequences of the CPs of PCMV and ASPV CP share only 37% identity, but there are 11 shared peptides 4-8 aa residues long. These may constitute linear epitopes responsible for ASPV antiserum cross reactions. No significant common linear epitopes were associated with PPV. Extensive phylogenetic analysis indicates that PCMV is closely related to ASPV and is a new and distinct member of the genus Foveavirus.
Emergence of Highly Pathogenic Avian Influenza A(H5N1) Virus PB1-F2 Variants and Their Virulence in BALB/c Mice

PubMed Central

Kamal, Ram P.; Kumar, Amrita; Davis, Charles T.; Tzeng, Wen-Pin; Nguyen, Tung; Donis, Ruben O.; Katz, Jacqueline M.

2015-01-01

ABSTRACT Influenza A viruses (IAVs) express the PB1-F2 protein from an alternate reading frame within the PB1 gene segment. The roles of PB1-F2 are not well understood but appear to involve modulation of host cell responses. As shown in previous studies, we find that PB1-F2 proteins of mammalian IAVs frequently have premature stop codons that are expected to cause truncations of the protein, whereas avian IAVs usually express a full-length 90-amino-acid PB1-F2. However, in contrast to other avian IAVs, recent isolates of highly pathogenic H5N1 influenza viruses had a high proportion of PB1-F2 truncations (15% since 2010; 61% of isolates in 2013) due to several independent mutations that have persisted and expanded in circulating viruses. One natural H5N1 IAV containing a mutated PB1-F2 start codon (i.e., lacking ATG) was 1,000-fold more virulent for BALB/c mice than a closely related H5N1 containing intact PB1-F2. In vitro, we detected expression of an in-frame protein (C-terminal PB1-F2) from downstream ATGs in PB1-F2 plasmids lacking the well-conserved ATG start codon. Transient expression of full-length PB1-F2, truncated (24-amino-acid) PB1-F2, and PB1-F2 lacking the initiating ATG in mammalian and avian cells had no effect on cell apoptosis or interferon expression in human lung epithelial cells. Full-length and C-terminal PB1-F2 mutants colocalized with mitochondria in A549 cells. Close monitoring of alterations of PB1-F2 and their frequency in contemporary avian H5N1 viruses should continue, as such changes may be markers for mammalian virulence. IMPORTANCE Although most avian influenza viruses are harmless for humans, some (such as highly pathogenic H5N1 avian influenza viruses) are capable of infecting humans and causing severe disease with a high mortality rate. A number of risk factors potentially associated with adaptation to mammalian infection have been noted. Here we demonstrate that the protein PB1-F2 is frequently truncated in recent isolates of highly pathogenic H5N1 viruses. Truncation of PB1-F2 has been proposed to act as an adaptation to mammalian infection. We show that some forms of truncation of PB1-F2 may be associated with increased virulence in mammals. Our data support the assessment of PB1-F2 truncations for genomic surveillance of influenza viruses. PMID:25787281

Molecular Structure and Transformation of the Glucose Dehydrogenase Gene in Drosophila Melanogaster

PubMed Central

Whetten, R.; Organ, E.; Krasney, P.; Cox-Foster, D.; Cavener, D.

1988-01-01

We have precisely mapped and sequenced the three 5' exons of the Drosophila melanogaster Gld gene and have identified the start sites for transcription and translation. The first exon is composed of 335 nucleotides and does not contain any putative translation start codons. The second exon is separated from the first exon by 8 kb and contains the Gld translation start codon. The inferred amino acid sequence of the amino terminus contains two unusual features: three tandem repeats of serine-alanine, and a relatively high density of cysteine residues. P element-mediated transformation experiments demonstrated that a 17.5-kb genomic fragment contains the functional and regulatory components of the Gld gene. PMID:3143620
Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

PubMed

Montandon, P E; Vasserot, A; Stutz, E

1986-01-01

We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.
The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae).

PubMed

Pan, Hong-Chun; Fang, Hong-Yan; Li, Shi-Wei; Liu, Jun-Hong; Wang, Ying; Wang, An-Tai

2014-12-01

The complete mitochondrial genome of Hydra vulgaris (Hydroida: Hydridae) is composed of two linear DNA molecules. The mitochondrial DNA (mtDNA) molecule 1 is 8010 bp long and contains six protein-coding genes, large subunit rRNA, methionine and tryptophan tRNAs, two pseudogenes consisting respectively of a partial copy of COI, and terminal sequences at two ends of the linear mtDNA, while the mtDNA molecule 2 is 7576 bp long and contains seven protein-coding genes, small subunit rRNA, methionine tRNA, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mtDNA. COI gene begins with GTG as start codon, whereas other 12 protein-coding genes start with a typical ATG initiation codon. In addition, all protein-coding genes are terminated with TAA as stop codon.
Constructing high complexity synthetic libraries of long ORFs using in vitro selection

NASA Technical Reports Server (NTRS)

Cho, G.; Keefe, A. D.; Liu, R.; Wilson, D. S.; Szostak, J. W.

2000-01-01

We present a method that can significantly increase the complexity of protein libraries used for in vitro or in vivo protein selection experiments. Protein libraries are often encoded by chemically synthesized DNA, in which part of the open reading frame is randomized. There are, however, major obstacles associated with the chemical synthesis of long open reading frames, especially those containing random segments. Insertions and deletions that occur during chemical synthesis cause frameshifts, and stop codons in the random region will cause premature termination. These problems can together greatly reduce the number of full-length synthetic genes in the library. We describe a strategy in which smaller segments of the synthetic open reading frame are selected in vitro using mRNA display for the absence of frameshifts and stop codons. These smaller segments are then ligated together to form combinatorial libraries of long uninterrupted open reading frames. This process can increase the number of full-length open reading frames in libraries by up to two orders of magnitude, resulting in protein libraries with complexities of greater than 10(13). We have used this methodology to generate three types of displayed protein library: a completely random sequence library, a library of concatemerized oligopeptide cassettes with a propensity for forming amphipathic alpha-helical or beta-strand structures, and a library based on one of the most common enzymatic scaffolds, the alpha/beta (TIM) barrel. Copyright 2000 Academic Press.
Evaluation of vector-primed cDNA library production from microgram quantities of total RNA.

PubMed

Kuo, Jonathan; Inman, Jason; Brownstein, Michael; Usdin, Ted B

2004-12-15

cDNA sequences are important for defining the coding region of genes, and full-length cDNA clones have proven to be useful for investigation of the function of gene products. We produced cDNA libraries containing 3.5-5 x 10(5) primary transformants, starting with 5 mug of total RNA prepared from mouse pituitary, adrenal, thymus, and pineal tissue, using a vector-primed cDNA synthesis method. Of approximately 1000 clones sequenced, approximately 20% contained the full open reading frames (ORFs) of known transcripts, based on the presence of the initiating methionine residue codon. The libraries were complex, with 94, 91, 83 and 55% of the clones from the thymus, adrenal, pineal and pituitary libraries, respectively, represented only once. Twenty-five full-length clones, not yet represented in the Mammalian Gene Collection, were identified. Thus, we have produced useful cDNA libraries for the isolation of full-length cDNA clones that are not yet available in the public domain, and demonstrated the utility of a simple method for making high-quality libraries from small amounts of starting material.
Numerical classification of coding sequences

NASA Technical Reports Server (NTRS)

Collins, D. W.; Liu, C. C.; Jukes, T. H.

1992-01-01

DNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9 ... (TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.
Methylation of class I translation termination factors: structural and functional aspects.

PubMed

Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie

2012-07-01

During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
A method for multi-codon scanning mutagenesis of proteins based on asymmetric transposons.

PubMed

Liu, Jia; Cropp, T Ashton

2012-02-01

Random mutagenesis followed by selection or screening is a commonly used strategy to improve protein function. Despite many available methods for random mutagenesis, nearly all generate mutations at the nucleotide level. An ideal mutagenesis method would allow for the generation of 'codon mutations' to change protein sequence with defined or mixed amino acids of choice. Herein we report a method that allows for mutations of one, two or three consecutive codons. Key to this method is the development of a Mu transposon variant with asymmetric terminal sequences. As a demonstration of the method, we performed multi-codon scanning on the gene encoding superfolder GFP (sfGFP). Characterization of 50 randomly chosen clones from each library showed that more than 40% of the mutants in these three libraries contained seamless, in-frame mutations with low site preference. By screening only 500 colonies from each library, we successfully identified several spectra-shift mutations, including a S205D variant that was found to bear a single excitation peak in the UV region.
The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae).

PubMed

Pan, Hong-Chun; Qian, Xiao-Cheng; Li, Ping; Li, Xiao-Fei; Wang, An-Tai

2014-02-01

The complete mitochondrial genome of Chinese green hydra, Hydra sinensis (Hydroida: Hydridae) is a linear molecule of 16,189 bp in length, containing 13 protein-coding genes, small and large subunit ribosomal RNAs, methionine and tryptophan transfer RNAs, a pseudogene consisting of a partial copy of COI and terminal sequences at two ends of the linear mitochondrial DNA. The A + T content of the overall base composition of H-strand is 77.2% (T: 41.7%; C: 10.9%; A: 35.5%; and G: 11.9%). COI and ND1 genes begin with GTG as start codon, while other 11 protein-coding genes start with a typical ATG initiation codon. COII, ATP8, ATP6, COIII, ND5, ND6, ND3, ND1, ND4 and COI genes are terminated with TAA as stop codon, ND4L ends with TAG, ND2 ends with TA and Cyt b ends with T.
A 20-basepair duplication in the human thyroid peroxidase gene results in a total iodide organification defect and congenital hypothyroidism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bikker, H.; Hartog, M.T. den; Gons, M.H.

1994-07-01

In this study, the authors present the molecular basis of a total iodide organification defect causing severe congenital hypothyroidism. In the thyroid gland of the patient, thyroid peroxidase (TPO) activity and the iodination degree of thyroglobulin were below detection limits, and no TPO messenger ribonucleic acid was detectable by Northern blot analysis. Denaturing gradient gel electrophoretic analysis of the TPO gene of the patient revealed a homozygous mutation in exon 2. Sequence analysis showed the presence of a 20-basepair duplication, 47 basepairs down-stream of the ATG start codon. This duplication generates a frame shift, resulting in a termination signal inmore » exon 3, compatible with the complete absence of TPO. Both parents of the patient are heterozygous for the same duplication, confirming the recessive mode of inheritance of the mutation. 32 refs., 4 figs.« less
DNATagger, colors for codons.

PubMed

Scherer, N M; Basso, D M

2008-09-16

DNATagger is a web-based tool for coloring and editing DNA, RNA and protein sequences and alignments. It is dedicated to the visualization of protein coding sequences and also protein sequence alignments to facilitate the comprehension of evolutionary processes in sequence analysis. The distinctive feature of DNATagger is the use of codons as informative units for coloring DNA and RNA sequences. The codons are colored according to their corresponding amino acids. It is the first program that colors codons in DNA sequences without being affected by "out-of-frame" gaps of alignments. It can handle single gaps and gaps inside the triplets. The program also provides the possibility to edit the alignments and change color patterns and translation tables. DNATagger is a JavaScript application, following the W3C guidelines, designed to work on standards-compliant web browsers. It therefore requires no installation and is platform independent. The web-based DNATagger is available as free and open source software at http://www.inf.ufrgs.br/~dmbasso/dnatagger/.
Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals.

PubMed

El Houmami, Nawal; Seligmann, Hervé

2017-01-01

We present an evolutionary hypothesis assuming that signals marking nucleotide synthesis (DNA replication and RNA transcription) evolved from multi- to unidimensional structures, and were carried over from transcription to translation. This evolutionary scenario presumes that signals combining secondary and primary nucleotide structures are evolutionary transitions. Mitochondrial replication initiation fits this scenario. Some observations reported in the literature corroborate that several signals for nucleotide synthesis function in translation, and vice versa. (a) Polymerase-induced frameshift mutations occur preferentially at translational termination signals (nucleotide deletion is interpreted as termination of nucleotide polymerization, paralleling the role of stop codons in translation). (b) Stem-loop hairpin presence/absence modulates codon-amino acid assignments, showing that translational signals sometimes combine primary and secondary nucleotide structures (here codon and stem-loop). (c) Homopolymer nucleotide triplets (AAA, CCC, GGG, TTT) cause transcriptional and ribosomal frameshifts. Here we find in recently described human mitochondrial RNAs that systematically lack mono-, dinucleotides after each trinucleotide (delRNAs) that delRNA triplets include 2x more homopolymers than mitogenome regions not covered by delRNA. Further analyses of delRNAs show that the natural circular code X (a little-known group of 20 translational signals enabling ribosomal frame retrieval consisting of 20 codons {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} universally overrepresented in coding versus other frames of gene sequences), regulates frameshift in transcription and translation. This dual transcription and translation role confirms for X the hypothesis that translational signals were carried over from transcriptional signals.
Foamy virus reverse transcriptase is expressed independently from the Gag protein.

PubMed Central

Enssle, J; Jordan, I; Mauer, B; Rethwilm, A

1996-01-01

In the foamy virus (FV) subgroup of retroviruses the pol genes are located in the +1 reading frame relative to the gag genes and possess potential ATG initiation codons in their 5' regions. This genome organization suggests either a + 1 ribosomal frameshift to generate a Gag-Pol fusion protein, similar to all other retroviruses studied so far, or new initiation of Pol translation, as used by pararetroviruses, to express the Pol protein. By using a genetic approach we have ruled out the former possibility and provide evidence for the latter. Two down-mutations (M53 and M54) of the pol ATG codon were found to abolish replication and Pol protein expression of the human FV isolate. The introduction of a new ATG in mutation M55, 3' to the down-mutated ATG of mutation M53, restored replication competence, indicating that the pol ATG functions as a translational initiation codon. Two nonsense mutants (M56 and M57), which functionally separated gag and pol with respect to potential frame-shifting sites, were also replication-competent, providing further genetic evidence that FVs express the Pol protein independently from Gag. Our results show that during a particular step of the replication cycle, FVs differ fundamentally from all other retroviruses. Images Fig. 3 PMID:8633029
Xylanase II from an alkaliphilic thermophilic Bacillus with a distinctly different structure from other xylanases: evolutionary relationship to alkaliphilic xylanases.

PubMed

Kulkarni, N; Lakshmikumaran, M; Rao, M

1999-10-05

A 1.0 kilobase gene fragment from the genomic DNA of an alkaliphilic thermophilic Bacillus was found to code for a functional xylanase (XynII). The complete nucleotide sequence including the structural gene and the 5' and 3' flanking sequences of the xylanase gene have been determined. An open reading frame starting from ATG initiator codon comprising 402 nucleotides gave a preprotein of 133 amino acids of calculated molecular mass 14.090 kDa. The occurrence of three potential N-glycosylation sites in XynII gene is a unique feature for a gene of bacterial origin. The stop codon was followed by hairpin loop structures indicating the presence of transcription termination signals. The secondary structure analysis of XynII predicted that the polypeptide was primarily formed of beta-sheets. XynII appeared to be a member of family G/11 of xylanases based on its molecular weight and basic pI (8.0). However, sequence homology revealed similar identity with families 10 and 11 of xylanases. The conserved triad (Val-Val-Xaa, where Xaa is Asn or Asp) was identified only in the xylanases from alkaliphilic organisms. Our results implicate for the first time the concept of convergent evolution for XynII and provide a basis for research in evolutionary relationship among the xylanases from alkaliphilic and neutrophilic organisms. Copyright 1999 Academic Press.
Codon optimisation to improve expression of a Mycobacterium avium ssp. paratuberculosis-specific membrane-associated antigen by Lactobacillus salivarius.

PubMed

Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim

2013-06-01

Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Cell illustrator 4.0: a computational platform for systems biology.

PubMed

Nagasaki, Masao; Saito, Ayumu; Jeong, Euna; Li, Chen; Kojima, Kaname; Ikeda, Emi; Miyano, Satoru

2011-01-01

Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.
Cell Illustrator 4.0: a computational platform for systems biology.

PubMed

Nagasaki, Masao; Saito, Ayumu; Jeong, Euna; Li, Chen; Kojima, Kaname; Ikeda, Emi; Miyano, Satoru

2010-01-01

Cell Illustrator is a software platform for Systems Biology that uses the concept of Petri net for modeling and simulating biopathways. It is intended for biological scientists working at bench. The latest version of Cell Illustrator 4.0 uses Java Web Start technology and is enhanced with new capabilities, including: automatic graph grid layout algorithms using ontology information; tools using Cell System Markup Language (CSML) 3.0 and Cell System Ontology 3.0; parameter search module; high-performance simulation module; CSML database management system; conversion from CSML model to programming languages (FORTRAN, C, C++, Java, Python and Perl); import from SBML, CellML, and BioPAX; and, export to SVG and HTML. Cell Illustrator employs an extension of hybrid Petri net in an object-oriented style so that biopathway models can include objects such as DNA sequence, molecular density, 3D localization information, transcription with frame-shift, translation with codon table, as well as biochemical reactions.
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.

PubMed

Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R

2001-01-01

Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
Defragged Binary I Ching Genetic Code Chromosomes Compared to Nirenberg’s and Transformed into Rotating 2D Circles and Squares and into a 3D 100% Symmetrical Tetrahedron Coupled to a Functional One to Discern Start From Non-Start Methionines through a Stella Octangula

PubMed Central

Castro-Chavez, Fernando

2012-01-01

Background Three binary representations of the genetic code according to the ancient I Ching of Fu-Xi will be presented, depending on their defragging capabilities by pairing based on three biochemical properties of the nucleic acids: H-bonds, Purine/Pyrimidine rings, and the Keto-enol/Amino-imino tautomerism, yielding the last pair a 32/32 single-strand self-annealed genetic code and I Ching tables. Methods Our working tool is the ancient binary I Ching's resulting genetic code chromosomes defragged by vertical and by horizontal pairing, reverse engineered into non-binaries of 2D rotating 4×4×4 circles and 8×8 squares and into one 3D 100% symmetrical 16×4 tetrahedron coupled to a functional tetrahedron with apical signaling and central hydrophobicity (codon formula: 4[1(1)+1(3)+1(4)+4(2)]; 5:5, 6:6 in man) forming a stella octangula, and compared to Nirenberg's 16×4 codon table (1965) pairing the first two nucleotides of the 64 codons in axis y. Results One horizontal and one vertical defragging had the start Met at the center. Two, both horizontal and vertical pairings produced two pairs of 2×8×4 genetic code chromosomes naturally arranged (M and I), rearranged by semi-introversion of central purines or pyrimidines (M' and I') and by clustering hydrophobic amino acids; their quasi-identity was disrupted by amino acids with odd codons (Met and Tyr pairing to Ile and TGA Stop); in all instances, the 64-grid 90° rotational ability was restored. Conclusions We defragged three I Ching representations of the genetic code while emphasizing Nirenberg's historical finding. The synthetic genetic code chromosomes obtained reflect the protective strategy of enzymes with a similar function, having both humans and mammals a biased G-C dominance of three H-bonds in the third nucleotide of their most used codons per amino acid, as seen in one chromosome of the i, M and M' genetic codes, while a two H-bond A-T dominance was found in their complementary chromosome, as seen in invertebrates and plants. The reverse engineering of chromosome I' into 2D rotating circles and squares was undertaken, yielding a 100% symmetrical 3D geometry which was coupled to a previously obtained genetic code tetrahedron in order to differentiate the start methionine from the methionine that is acting as a codifying non-start codon. PMID:23431415
The Acheta domesticus Densovirus, Isolated from the European House Cricket, Has Evolved an Expression Strategy Unique among Parvoviruses▿†

PubMed Central

Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter

2011-01-01

The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445

Identification of a novel selD homolog from Eukaryotes, Bacteria, and Archaea: Is there an autoregulatory mechanism in selenocysteine metabolism?

PubMed Central

Guimarães, M. Jorge; Peterson, David; Vicari, Alain; Cocks, Benjamin G.; Copeland, Neal G.; Gilbert, Debra J.; Jenkins, Nancy A.; Ferrick, David A.; Kastelein, Robert A.; Bazan, J. Fernando; Zlotnik, Albert

1996-01-01

Escherichia coli selenophosphate synthetase (SPS, the selD gene product) catalyzes the production of monoselenophosphate, the selenium donor compound required for synthesis of selenocysteine (Sec) and seleno-tRNAs. We report the molecular cloning of human and mouse homologs of the selD gene, designated Sps2, which contains an in-frame TGA codon at a site corresponding to the enzyme’s putative active site. These sequences allow the identification of selD gene homologs in the genomes of the bacterium Haemophilus influenzae and the archaeon Methanococcus jannaschii, which had been previously misinterpreted due to their in-frame TGA codon. Sps2 mRNA levels are elevated in organs previously implicated in the synthesis of selenoproteins and in active sites of blood cell development. In addition, we show that Sps2 mRNA is up-regulated upon activation of T lymphocytes and have mapped the Sps2 gene to mouse chromosome 7. Using the mouse gene isolated from the hematopoietic cell line FDCPmixA4, we devised a construct for protein expression that results in the insertion of a FLAG tag sequence at the N terminus of the SPS2 protein. This strategy allowed us to document the readthrough of the in-frame TGA codon and the incorporation of 75Se into SPS2. These results suggest the existence of an autoregulatory mechanism involving the incorporation of Sec into SPS2 that might be relevant to blood cell biology. This mechanism is likely to have been present in ancient life forms and conserved in a variety of living organisms from all domains of life. PMID:8986768
The Jigsaw Puzzle of mRNA Translation Initiation in Eukaryotes: A Decade of Structures Unraveling the Mechanics of the Process.

PubMed

Hashem, Yaser; Frank, Joachim

2018-03-01

Translation initiation in eukaryotes is a highly regulated and rate-limiting process. It results in the assembly and disassembly of numerous transient and intermediate complexes involving over a dozen eukaryotic initiation factors (eIFs). This process culminates in the accommodation of a start codon marking the beginning of an open reading frame at the appropriate ribosomal site. Although this process has been extensively studied by hundreds of groups for nearly half a century, it has been only recently, especially during the last decade, that we have gained deeper insight into the mechanics of the eukaryotic translation initiation process. This advance in knowledge is due in part to the contributions of structural biology, which have shed light on the molecular mechanics underlying the different functions of various eukaryotic initiation factors. In this review, we focus exclusively on the contribution of structural biology to the understanding of the eukaryotic initiation process, a long-standing jigsaw puzzle that is just starting to yield the bigger picture. Expected final online publication date for the Annual Review of Biophysics Volume 47 is May 20, 2018. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.

PubMed Central

Schuster, W; Unseld, M; Wissinger, B; Brennicke, A

1990-01-01

The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove

PubMed Central

Filbin, Megan E.; Kieft, Jeffrey S.

2011-01-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem–loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem–loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis. PMID:21606179
HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove.

PubMed

Filbin, Megan E; Kieft, Jeffrey S

2011-07-01

Hepatitis C virus (HCV) uses a structured internal ribosome entry site (IRES) RNA to recruit the translation machinery to the viral RNA and begin protein synthesis without the ribosomal scanning process required for canonical translation initiation. Different IRES structural domains are used in this process, which begins with direct binding of the 40S ribosomal subunit to the IRES RNA and involves specific manipulation of the translational machinery. We have found that upon initial 40S subunit binding, the stem-loop domain of the IRES that contains the start codon unwinds and adopts a stable configuration within the subunit's decoding groove. This configuration depends on the sequence and structure of a different stem-loop domain (domain IIb) located far from the start codon in sequence, but spatially proximal in the IRES•40S complex. Mutation of domain IIb results in misconfiguration of the HCV RNA in the decoding groove that includes changes in the placement of the AUG start codon, and a substantial decrease in the ability of the IRES to initiate translation. Our results show that two distal regions of the IRES are structurally communicating at the initial step of 40S subunit binding and suggest that this is an important step in driving protein synthesis.
DsrA regulatory RNA represses both hns and rbsD mRNAs through distinct mechanisms in Escherichia coli.

PubMed

Lalaouna, David; Morissette, Audrey; Carrier, Marie-Claude; Massé, Eric

2015-10-01

The 87 nucleotide long DsrA sRNA has been mostly studied for its translational activation of the transcriptional regulator RpoS. However, it also represses hns mRNA, which encodes H-NS, a major regulator that affects expression of nearly 5% of Escherichia coli genes. A speculative model previously suggested that DsrA would block hns mRNA translation by binding simultaneously to start and stop codon regions of hns mRNA (coaxial model). Here, we show that DsrA efficiently blocked translation of hns mRNA by base-pairing immediately downstream of the start codon. In addition, DsrA induced hns mRNA degradation by actively recruiting the RNA degradosome complex. Data presented here led to a model of DsrA action on hns mRNA, which supports a canonical mechanism of sRNA-induced mRNA degradation by binding to the translation initiation region. Furthermore, using MS2-affinity purification coupled with RNA sequencing technology (MAPS), we also demonstrated that DsrA targets rbsD mRNA, involved in ribose utilization. Surprisingly, DsrA base pairs far downstream of rbsD start codon and induces rapid degradation of the transcript. Thus, our study enables us to draw an extended DsrA targetome. © 2015 John Wiley & Sons Ltd.
Molecular Genetic Analysis and Evolution of Segment 7 in Rice Black-Streaked Dwarf Virus in China

PubMed Central

Chen, Yanping; Wu, Jirong; Meng, Qingchang; Han, Xiaohua; Hao, Zhuanfang; Li, Mingshun; Yong, Hongjun; Zhang, Degui; Zhang, Shihuang; Li, Xinhai

2015-01-01

Rice black-streaked dwarf virus (RBSDV) causes maize rough dwarf disease or rice black-streaked dwarf disease and can lead to severe yield losses in maize and rice. To analyse RBSDV evolution, codon usage bias and genetic structure were investigated in 111 maize and rice RBSDV isolates from eight geographic locations in 2013 and 2014. The linear dsRNA S7 is A+U rich, with overall codon usage biased toward codons ending with A (A3s, S7-1: 32.64%, S7-2: 29.95%) or U (U3s, S7-1: 44.18%, S7-2: 46.06%). Effective number of codons (Nc) values of 45.63 in S7-1 (the first open reading frame of S7) and 39.96 in S7-2 (the second open reading frame of S7) indicate low degrees of RBSDV-S7 codon usage bias, likely driven by mutational bias regardless of year, host, or geographical origin. Twelve optimal codons were detected in S7. The nucleotide diversity (π) of S7 sequences in 2013 isolates (0.0307) was significantly higher than in 2014 isolates (0.0244, P = 0.0226). The nucleotide diversity (π) of S7 sequences in isolates from Jinan (0.0391) was higher than that from the other seven locations (P < 0.01). Only one S7 recombinant was detected in Baoding. RBSDV isolates could be phylogenetically classified into two groups according to S7 sequences, and further classified into two subgroups. S7-1 and S7-2 were under negative and purifying selection, with respective Ka/Ks ratios of 0.0179 and 0.0537. These RBSDV populations were expanding (P < 0.01) as indicated by negative values for Tajima's D, Fu and Li's D, and Fu and Li's F. Genetic differentiation was detected in six RBSDV subpopulations (P < 0.05). Absolute Fst (0.0790) and Nm (65.12) between 2013 and 2014, absolute Fst (0.1720) and Nm (38.49) between maize and rice, and absolute Fst values of 0.0085-0.3069 and Nm values of 0.56-29.61 among these eight geographic locations revealed frequent gene flow between subpopulations. Gene flow between 2013 and 2014 was the most frequent. PMID:26121638
Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt.

PubMed

AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

2014-10-07

The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact "nanogenome."
Infection of capilloviruses requires subgenomic RNAs whose transcription is controlled by promoter-like sequences conserved among flexiviruses.

PubMed

Komatsu, Ken; Hirata, Hisae; Fukagawa, Takako; Yamaji, Yasuyuki; Okano, Yukari; Ishikawa, Kazuya; Adachi, Tatsushi; Maejima, Kensaku; Hashimoto, Masayoshi; Namba, Shigetou

2012-07-01

The first open-reading frame (ORF) of apple stem grooving virus (ASGV), of the genus Capillovirus, encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP). However, our previous study revealed that ASGV mutants with distinct and discontinuous Rep- and CP-coding regions successfully infect plants, indicating that CP expressed via a subgenomic RNA (sgRNA) is sufficient for viability of the virus. Here we identified a transcription start site of the CP sgRNA and revealed that CP translated from the sgRNA is essential for ASGV infection. We mapped the transcription start sites of both the CP and the movement protein (MP) sgRNAs of ASGV and found a hexanucleotide motif, UUAGGU, conserved upstream from both sgRNA transcription start sites. Mutational analysis of the putative CP initiation codon and of the UUAGGU sequence upstream from the transcription start site of CP sgRNA demonstrated their importance for ASGV accumulation. Our results also demonstrated that potato virus T (PVT), an unassigned species closely related to ASGV, produces two sgRNAs putatively deployed for the CP and MP expression and that the same hexanucleotide motif as found in ASGV is located upstream from the transcription start sites of both sgRNAs. This motif, which constituted putative core elements of the sgRNA promoter, is broadly conserved among viruses in the families Alphaflexiviridae and Betaflexiviridae, suggesting that the gene expression strategy of the viruses in both families has been conserved throughout evolution. Copyright © 2012 Elsevier B.V. All rights reserved.
Regulation of translation by upstream translation initiation codons of surfactant protein A1 splice variants

PubMed Central

Tsotakos, Nikolaos; Silveyra, Patricia; Lin, Zhenwu; Thomas, Neal; Vaid, Mudit

2014-01-01

Surfactant protein A (SP-A), a molecule with roles in lung innate immunity and surfactant-related functions, is encoded by two genes in humans: SFTPA1 (SP-A1) and SFTPA2 (SP-A2). The mRNAs from these genes differ in their 5′-untranslated regions (5′-UTR) due to differential splicing. The 5′-UTR variant ACD′ is exclusively found in transcripts of SP-A1, but not in those of SP-A2. Its unique exon C contains two upstream AUG codons (uAUGs) that may affect SP-A1 translation efficiency. The first uAUG (u1) is in frame with the primary start codon (p), but the second one (u2) is not. The purpose of this study was to assess the impact of uAUGs on SP-A1 expression. We employed RT-qPCR to determine the presence of exon C-containing SP-A1 transcripts in human RNA samples. We also used in vitro techniques including mutagenesis, reporter assays, and toeprinting analysis, as well as in silico analyses to determine the role of uAUGs. Exon C-containing mRNA is present in most human lung tissue samples and its expression can, under certain conditions, be regulated by factors such as dexamethasone or endotoxin. Mutating uAUGs resulted in increased luciferase activity. The mature protein size was not affected by the uAUGs, as shown by a combination of toeprint and in silico analysis for Kozak sequence, secondary structure, and signal peptide and in vitro translation in the presence of microsomes. In conclusion, alternative splicing may introduce uAUGs in SP-A1 transcripts, which in turn negatively affect SP-A1 translation, possibly affecting SP-A1/SP-A2 ratio, with potential for clinical implication. PMID:25326576
Functional analysis of the promoter of the molt-inhibiting hormone (mih) gene in mud crab Scylla paramamosain.

PubMed

Zhang, Xin; Huang, Danping; Jia, Xiwei; Zou, Zhihua; Wang, Yilei; Zhang, Ziping

2018-04-01

In this study, the 5'-flanking region of molt-inhibiting hormone (MIH) gene was cloned by Tail-PCR. It is 2024 bp starting from the translation initiation site, and 1818 bp starting from the predicted transcription start site. Forecast analysis results by the bioinformatics software showed that the transcription start site is located at 207 bp upstream of the start codon ATG, and TATA box is located at 240 bp upstream of the start codon ATG. Potential transcription factor binding sites include Sp1, NF-1, Oct-1, Sox-2, RAP1, and so on. There are two CpG islands, located at -25- +183 bp and -1451- -1316 bp respectively. The transfection results of luciferase reporter constructs showed that the core promoter region was located in the fragment -308 bp to -26 bp. NF-kappaB and RAP1 were essential for mih basal transcriptional activity. There are three kinds of polymorphism CA in the 5'-flanking sequence, and they can influence mih promoter activity. These findings provide a genetic foundation of the further research of mih transcription regulation. Copyright © 2017 Elsevier Inc. All rights reserved.
Analysis of the cbhE' plasmid gene from acute disease-causing isolates of Coxiella burnetii.

PubMed

Minnick, M F; Small, C L; Frazier, M E; Mallavia, L P

1991-07-15

A gene termed cbhE' was cloned from the QpH1 plasmid of Coxiella burnetii. Expression of recombinants containing cbhE' in vitro and in Escherichia coli maxicells, produced an insert-encoded polypeptide of approx. 42 kDa. The CbhE protein was not cleaved when intact maxicells were treated with trypsin. Hybridizations of total DNA isolated from the six strains of C. burnetii indicate that this gene is unique to C. burnetii strains associated with acute disease, i.e., Hamilton[I], Vacca[II], and Rasche[III]. The cbhE' gene was not detected in strains associated with chronic disease (Biotzere[IV] and Corazon[V]) or the Dod[VI] strain. The cbhE' open reading frame (ORF) is 1022 bp in length and is preceded by a predicted promoter/Shine-Dalgarno (SD) region of TCAACT(-35)-N16-TAAAAT(-10)-N14-AGAAGGA (SD) located 10 nucleotides (nt) before the presumed AUG start codon. The ORF ends with a single UAA stop codon and has no apparent Rho-factor-independent terminator following it. The cbhE' gene codes for the CbhE protein of 341 amino acid (aa) residues with a deduced Mr of 39,442. CbhE is predominantly hydrophilic with a predicted pI of 4.43. The function of CbhE is unknown. No nt or aa sequences with homology to cbhE' or CbhE, respectively, were found in searches of a number of data bases.
Possibilities for the evolution of the genetic code from a preceding form

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1973-01-01

Analysis of the interaction between mRNA codons and tRNA anticodons suggests a model for the evolution of the genetic code. Modification of the nucleic acid following the anticodon is at present essential in both eukaryotes and prokaryotes to ensure fidelity of translation of codons starting with A, and the amino acids which could be coded for before the evolution of the modifying enzymes can be deduced.
Upstream open reading frames regulate the expression of the nuclear Wnt13 isoforms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tang Tao; Rector, Kyle; Barnett, Corey D.

2008-02-22

Wnt proteins control cell survival and cell fate during development. Although Wnt expression is tightly regulated in a spatio-temporal manner, the mechanisms involved both at the transcriptional and translational levels are poorly defined. We have identified a downstream translation initiation codon, AUG(+74), in Wnt13B and Wnt13C mRNAs responsible for the expression of Wnt13 nuclear forms. In this report, we demonstrate that the expression of the nuclear Wnt13C form is translationally regulated in response to stress and apoptosis. Though the 5'-leaders of both Wnt13C and Wnt13B mRNAs have an inhibitory effect on translation, they did not display an internal ribosome entrymore » site activity as demonstrated by dicistronic reporter assays. However, mutations or deletions of the upstream AUG(-99) and AUG(+1) initiation codons abrogate these translation inhibitory effects, demonstrating that Wnt13C expression is controlled by upstream open reading frames. Since long 5'-untranslated region with short upstream open reading frames characterize other Wnt transcripts, our present data on the translational control of Wnt13 expression open the way to further studies on the translation control of Wnt expression as a modulator of their subcellular localization and activity.« less
A decamer duplication in the 3′ region of the BRI gene originates an amyloid peptide that is associated with dementia in a Danish kindred

PubMed Central

Vidal, Ruben; Révész, Tamas; Rostagno, Agueda; Kim, Eugene; Holton, Janice L.; Bek, Toke; Bojsen-Møller, Marie; Braendgaard, Hans; Plant, Gordon; Ghiso, Jorge; Frangione, Blas

2000-01-01

Familial Danish dementia (FDD), also known as heredopathia ophthalmo-oto-encephalica, is an autosomal dominant disorder characterized by cataracts, deafness, progressive ataxia, and dementia. Neuropathological findings include severe widespread cerebral amyloid angiopathy, hippocampal plaques, and neurofibrillary tangles, similar to Alzheimer's disease. N-terminal sequence analysis of isolated leptomeningeal amyloid fibrils revealed homology to ABri, the peptide originated by a point mutation at the stop codon of gene BRI in familial British dementia. Molecular genetic analysis of the BRI gene in the Danish kindred showed a different defect, namely the presence of a 10-nt duplication (795–796insTTTAATTTGT) between codons 265 and 266, one codon before the normal stop codon 267. The decamer duplication mutation produces a frame-shift in the BRI sequence generating a larger-than-normal precursor protein, of which the amyloid subunit (designated ADan) comprises the last 34 C-terminal amino acids. This de novo-created amyloidogenic peptide, associated with a genetic defect in the Danish kindred, stresses the importance of amyloid formation as a causative factor in neurodegeneration and dementia. PMID:10781099
5’-Terminal AUGs in Escherichia coli mRNAs with Shine-Dalgarno Sequences: Identification and Analysis of Their Roles in Non-Canonical Translation Initiation

PubMed Central

Beck, Heather J.; Fleming, Ian M. C.

2016-01-01

Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs) that contain a conventional untranslated leader and Shine-Dalgarno (SD) sequence upstream of the gene’s start codon while also containing an AUG triplet at the mRNA’s 5’- terminus (5’-uAUG). Fusion of the coding sequence specified by the 5’-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5’-terminal upstream open reading frames (5’-uORFs) tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5’-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5’-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5’-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5’-uORFs may play roles in downstream regulation. Since the 5’-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5’-uAUGs should be examined for their potential to function also as leaderless mRNAs. PMID:27467758
On the Evolution of the Standard Genetic Code: Vestiges of Critical Scale Invariance from the RNA World in Current Prokaryote Genomes

PubMed Central

José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.

2009-01-01

Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

PubMed

Brunak, S; Engelbrecht, J

1996-06-01

A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Novel coding, translation, and gene expression of a replicating covalently closed circular RNA of 220 nt

PubMed Central

AbouHaidar, Mounir Georges; Venkataraman, Srividhya; Golshani, Ashkan; Liu, Bolin; Ahmad, Tauqeer

2014-01-01

The highly structured (64% GC) covalently closed circular (CCC) RNA (220 nt) of the virusoid associated with rice yellow mottle virus codes for a 16-kDa highly basic protein using novel modalities for coding, translation, and gene expression. This CCC RNA is the smallest among all known viroids and virusoids and the only one that codes proteins. Its sequence possesses an internal ribosome entry site and is directly translated through two (or three) completely overlapping ORFs (shifting to a new reading frame at the end of each round). The initiation and termination codons overlap UGAUGA (underline highlights the initiation codon AUG within the combined initiation-termination sequence). Termination codons can be ignored to obtain larger read-through proteins. This circular RNA with no noncoding sequences is a unique natural supercompact “nanogenome.” PMID:25253891
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes.

PubMed

Seligmann, Hervé

2018-05-01

Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.

Similarity of Escherichia coli propanediol oxidoreductase (fucO product) and an unusual alcohol dehydrogenase from Zymomonas mobilis and Saccharomyces cerevisiae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Conway, T.; Ingram, L.O.

1989-07-01

The gene that encodes 1,2-propanediol oxidoreductase (fucO) from Escherichia coli was sequenced. The reading frame specified a protein of 383 amino acids (including the N-terminal methionine), with an aggregate molecular weight of 40,642. The induction of fucO transcription, which occurred in the presence of fucose, was confirmed by Northern blot analysis. In E. coli, the primary fucO transcript was approximately 2.1 kilobases in length. The 5{prime} end of the transcript began more than 0.7 kilobase upstream of the fucO start codon within or beyond the fucA gene. Propanediol oxidoreductase exhibited 41.7% identity with the iron-containing alcohol dehydrogenase II from Zymomonasmore » mobilis and 39.5% identity with ADH4 from Saccharomyces cerevisiae. These three proteins did not share homology with either short-chain or long-chain zinc-containing alcohol dehydrogenase enzymes. We propose that these three unusual alcohol dehydrogenases define a new family of enzymes.« less
Tissue specific expression of the retinoic acid receptor-beta 2: regulation by short open reading frames in the 5'-noncoding region

PubMed Central

1994-01-01

The 40-S subunit of eukaryotic ribosomes binds to the capped 5'-end of mRNA and scans for the first AUG in a favorable sequence context to initiate translation. Most eukaryotic mRNAs therefore have a short 5'- untranslated region (5'-UTR) and no AUGs upstream of the translational start site; features that seem to assure efficient translation. However, approximately 5-10% of all eukaryotic mRNAs, particularly those encoding for regulatory proteins, have complex leader sequences that seem to compromise translational initiation. The retinoic-acid- receptor-beta 2 (RAR beta 2) mRNA is such a transcript with a long (461 nucleotides) 5'-UTR that contains five, partially overlapping, upstream open reading frames (uORFs) that precede the major ORF. We have begun to investigate the function of this complex 5'-UTR in transgenic mice, by introducing mutations in the start/stop codons of the uORFs in RAR beta 2-lacZ reporter constructs. When we compared the expression patterns of mutant and wild-type constructs we found that these mutations affected expression of the downstream RAR beta 2-ORF, resulting in an altered regulation of RAR beta 2-lacZ expression in heart and brain. Other tissues were unaffected. RNA analysis of adult tissues demonstrated that the uORFs act at the level of translation; adult brains and hearts of transgenic mice carrying a construct with either the wild-type or a mutant UTR, had the same levels of mRNA, but only the mutant produced protein. Our study outlines an unexpected role for uORFs: control of tissue-specific and developmentally regulated gene expression. PMID:7962071
The complete mitochondrial genome of Setaria digitata (Nematoda: Filarioidea): Mitochondrial gene content, arrangement and composition compared with other nematodes.

PubMed

Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi

2010-09-01

In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Complete mitochondrial genome of Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae).

PubMed

Omeire, Destiny; Abdin, Shaunte; Brooks, Daniel M; Miranda, Hector C

2015-04-01

The Germain's Peacock-Pheasant Polyplectron germaini (Aves, Galliformes, Phasianidae) is classified as Near Threatened on the IUCN Red List. The complete mitochondrial genome of P. germaini is 16,699 bp, consisting of 13 protein-coding genes, 2 rRNA, 22 tRNA genes and 1 control region. All of the 13 protein-coding genes have ATG as start codon. Eight of the 13 protein-coding genes have TAA as stop codon.
An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.

PubMed

Omasits, Ulrich; Varadarajan, Adithi R; Schmid, Michael; Goetze, Sandra; Melidis, Damianos; Bourqui, Marc; Nikolayeva, Olga; Québatte, Maxime; Patrignani, Andrea; Dehio, Christoph; Frey, Juerg E; Robinson, Mark D; Wollscheid, Bernd; Ahrens, Christian H

2017-12-01

Accurate annotation of all protein-coding sequences (CDSs) is an essential prerequisite to fully exploit the rapidly growing repertoire of completely sequenced prokaryotic genomes. However, large discrepancies among the number of CDSs annotated by different resources, missed functional short open reading frames (sORFs), and overprediction of spurious ORFs represent serious limitations. Our strategy toward accurate and complete genome annotation consolidates CDSs from multiple reference annotation resources, ab initio gene prediction algorithms and in silico ORFs (a modified six-frame translation considering alternative start codons) in an integrated proteogenomics database (iPtgxDB) that covers the entire protein-coding potential of a prokaryotic genome. By extending the PeptideClassifier concept of unambiguous peptides for prokaryotes, close to 95% of the identifiable peptides imply one distinct protein, largely simplifying downstream analysis. Searching a comprehensive Bartonella henselae proteomics data set against such an iPtgxDB allowed us to unambiguously identify novel ORFs uniquely predicted by each resource, including lipoproteins, differentially expressed and membrane-localized proteins, novel start sites and wrongly annotated pseudogenes. Most novelties were confirmed by targeted, parallel reaction monitoring mass spectrometry, including unique ORFs and single amino acid variations (SAAVs) identified in a re-sequenced laboratory strain that are not present in its reference genome. We demonstrate the general applicability of our strategy for genomes with varying GC content and distinct taxonomic origin. We release iPtgxDBs for B. henselae , Bradyrhizobium diazoefficiens and Escherichia coli and the software to generate both proteogenomics search databases and integrated annotation files that can be viewed in a genome browser for any prokaryote. © 2017 Omasits et al.; Published by Cold Spring Harbor Laboratory Press.
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed

Schwartz, R; Curran, J F

1997-05-15

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.

PubMed Central

Schwartz, R; Curran, J F

1997-01-01

Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Cloning, characterization and sequence comparison of the gene coding for IMP dehydrogenase from Pyrococcus furiosus.

PubMed

Collart, F R; Osipiuk, J; Trent, J; Olsen, G J; Huberman, E

1996-10-03

We have cloned and characterized the gene encoding inosine monophosphate dehydrogenase (IMPDH) from Pyrococcus furiosus (Pf), a hyperthermophillic archeon. Sequence analysis of the Pf gene indicated an open reading frame specifying a protein of 485 amino acids (aa) with a calculated M(r) of 52900. Canonical Archaea promoter elements, Box A and Box B, are located -49 and -17 nucleotides (nt), respectively, upstream of the putative start codon. The sequence of the putative active-site region conforms to the IMPDH signature motif and contains a putative active-site cysteine. Phylogenetic relationships derived by using all available IMPDH sequences are consistent with trees developed for other molecules; they do not precisely resolve the history of Pf IMPDH but indicate a close similarity to bacterial IMPDH proteins. The phylogenetic analysis indicates that a gene duplication occurred prior to the division between rodents and humans, accounting for the Type I and II isoforms identified in mice and humans.
Neutropenia-associated ELANE mutations disrupting translation initiation produce novel neutrophil elastase isoforms

PubMed Central

Tidwell, Timothy; Wechsler, Jeremy; Nayak, Ramesh C.; Trump, Lisa; Salipante, Stephen J.; Cheng, Jerry C.; Donadieu, Jean; Glaubach, Taly; Corey, Seth J.; Grimes, H. Leighton; Lutzko, Carolyn; Cancelas, Jose A.

2014-01-01

Hereditary neutropenia is usually caused by heterozygous germline mutations in the ELANE gene encoding neutrophil elastase (NE). How mutations cause disease remains uncertain, but two hypotheses have been proposed. In one, ELANE mutations lead to mislocalization of NE. In the other, ELANE mutations disturb protein folding, inducing an unfolded protein response in the endoplasmic reticulum (ER). In this study, we describe new types of mutations that disrupt the translational start site. At first glance, they should block translation and are incompatible with either the mislocalization or misfolding hypotheses, which require mutant protein for pathogenicity. We find that start-site mutations, instead, force translation from downstream in-frame initiation codons, yielding amino-terminally truncated isoforms lacking ER-localizing (pre) and zymogen-maintaining (pro) sequences, yet retain essential catalytic residues. Patient-derived induced pluripotent stem cells recapitulate hematopoietic and molecular phenotypes. Expression of the amino-terminally deleted isoforms in vitro reduces myeloid cell clonogenic capacity. We define an internal ribosome entry site (IRES) within ELANE and demonstrate that adjacent mutations modulate IRES activity, independently of protein-coding sequence alterations. Some ELANE mutations, therefore, appear to cause neutropenia via the production of amino-terminally deleted NE isoforms rather than by altering the coding sequence of the full-length protein. PMID:24184683
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons

PubMed Central

Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel

2004-01-01

Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
PRIMARY STRUCTURE OF THE P450 LANOSTEROL DEMETHYLASE GENE FROM SACCHAROMYCES CEREVISIAE

EPA Science Inventory

We have sequenced the structural gene and flanking regions for lanosterol 14 alpha-demethylase (14DM) from Saccharomyces cerevisiae. An open reading frame of 530 codons encodes a 60.7-kDa protein. When this gene is disrupted by integrative transformation, the resulting strain req...
Abolition of Peroxiredoxin-5 Mitochondrial Targeting during Canid Evolution

PubMed Central

Van der Eecken, Valérie; Clippe, André; Dekoninck, Sophie; Goemaere, Julie; Walbrecq, Geoffroy; Van Veldhoven, Paul P.; Knoops, Bernard

2013-01-01

In human, the subcellular targeting of peroxiredoxin-5 (PRDX5), a thioredoxin peroxidase, is dependent on the use of multiple alternative transcription start sites and two alternative in-frame translation initiation sites, which determine whether or not the region encoding a mitochondrial targeting sequence (MTS) is translated. In the present study, the abolition of PRDX5 mitochondrial targeting in dog is highlighted and the molecular mechanism underlying the loss of mitochondrial PRDX5 during evolution is examined. Here, we show that the absence of mitochondrial PRDX5 is generalized among the extant canids and that the first events leading to PRDX5 MTS abolition in canids involve a mutation in the more 5′ translation initiation codon as well as the appearance of a STOP codon. Furthermore, we found that PRDX5 MTS functionality is maintained in giant panda and northern elephant seal, which are phylogenetically closely related to canids. Also, the functional consequences of the restoration of mitochondrial PRDX5 in dog Madin-Darby canine kidney (MDCK) cells were investigated. The restoration of PRDX5 mitochondrial targeting in MDCK cells, instead of protecting, provokes deleterious effects following peroxide exposure independently of its peroxidase activity, indicating that mitochondrial PRDX5 gains cytotoxic properties under acute oxidative stress in MDCK cells. Altogether our results show that, although mitochondrial PRDX5 cytoprotective function against oxidative stress has been clearly demonstrated in human and rodents, PRDX5 targeting to mitochondria has been evolutionary lost in canids. Moreover, restoration of mitochondrial PRDX5 in dog MDCK cells, instead of conferring protection against peroxide exposure, makes them more vulnerable. PMID:24023783
Translational Redefinition of UGA Codons Is Regulated by Selenium Availability*

PubMed Central

Howard, Michael T.; Carlson, Bradley A.; Anderson, Christine B.; Hatfield, Dolph L.

2013-01-01

Incorporation of selenium into ∼25 mammalian selenoproteins occurs by translational recoding whereby in-frame UGA codons are redefined to encode the selenium containing amino acid, selenocysteine (Sec). Here we applied ribosome profiling to examine the effect of dietary selenium levels on the translational mechanisms controlling selenoprotein synthesis in mouse liver. Dietary selenium levels were shown to control gene-specific selenoprotein expression primarily at the translation level by differential regulation of UGA redefinition and Sec incorporation efficiency, although effects on translation initiation and mRNA abundance were also observed. Direct evidence is presented that increasing dietary selenium causes a vast increase in ribosome density downstream of UGA-Sec codons for a subset of selenoprotein mRNAs and that the selenium-dependent effects on Sec incorporation efficiency are mediated in part by the degree of Sec-tRNA[Ser]Sec Um34 methylation. Furthermore, we find evidence for translation in the 5′-UTRs for a subset of selenoproteins and for ribosome pausing near the UGA-Sec codon in those mRNAs encoding the selenoproteins most affected by selenium availability. These data illustrate how dietary levels of the trace element selenium can alter the readout of the genetic code to affect the expression of an entire class of proteins. PMID:23696641
Structural determinants of an internal ribosome entry site that direct translational reading frame selection

PubMed Central

Ren, Qian; Au, Hilda H.T.; Wang, Qing S.; Lee, Seonghoon; Jan, Eric

2014-01-01

The dicistrovirus intergenic internal ribosome entry site (IGR IRES) directly recruits the ribosome and initiates translation using a non-AUG codon. A subset of IGR IRESs initiates translation in either of two overlapping open reading frames (ORFs), resulting in expression of the 0 frame viral structural polyprotein and an overlapping +1 frame ORFx. A U–G base pair adjacent to the anticodon-like pseudoknot of the IRES directs +1 frame translation. Here, we show that the U-G base pair is not absolutely required for +1 frame translation. Extensive mutagenesis demonstrates that 0 and +1 frame translation can be uncoupled. Ribonucleic acid (RNA) structural probing analyses reveal that the mutant IRESs adopt distinct conformations. Toeprinting analysis suggests that the reading frame is selected at a step downstream of ribosome assembly. We propose a model whereby the IRES adopts conformations to occlude the 0 frame aminoacyl-tRNA thereby allowing delivery of the +1 frame aminoacyl-tRNA to the A site to initiate translation of ORFx. This study provides a new paradigm for programmed recoding mechanisms that increase the coding capacity of a viral genome. PMID:25038250
The N-terminal region of the Plantago asiatica mosaic virus coat protein is required for cell-to-cell movement but is dispensable for virion assembly.

PubMed

Ozeki, Johji; Hashimoto, Masayoshi; Komatsu, Ken; Maejima, Kensaku; Himeno, Misako; Senshu, Hiroko; Kawanishi, Takeshi; Kagiwada, Satoshi; Yamaji, Yasuyuki; Namba, Shigetou

2009-06-01

Potexvirus cell-to-cell movement requires coat protein (CP) and movement proteins. In this study, mutations in two conserved in-frame AUG codons in the 5' region of the CP open reading frame of Plantago asiatica mosaic virus (PlAMV) were introduced, and virus accumulation of these mutants was analyzed in inoculated and upper noninoculated leaves. When CP was translated only from the second AUG codon, virus accumulation in inoculated leaves was lower than that of wild-type PlAMV, and the viral spread was impaired. Trans-complementation analysis showed that the leucine residue at the third position (Leu-3) of CP is important for cell-to-cell movement of PlAMV. The 14-amino-acid N-terminal region of CP was dispensable for virion formation. Immunoprecipitation assays conducted with an anti-TGBp1 antibody indicated that PlAMV CP interacts with TGBp1 in vivo and that this interaction is not affected by alanine substitution at Leu-3. These results support the concept that the N-terminal region of potexvirus CP can be separated into two distinct functional domains.
The complete mitochondrial genome of Gryllotalpa unispina Saussure, 1874 (Orthoptera: Gryllotalpoidea: Gryllotalpidae).

PubMed

Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan

2016-01-01

The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.
alpha-Tubulin of Histriculus cavicola (Ciliophora; Hypotrichea).

PubMed

Pérez-Romero, P; Villalobo, E; Díaz-Ramos, C; Calvo, P; Santos-Rosa, F; Torres, A

1997-03-01

An alpha-tubulin gene fragment amplified by PCR from the hypotrichous ciliate Histriculus cavicola has been sequenced. This fragment, 1,182 bp long, contains an in-frame "stop" codon (UAA), which in other hypotrichous species codes for a glutamine residue. The comparison of the alpha-tubulin genes from several ciliates classes have revealed amino acid positions which could serve to distinguish these taxonomic groups.
The complete mitochondrial genome of the diamondback moth, Plutella xylostella (Lepidoptera: Plutellidae).

PubMed

Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang

2016-01-01

The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.
A CGMMV genome-replicon vector with partial sequences of coat protein gene efficiently expresses GFP in Nicotiana benthamiana.

PubMed

Jailani, A Abdul Kader; Solanki, Vikas; Roy, Anirban; Sivasudha, T; Mandal, Bikash

2017-04-02

A highly infectious clone of Cucumber green mottle mosaic virus (CGMMV), a cucurbit-infecting tobamovirus was utilized for designing of gene expression vectors. Two versions of vector were examined for their efficacy in expressing the green fluorescent protein (GFP) in Nicotiana benthamiana. When the GFP gene was inserted at the stop codon of coat protein (CP) gene of the CGMMV genome without any read-through codon, systemic expression of GFP, as well as virion formation and systemic symptoms expression were obtained in N. benthamiana. The qRT-PCR analysis showed 23 fold increase of GFP over actin at 10days post inoculation (dpi), which increased to 45 fold at 14dpi and thereafter the GFP expression was significantly declined. Further, we show that when the most of the CP sequence is deleted retaining only the first 105 nucleotides, the shortened vector containing GFP in frame of original CP open reading frame (ORF) resulted in 234 fold increase of GFP expression over actin at 5dpi in N. benthamiana without the formation of virions and disease symptoms. Our study demonstrated that a simple manipulation of CP gene in the CGMMV genome while preserving the translational frame of CP resulted in developing a virus-free, rapid and efficient foreign protein expression system in the plant. The CGMMV based vectors developed in this study may be potentially useful for the production of edible vaccines in cucurbits. Copyright © 2017 Elsevier B.V. All rights reserved.
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.

PubMed Central

Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B

1990-01-01

Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563

Homology between DNA polymerases of poxviruses, herpesviruses, and adenoviruses: nucleotide sequence of the vaccinia virus DNA polymerase gene.

PubMed Central

Earl, P L; Jones, E V; Moss, B

1986-01-01

A 5400-base-pair segment of the vaccinia virus genome was sequenced and an open reading frame of 938 codons was found precisely where the DNA polymerase had been mapped by transfer of a phosphonoacetate-resistance marker. A single nucleotide substitution changing glycine at position 347 to aspartic acid accounts for the drug resistance of the mutant vaccinia virus. The 5' end of the DNA polymerase mRNA was located 80 base pairs before the methionine codon initiating the open reading frame. Correspondence between the predicted Mr 108,577 polypeptide and the 110,000 purified enzyme indicates that little or no proteolytic processing occurs. Extensive homology, extending over 435 amino acids, was found upon comparing the DNA polymerase of vaccinia virus and DNA polymerase of Epstein-Barr virus. A highly conserved sequence of 14 amino acids in the carboxyl-terminal regions of the above DNA polymerases is also present at a similar location in adenovirus DNA polymerase. This structure, which is predicted to form a turn flanked by beta-pleated sheets, may form part of an essential binding or catalytic site that accounts for its presence in DNA polymerases of poxviruses, herpesviruses, and adenoviruses. Images PMID:3012524
Expression of Human Hemojuvelin (HJV) Is Tightly Regulated by Two Upstream Open Reading Frames in HJV mRNA That Respond to Iron Overload in Hepatic Cells

PubMed Central

Onofre, Cláudia; Tomé, Filipa; Barbosa, Cristina; Silva, Ana Luísa

2015-01-01

The gene encoding human hemojuvelin (HJV) is one of the genes that, when mutated, can cause juvenile hemochromatosis, an early-onset inherited disorder associated with iron overload. The 5′ untranslated region of the human HJV mRNA has two upstream open reading frames (uORFs), with 28 and 19 codons formed by two upstream AUGs (uAUGs) sharing the same in-frame stop codon. Here we show that these uORFs decrease the translational efficiency of the downstream main ORF in HeLa and HepG2 cells. Indeed, ribosomal access to the main AUG is conditioned by the strong uAUG context, which results in the first uORF being translated most frequently. The reach of the main ORF is then achieved by ribosomes that resume scanning after uORF translation. Furthermore, the amino acid sequences of the uORF-encoded peptides also reinforce the translational repression of the main ORF. Interestingly, when iron levels increase, translational repression is relieved specifically in hepatic cells. The upregulation of protein levels occurs along with phosphorylation of the eukaryotic initiation factor 2α. Nevertheless, our results support a model in which the increasing recognition of the main AUG is mediated by a tissue-specific factor that promotes uORF bypass. These results support a tight HJV translational regulation involved in iron homeostasis. PMID:25666510
Translation initiation at an upstream CUG codon regulates the expression of Hibiscus chlorotic ringspot virus coat protein.

PubMed

Koh, Dora Chin-Yen; Wang, Xiaoxing; Wong, Sek-Man; Liu, D X

2006-12-01

Viruses depend heavily on host cells for replication and exploit the host translation machinery for its gene expression using various unorthodox translation mechanisms. According to the conventional scanning model, only the 5'-proximal gene in the viral RNA is accessible to the ribosomes whereas other genes are silent. In this study, we use a model plant RNA virus, Hibiscus chlorotic ringspot virus (HCRSV), to investigate various translation mechanisms involved in regulation of the expression of internal genes. The 3'-end 1.2kb region of HCRSV genomic and subgenomic RNAs were shown to encode four polypeptides of 38, 27, 25 and 22.5kDa. Mutagenesis studies revealed that a CUG codon ((2570)CUG) is the initiation codon for p27, the longest of the three co-C-terminal products (p27, p25 and p22.5), and translation of p25 and p22.5 was initiated at (2603)AUG and (2666)AUG, respectively. Translation initiation of the p27 expression at the (2570)CUG codon regulates the expression of p38, the viral coat protein through a leaky scanning mechanism and mutational analysis of an upstream open reading frame (ORF) demonstrated that initiation of the p27 expression at this CUG codon (instead of an AUG) may play a role in maintaining the ratio of p27 and p38. In addition, a previously identified internal ribosome entry site was shown to control the expression of p27 and p38 in the subgenomic RNA 2.
Single nucleotide polymorphisms of Helicobacter pylori dupA that lead to premature stop codons.

PubMed

Moura, Sílvia B; Costa, Rafaella F A; Anacleto, Charles; Rocha, Gifone A; Rocha, Andreia M C; Queiroz, Dulciene M M

2012-06-01

The detection of the putative disease-specific Helicobacter pylori marker duodenal ulcer promoting gene A (dupA) is currently based on PCR detection of jhp0917 and jhp0918 that form the gene. However, mutations that lead to premature stop codons that split off the dupA leading to truncated products cannot be evaluated by PCR. We directly sequence the complete dupA of 75 dupA-positive strains of H. pylori isolated from patients with gastritis (n = 26), duodenal ulcer (n = 29), and gastric carcinoma (n = 20), to search for frame-shifting mutations that lead to stop codon. Thirty-four strains had single nucleotide mutations in dupA that lead to premature stop codon creating smaller products than the predicted 1839 bp product and, for this reason, were considered as dupA-negative. Intact dupA was more frequently observed in strains isolated from duodenal ulcer patients (65.5%) than in patients with gastritis only (46.2%) or with gastric carcinoma (50%). In logistic analysis, the presence of the intact dupA independently associated with duodenal ulcer (OR = 5.06; 95% CI = 1.22-20.96, p = .02). We propose the primer walking methodology as a simple technique to sequence the gene. When we considered as dupA-positive only those strains that carry dupA gene without premature stop codons, the gene was associated with duodenal ulcer and, therefore, can be used as a marker for this disease in our population. © 2012 Blackwell Publishing Ltd.
Expression of different functional isoforms in haematopoiesis.

PubMed

Grech, Godfrey; Pollacco, Joel; Portelli, Mark; Sacco, Keith; Baldacchino, Shawn; Grixti, Justine; Saliba, Christian

2014-01-01

Haematopoiesis is a complex process regulated at various levels facilitating rapid responses to external factors including stress, modulation of lineage commitment and terminal differentiation of progenitors. Although the transcription program determines the RNA pool of a cell, various mRNA strands can be obtained from the same template, giving rise to multiple protein isoforms. The majority of variants and isoforms co-occur in normal haematopoietic cells or are differentially expressed at various maturity stages of progenitor maturation and cellular differentiation within the same lineage or across lineages. Genetic aberrations or specific cellular states result in the predominant expression of abnormal isoforms leading to deregulation and disease. The presence of upstream open reading frames (uORF) in 5' untranslated regions (UTRs) of a transcript, couples the utilization of start codons with the cellular status and availability of translation initiation factors (eIFs). In addition, tissue-specific and cell lineage-specific alternative promoter use, regulates several transcription factors producing transcript variants with variable 5' exons. In this review, we propose to give a detailed account of the differential isoform formation, causing haematological malignancies.
Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

PubMed Central

Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

1988-01-01

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Rhesus Monkey Rhadinovirus ORF57 Induces gH and gL Glycoprotein Expression through Posttranscriptional Accumulation of Target mRNAs ▿

PubMed Central

Shin, Young C.; Desrosiers, Ronald C.

2011-01-01

Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403
Structure and evolution of the mitochondrial genome of Exorista sorbillans: the Tachinidae (Diptera: Calyptratae) perspective.

PubMed

Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing

2012-12-01

The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
PRIMARY STRUCTURE OF THE CYTOCHROME P450 LANOSTEROL 14A-DEMETHYLASE GENE FROM CANDIDA TROPICALIS

EPA Science Inventory

We report the nucleotide sequence of the gene and flanking DNA for the cytochrome P450 lanosterol 14 alpha-demethylase (14DM) from the yeast Candida tropicalis ATCC750. An open reading frame (ORF) of 528 codons encoding a 60.9-kD protein is identified. This ORF includes a charact...
Functional Genomics Using the Saccharomyces cerevisiae Yeast Deletion Collections.

PubMed

Nislow, Corey; Wong, Lai Hong; Lee, Amy Huei-Yi; Giaever, Guri

2016-09-01

Constructed by a consortium of 16 laboratories, the Saccharomyces genome-wide deletion collections have, for the past decade, provided a powerful, rapid, and inexpensive approach for functional profiling of the yeast genome. Loss-of-function deletion mutants were systematically created using a polymerase chain reaction (PCR)-based gene deletion strategy to generate a start-to-stop codon replacement of each open reading frame by homologous recombination. Each strain carries two molecular barcodes that serve as unique strain identifiers, enabling their growth to be analyzed in parallel and the fitness contribution of each gene to be quantitatively assessed by hybridization to high-density oligonucleotide arrays or through the use of next-generation sequencing technologies. Functional profiling of the deletion collections, using either strain-by-strain or parallel assays, provides an unbiased approach to systematically survey the yeast genome. The Saccharomyces yeast deletion collections have proved immensely powerful in contributing to the understanding of gene function, including functional relationships between genes and genetic pathways in response to diverse genetic and environmental perturbations. © 2016 Cold Spring Harbor Laboratory Press.
Fatal mitochondrial encephalopathy caused by fumarase deficiency: A molecular-genetic study

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gellera, C.; Cavadini, P.; Baratta, S.

Fumarase deficiency is a rare autosomal recessive disorder of the citric acid cycle resulting in severe organic aciduria and encephalopathy. Mammalian cells contain two fumarase isoenzymes, one mitochondrial and one cytosolic. In rat, the two proteins are encoded by the same gene and are synthesized by alternative initiation of translation at two in-phase AUG codons. One single fumarase gene locus has been identified on human chromosome 1. In most of the patients so far described, the activities of both isozymes are severely affected, suggesting that mutations within a single gene may underlie the disease. Here, we report the molecular studymore » of fumarase deficiency in a patient exhibiting compound heterozygosity for two different allelic mutations affecting the amino acid composition of both isoforms. The proband, an Italian boy of nonconsanguineous parents, died at 7 months of age of a progressive encephalopathy. Immunoblot demonstrated absence of cross-reacting material in both cytosolic and mitochondrial fraction of all tissues examined. Molecular analysis of the patient`s fumarase cDNA amplified by RT-PCR showed the presence of two mutations affecting the amino acid composition of both isoforms, a missense mutation resulting in the nonconservative amino acid substitution at codon 190 (Arg190Cys) and an amino acid in-frame insertion at codon 434 (Lys434ins). SSCP analysis of genomic PCR fragments encompassing the mutations demonstrated that the patient was heterozygous for both mutations, having inherited the Arg-to-Cys substitution from the father and the in-frame insertion from the mother. Finally, the effects of the mutations on enzyme function were investigated by expressing both normal and mutated fumarase cDNAs in a fumarase-deficient ({delta}FUM1) S. cerevisiae strain.« less
Unit-length line-1 transcripts in human teratocarcinoma cells.

PubMed Central

Skowronski, J; Fanning, T G; Singer, M F

1988-01-01

We have characterized the approximately 6.5-kilobase cytoplasmic poly(A)+ Line-1 (L1) RNA present in a human teratocarcinoma cell line, NTera2D1, by primer extension and by analysis of cloned cDNAs. The bulk of the RNA begins (5' end) at the residue previously identified as the 5' terminus of the longest known primate genomic L1 elements, presumed to represent "unit" length. Several of the cDNA clones are close to 6 kilobase pairs, that is, close to full length. The partial sequences of 18 cDNA clones and full sequence of one (5,975 base pairs) indicate that many different genomic L1 elements contribute transcripts to the 6.5-kilobase cytoplasmic poly(A)+ RNA in NTera2D1 cells because no 2 of the 19 cDNAs analyzed had identical sequences. The transcribed elements appear to represent a subset of the total genomic L1s, a subset that has a characteristic consensus sequence in the 3' noncoding region and a high degree of sequence conservation throughout. Two open reading frames (ORFs) of 1,122 (ORF1) and 3,852 (ORF2) bases, flanked by about 800 and 200 bases of sequence at the 5' and 3' ends, respectively, can be identified in the cDNAs. Both ORFs are in the same frame, and they are separated by 33 bases bracketed by two conserved in-frame stop codons. ORF 2 is interrupted by at least one randomly positioned stop codon in the majority of the cDNAs. The data support proposals suggesting that the human L1 family includes one or more functional genes as well as an extraordinarily large number of pseudogenes whose ORFs are broken by stop codons. The cDNA structures suggest that both genes and pseudogenes are transcribed. At least one of the cDNAs (cD11), which was sequenced in its entirety, could, in principle, represent an mRNA for production of the ORF1 polypeptide. The similarity of mammalian L1s to several recently described invertebrate movable elements defines a new widely distributed class of elements which we term class II retrotransposons. Images PMID:2454389
Construction and Validation of the Rhodobacter sphaeroides 2.4.1 DNA Microarray: Transcriptome Flexibility at Diverse Growth Modes

PubMed Central

Pappas, Christopher T.; Sram, Jakub; Moskvin, Oleg V.; Ivanov, Pavel S.; Mackenzie, R. Christopher; Choudhary, Madhusudan; Land, Miriam L.; Larimer, Frank W.; Kaplan, Samuel; Gomelsky, Mark

2004-01-01

A high-density oligonucleotide DNA microarray, a genechip, representing the 4.6-Mb genome of the facultative phototrophic proteobacterium, Rhodobacter sphaeroides 2.4.1, was custom-designed and manufactured by Affymetrix, Santa Clara, Calif. The genechip contains probe sets for 4,292 open reading frames (ORFs), 47 rRNA and tRNA genes, and 394 intergenic regions. The probe set sequences were derived from the genome annotation generated by Oak Ridge National Laboratory after extensive revision, which was based primarily upon codon usage characteristic of this GC-rich bacterium. As a result of the revision, numerous missing ORFs were uncovered, nonexistent ORFs were deleted, and misidentified start codons were corrected. To evaluate R. sphaeroides transcriptome flexibility, expression profiles for three diverse growth modes—aerobic respiration, anaerobic respiration in the dark, and anaerobic photosynthesis—were generated. Expression levels of one-fifth to one-third of the R. sphaeroides ORFs were significantly different in cells under any two growth modes. Pathways involved in energy generation and redox balance maintenance under three growth modes were reconstructed. Expression patterns of genes involved in these pathways mirrored known functional changes, suggesting that massive changes in gene expression are the major means used by R. sphaeroides in adaptation to diverse conditions. Differential expression was observed for genes encoding putative new participants in these pathways (additional photosystem genes, duplicate NADH dehydrogenase, ATP synthases), whose functionality has yet to be investigated. The DNA microarray data correlated well with data derived from quantitative reverse transcription-PCR, as well as with data from the literature, thus validating the R. sphaeroides genechip as a powerful and reliable tool for studying unprecedented metabolic versatility of this bacterium. PMID:15231807
Analysis of the regulatory region of the protease III (ptr) gene of Escherichia coli K-12.

PubMed

Claverie-Martin, F; Diaz-Torres, M R; Kushner, S R

1987-01-01

The ptr gene of Escherichia coli encodes protease III (Mr 110,000) and a 50-kDa polypeptide, both of which are found in the periplasmic space. The gene is physically located between the recC and recB loci on the E. coli chromosome. The nucleotide sequence of a 1167-bp EcoRV-ClaI fragment of chromosomal DNA containing the promoter region and 885 bp of the ptr coding sequence has been determined. S1 nuclease mapping analysis showed that the major 5' end of the ptr mRNA was localized 127 bp upstream from the ATG start codon. The open reading frame (ORF), preceded by a Shine-Dalgarno sequence, extends to the end of the sequenced DNA. Downstream from the -35 and -10 regions is a sequence that strongly fits the consensus sequence of known nitrogen-regulated promoters. A signal peptide of 23 amino acids residues is present at the N terminus of the derived amino acid sequence. The cleavage site as well as the ORF were confirmed by sequencing the N terminus of mature protease III.
The P1N-PISPO trans-Frame Gene of Sweet Potato Feathery Mottle Potyvirus Is Produced during Virus Infection and Functions as an RNA Silencing Suppressor

PubMed Central

Mingot, Ares; Valli, Adrián; Rodamilans, Bernardo; San León, David; Baulcombe, David C.; García, Juan Antonio

2016-01-01

ABSTRACT The positive-sense RNA genome of Sweet potato feathery mottle virus (SPFMV) (genus Potyvirus, family Potyviridae) contains a large open reading frame (ORF) of 3,494 codons translatable as a polyprotein and two embedded shorter ORFs in the −1 frame: PISPO, of 230 codons, and PIPO, of 66 codons, located in the P1 and P3 regions, respectively. PISPO is specific to some sweet potato-infecting potyviruses, while PIPO is present in all potyvirids. In SPFMV these two extra ORFs are preceded by conserved G2A6 motifs. We have shown recently that a polymerase slippage mechanism at these sites could produce transcripts bringing these ORFs in frame with the upstream polyprotein, thus leading to P1N-PISPO and P3N-PIPO products (B. Rodamilans, A. Valli, A. Mingot, D. San Leon, D. B. Baulcombe, J. J. Lopez-Moya, and J.A. Garcia, J Virol 89:6965–6967, 2015, doi:10.1128/JVI.00337-15). Here, we demonstrate by liquid chromatography coupled to mass spectrometry that both P1 and P1N-PISPO are produced during viral infection and coexist in SPFMV-infected Ipomoea batatas plants. Interestingly, transient expression of SPFMV gene products coagroinfiltrated with a reporter gene in Nicotiana benthamiana revealed that P1N-PISPO acts as an RNA silencing suppressor, a role normally associated with HCPro in other potyviruses. Moreover, mutation of WG/GW motifs present in P1N-PISPO abolished its silencing suppression activity, suggesting that the function might require interaction with Argonaute components of the silencing machinery, as was shown for other viral suppressors. Altogether, our results reveal a further layer of complexity of the RNA silencing suppression activity within the Potyviridae family. IMPORTANCE Gene products of potyviruses include P1, HCPro, P3, 6K1, CI, 6K2, VPg/NIaPro, NIb, and CP, all derived from the proteolytic processing of a large polyprotein, and an additional P3N-PIPO product, with the PIPO segment encoded in a different frame within the P3 cistron. In sweet potato feathery mottle virus (SPFMV), another out-of-frame element (PISPO) was predicted within the P1 region. We have shown recently that a polymerase slippage mechanism can generate the transcript variants with extra nucleotides that could be translated into P1N-PISPO and P3N-PIPO. Now, we demonstrate by mass spectrometry analysis that P1N-PISPO is indeed produced in SPFMV-infected plants, in addition to P1. Interestingly, while in other potyviruses the suppressor of RNA silencing is HCPro, we show here that P1N-PISPO exhibited this activity in SPFMV, revealing how the complexity of the gene content could contribute to supply this essential function in members of the Potyviridae family. PMID:26792740
Identification of seven haplotypes of the caprine PrP gene at codons 127, 142, 154, 211, 222 and 240 in French Alpine and Saanen breeds and their association with classical scrapie.

PubMed

Barillet, F; Mariat, D; Amigues, Y; Faugeras, R; Caillat, H; Moazami-Goudarzi, K; Rupp, R; Babilliot, J M; Lacroux, C; Lugan, S; Schelcher, F; Chartier, C; Corbière, F; Andréoletti, O; Perrin-Chauvineau, C

2009-03-01

In sheep, susceptibility to scrapie is mainly influenced by polymorphisms of the PrP gene. In goats, there are to date few data related to scrapie susceptibility association with PrP gene polymorphisms. In this study, we first investigated PrP gene polymorphisms of the French Alpine and Saanen breeds. Based on PrP gene open reading frame sequencing of artificial insemination bucks (n=404), six encoding mutations were identified at codons 127, 142, 154, 211, 222 and 240. However, only seven haplotypes could be detected: four (GIH(154)RQS, GIRQ(211)QS, GIRRK(222)S and GIRRQP(240)) derived from the wild-type allele (G(127)I(142)R(154)R(211)Q(222)S(240)) by a single-codon mutation, and two (S(127)IRRQP(240) and GM(142)RRQP(240)) by a double-codon mutation. A case-control study was then implemented in a highly affected Alpine and Saanen breed herd (90 cases/164 controls). Mutations at codon 142 (I/M), 154 (R/H), 211 (R/Q) and 222 (Q/K) were found to induce a significant degree of protection towards natural scrapie infection. Compared with the baseline homozygote wild-type genotype I(142)R(154)R(211)Q(222)/IRRQ goats, the odds of scrapie cases in IRQ(211)Q/IRRQ and IRRK(222)/IRRQ heterozygous animals were significantly lower [odds ratio (OR)=0.133, P<0.0001; and OR=0.048, P<0.0001, respectively]. The heterozygote M(142)RRQ/IRRQ genotype was only protective (OR=0.243, P=0.0186) in goats also PP(240) homozygous at codon 240. However, mutated allele frequencies in French Alpine and Saanen breeds were low (0.5-18.5 %), which prevent us from assessing the influence of all the possible genotypes in natural exposure conditions.
Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

PubMed Central

Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

2010-01-01

Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665
The effector gene xopAE of Xanthomonas euvesicatoria 85-10 is part of an operon and encodes an E3 ubiquitin ligase.

PubMed

Popov, Georgy; Majhi, Bharat Bhusan; Sessa, Guido

2018-05-21

The type III effector XopAE from the Xanthomonas euvesicatoria strain 85-10 ( Xe 85-10) was previously shown to inhibit plant immunity and enhance pathogen-induced disease symptoms. Evolutionary analysis of 60 xopAE alleles ( AEal ) revealed that the xopAE locus is conserved in multiple Xanthomonas species. The majority of xopAE alleles (55 out of 60) encodes a single ORF ( xopAE ), while in 5 alleles, including AEal 37 of the Xe 85-10 strain, a frame-shift splits the locus into two ORFs ( hpaF and a truncated xopAE ). To test whether the second ORF of AEal 37 ( xopAE 85-10 ) is translated, we examined expression of YFP fused downstream to truncated or mutant forms of the locus in Xanthomonas bacteria. YFP fluorescence was detected at maximal levels when the reporter was in proximity of an internal ribosome-binding site upstream to a rare ATT start codon in the xopAE 85-10 ORF, but severely reduced when these elements were abolished. In agreement with the notion that xopAE 85- 10 is a functional gene, its protein product was translocated into plant cells by the type III secretion system and translocation was dependent on its upstream ORF hpaF. Homology modeling predicted that XopAE 85-10 contains an E3 ligase XL-box domain at the C-terminus, and in vitro assays demonstrated that this domain displays mono-ubiquitination activity. Remarkably, the XL-box was essential for XopAE 85-10 to inhibit PAMP-induced gene expression in Arabidopsis protoplasts. Together, these results indicate that the xopAE 85-10 gene resides in a functional operon, which utilizes the alternative start codon ATT, and encodes a novel XL-box E3 ligase. Importance Xanthomonas bacteria utilize a type III secretion system to cause disease in many crops. This study provides insights into evolution, translocation and biochemical function of the XopAE type III secreted effector contributing to the understanding of Xanthomonas-host interactions. We establish XopAE as core effector of seven Xanthomonas species and elucidate evolution of the Xanthomonas euvesicatoria xopAE locus, which contains an operon encoding a truncated effector. Our findings indicate that this operon evolved from the split of a multi-domains gene into two ORFs that conserved the original domain function. Analysis of xopAE 85-10 translation provides the first evidence for translation initiation from an ATT codon in Xanthomonas Our data demonstrate that XopAE 85-10 is an XL-box E3 ubiquitin ligase and provide insights into structure and function of this effector family. Copyright © 2018 American Society for Microbiology.
Cellular Selenoprotein mRNA Tethering via Antisense Interactions with Ebola and HIV-1 mRNAs May Impact Host Selenium Biochemistry.

PubMed

Taylor, Ethan Will; Ruzicka, Jan A; Premadasa, Lakmini; Zhao, Lijun

2016-01-01

Regulation of protein expression by non-coding RNAs typically involves effects on mRNA degradation and/or ribosomal translation. The possibility of virus-host mRNA-mRNA antisense tethering interactions (ATI) as a gain-of-function strategy, via the capture of functional RNA motifs, has not been hitherto considered. We present evidence that ATIs may be exploited by certain RNA viruses in order to tether the mRNAs of host selenoproteins, potentially exploiting the proximity of a captured host selenocysteine insertion sequence (SECIS) element to enable the expression of virally-encoded selenoprotein modules, via translation of in-frame UGA stop codons as selenocysteine. Computational analysis predicts thermodynamically stable ATIs between several widely expressed mammalian selenoprotein mRNAs (e.g., isoforms of thioredoxin reductase) and specific Ebola virus mRNAs, and HIV-1 mRNA, which we demonstrate via DNA gel shift assays. The probable functional significance of these ATIs is further supported by the observation that, in both viruses, they are located in close proximity to highly conserved in-frame UGA stop codons at the 3' end of open reading frames that encode essential viral proteins (the HIV-1 nef protein and the Ebola nucleoprotein). Significantly, in HIV/AIDS patients, an inverse correlation between serum selenium and mortality has been repeatedly documented, and clinical benefits of selenium in the context of multi-micronutrient supplementation have been demonstrated in several well-controlled clinical trials. Hence, in the light of our findings, the possibility of a similar role for selenium in Ebola pathogenesis and treatment merits serious investigation.
The complete mitochondrial genome of Plodia interpunctella (Lepidoptera: Pyralidae) and comparison with other Pyraloidea insects.

PubMed

Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping

2016-01-01

The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.

The complete mitochondrial genome of the American black flour beetle Tribolium audax (Coleoptera: Tenebrionidae).

PubMed

Ou, Jing; Liu, Jin-Bo; Yao, Fu-Jiao; Wang, Xin-Guo; Wei, Zhao-Ming

2016-01-01

Flour beetles of the genus Tribolium are all pests of stored products and cause severe economic losses every year. The American black flour beetle Tribolium audax is one of the important pest species of flour beetle, and it is also an important quarantine insect. Here we sequenced and characterized the complete mitochondrial genome of T. audax, which was intercepted by Huangpu Custom in maize from America. The complete circular mitochondrial genome (mitogenome) of T. audax was 15,924 bp in length, containing 37 typical coding genes and one non-coding AT-rich region. The mitogenome of T. audax exhibits a gene arrangement and content identical to the most common type in insects. All protein coding genes (PCGs) are start with a typical ATN initiation codon, except for the cox1, which use AAC as its start codon instead of ATN. Eleven genes use standard complete termination codon (nine TAA, two TAG), whereas the nad4 and nad5 genes end with single T. Except for trnS1 (AGN), all tRNA genes display typical secondary cloverleaf structures as those of other insects. The sizes of the large and small ribosomal RNA genes are 1288 and 780 bp, respectively. The AT content of the AT-rich region is 81.36%. The 5 bp conserved motif TACTA was found in the intergenic region between trnS2 (UCN) and nad1.
Essential and Dispensable Virus-Encoded Replication Elements Revealed by Efforts To Develop Hypoviruses as Gene Expression Vectors

PubMed Central

Suzuki, Nobuhiro; Geletka, Lynn M.; Nuss, Donald L.

2000-01-01

We have investigated whether hypoviruses, viral agents responsible for virulence attenuation (hypovirulence) of the chestnut blight fungus Cryphonectria parasitica, could serve as gene expression vectors. The infectious cDNA clone of the prototypic hypovirus CHV1-EP713 was modified to generate 20 different vector candidates. Although transient expression was achieved for a subset of vectors that contained the green fluorescent protein gene from Aequorea victoria, long-term expression (past day 8) was not observed for any vector construct. Analysis of viral RNAs recovered from transfected fungal colonies revealed that the foreign genes were readily deleted from the replicating virus, although small portions of foreign sequences were retained by some vectors after months of replication. However, the results of vector viability and progeny characterization provided unexpected new insights into essential and dispensable elements of hypovirus replication. The N-terminal portion (codons 1 to 24) of the 5′-proximal open reading frame (ORF), ORF A, was found to be required for virus replication, while the remaining 598 codons of this ORF were completely dispensable. Substantial alterations were tolerated in the pentanucleotide UAAUG that contains the ORF A termination codon and the overlapping putative initiation codon of the second of the two hypovirus ORFs, ORF B. Replication competence was maintained following either a frameshift mutation that caused a two-codon extension of ORF A or a modification that produced a single-ORF genomic organization. These results are discussed in terms of determinants of hypovirus replication, the potential utility of hypoviruses as gene expression vectors, and possible mechanisms by which hypoviruses recognize and delete foreign sequences. PMID:10906211
The mitochondnal genome of Aspergillus nidulans contains reading frames homologous to the human URFs 1 and 4.

PubMed Central

Brown, T A; Davies, R W; Ray, J A; Waring, R B; Scazzocchio, C

1983-01-01

A 2830-bp segment of the mitochondrial genome of the fungus Aspergillus nidulans was sequenced and shown to contain two unidentified reading frames (URFs). These reading frames are 352 and 488 codons in length, and would specify unmodified proteins of mol. wts. 39,000 and 54,000, respectively. The derived amino acid sequences indicate that these genes are equivalent to the human mitochondrial URFs 1 and 4, with 39% amino acid homology for URF1 and 26% for URF4. Both URFs were shown by secondary structure predictions to code for predominantly beta-sheeted proteins with strong structural conservation between the fungal and human homologues. Counterparts of mammalian URFs have not previously been identified in non-mammalian genomes, and the discovery that A. nidulans possesses reading frames so closely homologous with URF1 and URF4 shows that these genes are of general functional importance in the mitochondria of diverse species. PMID:11894959
Structure of a human cap-dependent 48S translation pre-initiation complex

PubMed Central

Eliseev, Boris; Yeramala, Lahari; Leitner, Alexander; Karuppasamy, Manikandan; Raimondeau, Etienne; Huard, Karine; Alkalaeva, Elena; Aebersold, Ruedi

2018-01-01

Abstract Eukaryotic translation initiation is tightly regulated, requiring a set of conserved initiation factors (eIFs). Translation of a capped mRNA depends on the trimeric eIF4F complex and eIF4B to load the mRNA onto the 43S pre-initiation complex comprising 40S and initiation factors 1, 1A, 2, 3 and 5 as well as initiator-tRNA. Binding of the mRNA is followed by mRNA scanning in the 48S pre-initiation complex, until a start codon is recognised. Here, we use a reconstituted system to prepare human 48S complexes assembled on capped mRNA in the presence of eIF4B and eIF4F. The highly purified h-48S complexes are used for cross-linking/mass spectrometry, revealing the protein interaction network in this complex. We report the electron cryo-microscopy structure of the h-48S complex at 6.3 Å resolution. While the majority of eIF4B and eIF4F appear to be flexible with respect to the ribosome, additional density is detected at the entrance of the 40S mRNA channel which we attribute to the RNA-recognition motif of eIF4B. The eight core subunits of eIF3 are bound at the 40S solvent-exposed side, as well as the subunits eIF3d, eIF3b and eIF3i. elF2 and initiator-tRNA bound to the start codon are present at the 40S intersubunit side. This cryo-EM structure represents a molecular snap-shot revealing the h-48S complex following start codon recognition. PMID:29401259
Complete mitochondrial genome of yellow meal worm(Tenebrio molitor)

PubMed Central

LIU, Li-Na; WANG, Cheng-Ye

2014-01-01

The yellow meal worm(Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons(CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNASer(AGN), which lacked a dihydrouridine(DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm. PMID:25465087
Complete mitochondrial genome of yellow meal worm (Tenebrio molitor).

PubMed

Liu, Li-Na; Wang, Cheng-Ye

2014-11-18

The yellow meal worm (Tenebrio molitor L.) is an important resource insect typically used as animal feed additive. It is also widely used for biological research. The first complete mitochondrial genome of T. molitor was determined for the first time by long PCR and conserved primer walking approaches. The results showed that the entire mitogenome of T. molitor was 15 785 bp long, with 72.35% A+T content [deposited in GenBank with accession number KF418153]. The gene order and orientation were the same as the most common type suggested as ancestral for insects. Two protein-coding genes used atypical start codons (CTA in ND2 and AAT in COX1), and the remaining 11 protein-coding genes started with a typical insect initiation codon ATN. All tRNAs showed standard clover-leaf structure, except for tRNA(Ser) (AGN), which lacked a dihydrouridine (DHU) arm. The newly added T. molitor mitogenome could provide information for future studies on yellow meal worm.
Self-organizing approach for meta-genomes.

PubMed

Zhu, Jianfeng; Zheng, Wei-Mou

2014-12-01

We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.
Balancing Selection of a Frame-Shift Mutation in the MRC2 Gene Accounts for the Outbreak of the Crooked Tail Syndrome in Belgian Blue Cattle

PubMed Central

Li, Wanbo; Dive, Marc; Tamma, Nico; Michaux, Charles; Druet, Tom; Huijbers, Ivo J.; Isacke, Clare M.; Coppieters, Wouter; Georges, Michel; Charlier, Carole

2009-01-01

We herein describe the positional identification of a 2-bp deletion in the open reading frame of the MRC2 receptor causing the recessive Crooked Tail Syndrome in cattle. The resulting frame-shift reveals a premature stop codon that causes nonsense-mediated decay of the mutant messenger RNA, and the virtual absence of functional Endo180 protein in affected animals. Cases exhibit skeletal anomalies thought to result from impaired extracellular matrix remodeling during ossification, and as of yet unexplained muscular symptoms. We demonstrate that carrier status is very significantly associated with desired characteristics in the general population, including enhanced muscular development, and that the resulting heterozygote advantage caused a selective sweep which explains the unexpectedly high frequency (25%) of carriers in the Belgian Blue Cattle Breed. PMID:19779552
Systematic screening for mutations in the human serotonin 1F receptor gene in patients with bipolar affective disorder and schizophrenia

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shimron-Abarbanell, D.; Harms, H.; Erdmann, J.

1996-04-09

Using single strand conformational analysis we screened the complete coding sequence of the serotonin 1F (5-HT{sub 1F}) receptor gene for the presence of DNA sequence variation in a sample of 137 unrelated individuals including 45 schizophrenic patients, 46 bipolar patients, as well as 46 healthy controls. We detected only three rare sequence variants which are characterized by single base pair substitutions, namely a silent T{r_arrow}A transversion in the third position of codon 261 (encoding isoleucine), a silent C{r_arrow}T transition in the third position of codon 176 (encoding histidine), and a C{r_arrow}T transition in position -78 upstream from the start codon.more » The lack of significant mutations in patients suffering from schizophrenia and bipolar affective disorder indicates that the 5-HT{sub 1F} receptor is not commonly involved in the etiology of these diseases. 12 refs., 1 fig., 2 tabs.« less
A species-specific nucleosomal signature defines a periodic distribution of amino acids in proteins.

PubMed

Quintales, Luis; Soriano, Ignacio; Vázquez, Enrique; Segurado, Mónica; Antequera, Francisco

2015-04-01

Nucleosomes are the basic structural units of chromatin. Most of the yeast genome is organized in a pattern of positioned nucleosomes that is stably maintained under a wide range of physiological conditions. In this work, we have searched for sequence determinants associated with positioned nucleosomes in four species of fission and budding yeasts. We show that mononucleosomal DNA follows a highly structured base composition pattern, which differs among species despite the high degree of histone conservation. These nucleosomal signatures are present in transcribed and non-transcribed regions across the genome. In the case of open reading frames, they correctly predict the relative distribution of codons on mononucleosomal DNA, and they also determine a periodicity in the average distribution of amino acids along the proteins. These results establish a direct and species-specific connection between the position of each codon around the histone octamer and protein composition.
Defect in the GTPase activating protein (GAP) function of eIF5 causes repression of GCN4 translation.

PubMed

Antony A, Charles; Alone, Pankaj V

2017-05-13

In eukaryotes, the eIF5 protein plays an important role in translation start site selection by providing the GAP (GTPase activating protein) function. However, in yeast translation initiation fidelity defective eIF5 G31R mutant causes preferential utilization of UUG as initiation codon and is termed as Suppressor of initiation codon (Sui - ) phenotype due to its hyper GTPase activity. The eIF5 G31R mutant dominantly represses GCN4 expression and confers sensitivity to 3-Amino-1,2,4-Trizole (3AT) induced starvation. The down-regulation of the GCN4 expression (Gcn - phenotype) in the eIF5 G31R mutant was not because of leaky scanning defects; rather was due to the utilization of upUUG initiation codons at the 5' regulatory region present between uORF1 and the main GCN4 ORF. Copyright © 2017 Elsevier Inc. All rights reserved.
Influenza A virus PB1-F2 protein expression is regulated in a strain-specific manner by sequences located downstream of the PB1-F2 initiation codon

USDA-ARS?s Scientific Manuscript database

Translation of influenza A virus PB1-F2 occurs in a second open reading frame (ORF) of the PB1 gene segment. PB1-F2 has been implicated in regulation of polymerase activity, immunopathology, susceptibility to secondary bacterial infection, and induction of apoptosis. Experimental evidence of PB1-F2 ...
Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias.

PubMed

Kjær, Jonas; Belsham, Graham J

2018-01-01

Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Beyond the Triplet Code: Context Cues Transform Translation.

PubMed

Brar, Gloria A

2016-12-15

The elucidation of the genetic code remains among the most influential discoveries in biology. While innumerable studies have validated the general universality of the code and its value in predicting and analyzing protein coding sequences, established and emerging work has also suggested that full genome decryption may benefit from a greater consideration of a codon's neighborhood within an mRNA than has been broadly applied. This Review examines the evidence for context cues in translation, with a focus on several recent studies that reveal broad roles for mRNA context in programming translation start sites, the rate of translation elongation, and stop codon identity. Copyright © 2016 Elsevier Inc. All rights reserved.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

PubMed

Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

2016-12-01

Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
Insights into factorless translational initiation by the tRNA-like pseudoknot domain of a viral IRES.

PubMed

Au, Hilda H T; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES

PubMed Central

Au, Hilda H. T.; Jan, Eric

2012-01-01

The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Cellular Selenoprotein mRNA Tethering via Antisense Interactions with Ebola and HIV-1 mRNAs May Impact Host Selenium Biochemistry

PubMed Central

Taylor, Ethan Will; Ruzicka, Jan A.; Premadasa, Lakmini; Zhao, Lijun

2016-01-01

Regulation of protein expression by non-coding RNAs typically involves effects on mRNA degradation and/or ribosomal translation. The possibility of virus-host mRNA-mRNA antisense tethering interactions (ATI) as a gain-of-function strategy, via the capture of functional RNA motifs, has not been hitherto considered. We present evidence that ATIs may be exploited by certain RNA viruses in order to tether the mRNAs of host selenoproteins, potentially exploiting the proximity of a captured host selenocysteine insertion sequence (SECIS) element to enable the expression of virally-encoded selenoprotein modules, via translation of in-frame UGA stop codons as selenocysteine. Computational analysis predicts thermodynamically stable ATIs between several widely expressed mammalian selenoprotein mRNAs (e.g., isoforms of thioredoxin reductase) and specific Ebola virus mRNAs, and HIV-1 mRNA, which we demonstrate via DNA gel shift assays. The probable functional significance of these ATIs is further supported by the observation that, in both viruses, they are located in close proximity to highly conserved in-frame UGA stop codons at the 3′ end of open reading frames that encode essential viral proteins (the HIV-1 nef protein and the Ebola nucleoprotein). Significantly, in HIV/AIDS patients, an inverse correlation between serum selenium and mortality has been repeatedly documented, and clinical benefits of selenium in the context of multi-micronutrient supplementation have been demonstrated in several well-controlled clinical trials. Hence, in the light of our findings, the possibility of a similar role for selenium in Ebola pathogenesis and treatment merits serious investigation. PMID:26369818
Therapy for Duchenne muscular dystrophy: renewed optimism from genetic approaches.

PubMed

Fairclough, Rebecca J; Wood, Matthew J; Davies, Kay E

2013-06-01

Duchenne muscular dystrophy (DMD) is a devastating progressive disease for which there is currently no effective treatment except palliative therapy. There are several promising genetic approaches, including viral delivery of the missing dystrophin gene, read-through of translation stop codons, exon skipping to restore the reading frame and increased expression of the compensatory utrophin gene. The lessons learned from these approaches will be applicable to many other disorders.
Mucopolysaccharidosis type I: Identification and characterization of mutations affecting alpha-L-iduronidase activity.

PubMed

Lee-Chen, Guey-Jen; Lin, Shuan-Pei; Chen, I-Shen; Chang, Jui-Hung; Yang, Chyau-Wen; Chin, Yi-Wen

2002-06-01

Mucopolysaccharidosis type I (MPS I) is caused by a deficiency of the lysosomal enzyme alpha-L-iduronidase (IDUA). MPS I covers a broad spectrum of clinical severity ranging from severe Hurler syndrome through intermediate Hurler/Scheie syndrome to mild Scheie syndrome. Mutation screening was performed in two unrelated Taiwanese MPS I patients. A Hurler/Scheie patient had A79V (C to T transition in codon 79) in exon 2 and R619G (C to G transversion in codon 619) in exon 14. R619G has been shown to cause disease. Expression of A79V in COS-7 cells showed trace amounts of IDUA activity, demonstrating the deleterious nature of the mutation. A79V mutation did not cause a reduction in IDUA mRNA levels. The reduced level of IDUA protein suggests increased degradation of the mutant enzyme. A Hurler patient had 134del12 (in-frame deletion of codons 16-19 in signal peptide) in exon 1 and Q584X (C to T transition in codon 584) in exon 13. Transfection of COS-7 cells with Q584X did not yield active enzyme. Q584X mutation caused an apparent reduction in the IDUA mRNA level and no IDUA protein was detected. Conversely, 134del12 showed 124.6% of normal activity in transfected cells and a 77-kDa precursor protein was observed on Western blot, suggesting biologic activity of precursor IDUA without posttranslational cleavage. These findings provide further evidence of the molecular heterogeneity in mutations in MPS I.

Comparative Analysis of Syntenic Genes in Grass Genomes Reveals Accelerated Rates of Gene Structure and Coding Sequence Evolution in Polyploid Wheat1[W][OA

PubMed Central

Akhunov, Eduard D.; Sehgal, Sunish; Liang, Hanquan; Wang, Shichen; Akhunova, Alina R.; Kaur, Gaganpreet; Li, Wanlong; Forrest, Kerrie L.; See, Deven; Šimková, Hana; Ma, Yaqin; Hayden, Matthew J.; Luo, Mingcheng; Faris, Justin D.; Doležel, Jaroslav; Gill, Bikram S.

2013-01-01

Cycles of whole-genome duplication (WGD) and diploidization are hallmarks of eukaryotic genome evolution and speciation. Polyploid wheat (Triticum aestivum) has had a massive increase in genome size largely due to recent WGDs. How these processes may impact the dynamics of gene evolution was studied by comparing the patterns of gene structure changes, alternative splicing (AS), and codon substitution rates among wheat and model grass genomes. In orthologous gene sets, significantly more acquired and lost exonic sequences were detected in wheat than in model grasses. In wheat, 35% of these gene structure rearrangements resulted in frame-shift mutations and premature termination codons. An increased codon mutation rate in the wheat lineage compared with Brachypodium distachyon was found for 17% of orthologs. The discovery of premature termination codons in 38% of expressed genes was consistent with ongoing pseudogenization of the wheat genome. The rates of AS within the individual wheat subgenomes (21%–25%) were similar to diploid plants. However, we uncovered a high level of AS pattern divergence between the duplicated homeologous copies of genes. Our results are consistent with the accelerated accumulation of AS isoforms, nonsynonymous mutations, and gene structure rearrangements in the wheat lineage, likely due to genetic redundancy created by WGDs. Whereas these processes mostly contribute to the degeneration of a duplicated genome and its diploidization, they have the potential to facilitate the origin of new functional variations, which, upon selection in the evolutionary lineage, may play an important role in the origin of novel traits. PMID:23124323
Selenium. Role of the Essential Metalloid in Health

PubMed Central

Kurokawa, Suguru; Berry, Marla J.

2015-01-01

Selenium is an essential micronutrient in mammals, but is also recognized as toxic in excess. It is a non-metal with properties that are intermediate between the chalcogen elements sulfur and tellurium. Selenium exerts its biological functions through selenoproteins. Selenoproteins contain selenium in the form of the 21st amino acid, selenocysteine (Sec), which is an analog of cysteine with the sulfur-containing side chain replaced by a Se-containing side chain. Sec is encoded by the codon UGA, which is one of three termination codons for mRNA translation in non-selenoprotein genes. Recognition of the UGA codon as a Sec insertion site instead of stop requires a Sec insertion sequence (SECIS) element in selenoprotein mRNAs and a unique selenocysteyl-tRNA, both of which are recognized by specialized protein factors. Unlike the 20 standard amino acids, Sec is biosynthesized from serine on its tRNA. Twenty-five selenoproteins are encoded in the human genome. Most of the selenoprotein genes were discovered by bioinformatics approaches, searching for SECIS elements downstream of in-frame UGA codons. Sec has been described as having stronger nucleophilic and electrophilic properties than cysteine, and Sec is present in the catalytic site of all selenoenzymes. Most selenoproteins, whose functions are known, are involved in redox systems and signaling pathways. However, several selenoproteins are not well characterized in terms of their function. The selenium field has grown dramatically in the last few decades, and research on selenium biology is providing extensive new information regarding its importance for human health. PMID:24470102
Complete Mitochondrial Genome of Suwallia teleckojensis (Plecoptera: Chloroperlidae) and Implications for the Higher Phylogeny of Stoneflies

PubMed Central

Cao, Jin-Jun; Li, Wei-Hai

2018-01-01

Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis, which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs (COII and ND5) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNASer(AGN), which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla, and Chloroperlidae and Perlidae were herein supported to be a sister group. PMID:29495588
Complete Mitochondrial Genome of Suwallia teleckojensis (Plecoptera: Chloroperlidae) and Implications for the Higher Phylogeny of Stoneflies.

PubMed

Wang, Ying; Cao, Jin-Jun; Li, Wei-Hai

2018-02-28

Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis , which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs ( COII and ND5 ) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNA Ser(AGN) , which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla , and Chloroperlidae and Perlidae were herein supported to be a sister group.
The complete mitogenome sequence of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae).

PubMed

Kim, Seong Ryeol; Kim, Man Il; Hong, Mee Yeon; Kim, Kee Young; Kang, Pil Don; Hwang, Jae Sam; Han, Yeon Soo; Jin, Byung Rae; Kim, Iksoo

2009-09-01

The 15,338-bp long complete mitochondrial genome (mitogenome) of the Japanese oak silkmoth, Antheraea yamamai (Lepidoptera: Saturniidae) was determined. This genome has a gene arrangement identical to those of all other sequenced lepidopteran insects, but differs from the most common type, as the result of the movement of tRNA(Met) to a position 5'-upstream of tRNA(Ile). No typical start codon of the A. yamamai COI gene is available. Instead, a tetranucleotide, TTAG, which is found at the beginning context of all sequenced lepidopteran insects was tentatively designated as the start codon for A. yamamai COI gene. Three of the 13 protein-coding genes (PCGs) harbor the incomplete termination codon, T or TA. All tRNAs formed stable stem-and-loop structures, with the exception of tRNA(Ser)(AGN), the DHU arm of which formed a simple loop as has been observed in many other metazoan mt tRNA(Ser)(AGN). The 334-bp long A + T-rich region is noteworthy in that it harbors tRNA-like structures, as has also been seen in the A + T-rich regions of other insect mitogenomes. Phylogenetic analyses of the available species of Bombycoidea, Pyraloidea, and Tortricidea bolstered the current morphology-based hypothesis that Bombycoidea and Pyraloidea are monophyletic (Obtectomera). As has been previously suggested, Bombycidae (Bombyx mori and B. mandarina) and Saturniidae (A. yamamai and Caligula boisduvalii) formed a reciprocal monophyletic group.
The CUG-initiated larger form coat protein of Chinese wheat mosaic virus binds to the cysteine-rich RNA silencing suppressor.

PubMed

Sun, Liying; Andika, Ida Bagus; Shen, Jiangfeng; Yang, Di; Ratti, Claudio; Chen, Jianping

2013-10-01

Some viruses use alternative translation initiation at non-AUG codons as a strategy to produce multiple proteins during gene expression. Here we show that, using this strategy, Chinese wheat mosaic virus (CWMV; Furovirus) expresses a larger form of coat protein (N-ext/CP) in infected plants. Site-directed mutagenesis and transient expression analysis confirmed that CWMV N-ext/CP is initiated at an upstream in-frame CUG codon at nucleotide position 207-209 of RNA 2, which adds a 39 amino acid (aa) N-terminal extension to the major CP. Interestingly, in planta and in vitro analyses indicated that CWMV N-ext/CP but not CP interacts with the CWMV cysteine-rich protein (CRP), an RNA silencing suppressor. We further determined that the N-terminal 39 aa extension, particularly the 10 aa region immediately upstream of the major CP coding region is responsible for the interaction of N-ext/CP with CRP. In an Agrobacterium co-infiltration assay, co-expression with N-ext/CP did not affect CRP silencing suppression activity. Thus the alternative translation initiation at a CUG codon provides the CWMV N-ext/CP with the ability to bind to the viral silencing suppressor. Copyright © 2013 Elsevier B.V. All rights reserved.
Global shape mimicry of tRNA within a viral internal ribosome entry site mediates translational reading frame selection

PubMed Central

Au, Hilda H.; Cornilescu, Gabriel; Mouzakis, Kathryn D.; Ren, Qian; Burke, Jordan E.; Lee, Seonghoon; Butcher, Samuel E.; Jan, Eric

2015-01-01

The dicistrovirus intergenic region internal ribosome entry site (IRES) adopts a triple-pseudoknotted RNA structure and occupies the core ribosomal E, P, and A sites to directly recruit the ribosome and initiate translation at a non-AUG codon. A subset of dicistrovirus IRESs directs translation in the 0 and +1 frames to produce the viral structural proteins and a +1 overlapping open reading frame called ORFx, respectively. Here we show that specific mutations of two unpaired adenosines located at the core of the three-helical junction of the honey bee dicistrovirus Israeli acute paralysis virus (IAPV) IRES PKI domain can uncouple 0 and +1 frame translation, suggesting that the structure adopts distinct conformations that contribute to 0 or +1 frame translation. Using a reconstituted translation system, we show that ribosomes assembled on mutant IRESs that direct exclusive 0 or +1 frame translation lack reading frame fidelity. Finally, a nuclear magnetic resonance/small-angle X-ray scattering hybrid approach reveals that the PKI domain of the IAPV IRES adopts an RNA structure that resembles a complete tRNA. The tRNA shape-mimicry enables the viral IRES to gain access to the ribosome tRNA-binding sites and form intermolecular contacts with the ribosome that are necessary for initiating IRES translation in a specific reading frame. PMID:26554019
The genetic basis of asymptomatic codon 8 frame-shift (HBB:c25_26delAA) β(0) -thalassaemia homozygotes.

PubMed

Jiang, Zhihua; Luo, Hong-Yuan; Huang, Shengwen; Farrell, John J; Davis, Lance; Théberge, Roger; Benson, Katherine A; Riolueang, Suchada; Viprakasit, Vip; Al-Allawi, Nasir A S; Ünal, Sule; Gümrük, Fatma; Akar, Nejat; Başak, A Nazli; Osorio, Leonor; Badens, Catherine; Pissard, Serge; Joly, Philippe; Campbell, Andrew D; Gallagher, Patrick G; Steinberg, Martin H; Forget, Bernard G; Chui, David H K

2016-03-01

Two 21-year old dizygotic twin men of Iraqi descent were homozygous for HBB codon 8, deletion of two nucleotides (-AA) frame-shift β(0) -thalassaemia mutation (FSC8; HBB:c25_26delAA). Both were clinically well, had splenomegaly, and were never transfused. They had mild microcytic anaemia (Hb 120-130 g/l) and 98% of their haemoglobin was fetal haemoglobin (HbF). Both were carriers of Hph α-thalassaemia mutation. On the three major HbF quantitative trait loci (QTL), the twins were homozygous for G>A HBG2 Xmn1 site at single nucleotide polymorphism (SNP) rs7482144, homozygous for 3-bp deletion HBS1L-MYB intergenic polymorphism (HMIP) at rs66650371, and heterozygous for the A>C BCL11A intron 2 polymorphism at rs766432. These findings were compared with those found in 22 other FSC8 homozygote patients: four presented with thalassaemia intermedia phenotype, and 18 were transfusion dependent. The inheritance of homozygosity for HMIP 3-bp deletion at rs66650371 and heterozygosity for Hph α-thalassaemia mutation was found in the twins and not found in any of the other 22 patients. Further studies are needed to uncover likely additional genetic variants that could contribute to the exceptionally high HbF levels and mild phenotype in these twins. © 2016 John Wiley & Sons Ltd.
Developmental rearrangement of cyanobacterial nif genes: nucleotide sequence, open reading frames, and cytochrome P-450 homology of the Anabaena sp. strain PCC 7120 nifD element.

PubMed Central

Lammers, P J; McLaughlin, S; Papin, S; Trujillo-Provencio, C; Ryncarz, A J

1990-01-01

An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria. Images PMID:2123860
Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes.

PubMed

Seligmann, Hervé

2013-05-07

GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges. Copyright © 2013 Elsevier Ltd. All rights reserved.
Two novel mutations in the alpha-galactosidase gene in Japanese classical hemizygotes with Fabry disease.

PubMed

Okumiya, T; Takenaka, T; Ishii, S; Kase, R; Kamei, S; Sakuraba, H

1996-09-01

Four alpha-galactosidase gene mutations were identified in Japanese male patients with Fabry disease who had no detectable alpha-galactosidase activity. Two of them were novel mutations, an 11-bp deletion in exon 2 and a g-1 to t substitution at the 3' end of the splice acceptor site in intron 1. The former caused a frameshift and led to the creation of a new stop codon at codon 118. The latter was predicted to provoke aberrant mRNA splicing followed by accelerated degradation of the mRNA. A nonsense mutation, R301X, and a 2-bp deletion starting at nucleotide position 718, which were reported previously, were also identified in unrelated patients.
Conformational Differences between Open and Closed States of the Eukaryotic Translation Initiation Complex

PubMed Central

Llácer, Jose L.; Hussain, Tanweer; Marler, Laura; Aitken, Colin Echeverría; Thakur, Anil; Lorsch, Jon R.; Hinnebusch, Alan G.; Ramakrishnan, V.

2015-01-01

Summary Translation initiation in eukaryotes begins with the formation of a pre-initiation complex (PIC) containing the 40S ribosomal subunit, eIF1, eIF1A, eIF3, ternary complex (eIF2-GTP-Met-tRNAi), and eIF5. The PIC, in an open conformation, attaches to the 5′ end of the mRNA and scans to locate the start codon, whereupon it closes to arrest scanning. We present single particle cryo-electron microscopy (cryo-EM) reconstructions of 48S PICs from yeast in these open and closed states, at 6.0 Å and 4.9 Å, respectively. These reconstructions show eIF2β as well as a configuration of eIF3 that appears to encircle the 40S, occupying part of the subunit interface. Comparison of the complexes reveals a large conformational change in the 40S head from an open mRNA latch conformation to a closed one that constricts the mRNA entry channel and narrows the P site to enclose tRNAi, thus elucidating key events in start codon recognition. PMID:26212456
Inability of Prevotella bryantii to Form a Functional Shine-Dalgarno Interaction Reflects Unique Evolution of Ribosome Binding Sites in Bacteroidetes

PubMed Central

Accetto, Tomaž; Avguštin, Gorazd

2011-01-01

The Shine-Dalgarno (SD) sequence is a key element directing the translation to initiate at the authentic start codons and also enabling translation initiation to proceed in 5′ untranslated mRNA regions (5′-UTRs) containing moderately strong secondary structures. Bioinformatic analysis of almost forty genomes from the major bacterial phylum Bacteroidetes revealed, however, a general absence of SD sequence, drop in GC content and consequently reduced tendency to form secondary structures in 5′-UTRs. The experiments using the Prevotella bryantii TC1-1 expression system were in agreement with these findings: neither addition nor omission of SD sequence in the unstructured 5′-UTR affected the level of the reporter protein, non-specific nuclease NucB. Further, NucB level in P. bryantii TC1-1, contrary to hMGFP level in Escherichia coli, was five times lower when SD sequence formed part of the secondary structure with a folding energy -5,2 kcal/mol. Also, the extended SD sequences did not affect protein levels as in E. coli. It seems therefore that a functional SD interaction does not take place during the translation initiation in P. bryanttii TC1-1 and possibly other members of phylum Bacteroidetes although the anti SD sequence is present in 16S rRNA genes of their genomes. We thus propose that in the absence of the SD sequence interaction, the selection of genuine start codons in Bacteroidetes is accomplished by binding of ribosomal protein S1 to unstructured 5′-UTR as opposed to coding region which is inaccessible due to mRNA secondary structure. Additionally, we found that sequence logos of region preceding the start codons may be used as taxonomical markers. Depending on whether complete sequence logo or only part of it, such as information content and base proportion at specific positions, is used, bacterial genera or families and in some cases even bacterial phyla can be distinguished. PMID:21857964
GTG mutation in the start codon of the androgen receptor gene in a family of horses with 64,XY disorder of sex development.

PubMed

Révay, T; Villagómez, D A F; Brewer, D; Chenier, T; King, W A

2012-01-01

Genetic sex in mammals is determined by the sex chromosomal composition of the zygote. The X and Y chromosomes are responsible for numerous factors that must work in close concert for the proper development of a healthy sexual phenotype. The role of androgens in case of XY chromosomal constitution is crucial for normal male sex differentiation. The intracellular androgenic action is mediated by the androgen receptor (AR), and its impaired function leads to a myriad of syndromes with severe clinical consequences, most notably androgen insensitivity syndrome and prostate cancer. In this paper, we investigated the possibility that an alteration of the equine AR gene explains a recently described familial XY, SRY + disorder of sex development. We uncovered a transition in the first nucleotide of the AR start codon (c.1A>G). To our knowledge, this represents the first causative AR mutation described in domestic animals. It is also a rarely observed mutation in eukaryotes and is unique among the >750 entries of the human androgen receptor mutation database. In addition, we found another quiet missense mutation in exon 1 (c.322C>T). Transcription of AR was confirmed by RT-PCR amplification of several exons. Translation of the full-length AR protein from the initiating GTG start codon was confirmed by Western blot using N- and C-terminal-specific antibodies. Two smaller peptides (25 and 14 amino acids long) were identified from the middle of exon 1 and across exons 5 and 6 by mass spectrometry. Based upon our experimental data and the supporting literature, it appears that the AR is expressed as a full-length protein and in a functional form, and the observed phenotype is the result of reduced AR protein expression levels. Copyright © 2011 S. Karger AG, Basel.
Role of the Integrin-Linked Kinase, ILK, in Mammary Carcinogensis

DTIC Science & Technology

2000-08-01

have been implicated in environmental stress clonei 6-10 responses in yeasts, plants and mammals, as well as regulating abscisic acid signal transduction...phosphatase 2C involved in abscisic acid signal transduction in higher plants. Proc. Natl Acad. Sci. USA, 95, 975-980. Strovel,E.T., Wu,D. and Sussman,D.J...contain a 450bp open reading frame, coding for 149 amino acids and a poly A tail 245bp downstream of the stop codon, although no polyadenylation site
Translational control of Nrf2 within the open reading frame

DOE Office of Scientific and Technical Information (OSTI.GOV)

Perez-Leal, Oscar, E-mail: operez@temple.edu; Barrero, Carlos A.; Merali, Salim, E-mail: smerali@temple.edu

2013-07-19

Highlights: •Identification of a novel Nrf2 translational repression mechanism. •The repressor is within the 3′ portion of the Nrf2 ORF. •The translation of Nrf2 or eGFP is reduced by the regulatory element. •The translational repression can be reversed with synonymous codon substitutions. •The molecular mechanism requires the mRNA sequence, but not the encoded amino acids. -- Abstract: Nuclear Factor Erythroid 2-Related Factor 2 (Nrf2) is a transcription factor that is essential for the regulation of an effective antioxidant and detoxifying response. The regulation of its activity can occur at transcription, translation and post-translational levels. Evidence suggests that under environmental stressmore » conditions, new synthesis of Nrf2 is required – a process that is regulated by translational control and is not fully understood. Here we described the identification of a novel molecular process that under basal conditions strongly represses the translation of Nrf2 within the open reading frame (ORF). This mechanism is dependent on the mRNA sequence within the 3′ portion of the ORF of Nrf2 but not in the encoded amino acid sequence. The Nrf2 translational repression can be reversed with the use of synonymous codon substitutions. This discovery suggests an additional layer of control to explain the reason for the low Nrf2 concentration under quiescent state.« less
Functional analysis of a frame-shift mutant of the dihydropyridine receptor pore subunit (α1S) expressing two complementary protein fragments

PubMed Central

Ahern, Chris A; Vallejo, Paola; Mortenson, Lindsay; Coronado, Roberto

2001-01-01

Background The L-type Ca2+ channel formed by the dihydropyridine receptor (DHPR) of skeletal muscle senses the membrane voltage and opens the ryanodine receptor (RyR1). This channel-to-channel coupling is essential for Ca2+ signaling but poorly understood. We characterized a single-base frame-shift mutant of α1S, the pore subunit of the DHPR, that has the unusual ability to function voltage sensor for excitation-contraction (EC) coupling by virtue of expressing two complementary hemi-Ca2+ channel fragments. Results Functional analysis of cDNA transfected dysgenic myotubes lacking α1S were carried out using voltage-clamp, confocal Ca2+ indicator fluoresence, epitope immunofluorescence and immunoblots of expressed proteins. The frame-shift mutant (fs-α1S) expressed the N-terminal half of α1S (M1 to L670) and the C-terminal half starting at M701 separately. The C-terminal fragment was generated by an unexpected restart of translation of the fs-α1S message at M701 and was eliminated by a M701I mutation. Protein-protein complementation between the two fragments produced recovery of skeletal-type EC coupling but not L-type Ca2+ current. Discussion A premature stop codon in the II-III loop may not necessarily cause a loss of DHPR function due to a restart of translation within the II-III loop, presumably by a mechanism involving leaky ribosomal scanning. In these cases, function is recovered by expression of complementary protein fragments from the same cDNA. DHPR-RyR1 interactions can be achieved via protein-protein complementation between hemi-Ca2+ channel proteins, hence an intact II-III loop is not essential for coupling the DHPR voltage sensor to the opening of RyR1 channel. PMID:11806762
Problem-Based Test: An "In Vitro" Experiment to Analyze the Genetic Code

ERIC Educational Resources Information Center

Szeberenyi, Jozsef

2010-01-01

Terms to be familiar with before you start to solve the test: genetic code, translation, synthetic polynucleotide, leucine, serine, filter precipitation, radioactivity measurement, template, mRNA, tRNA, rRNA, aminoacyl-tRNA synthesis, ribosomes, degeneration of the code, wobble, initiation, and elongation of protein synthesis, initiation codon.…
Complete DNA sequence of the mitochondrial genome of the treehopper Leptobelus gazella (Membracoidea: Hemiptera).

PubMed

Zhao, Xing; Liang, Ai-Ping

2016-09-01

The first complete DNA sequence of the mitochondrial genome (mitogenome) of Leptobelus gazelle (Membracoidea: Hemiptera) is determined in this study. The circular molecule is 16,007 bp in its full length, which encodes a set of 37 genes, including 13 proteins, 2 ribosomal RNAs, 22 transfer RNAs, and contains an A + T-rich region (CR). The gene numbers, content, and organization of L. gazelle are similar to other typical metazoan mitogenomes. Twelve of the 13 PCGs are initiated with ATR methionine or ATT isoleucine codons, except the atp8 gene that uses the ATC isoleucine as start signal. Ten of the 13 PCGs have complete termination codons, either TAA (nine genes) or TAG (cytb). The remaining 3 PCGs (cox1, cox2 and nad5) have incomplete termination codons T (AA). All of the 22 tRNAs can be folded in the form of a typical clover-leaf structure. The complete mitogenome sequence data of L. gazelle is useful for the phylogenetic and biogeographic studies of the Membracoidea and Hemiptera.
The mitochondrial genome of the multicolored Asian lady beetle Harmonia axyridis (Pallas) and a phylogenetic analysis of the Polyphaga (Insecta: Coleoptera).

PubMed

Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun

2016-07-01

Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.

A novel mutation causing complete thyroxine-binding globulin deficiency (TBG-CD-Negev) among the Bedouins in southern Israel.

PubMed

Miura, Y; Hershkovitz, E; Inagaki, A; Parvari, R; Oiso, Y; Phillip, M

2000-10-01

T4-binding globulin (TBG) is the major thyroid hormone transport protein in human serum. Inherited TBG abnormalities do not usually alter the metabolic status and are transmitted in X-linked inheritance. A high prevalence of complete TBG deficiency (TBG-CD) has been reported among the Bedouin population in the Negev (southern Israel). In this study we report a novel single mutation causing complete TBG deficiency due to a deletion of the last base of codon 38 (exon 1), which led to a frame shift resulting in a premature stop at codon 51 and a presumed truncated peptide of 50 residues. This new variant of TBG (TBG-CD-Negev) was found among all of the patients studied. We conclude that a single mutation may account for TBG deficiency among the Bedouins in the Negev. This report is the first to describe a mutation in a population with an unusually high prevalence of TBG-CD.
Hda Monomerization by ADP Binding Promotes Replicase Clamp-mediated DnaA-ATP Hydrolysis*S⃞

PubMed Central

Su'etsugu, Masayuki; Nakamura, Kenta; Keyamura, Kenji; Kudo, Yuka; Katayama, Tsutomu

2008-01-01

ATP-DnaA is the initiator of chromosomal replication in Escherichia coli, and the activity of DnaA is regulated by the regulatory inactivation of the DnaA (RIDA) system. In this system, the Hda protein promotes DnaA-ATP hydrolysis to produce inactive ADP-DnaA in a mechanism that is mediated by the DNA-loaded form of the replicase sliding clamp. In this study, we first revealed that hda translation uses an unusual initiation codon, CUG, located downstream of the annotated initiation codon. The CUG initiation codon could be used for restricting the Hda level, as this initiation codon has a low translation efficiency, and the cellular Hda level is only ∼100 molecules per cell. Hda translated using the correct reading frame was purified and found to have a high RIDA activity in vitro. Moreover, we found that Hda has a high affinity for ADP but not for other nucleotides, including ATP. ADP-Hda was active in the RIDA system in vitro and stable in a monomeric state, whereas apo-Hda formed inactive homomultimers. Both ADP-Hda and apo-Hda could form complexes with the DNA-loaded clamp; however, only ADP-Hda-DNA-clamp complexes were highly functional in the following interaction with DnaA. Formation of ADP-Hda was also observed in vivo, and mutant analysis suggested that ADP binding is crucial for cellular Hda activity. Thus, we propose that ADP is a crucial Hda ligand that promotes the activated conformation of the protein. ADP-dependent monomerization might enable the arginine finger of the Hda AAA+ domain to be accessible to ATP bound to the DnaA AAA+ domain. PMID:18977760
Hda monomerization by ADP binding promotes replicase clamp-mediated DnaA-ATP hydrolysis.

PubMed

Su'etsugu, Masayuki; Nakamura, Kenta; Keyamura, Kenji; Kudo, Yuka; Katayama, Tsutomu

2008-12-26

ATP-DnaA is the initiator of chromosomal replication in Escherichia coli, and the activity of DnaA is regulated by the regulatory inactivation of the DnaA (RIDA) system. In this system, the Hda protein promotes DnaA-ATP hydrolysis to produce inactive ADP-DnaA in a mechanism that is mediated by the DNA-loaded form of the replicase sliding clamp. In this study, we first revealed that hda translation uses an unusual initiation codon, CUG, located downstream of the annotated initiation codon. The CUG initiation codon could be used for restricting the Hda level, as this initiation codon has a low translation efficiency, and the cellular Hda level is only approximately 100 molecules per cell. Hda translated using the correct reading frame was purified and found to have a high RIDA activity in vitro. Moreover, we found that Hda has a high affinity for ADP but not for other nucleotides, including ATP. ADP-Hda was active in the RIDA system in vitro and stable in a monomeric state, whereas apo-Hda formed inactive homomultimers. Both ADP-Hda and apo-Hda could form complexes with the DNA-loaded clamp; however, only ADP-Hda-DNA-clamp complexes were highly functional in the following interaction with DnaA. Formation of ADP-Hda was also observed in vivo, and mutant analysis suggested that ADP binding is crucial for cellular Hda activity. Thus, we propose that ADP is a crucial Hda ligand that promotes the activated conformation of the protein. ADP-dependent monomerization might enable the arginine finger of the Hda AAA+ domain to be accessible to ATP bound to the DnaA AAA+ domain.
Polymorphism of prion protein gene in Arctic fox (Vulpes lagopus).

PubMed

Wan, Jiayu; Bai, Xue; Liu, Wensen; Xu, Jing; Xu, Ming; Gao, Hongwei

2009-07-01

Prion diseases are fatal neurodegenerative disorders of humans and certain other mammals. Prion protein gene (Prnp) is associated with susceptibility and species barrier to prion diseases. No natural and experimental prion diseases have been documented to date in Arctic fox. In the present study, coding region of Prnp from 135 Arctic foxes were cloned and screened for polymorphisms. Our results indicated that the Arctic fox Prnp open reading frame (ORF) contains 771 nucleotides encoding 257 amino acids. Four single nucleotide polymorphisms (SNPs) (G312C, A337G, C541T, and A723G) were identified. SNPs G312C and A723G produced silent mutations, but SNPs A337G and C541T resulted in a M-V change at codon 113 and R-C at codon 181, respectively. The Arctic fox Prnp amino acid sequence was similar to that of the dog (XM 542906). In short, this study provides preliminary information about genotypes of Prnp in Arctic fox.
The Complete Mitochondrial Genome of the Rice Moth, Corcyra cephalonica

PubMed Central

Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong

2012-01-01

The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)3. The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)9, (AT)8 elements. PMID:23413968
The complete mitochondrial genome of the rice moth, Corcyra cephalonica.

PubMed

Wu, Yu-Peng; Li, Jie; Zhao, Jin-Liang; Su, Tian-Juan; Luo, A-Rong; Fan, Ren-Jun; Chen, Ming-Chang; Wu, Chun-Sheng; Zhu, Chao-Dong

2012-01-01

The complete mitochondrial genome (mitogenome) of the rice moth, Corcyra cephalonica Stainton (Lepidoptera: Pyralidae) was determined as a circular molecular of 15,273 bp in size. The mitogenome composition (37 genes) and gene order are the same as the other lepidopterans. Nucleotide composition of the C. cephalonica mitogenome is highly A+T biased (80.43%) like other insects. Twelve protein-coding genes start with a typical ATN codon, with the exception of coxl gene, which uses CGA as the initial codon. Nine protein-coding genes have the common stop codon TAA, and the nad2, cox1, cox2, and nad4 have single T as the incomplete stop codon. 22 tRNA genes demonstrated cloverleaf secondary structure. The mitogenome has several large intergenic spacer regions, the spacer1 between trnQ gene and nad2 gene, which is common in Lepidoptera. The spacer 3 between trnE and trnF includes microsatellite-like repeat regions (AT)18 and (TTAT)(3). The spacer 4 (16 bp) between trnS2 gene and nad1 gene has a motif ATACTAT; another species, Sesamia inferens encodes ATCATAT at the same position, while other lepidopteran insects encode a similar ATACTAA motif. The spacer 6 is A+T rich region, include motif ATAGA and a 20-bp poly(T) stretch and two microsatellite (AT)(9), (AT)(8) elements.
Isolation and characterization of the gene coding for Escherichia coli arginyl-tRNA synthetase.

PubMed Central

Eriani, G; Dirheimer, G; Gangloff, J

1989-01-01

The gene coding for Escherichia coli arginyl-tRNA synthetase (argS) was isolated as a fragment of 2.4 kb after analysis and subcloning of recombinant plasmids from the Clarke and Carbon library. The clone bearing the gene overproduces arginyl-tRNA synthetase by a factor 100. This means that the enzyme represents more than 20% of the cellular total protein content. Sequencing revealed that the fragment contains a unique open reading frame of 1734 bp flanked at its 5' and 3' ends respectively by 247 bp and 397 bp. The length of the corresponding protein (577 aa) is well consistent with earlier Mr determination (about 70 kd). Primer extension analysis of the ArgRS mRNA by reverse transcriptase, located its 5' end respectively at 8 and 30 nucleotides downstream of a TATA and a TTGAC like element (CTGAC) and 60 nucleotides upstream of the unusual translation initiation codon GUG; nuclease S1 analysis located the 3'-end at 48 bp downstream of the translation termination codon. argS has a codon usage pattern typical for highly expressed E. coli genes. With the exception of the presence of a HVGH sequence similar to the HIGH consensus element, ArgRS has no relevant sequence homologies with other aminoacyl-tRNA synthetases. Images PMID:2668891
Novel SIL1 nonstop mutation in a Chinese consanguineous family with Marinesco-Sjögren syndrome and Dandy-Walker syndrome.

PubMed

Gai, Nan; Jiang, Chen; Zou, Yong-Yi; Zheng, Yu; Liang, De-Sheng; Wu, Ling-Qian

2016-07-01

Marinesco-Sjögren syndrome (MSS) is a rare autosomal recessive disorder, which is characterized by congenital cataracts, cerebellar ataxia, progressive muscle weakness, and delayed psychomotor development. SIL1, which is located at 5q31.2, is the only gene known to cause MSS. Dandy-Walker syndrome (DWS) is defined by hypoplasia, upward rotation of the cerebellar vermis, and cystic dilation of the fourth ventricle; however, its genetic pathogeny remains unclear. Here, we report a Chinese consanguineous family with MSS and DWS. Whole exome sequencing identified a novel nonstop mutation in SIL1. Sanger sequencing revealed that the mutation was segregated in this family according to a recessive mode of inheritance. We found that the mutation changed a stop codon (TGA) to an arginine codon (CGA), and no in-frame termination codon in the 3' untranslated region (UTR) of SIL1 could be found. The mRNA levels of SIL1 were decreased by 56.6% and 37.5% in immortalized lymphoblasts of the patients respectively; the protein levels of SIL1 were substantially decreased. This case study is the first report on Chinese MSS patients, MSS complicated by DWS, and a nonstop mutation in SIL1. Our findings imply the pathogenetic association between DWS and MSS. Copyright © 2016 Elsevier B.V. All rights reserved.
Nucleotide sequence of the gene for the Mr 32,000 thylakoid membrane protein from Spinacia oleracea and Nicotiana debneyi predicts a totally conserved primary translation product of Mr 38,950

PubMed Central

Zurawski, Gerard; Bohnert, Hans J.; Whitfeld, Paul R.; Bottomley, Warwick

1982-01-01

The gene for the so-called Mr 32,000 rapidly labeled photosystem II thylakoid membrane protein (here designated psbA) of spinach (Spinacia oleracea) chloroplasts is located on the chloroplast DNA in the large single-copy region immediately adjacent to one of the inverted repeat sequences. In this paper we show that the size of the mRNA for this protein is ≈ 1.25 kilobases and that the direction of transcription is towards the inverted repeat unit. The nucleotide sequence of the gene and its flanking regions is presented. The only large open reading frame in the sequence codes for a protein of Mr 38,950. The nucleotide sequence of psbA from Nicotiana debneyi also has been determined, and comparison of the sequences from the two species shows them to be highly conserved (>95% homology) throughout the entire reading frame. Conservation of the amino acid sequence is absolute, there being no changes in a total of 353 residues. This leads us to conclude that the primary translation product of psbA must be a protein of Mr 38,950. The protein is characterized by the complete absence of lysine residues and is relatively rich in hydrophobic amino acids, which tend to be clustered. Transcription of spinach psbA starts about 86 base pairs before the first ATG codon. Immediately upstream from this point there is a sequence typical of that found in E. coli promoters. An almost identical sequence occurs in the equivalent region of N. debneyi DNA. Images PMID:16593262
Purification and DNA binding properties of the blaI gene product, repressor for the beta-lactamase gene, blaP, of Bacillus licheniformis.

PubMed Central

Grossman, M J; Lampen, J O

1987-01-01

The location of the repressor gene, blaI, for the beta-lactamase gene blaP of Bacillus licheniformis 749, on the 5' side of blaP, was confirmed by sequencing the bla region of the constitutive mutant 749/C. An amber stop codon, likely to result in a nonfunctional truncated repressor, was found at codon 32 of the 128 codon blaI open reading frame (ORF) located 5' to blaP. In order to study the DNA binding activity of the repressor, the structural gene for blaI, from strain 749, with its ribosome binding site was expressed using a two plasmid T7 RNA polymerase/promotor system (S. Tabor and C. C. Richardson. Proc. Natl. Acad. Sci. 82, 1074-1078 (1985). Heat induction of this system in Escherichia coli K38 resulted in the production of BlaI as 5-10% of the soluble cell protein. Repressor protein was then purified by ammonium sulfate fractionation and cation exchange chromatography. The sequence of the N-terminal 28 amino acid residues was determined and was as predicted from the DNA. Binding of BlaI to DNA was detected by the slower migration of protein DNA complexes during polyacrylamide gel electrophoresis. BlaI was shown to selectively bind DNA fragments carrying the promoter regions of blaI and blaP. Images PMID:3498148
Overlapping reading frames at the LYS5 locus in the yeast Yarrowia lipolytica.

PubMed Central

Xuan, J W; Fournier, P; Declerck, N; Chasles, M; Gaillardin, C

1990-01-01

Mutants affected at the LYS5 locus of Yarrowia lipolytica lack detectable dehydrogenase (SDH) activity. The LYS5 gene has previously been cloned, and we present here the sequence of the 2.5-kilobase-pair (kb) DNA fragment complementing the lys5 mutation. Two large antiparallel open reading frames (ORF1 and ORF2) were observed, flanked by potential transcription signals. Both ORFs appear to be transcribed, but several lines of evidence suggest that only ORF2 is translated and encodes SDH. (i) The global amino acid compositions of Saccharomyces cerevisiae SDH and of the putative ORF2 product are similar and that of ORF1 is dissimilar. (ii) An in-frame translational fusion of ORF2 with the Escherichia coli lacZ gene was introduced into yeast cells and resulted in a beta-galactosidase activity regulated similarly to SDH; no beta-galactosidase activity was obtained with an in-frame fusion of ORF1 with lacZ. (iii) The introduction of a stop codon at the beginning of ORF2 prevented SDH expression in yeast cells, whereas no phenotypic effect was observed when ORF1 translation was blocked. Images PMID:2388625
Identification of the subgenomic promoter of the coat protein gene of cucumber fruit mottle mosaic virus and development of a heterologous expression vector.

PubMed

Rhee, Sun-Ju; Jang, Yoon Jeong; Lee, Gung Pyo

2016-06-01

Heterologous gene expression using plant virus vectors enables research on host-virus interactions and the production of useful proteins, but the host range of plant viruses limits the practical applications of such vectors. Here, we aimed to develop a viral vector based on cucumber fruit mottle mosaic virus (CFMMV), a member of the genus Tobamovirus, whose members infect cucurbits. The subgenomic promoter (SGP) in the coat protein (CP) gene, which was used to drive heterologous expression, was mapped by analyzing deletion mutants from a CaMV 35S promoter-driven infectious CFMMV clone. The region from nucleotides (nt) -55 to +160 relative to the start codon of the open reading frame (ORF) of CP was found to be a fully active promoter, and the region from nt -55 to +100 was identified as the active core promoter. Based on these SGPs, we constructed a cloning site in the CFMMV vector and successfully expressed enhanced green fluorescent protein (EGFP) in Nicotiana benthamiana and watermelon (Citrullus lanatus). Co-inoculation with the P19 suppressor increased EGFP expression and viral replication by blocking degradation of the viral genome. Our CFMMV vector will be useful as an expression vector in cucurbits.
Cation-induced transcriptional regulation of the dlt operon of Staphylococcus aureus.

PubMed

Koprivnjak, Tomaz; Mlakar, Vid; Swanson, Lindsey; Fournier, Benedicte; Peschel, Andreas; Weiss, Jerrold P

2006-05-01

Lipoteichoic and wall teichoic acids (TA) are highly anionic cell envelope-associated polymers containing repeating polyglycerol/ribitol phosphate moieties. Substitution of TA with D-alanine is important for modulation of many cell envelope-dependent processes, such as activity of autolytic enzymes, binding of divalent cations, and susceptibility to innate host defenses. D-Alanylation of TA is diminished when bacteria are grown in medium containing increased NaCl concentrations, but the effects of increased salt concentration on expression of the dlt operon encoding proteins mediating D-alanylation of TA are unknown. We demonstrate that Staphylococcus aureus transcriptionally represses dlt expression in response to high concentrations of Na(+) and moderate concentrations of Mg(2+) and Ca(2+) but not sucrose. Changes in dlt mRNA are induced within 15 min and sustained for several generations of growth. Mg(2+)-induced dlt repression depends on the ArlSR two-component system. Northern blotting, reverse transcription-PCR, and SMART-RACE analyses suggest that the dlt transcript begins 250 bp upstream of the dltA start codon and includes an open reading frame immediately upstream of dltA. Chloramphenicol transacetylase transcriptional fusions indicate that a region encompassing the 171 to 325 bp upstream of dltA is required for expression and Mg(2+)-induced repression of the dlt operon in S. aureus.
Cation-Induced Transcriptional Regulation of the dlt Operon of Staphylococcus aureus

PubMed Central

Koprivnjak, Tomaz; Mlakar, Vid; Swanson, Lindsey; Fournier, Benedicte; Peschel, Andreas; Weiss, Jerrold P.

2006-01-01

Lipoteichoic and wall teichoic acids (TA) are highly anionic cell envelope-associated polymers containing repeating polyglycerol/ribitol phosphate moieties. Substitution of TA with d-alanine is important for modulation of many cell envelope-dependent processes, such as activity of autolytic enzymes, binding of divalent cations, and susceptibility to innate host defenses. d-Alanylation of TA is diminished when bacteria are grown in medium containing increased NaCl concentrations, but the effects of increased salt concentration on expression of the dlt operon encoding proteins mediating d-alanylation of TA are unknown. We demonstrate that Staphylococcus aureus transcriptionally represses dlt expression in response to high concentrations of Na+ and moderate concentrations of Mg2+ and Ca2+ but not sucrose. Changes in dlt mRNA are induced within 15 min and sustained for several generations of growth. Mg2+-induced dlt repression depends on the ArlSR two-component system. Northern blotting, reverse transcription-PCR, and SMART-RACE analyses suggest that the dlt transcript begins 250 bp upstream of the dltA start codon and includes an open reading frame immediately upstream of dltA. Chloramphenicol transacetylase transcriptional fusions indicate that a region encompassing the 171 to 325 bp upstream of dltA is required for expression and Mg2+-induced repression of the dlt operon in S. aureus. PMID:16672616
β-Glucuronidase as a Sensitive and Versatile Reporter in Actinomycetes ▿

PubMed Central

Myronovskyi, Maksym; Welle, Elisabeth; Fedorenko, Viktor; Luzhetskyy, Andriy

2011-01-01

Here we describe a versatile and sensitive reporter system for actinomycetes that is based on gusA, which encodes the β-glucuronidase enzyme. A series of gusA-containing transcriptional and translational fusion vectors were constructed and utilized to study the regulatory cascade of the phenalinolactone biosynthetic gene cluster. Furthermore, these vectors were used to study the efficiency of translation initiation at the ATG, GTG, TTG, and CTG start codons. Surprisingly, constructs using a TTG start codon showed the best activity, whereas those using ATG or GTG were approximately one-half or one-third as active, respectively. The CTG fusion showed only 5% of the activity of the TTG fusion. A suicide vector, pKGLP2, carrying gusA in its backbone was used to visually detect merodiploid formation and resolution, making gene targeting in actinomycetes much faster and easier. Three regulatory genes, plaR1, plaR2, and plaR3, involved in phenalinolactone biosynthesis were efficiently replaced with an apramycin resistance marker using this system. Finally, we expanded the genetic code of actinomycetes by introducing the nonproteinogenic amino acid N-epsilon-cyclopentyloxycarbonyl-l-lysine with the GusA protein as a reporter. PMID:21685164
Conserved small mRNA with an unique, extended Shine-Dalgarno sequence

PubMed Central

Hahn, Julia; Migur, Anzhela; von Boeselager, Raphael Freiherr; Kubatova, Nina; Kubareva, Elena; Schwalbe, Harald

2017-01-01

ABSTRACT Up to now, very small protein-coding genes have remained unrecognized in sequenced genomes. We identified an mRNA of 165 nucleotides (nt), which is conserved in Bradyrhizobiaceae and encodes a polypeptide with 14 amino acid residues (aa). The small mRNA harboring a unique Shine-Dalgarno sequence (SD) with a length of 17 nt was localized predominantly in the ribosome-containing P100 fraction of Bradyrhizobium japonicum USDA 110. Strong interaction between the mRNA and 30S ribosomal subunits was demonstrated by their co-sedimentation in sucrose density gradient. Using translational fusions with egfp, we detected weak translation and found that it is impeded by both the extended SD and the GTG start codon (instead of ATG). Biophysical characterization (CD- and NMR-spectroscopy) showed that synthesized polypeptide remained unstructured in physiological puffer. Replacement of the start codon by a stop codon increased the stability of the transcript, strongly suggesting additional posttranscriptional regulation at the ribosome. Therefore, the small gene was named rreB (ribosome-regulated expression in Bradyrhizobiaceae). Assuming that the unique ribosome binding site (RBS) is a hallmark of rreB homologs or similarly regulated genes, we looked for similar putative RBS in bacterial genomes and detected regions with at least 16 nt complementarity to the 3′-end of 16S rRNA upstream of sORFs in Caulobacterales, Rhizobiales, Rhodobacterales and Rhodospirillales. In the Rhodobacter/Roseobacter lineage of α-proteobacteria the corresponding gene (rreR) is conserved and encodes an 18 aa protein. This shows how specific RBS features can be used to identify new genes with presumably similar control of expression at the RNA level. PMID:27834614
Two Isoforms of Geobacter sulfurreducens PilA Have Distinct Roles in Pilus Biogenesis, Cytochrome Localization, Extracellular Electron Transfer, and Biofilm Formation

PubMed Central

Richter, Lubna V.; Sandler, Steven J.

2012-01-01

Type IV pili of Geobacter sulfurreducens are composed of PilA monomers and are essential for long-range extracellular electron transfer to insoluble Fe(III) oxides and graphite anodes. A previous analysis of pilA expression indicated that transcription was initiated at two positions, with two predicted ribosome-binding sites and translation start codons, potentially producing two PilA preprotein isoforms. The present study supports the existence of two functional translation start codons for pilA and identifies two isoforms (short and long) of the PilA preprotein. The short PilA isoform is found predominantly in an intracellular fraction. It seems to stabilize the long isoform and to influence the secretion of several outer-surface c-type cytochromes. The long PilA isoform is required for secretion of PilA to the outer cell surface, a process that requires coexpression of pilA with nine downstream genes. The long isoform was determined to be essential for biofilm formation on certain surfaces, for optimum current production in microbial fuel cells, and for growth on insoluble Fe(III) oxides. PMID:22408162
Design, Construction and Cloning of Truncated ORF2 and tPAsp-PADRE-Truncated ORF2 Gene Cassette From Hepatitis E Virus in the pVAX1 Expression Vector

PubMed Central

Farshadpour, Fatemeh; Makvandi, Manoochehr; Taherkhani, Reza

2015-01-01

Background: Hepatitis E Virus (HEV) is the causative agent of enterically transmitted acute hepatitis and has high mortality rate of up to 30% among pregnant women. Therefore, development of a novel vaccine is a desirable goal. Objectives: The aim of this study was to construct tPAsp-PADRE-truncated open reading frame 2 (ORF2) and truncated ORF2 DNA plasmid, which can assist future studies with the preparation of an effective vaccine against Hepatitis E Virus. Materials and Methods: A synthetic codon-optimized gene cassette encoding tPAsp-PADRE-truncated ORF2 protein was designed, constructed and analyzed by some bioinformatics software. Furthermore, a codon-optimized truncated ORF2 gene was amplified by the polymerase chain reaction (PCR), with a specific primer from the previous construct. The constructs were sub-cloned in the pVAX1 expression vector and finally expressed in eukaryotic cells. Results: Sequence analysis and bioinformatics studies of the codon-optimized gene cassette revealed that codon adaptation index (CAI), GC content, and frequency of optimal codon usage (Fop) value were improved, and performance of the secretory signal was confirmed. Cloning and sub-cloning of the tPAsp-PADRE-truncated ORF2 gene cassette and truncated ORF2 gene were confirmed by colony PCR, restriction enzymes digestion and DNA sequencing of the recombinant plasmids pVAX-tPAsp-PADRE-truncated ORF2 (aa 112-660) and pVAX-truncated ORF2 (aa 112-660). The expression of truncated ORF2 protein in eukaryotic cells was approved by an Immunoﬂuorescence assay (IFA) and the reverse transcriptase polymerase chain reaction (RT-PCR) method. Conclusions: The results of this study demonstrated that the tPAsp-PADRE-truncated ORF2 gene cassette and the truncated ORF2 gene in recombinant plasmids are successfully expressed in eukaryotic cells. The immunogenicity of the two recombinant plasmids with different formulations will be evaluated as a novel DNA vaccine in future investigations. PMID:26865938
Thiamine-responsive megaloblastic anemia: early diagnosis may be effective in preventing deafness.

PubMed

Onal, Hasan; Bariş, Safa; Ozdil, Mine; Yeşil, Gözde; Altun, Gürkan; Ozyilmaz, Isa; Aydin, Ahmet; Celkan, Tiraje

2009-01-01

Thiamine-responsive megaloblastic anemia syndrome is an autosomal recessive disorder characterized by diabetes mellitus, megaloblastic anemia and sensorineural hearing loss. Mutations in the SLC19A2 gene, encoding a high-affinity thiamine transporter protein, THTR-1, are responsible for the clinical features associated with thiamine-responsive megaloblastic anemia syndrome in which treatment with pharmacological doses of thiamine correct the megaloblastic anemia and diabetes mellitus. The anemia can recur when thiamine is withdrawn. Thiamine may be effective in preventing deafness if started before two months. Our patient was found homozygous for a mutation, 242insA, in the nucleic acid sequence of exon B, with insertion of an adenine introducing a stop codon at codon 52 in the high-affinity thiamine transporter gene, SLC19A2, on chromosome 1q23.3.
Mitochondrial genome of Pteronotus personatus (Chiroptera: Mormoopidae): comparison with selected bats and phylogenetic considerations.

PubMed

López-Wilchis, Ricardo; Del Río-Portilla, Miguel Ángel; Guevara-Chumacero, Luis Manuel

2017-02-01

We described the complete mitochondrial genome (mitogenome) of the Wagner's mustached bat, Pteronotus personatus, a species belonging to the family Mormoopidae, and compared it with other published mitogenomes of bats (Chiroptera). The mitogenome of P. personatus was 16,570 bp long and contained a typically conserved structure including 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and one control region (D-loop). Most of the genes were encoded on the H-strand, except for eight tRNA and the ND6 genes. The order of protein-coding and rRNA genes was highly conserved in all mitogenomes. All protein-coding genes started with an ATG codon, except for ND2, ND3, and ND5, which initiated with ATA, and terminated with the typical stop codon TAA/TAG or the codon AGA. Phylogenetic trees constructed using Maximum Parsimony, Maximum Likelihood, and Bayesian inference methods showed an identical topology and indicated the monophyly of different families of bats (Mormoopidae, Phyllostomidae, Vespertilionidae, Rhinolophidae, and Pteropopidae) and the existence of two major clades corresponding to the suborders Yangochiroptera and Yinpterochiroptera. The mitogenome sequence provided here will be useful for further phylogenetic analyses and population genetic studies in mormoopid bats.

Start codon targeted (SCoT) and target region amplification polymorphism (TRAP) for evaluating the genetic relationship of Dendrobium species.

PubMed

Feng, Shangguo; He, Refeng; Yang, Sai; Chen, Zhe; Jiang, Mengying; Lu, Jiangjie; Wang, Huizhong

2015-08-10

Two molecular marker systems, start codon targeted (SCoT) and target region amplification polymorphism (TRAP), were used for genetic relationship analysis of 36 Dendrobium species collected from China. Twenty-two selected SCoT primers produced 337 loci, of which 324 (96%) were polymorphic, whereas 13 TRAP primer combinations produced a total of 510 loci, with 500 (97.8%) of them being polymorphic. An average polymorphism information content of 0.953 and 0.983 was detected using the SCoT and TRAP primers, respectively, showing that a high degree of genetic diversity exists among Chinese Dendrobium species. The partition of clusters in the unweighted pair group method with arithmetic mean dendrogram and principal coordinate analysis plot based on the SCoT and TRAP markers was similar and clustered the 36 Dendrobium species into four main groups. Our results will provide useful information for resource protection and will also be useful to improve the current Dendrobium breeding programs. Our results also demonstrate that SCoT and TRAP markers are informative and can be used to evaluate genetic relationships between Dendrobium species. Copyright © 2015 Elsevier B.V. All rights reserved.
Novel transcripts of the estrogen receptor α gene in channel catfish

USGS Publications Warehouse

Patino, Reynaldo; Xia, Zhenfang; Gale, William L.; Wu, Chunfa; Maule, Alec G.; Chang, Xiaotian

2000-01-01

Complementary DNA libraries from liver and ovary of an immature female channel catfish were screened with a homologous ERα cDNA probe. The hepatic library yielded two new channel catfish ER cDNAs that encode N-terminal ERα variants of different sizes. Relative to the catfish ERα (medium size; 581 residues) previously reported, these new cDNAs encode Long-ERα (36 residues longer) and Short-ERα (389 residues shorter). The 5′-end of Long-ERα cDNA is identical to that of Medium-ERα but has an additional 503-bp segment with an upstream, in-frame translation-start codon. Recombinant Long-ERα binds estrogen with high affinity (Kd = 3.4 nM), similar to that previously reported for Medium-ERα but lower than reported for catfish ERβ. Short-ERα cDNA encodes a protein that lacks most of the receptor protein and does not bind estrogen. Northern hybridization confirmed the existence of multiple hepatic ERα RNAs that include the size range of the ERα cDNAs obtained from the libraries as well as additional sizes. Using primers for RT-PCR that target locations internal to the protein-coding sequence, we also established the presence of several ERα cDNA variants with in-frame insertions in the ligand-binding and DNA-binding domains and in-frame or out-of-frame deletions in the ligand-binding domain. These internal variants showed patterns of expression that differed between the ovary and liver. Further, the ovarian library yielded a full-length, ERα antisense cDNA containing a poly(A) signal and tail. A limited survey of histological preparations from juvenile catfish by in situ hybridization using directionally synthesized cRNA probes also suggested the expression of ERα antisense RNA in a tissue-specific manner. In conclusion, channel catfish seemingly have three broad classes of ERα mRNA variants: those encoding N-terminal truncated variants, those encoding internal variants (including C-terminal truncated variants), and antisense mRNA. The sense variants may encode functional ERα or related proteins that modulate ERα or ERβ activity. The existence of ER antisense mRNA is reported in this study for the first time. Its role may be to participate in the regulation of ER gene expression.
Thermostable proteins bioprocesses: The activity of restriction endonuclease-methyltransferase from Thermus thermophilus (RM.TthHB27I) cloned in Escherichia coli is critically affected by the codon composition of the synthetic gene.

PubMed

Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M

2017-01-01

Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
A Frameshift Mutation in the Cubilin Gene (CUBN) in Border Collies with Imerslund-Gräsbeck Syndrome (Selective Cobalamin Malabsorption)

PubMed Central

Owczarek-Lipska, Marta; Jagannathan, Vidhya; Drögemüller, Cord; Lutz, Sabina; Glanemann, Barbara

2013-01-01

Imerslund-Gräsbeck syndrome (IGS) or selective cobalamin malabsorption has been described in humans and dogs. IGS occurs in Border Collies and is inherited as a monogenic autosomal recessive trait in this breed. Using 7 IGS cases and 7 non-affected controls we mapped the causative mutation by genome-wide association and homozygosity mapping to a 3.53 Mb interval on chromosome 2. We re-sequenced the genome of one affected dog at ∼10× coverage and detected 17 non-synonymous variants in the critical interval. Two of these non-synonymous variants were in the cubilin gene (CUBN), which is known to play an essential role in cobalamin uptake from the ileum. We tested these two CUBN variants for association with IGS in larger cohorts of dogs and found that only one of them was perfectly associated with the phenotype. This variant, a single base pair deletion (c.8392delC), is predicted to cause a frameshift and premature stop codon in the CUBN gene. The resulting mutant open reading frame is 821 codons shorter than the wildtype open reading frame (p.Q2798Rfs*3). Interestingly, we observed an additional nonsense mutation in the MRC1 gene encoding the mannose receptor, C type 1, which was in perfect linkage disequilibrium with the CUBN frameshift mutation. Based on our genetic data and the known role of CUBN for cobalamin uptake we conclude that the identified CUBN frameshift mutation is most likely causative for IGS in Border Collies. PMID:23613799
An upstream open reading frame represses expression of Lc, a member of the R/B family of maize transcriptional activators

DOE Office of Scientific and Technical Information (OSTI.GOV)

Damiani, R.D. Jr.; Wessler, S.R.

1993-09-01

The R/B genes of maize encode a family of basic helix-loop-helix proteins that determine where and when the anthocyanin-pigment pathway will be expressed in the plant. Previous studies showed that allelic diversity among family members reflects differences in gene expression, specifically in transcription initiation. The authors present evidence that the R gene Lc is under translational control. They demonstrate that the 235-nt transcript leader of Lc represses expression 25- to 30-fold in an in vivo assay. Repression is mediated by the presence in cis of a 38-codon upstream open reading frame. Furthermore, the coding capacity of the upstream open readingmore » frame influences the magnitude of repression. It is proposed that translational control does not contribute to tissue specificity but prevents overexpression of the Lc protein. The diversity of promoter and 5' untranslated leader sequences among the R/B genes provides an opportunity to study the coevolution of transcriptional and translational mechanisms of gene regulation. 36 refs., 5 figs.« less
The mitochondrial genome of Polistes jokahamae and a phylogenetic analysis of the Vespoidea (Insecta: Hymenoptera).

PubMed

Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin

2016-07-01

The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.
Substitution rate and natural selection in parvovirus B19

PubMed Central

Stamenković, Gorana G.; Ćirković, Valentina S.; Šiljić, Marina M.; Blagojević, Jelena V.; Knežević, Aleksandra M.; Joksić, Ivana D.; Stanojević, Maja P.

2016-01-01

The aim of this study was to estimate substitution rate and imprints of natural selection on parvovirus B19 genotype 1. Studied datasets included 137 near complete coding B19 genomes (positions 665 to 4851) for phylogenetic and substitution rate analysis and 146 and 214 partial genomes for selection analyses in open reading frames ORF1 and ORF2, respectively, collected 1973–2012 and including 9 newly sequenced isolates from Serbia. Phylogenetic clustering assigned majority of studied isolates to G1A. Nucleotide substitution rate for total coding DNA was 1.03 (0.6–1.27) x 10−4 substitutions/site/year, with higher values for analyzed genome partitions. In spite of the highest evolutionary rate, VP2 codons were found to be under purifying selection with rare episodic positive selection, whereas codons under diversifying selection were found in the unique part of VP1, known to contain B19 immune epitopes important in persistent infection. Analyses of overlapping gene regions identified nucleotide positions under opposite selective pressure in different ORFs, suggesting complex evolutionary mechanisms of nucleotide changes in B19 viral genomes. PMID:27775080
The Oenococcus oeni clpX Homologue Is a Heat Shock Gene Preferentially Expressed in Exponential Growth Phase

PubMed Central

Jobin, Michel-Philippe; Garmyn, Dominique; Diviès, Charles; Guzzo, Jean

1999-01-01

Using degenerated primers from conserved regions of previously studied clpX gene products, we cloned the clpX gene of the malolactic bacterium Oenococcus oeni. The clpX gene was sequenced, and the deduced protein of 413 amino acids (predicted molecular mass of 45,650 Da) was highly similar to previously analyzed clpX gene products from other organisms. An open reading frame located upstream of the clpX gene was identified as the tig gene by similarity of its predicted product to other bacterial trigger factors. ClpX was purified by using a maltose binding protein fusion system and was shown to possess an ATPase activity. Northern analyses indicated the presence of two independent 1.6-kb monocistronic clpX and tig mRNAs and also showed an increase in clpX mRNA amount after a temperature shift from 30 to 42°C. The clpX transcript is abundant in the early exponential growth phase and progressively declines to undetectable levels in the stationary phase. Thus, unlike hsp18, the gene encoding one of the major small heat shock proteins of Oenococcus oeni, clpX expression is related to the exponential growth phase and requires de novo protein synthesis. Primer extension analysis identified the 5′ end of clpX mRNA which is located 408 nucleotides upstream of a putative AUA start codon. The putative transcription start site allowed identification of a predicted promoter sequence with a high similarity to the consensus sequence found in the housekeeping gene promoter of gram-positive bacteria as well as Escherichia coli. PMID:10542163
Phylogenetic tree construction using trinucleotide usage profile (TUP).

PubMed

Chen, Si; Deng, Lih-Yuan; Bowman, Dale; Shiau, Jyh-Jen Horng; Wong, Tit-Yee; Madahian, Behrouz; Lu, Henry Horng-Shing

2016-10-06

It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4 6 =4096 to 4 15 . We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4 3 =64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods. Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of overlapping words, which is the average (or mixture) of the frequency distributions of three possible reading frames. Consequently, we show (from the entropy viewpoint) that the FFP procedure could dilute important gene information and therefore provides less accurate classification.
Positions of Trp Codons in the Leader Peptide-Coding Region of the at Operon Influence Anti-Trap Synthesis and trp Operon Expression in Bacillus licheniformis▿

PubMed Central

Levitin, Anastasia; Yanofsky, Charles

2010-01-01

Tryptophan, phenylalanine, tyrosine, and several other metabolites are all synthesized from a common precursor, chorismic acid. Since tryptophan is a product of an energetically expensive biosynthetic pathway, bacteria have developed sensing mechanisms to downregulate synthesis of the enzymes of tryptophan formation when synthesis of the amino acid is not needed. In Bacillus subtilis and some other Gram-positive bacteria, trp operon expression is regulated by two proteins, TRAP (the tryptophan-activated RNA binding protein) and AT (the anti-TRAP protein). TRAP is activated by bound tryptophan, and AT synthesis is increased upon accumulation of uncharged tRNATrp. Tryptophan-activated TRAP binds to trp operon leader RNA, generating a terminator structure that promotes transcription termination. AT binds to tryptophan-activated TRAP, inhibiting its RNA binding ability. In B. subtilis, AT synthesis is upregulated both transcriptionally and translationally in response to the accumulation of uncharged tRNATrp. In this paper, we focus on explaining the differences in organization and regulatory functions of the at operon's leader peptide-coding region, rtpLP, of B. subtilis and Bacillus licheniformis. Our objective was to correlate the greater growth sensitivity of B. licheniformis to tryptophan starvation with the spacing of the three Trp codons in its at operon leader peptide-coding region. Our findings suggest that the Trp codon location in rtpLP of B. licheniformis is designed to allow a mild charged-tRNATrp deficiency to expose the Shine-Dalgarno sequence and start codon for the AT protein, leading to increased AT synthesis. PMID:20061467
How the Sequence of a Gene Specifies Structural Symmetry in Proteins

PubMed Central

Shen, Xiaojuan; Huang, Tongcheng; Wang, Guanyu; Li, Guanglin

2015-01-01

Internal symmetry is commonly observed in the majority of fundamental protein folds. Meanwhile, sufficient evidence suggests that nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process allows mRNA to contain additional information on protein structure. In this paper, we study the relationship between gene sequences and protein structures from the viewpoint of symmetry to explore how gene sequences code for structural symmetry in proteins. We found that, for a set of two-fold symmetric proteins from left-handed beta-helix fold, intragenic symmetry always exists in their corresponding gene sequences. Meanwhile, codon usage bias and local mRNA structure might be involved in modulating translation speed for the formation of structural symmetry: a major decrease of local codon usage bias in the middle of the codon sequence can be identified as a common feature; and major or consecutive decreases in local mRNA folding energy near the boundaries of the symmetric substructures can also be observed. The results suggest that gene duplication and fusion may be an evolutionarily conserved process for this protein fold. In addition, the usage of rare codons and the formation of higher order of secondary structure near the boundaries of symmetric substructures might have coevolved as conserved mechanisms to slow down translation elongation and to facilitate effective folding of symmetric substructures. These findings provide valuable insights into our understanding of the mechanisms of translation and its evolution, as well as the design of proteins via symmetric modules. PMID:26641668
Hypothesis Formation and Qualitative Reasoning in Molecular Biology

DTIC Science & Technology

1989-06-01

presents studies of the trp operon in the bacterium S . Marcescens . In vitro transcription studies showed that transcription termination does occur in...observed was that there are two 4.4. ANNOTATED CHRONOLOGY OF THE RESEARCH 135 translation-start codons in the S . marcescens leader region. The authors...of leader-region mRNA secondary structures in attenuation in the S . marcescens trp operon. A different bac- terium was used because it included
Complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis.

PubMed

Feutry, Pierre; Kyne, Peter M; Peng, Zaiqing; Pan, Lianghao; Chen, Xiao

2016-05-01

The complete mitochondrial genome of the Freshwater Whipray Himantura dalyensis is presented in this study. It is 17,693 bp in length and contains 37 genes in typical gene order and transcriptional orientation observed in vertebrates. There were a total of 86 bp short intergenic spacers and 22 bp overlaps in the genome. The overall base composition was 31.4% A, 25.5% C, 13.2% G and 29.9% T. Two start codons (GTG and ATG) and two stop codons (TAG and TAA/T) were found in 13 protein-coding genes. The length of 22 tRNA genes ranged from 68 (tRNA-Cys and tRNA-Ser2) to 75 bp (tRNA-Leu1). The origin of L-strand replication (OL) was found between the tRNA-Asn and tRNA-Cys genes. The base composition of the control region (1940 bp) was similar to the whole mitogenome.
Molecular spectrum of c-KIT and PDGFRA gene mutations in gastro intestinal stromal tumor: determination of frequency, distribution pattern and identification of novel mutations in Indian patients.

PubMed

Ahmad, Firoz; Lad, Purnima; Bhatia, Simi; Das, Bibhu Ranjan

2015-01-01

KIT and PDGFRA gene mutations are the major genetic alterations seen in gastrointestinal stromal tumors (GISTs) and are being used clinically for predicting response to imatinib therapy. In the current study, we set out to explore the frequency and distribution pattern of c-KIT (exons 9, 11 and 13) and PDGFRA (exons 12 and 18) by direct sequencing in a series of 70 Indian GIST cases. Overall, 27 (38.5 %) and 4 (5.7 %) of the cases had c-KIT and PDGFRA mutations, respectively. Majority of KIT mutations involved exon 11 (85.7 %), followed by exon 9 (14.3 %), while none showed exon 13 mutation. Most exon 9 mutations showed Ala503-Tyr504 duplication, while one had novel point mutation at codon 476 (S476G). In contrast to exon 9 mutations, most exon 11 mutations were in-frame deletions (79 %, 19/24), predominantly at codons 550-560, while remaining exon 11 mutant cases were point mutations at codons 559, 560, 568, 573 and 575. Interestingly, P573T, Q556_V560delinsH, Q575H and Q575_P577 were novel variations observed in exon 11. The PDGFRA mutations were seen mostly in exon 18, which showed point mutation at codon 842 (D842V), while exon 12 showed a novel indel variation (V561_H570delinsT). No significant correlation between c-KIT/PDGFRA mutations and clinicopathological data was observed. In conclusion, this study highlights the frequency and distribution pattern of c-KIT/PDGFRA mutation in Indian cohort. The current study identified novel variations that added new insights into the genetic heterogeneity of GIST patients. Furthermore, this is the first study to report the presence of PDGFRA mutation from Indian subcontinent.
Global translational impacts of the loss of the tRNA modification t6A in yeast.

PubMed

Thiaville, Patrick C; Legendre, Rachel; Rojas-Benítez, Diego; Baudin-Baillieu, Agnès; Hatin, Isabelle; Chalancon, Guilhem; Glavic, Alvaro; Namy, Olivier; de Crécy-Lagard, Valérie

2016-01-01

The universal tRNA modification t 6 A is found at position 37 of nearly all tRNAs decoding ANN codons. The absence of t 6 A 37 leads to severe growth defects in baker's yeast, phenotypes similar to those caused by defects in mcm 5 s 2 U 34 synthesis. Mutants in mcm 5 s 2 U 34 can be suppressed by overexpression of tRNA Lys UUU , but we show t 6 A phenotypes could not be suppressed by expressing any individual ANN decoding tRNA, and t 6 A and mcm 5 s 2 U are not determinants for each other's formation. Our results suggest that t 6 A deficiency, like mcm 5 s 2 U deficiency, leads to protein folding defects, and show that the absence of t 6 A led to stress sensitivities (heat, ethanol, salt) and sensitivity to TOR pathway inhibitors. Additionally, L-homoserine suppressed the slow growth phenotype seen in t 6 A-deficient strains, and proteins aggregates and Advanced Glycation End-products (AGEs) were increased in the mutants. The global consequences on translation caused by t 6 A absence were examined by ribosome profiling. Interestingly, the absence of t 6 A did not lead to global translation defects, but did increase translation initiation at upstream non-AUG codons and increased frame-shifting in specific genes. Analysis of codon occupancy rates suggests that one of the major roles of t 6 A is to homogenize the process of elongation by slowing the elongation rate at codons decoded by high abundance tRNAs and I 34 :C 3 pairs while increasing the elongation rate of rare tRNAs and G 34 :U 3 pairs. This work reveals that the consequences of t 6 A absence are complex and multilayered and has set the stage to elucidate the molecular basis of the observed phenotypes.
A high-level prokaryotic expression system: synthesis of human interleukin 1 alpha and its receptor antagonist.

PubMed

Birikh, K R; Lebedenko, E N; Boni, I V; Berlin, Y A

1995-10-27

Synthetic intronless genes, coding for human interleukin 1 alpha (IL 1 alpha) and interleukin 1 receptor antagonist (IL1ra), have been expressed efficiently in a specially designed prokaryotic vector, pGMCE (a pGEM1 derivative), where the target gene forms the second part of a two-cistron system. The first part of the system is a translation enhancer-containing mini-cistron, whose termination codon overlaps the start codon of the target gene. In the case of the IL1 alpha gene, the high expression level is largely due to the direct efficient translation initiation at the second cistron, whereas with the IL1ra gene in the same system, the proximal translation initiation region (TIR) provides a high level of coupled expression of the target gene. Thus, pGMCE is a potentially versatile vector for direct prokaryotic expression.
ClubSub-P: Cluster-Based Subcellular Localization Prediction for Gram-Negative Bacteria and Archaea

PubMed Central

Paramasivam, Nagarajan; Linke, Dirk

2011-01-01

The subcellular localization (SCL) of proteins provides important clues to their function in a cell. In our efforts to predict useful vaccine targets against Gram-negative bacteria, we noticed that misannotated start codons frequently lead to wrongly assigned SCLs. This and other problems in SCL prediction, such as the relatively high false-positive and false-negative rates of some tools, can be avoided by applying multiple prediction tools to groups of homologous proteins. Here we present ClubSub-P, an online database that combines existing SCL prediction tools into a consensus pipeline from more than 600 proteomes of fully sequenced microorganisms. On top of the consensus prediction at the level of single sequences, the tool uses clusters of homologous proteins from Gram-negative bacteria and from Archaea to eliminate false-positive and false-negative predictions. ClubSub-P can assign the SCL of proteins from Gram-negative bacteria and Archaea with high precision. The database is searchable, and can easily be expanded using either new bacterial genomes or new prediction tools as they become available. This will further improve the performance of the SCL prediction, as well as the detection of misannotated start codons and other annotation errors. ClubSub-P is available online at http://toolkit.tuebingen.mpg.de/clubsubp/ PMID:22073040
Identification and characterization of a catechol-o-methyltransferase cDNA in the catfish Heteropneustes fossilis: Tissue, sex and seasonal variations, and effects of gonadotropin and 2-hydroxyestradiol-17β on mRNA expression.

PubMed

Chaube, R; Rawat, A; Inbaraj, R M; Bobe, J; Guiguen, Y; Fostier, A; Joy, K P

2017-05-15

Catechol-O-methyltransferase (COMT) is involved in the methylation and inactivation of endogenous and xenobiotic catechol compounds, and serves as a common biochemical link in the catecholamine and catecholestrogen metabolism. Studies on cloning, sequencing and function characterization comt gene in lower vertebrates like fish are fewer. In the present study, a full-length comt cDNA of 1442bp with an open-reading frame (ORF) of 792bp, and start codon (ATG) at nucleotide 162 and stop codon (TAG) at nucleotide 953 was isolated and characterized in the stinging catfish Heteropneustes fossilis (accession No. KT597925). The ORF codes for a protein of 263 amino acid residues, which is also validated by the catfish transcriptome data analysis. The catfish Comt shared conserved putative structural regions important for S-adenosyl methionine (AdoMet)- and catechol-binding, transmembrane regions, two glycosylation sites (N-65 and N-91) at the N-terminus and two phosphorylation sites (Ser-235 and Thr-240) at the C-terminus. The gene was expressed in all tissues examined and the expression showed significant sex dimorphic distribution with high levels in females. The transcript was abundant in the liver, brain and gonads and low in muscles. The transcripts showed significant seasonal variations in the brain and ovary, increased progressively to the peak levels in spawning phase and then declined. The brain and ovarian comt mRNA levels showed periovulatory changes after in vivo and in vitro human chorionic gonadotropin (hCG) treatments with high fold increases at 16 and 24h in the brain and at 16h in the ovary. The catecholestrogen 2-hydroxyE 2 up regulated ovarian comt expression in vitro with the highest fold increase at 16h. The mRNA and protein was localized in the follicular layer of the vitellogenic follicles and in the cytoplasm of primary follicles. The data were discussed in relation to catecholamine and catecholestrogen-mediated functions in the brain and ovary of the stinging catfish. Copyright © 2016 Elsevier Inc. All rights reserved.
Assessment of possible allergenicity of hypothetical ORFs in common food crops using current bioinformatic guidelines and its implications for the safety assessment of GM crops.

PubMed

Young, Gregory J; Zhang, Shiping; Mirsky, Henry P; Cressman, Robert F; Cong, Bin; Ladics, Gregory S; Zhong, Cathy X

2012-10-01

Before a genetically modified (GM) crop can be commercialized it must pass through a rigorous regulatory process to verify that it is safe for human and animal consumption, and to the environment. One particular area of focus is the potential introduction of a known or cross-reactive allergen not previously present within the crop. The assessment of possible allergenicity uses the guidelines outlined by the Food and Agriculture Organization (FAO) and World Health Organization's (WHO) Codex Alimentarius Commission (Codex) to evaluate all newly expressed proteins. Some regulatory authorities have broadened the scope of the assessment to include all DNA reading frames between stop codons across the insert and spanning the insert/genomic DNA junctions. To investigate the utility of this bioinformatic assessment, all naturally occurring stop-to-stop frames in the non-transgenic genomes of maize, rice, and soybean, as well as the human genome, were compared against the AllergenOnline (www.allergenonline.org) database using the Codex criteria. We discovered thousands of frames that exceeded the Codex defined threshold for potential cross-reactivity suggesting that evaluating hypothetical ORFs (stop-to-stop frames) has questionable value for making decisions on the safety of GM crops. Copyright © 2012 Elsevier Ltd. All rights reserved.
Inhibition of Non-ATG Translational Events in Cells via Covalent Small Molecules Targeting RNA.

PubMed

Yang, Wang-Yong; Wilson, Henry D; Velagapudi, Sai Pradeep; Disney, Matthew D

2015-04-29

One major class of disease-causing RNAs is expanded repeating transcripts. These RNAs cause diseases via multiple mechanisms, including: (i) gain-of-function, in which repeating RNAs bind and sequester proteins involved in RNA biogenesis and (ii) repeat associated non-ATG (RAN) translation, in which repeating transcripts are translated into toxic proteins without use of a canonical, AUG, start codon. Herein, we develop and study chemical probes that bind and react with an expanded r(CGG) repeat (r(CGG)(exp)) present in a 5' untranslated region that causes fragile X-associated tremor/ataxia syndrome (FXTAS). Reactive compounds bind to r(CGG)(exp) in cellulo as shown with Chem-CLIP-Map, an approach to map small molecule binding sites within RNAs in cells. Compounds also potently improve FXTAS-associated pre-mRNA splicing and RAN translational defects, while not affecting translation of the downstream open reading frame. In contrast, oligonucleotides affect both RAN and canonical translation when they bind to r(CGG)(exp), which is mechanistically traced to a decrease in polysome loading. Thus, designer small molecules that react with RNA targets can be used to profile the RNAs to which they bind in cells, including identification of binding sites, and can modulate several aspects of RNA-mediated disease pathology in a manner that may be more beneficial than oligonucleotides.

Complete mitochondrial genome sequence of Urechis caupo, a representative of the phylum Echiura

PubMed Central

Boore, Jeffrey L

2004-01-01

Background Mitochondria contain small genomes that are physically separate from those of nuclei. Their comparison serves as a model system for understanding the processes of genome evolution. Although hundreds of these genome sequences have been reported, the taxonomic sampling is highly biased toward vertebrates and arthropods, with many whole phyla remaining unstudied. This is the first description of a complete mitochondrial genome sequence of a representative of the phylum Echiura, that of the fat innkeeper worm, Urechis caupo. Results This mtDNA is 15,113 nts in length and 62% A+T. It contains the 37 genes that are typical for animal mtDNAs in an arrangement somewhat similar to that of annelid worms. All genes are encoded by the same DNA strand which is rich in A and C relative to the opposite strand. Codons ending with the dinucleotide GG are more frequent than would be expected from apparent mutational biases. The largest non-coding region is only 282 nts long, is 71% A+T, and has potential for secondary structures. Conclusions Urechis caupo mtDNA shares many features with those of the few studied annelids, including the common usage of ATG start codons, unusual among animal mtDNAs, as well as gene arrangements, tRNA structures, and codon usage biases. PMID:15369601
The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

PubMed

Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

2007-08-01

The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.
Identification of a second flagellin gene and functional characterization of a sigma70-like promoter upstream of a Leptospira borgpetersenii flaB gene.

PubMed

Lin, Min; Dan, Hanhong; Li, Yijing

2004-02-01

Leptospira borgpetersenii, one of the causative agents of leptospirosis in both animals and humans, is a bacterial pathogen with characteristic motility that is mediated by the rotation of two periplasmic flagella (PF). The flaB gene coding for a core polypeptide subunit of PF was previously characterized by sequence analysis of its open reading frame (ORF) (M. Lin, J Biochem Mol Biol Biophys 2:181-187, 1999). The present study was undertaken to isolate and clone the uncharacterized sequence upstream of the flaB gene by using a PCR-based genome walking procedure. This has resulted in a 1470-bp genomic DNA sequence in which an 846-bp ORF coding for a 281-amino acid polypeptide (31.3 kDa) is identified 455 bp upstream from the flaB start codon. The encoded protein exhibits 72% amino acid identity to the deduced FlaB protein sequence of L. borgpetersenii and a high degree of sequence homology to the FlaB proteins of other spirochaetes. This has demonstrated for the first time that a second flaB gene homolog is present in a Leptospira species. The newly identified gene is designated flaB1, and the previously cloned flaB renamed flaB2. Within the intergenic sequence between flaB1 and flaB2, a potential stem-loop structure (12-bp inverted repeats) was identified 25 bp downstream of the flaB1 stop codon; this could serve as a transcription terminator for the flaB1 mRNA. Three E. coli-like promoter regions (I, II, and III) for binding Esigma(70), a regulatory sequence uncommonly found in flagellar genes, were predicted upstream of the flaB2 ORF. Only promoter region II contains a promoter that is functional in E. coli, as revealed at phenotypic and transcriptional levels by its capability of directing the expression of the chloramphenicol acetyltransferase (CAT) gene in the promoter probe vector pKK232-8. These observations may suggest that flaB1 and flaB2 are transcribed separately and do not form a transcriptional operon controlled by a single promoter.
The Rift Valley fever accessory proteins NSm and P78/NSm-GN are distinct determinants of virus propagation in vertebrate and invertebrate hosts

PubMed Central

Kreher, Felix; Tamietti, Carole; Gommet, Céline; Guillemot, Laurent; Ermonval, Myriam; Failloux, Anna-Bella; Panthier, Jean-Jacques; Bouloy, Michèle; Flamand, Marie

2014-01-01

Rift Valley fever virus (RVFV) is an enzootic virus circulating in Africa that is transmitted to its vertebrate host by a mosquito vector and causes severe clinical manifestations in humans and ruminants. RVFV has a tripartite genome of negative or ambisense polarity. The M segment contains five in-frame AUG codons that are alternatively used for the synthesis of two major structural glycoproteins, GN and GC, and at least two accessory proteins, NSm, a 14-kDa cytosolic protein, and P78/NSm-GN, a 78-kDa glycoprotein. To determine the relative contribution of P78 and NSm to RVFV infectivity, AUG codons were knocked out to generate mutant viruses expressing various sets of the M-encoded proteins. We found that, in the absence of the second AUG codon used to express NSm, a 13-kDa protein corresponding to an N-terminally truncated form of NSm, named NSm′, was synthesized from AUG 3. None of the individual accessory proteins had any significant impact on RVFV virulence in mice. However, a mutant virus lacking both NSm and NSm′ was strongly attenuated in mice and grew to reduced titers in murine macrophages, a major target cell type of RVFV. In contrast, P78 was not associated with reduced viral virulence in mice, yet it appeared as a major determinant of virus dissemination in mosquitoes. This study demonstrates how related accessory proteins differentially contribute to RVFV propagation in mammalian and arthropod hosts. PMID:26038497
Dysfunctional growth hormone receptor in a strain of sex-linked dwarf chicken: evidence for a mutation in the intracellular domain.

PubMed

Agarwal, S K; Cogburn, L A; Burnside, J

1994-09-01

The sex-linked dwarf (dwdw) chicken represents a valuable animal model for studying GH insensitivity and the consequence of mutations in the GH receptor (GHR) gene. We have recently reported undetectable hepatic GH-binding activity and an aberrantly sized transcript in a strain of dwdw chickens obtained from Arbor Acre Farms, Inc. (Glastonbury, CT, USA). Southern blot analysis of the chicken GHR (cGHR) gene revealed a restriction-fragment length polymorphism in HindIII and EcoRI digests of genomic DNA in this strain of dwdw chicken. In order to localize the molecular mutation, we analysed the gene structure and determined the complete sequence of the 3' untranslated region (3' UTR) of the normal cGHR. With the use of this information, we located a large deletion in the 3' end of the cGHR gene of the Connecticut (CT) strain of dwdw chicken. This deletion (1773 bp) contained 27 highly conserved amino acids of the 3' end of the coding region, the in-frame stop codon, a less frequently used poly(A) signal that is normally found 445 bp downstream of the stop codon, and a large portion of the 3' UTR. Because of this deletion, 27 novel amino acids were substituted and the open reading frame was extended for an additional 26 amino acids before reaching the transcriptional termination site. The predicted amino acid sequence of the novel carboxyl-terminus of the dwdw cGHR is largely hydrophobic with a polylysine tail, whereas the carboxyl-terminus of the wild-type (DwDw) cGHR is composed of hydrophilic amino acids.(ABSTRACT TRUNCATED AT 250 WORDS)
Broad genomic and transcriptional analysis reveals a highly derived genome in dinoflagellate mitochondria

PubMed Central

Jackson, Christopher J; Norman, John E; Schnare, Murray N; Gray, Michael W; Keeling, Patrick J; Waller, Ross F

2007-01-01

Background Dinoflagellates comprise an ecologically significant and diverse eukaryotic phylum that is sister to the phylum containing apicomplexan endoparasites. The mitochondrial genome of apicomplexans is uniquely reduced in gene content and size, encoding only three proteins and two ribosomal RNAs (rRNAs) within a highly compacted 6 kb DNA. Dinoflagellate mitochondrial genomes have been comparatively poorly studied: limited available data suggest some similarities with apicomplexan mitochondrial genomes but an even more radical type of genomic organization. Here, we investigate structure, content and expression of dinoflagellate mitochondrial genomes. Results From two dinoflagellates, Crypthecodinium cohnii and Karlodinium micrum, we generated over 42 kb of mitochondrial genomic data that indicate a reduced gene content paralleling that of mitochondrial genomes in apicomplexans, i.e., only three protein-encoding genes and at least eight conserved components of the highly fragmented large and small subunit rRNAs. Unlike in apicomplexans, dinoflagellate mitochondrial genes occur in multiple copies, often as gene fragments, and in numerous genomic contexts. Analysis of cDNAs suggests several novel aspects of dinoflagellate mitochondrial gene expression. Polycistronic transcripts were found, standard start codons are absent, and oligoadenylation occurs upstream of stop codons, resulting in the absence of termination codons. Transcripts of at least one gene, cox3, are apparently trans-spliced to generate full-length mRNAs. RNA substitutional editing, a process previously identified for mRNAs in dinoflagellate mitochondria, is also implicated in rRNA expression. Conclusion The dinoflagellate mitochondrial genome shares the same gene complement and fragmentation of rRNA genes with its apicomplexan counterpart. However, it also exhibits several unique characteristics. Most notable are the expansion of gene copy numbers and their arrangements within the genome, RNA editing, loss of stop codons, and use of trans-splicing. PMID:17897476
Electronic clinical predictive thermometer using logarithm for temperature prediction

NASA Technical Reports Server (NTRS)

Cambridge, Vivien J. (Inventor); Koger, Thomas L. (Inventor); Nail, William L. (Inventor); Diaz, Patrick (Inventor)

1998-01-01

A thermometer that rapidly predicts body temperature based on the temperature signals received from a temperature sensing probe when it comes into contact with the body. The logarithms of the differences between the temperature signals in a selected time frame are determined. A line is fit through the logarithms and the slope of the line is used as a system time constant in predicting the final temperature of the body. The time constant in conjunction with predetermined additional constants are used to compute the predicted temperature. Data quality in the time frame is monitored and if unacceptable, a different time frame of temperature signals is selected for use in prediction. The processor switches to a monitor mode if data quality over a limited number of time frames is unacceptable. Determining the start time on which the measurement time frame for prediction is based is performed by summing the second derivatives of temperature signals over time frames. When the sum of second derivatives in a particular time frame exceeds a threshold, the start time is established.
uAUG-mediated translational initiations are responsible for human mu opioid receptor gene expression

PubMed Central

Song, Kyu Young; Kim, Chun Sung; Hwang, Cheol Kyu; Choi, Hack Sun; Law, Ping-Yee; Wei, Li-Na; Loh, Horace H

2010-01-01

Abstract Mu opioid receptor (MOR) is the main site of interaction for major clinical analgesics, particularly morphine. MOR expression is regulated at the transcriptional and post-transcriptional levels. However, the protein expression of the MOR gene is relatively low and the translational control of MOR gene has not been well studied. The 5′-untranslated region (UTR) of the human MOR (OPRM1) mRNA contains four upstream AUG codons (uAUG) preceding the main translation initiation site. We mutated the four uAUGs individually and in combination. Mutations of the third uAUG, containing the same open reading frame, had the strongest inhibitory effect. The inhibitory effect caused by the third in-frame uAUG was confirmed by in vitro translation and receptor-binding assays. Toeprinting results showed that OPRM1 ribosomes initiated efficiently at the first uAUG, and subsequently re-initiated at the in-frame #3 uAUG and the physiological AUG site. This re-initiation resulted in negative expression of OPRM1 under normal conditions. These results indicate that re-initiation in MOR gene expression could play an important role in OPRM1 regulation. PMID:19438807
Analysis of synonymous codon usage patterns in the genus Rhizobium.

PubMed

Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin

2013-11-01

The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes.

PubMed

Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil

2017-04-01

With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
MUC1-ARF-A Novel MUC1 Protein That Resides in the Nucleus and Is Expressed by Alternate Reading Frame Translation of MUC1 mRNA.

PubMed

Chalick, Michael; Jacobi, Oded; Pichinuk, Edward; Garbar, Christian; Bensussan, Armand; Meeker, Alan; Ziv, Ravit; Zehavi, Tania; Smorodinsky, Nechama I; Hilkens, John; Hanisch, Franz-Georg; Rubinstein, Daniel B; Wreschner, Daniel H

2016-01-01

Translation of mRNA in alternate reading frames (ARF) is a naturally occurring process heretofore underappreciated as a generator of protein diversity. The MUC1 gene encodes MUC1-TM, a signal-transducing trans-membrane protein highly expressed in human malignancies. Here we show that an AUG codon downstream to the MUC1-TM initiation codon initiates an alternate reading frame thereby generating a novel protein, MUC1-ARF. MUC1-ARF, like its MUC1-TM 'parent' protein, contains a tandem repeat (VNTR) domain. However, the amino acid sequence of the MUC1-ARF tandem repeat as well as N- and C- sequences flanking it differ entirely from those of MUC1-TM. In vitro protein synthesis assays and extensive immunohistochemical as well as western blot analyses with MUC1-ARF specific monoclonal antibodies confirmed MUC1-ARF expression. Rather than being expressed at the cell membrane like MUC1-TM, immunostaining showed that MUC1-ARF protein localizes mainly in the nucleus: Immunohistochemical analyses of MUC1-expressing tissues demonstrated MUC1-ARF expression in the nuclei of secretory luminal epithelial cells. MUC1-ARF expression varies in different malignancies. While the malignant epithelial cells of pancreatic cancer show limited expression, in breast cancer tissue MUC1-ARF demonstrates strong nuclear expression. Proinflammatory cytokines upregulate expression of MUC1-ARF protein and co-immunoprecipitation analyses demonstrate association of MUC1-ARF with SH3 domain-containing proteins. Mass spectrometry performed on proteins coprecipitating with MUC1-ARF demonstrated Glucose-6-phosphate 1-dehydrogenase (G6PD) and Dynamin 2 (DNM2). These studies not only reveal that the MUC1 gene generates a previously unidentified MUC1-ARF protein, they also show that just like its 'parent' MUC1-TM protein, MUC1-ARF is apparently linked to signaling and malignancy, yet a definitive link to these processes and the roles it plays awaits a precise identification of its molecular functions. Comprising at least 524 amino acids, MUC1-ARF is, furthermore, the longest ARF protein heretofore described.
MUC1-ARF—A Novel MUC1 Protein That Resides in the Nucleus and Is Expressed by Alternate Reading Frame Translation of MUC1 mRNA

PubMed Central

Pichinuk, Edward; Garbar, Christian; Bensussan, Armand; Meeker, Alan; Ziv, Ravit; Zehavi, Tania; Smorodinsky, Nechama I.; Hilkens, John; Hanisch, Franz-Georg; Rubinstein, Daniel B.; Wreschner, Daniel H.

2016-01-01

Translation of mRNA in alternate reading frames (ARF) is a naturally occurring process heretofore underappreciated as a generator of protein diversity. The MUC1 gene encodes MUC1-TM, a signal-transducing trans-membrane protein highly expressed in human malignancies. Here we show that an AUG codon downstream to the MUC1-TM initiation codon initiates an alternate reading frame thereby generating a novel protein, MUC1-ARF. MUC1-ARF, like its MUC1-TM 'parent’ protein, contains a tandem repeat (VNTR) domain. However, the amino acid sequence of the MUC1-ARF tandem repeat as well as N- and C- sequences flanking it differ entirely from those of MUC1-TM. In vitro protein synthesis assays and extensive immunohistochemical as well as western blot analyses with MUC1-ARF specific monoclonal antibodies confirmed MUC1-ARF expression. Rather than being expressed at the cell membrane like MUC1-TM, immunostaining showed that MUC1-ARF protein localizes mainly in the nucleus: Immunohistochemical analyses of MUC1-expressing tissues demonstrated MUC1-ARF expression in the nuclei of secretory luminal epithelial cells. MUC1-ARF expression varies in different malignancies. While the malignant epithelial cells of pancreatic cancer show limited expression, in breast cancer tissue MUC1-ARF demonstrates strong nuclear expression. Proinflammatory cytokines upregulate expression of MUC1-ARF protein and co-immunoprecipitation analyses demonstrate association of MUC1-ARF with SH3 domain-containing proteins. Mass spectrometry performed on proteins coprecipitating with MUC1-ARF demonstrated Glucose-6-phosphate 1-dehydrogenase (G6PD) and Dynamin 2 (DNM2). These studies not only reveal that the MUC1 gene generates a previously unidentified MUC1-ARF protein, they also show that just like its ‘parent’ MUC1-TM protein, MUC1-ARF is apparently linked to signaling and malignancy, yet a definitive link to these processes and the roles it plays awaits a precise identification of its molecular functions. Comprising at least 524 amino acids, MUC1-ARF is, furthermore, the longest ARF protein heretofore described. PMID:27768738
Circuitry linking the global Csr and σE-dependent cell envelope stress response systems.

PubMed

Yakhnin, Helen; Aichele, Robert; Ades, Sarah E; Romeo, Tony; Babitzke, Paul

2017-09-18

CsrA of Escherichia coli is an RNA-binding protein that globally regulates a wide variety of cellular processes and behaviors including carbon metabolism, motility, biofilm formation, and the stringent response. CsrB and CsrC are sRNAs that sequester CsrA, thereby preventing CsrA-mRNA interaction. RpoE (σ E ) is the extracytoplasmic stress response sigma factor of E. coli Previous RNA-seq studies identified rpoE mRNA as a CsrA target. Here we explored the regulation of rpoE by CsrA and found that CsrA represses rpoE translation. Gel mobility shift, footprint and toeprint studies identified three CsrA binding sites in the rpoE leader transcript, one of which overlaps the rpoE Shine-Dalgarno (SD) sequence, while another overlaps the rpoE translation initiation codon. Coupled in vitro transcription-translation experiments showed that CsrA represses rpoE translation by binding to these sites. We further demonstrate that σ E indirectly activates transcription of csrB and csrC , leading to increased sequestration of CsrA such that repression of rpoE by CsrA is reduced. We propose that the Csr system fine-tunes the σ E -dependent cell envelope stress response. We also identified a 51 amino acid coding sequence whose stop codon overlaps the rpoE start codon, and demonstrate that rpoE is translationally coupled with this upstream open reading frame (ORF51). Loss of coupling reduces rpoE translation by more than 50%. Identification of a translationally coupled ORF upstream of rpoE suggests that this previously unannotated protein may participate in the cell envelope stress response. In keeping with existing nomenclature, we name ORF51 as rseD , resulting in an operon arrangement of rseD-rpoE-rseA-rseB-rseC IMPORTANCE CsrA posttranscriptionally represses genes required for bacterial stress responses, including the stringent response, catabolite repression, and the RpoS (σ S )-mediated general stress response. We show that CsrA represses translation of rpoE , encoding the extracytoplasmic stress response sigma factor and that σ E indirectly activates transcription of csrB and csrC , resulting in reciprocal regulation of these two global regulatory systems. These findings suggest that extracytoplasmic stress leads to derepression of rpoE translation by CsrA, and CsrA-mediated repression helps to reset RpoE abundance to pre-stress levels once envelope damage is repaired. The discovery of an ORF, RseD, translationally coupled with rpoE adds further complexity to translational control of rpoE . Copyright © 2017 American Society for Microbiology.
ORF4-protein deficient PCV2 mutants enhance virus-induced apoptosis and show differential expression of mRNAs in vitro.

PubMed

Gao, Zhangzhao; Dong, Qinfang; Jiang, Yonghou; Opriessnig, Tanja; Wang, Jingxiu; Quan, Yanping; Yang, Zongqi

2014-04-01

Porcine circovirus type 2 (PCV2) is the essential infectious agent of PCV associated disease (PCVAD). During previous in vitro studies, 11 RNAs and four viral proteins have been detected in PCV2-infected cells. Open reading frame (ORF) 4 is 180bp in length and has been identified at the transcription and the translation level. It overlaps completely with ORF3, which has a role in virus-induced apoptosis. In this study, start codon mutations (M1-PCV2) or in-frame termination mutations (M2-PCV2) were utilized to construct two ORF4-protein deficient viruses aiming to investigate its role in viral infection. The abilities of M1-PCV2 and M2-PCV2 to replicate, transcribe, express viral proteins, and to cause cellular apoptosis were evaluated. Viral DNA replication curves supported that the ORF4 protein is not essential for viral replication, but inhibits viral replication in the early stage of infection. Comparison of the expression level of ORF3 mRNA among wild-type and ORF4-deficient viruses in infected PK-15 cell demonstrated enhanced ORF3 transcription of both ORF4 mutants suggesting that the ORF4 protein may play an important role by restricting ORF3 transcription thereby preventing virus-induced apoptosis. This is further confirmed by the significantly higher caspase 3 and 8 activities in M1-PCV2 and M2-PCV2 compared to wild-type PCV2. Furthermore, the role of ORF4 in cell apoptosis and a possible interaction with the ORF1 associated Rep protein could perhaps explain the rapid viral growth in the early stage of infection and the higher expression level of ORF1 mRNA in ORF4 protein deficient PCV2 mutants. Copyright © 2014 Elsevier B.V. All rights reserved.
Molecular characterization of phosphorylcholine expression on the lipooligosaccharide of Histophilus somni.

PubMed

Elswaifi, Shaadi F; St Michael, Frank; Sreenivas, Avula; Cox, Andrew; Carman, George M; Inzana, Thomas J

2009-10-01

Histophilus somni (Haemophilus somnus) is an important pathogen of cattle that is responsible for respiratory disease, septicemia, and systemic diseases such as thrombotic meningoencephalitis, myocarditis, and abortion. A variety of virulence factors have been identified in H. somni, including compositional and antigenic variation of the lipooligosaccharide (LOS). Phosphorylcholine (ChoP) has been identified as one of the components of H. somni LOS that undergoes antigenic variation. In this study, five genes (lic1ABCD(Hs) and glpQ) with homology to genes responsible for ChoP expression in Haemophilus influenzae LOS were identified in the H. somni genome. An H. somni open reading frame (ORF) with homology to H. influenzae lic1A (lic1A(Hi)) contained a variable number of tandem repeats (VNTR). However, whereas the tetranucleotide repeat 5'-CAAT-3' is present in lic1A(Hi), the VNTR in H. somni lic1A (lic1A(Hs)) consisted of 5'-AACC-3'. Due to the propensity of VNTR to vary during replication and cause the ORF to shift in and out of frame with the upstream start codon, the VNTR were deleted from lic1A(Hs) to maintain the gene constitutively on. This construct was cloned into Escherichia coli, and functional enzyme assays confirmed that lic1A(Hs) encoded a choline kinase, and that the VNTR were not required for expression of a functional gene product. Variation in the number of VNTR in lic1A(Hs) correlated with antigenic variation of ChoP expression in H. somni strain 124P. However, antigenic variation of ChoP expression in strain 738 predominately occurred through variable extension/truncation of the LOS outer core. These results indicated that the lic1(Hs) genes controlled expression of ChoP on the LOS, but that in H. somni there are two potential mechanisms that account for antigenic variation of ChoP.
Molecular characterization of phosphorylcholine expression on the lipooligosaccharide of Histophilus somni

PubMed Central

Elswaifi, Shaadi F.; St. Michael, Frank; Sreenivas, Avula; Cox, Andrew; Carman, George M.; Inzana, Thomas J.

2013-01-01

Histophilus somni (Haemophilus somnus) is an important pathogen of cattle that is responsible for respiratory disease, septicemia, and systemic diseases such as thrombotic meningoencephalitis, myocarditis, and abortion. A variety of virulence factors have been identified in H. somni, including compositional and antigenic variation of the lipooligosaccharide (LOS). Phosphorylcholine (ChoP) has been identified as one of the components of H. somni LOS that undergoes antigenic variation. In this study, five genes (lic1ABCDHs and glpQ) with homology to genes responsible for ChoP expression in Haemophilus influenzae LOS were identified in the H. somni genome. An H. somni open reading frame (ORF) with homology to H. influenzae lic1A (lic1AHi) contained a variable number of tandem repeats (VNTR). However, whereas the tetranucleotide repeat 5′-CAAT-3′ is present in lic1AHi, the VNTR in H. somni lic1A (lic1AHs) consisted of 5′-AACC-3′. Due to the propensity of VNTR to vary during replication and cause the ORF to shift in and out of frame with the upstream start codon, the VNTR were deleted from lic1AHs to maintain the gene constitutively on. This construct was cloned into Escherichia coli, and functional enzyme assays confirmed that lic1AHs encoded a choline kinase, and that the VNTR were not required for expression of a functional gene product. Variation in the number of VNTR in lic1AHs correlated with antigenic variation of ChoP expression in H. somni strain 124P. However, antigenic variation of ChoP expression in strain 738 predominately occurred through variable extension/truncation of the LOS outer core. These results indicated that the lic1Hs genes controlled expression of ChoP on the LOS, but that in H. somni there are two potential mechanisms that account for antigenic variation of ChoP. PMID:19682567
Emerging genetic therapies to treat Duchenne muscular dystrophy

PubMed Central

Nelson, Stanley F.; Crosbie, Rachelle H.; Miceli, M. Carrie; Spencer, Melissa J.

2010-01-01

Purpose of review Duchenne muscular dystrophy is a progressive muscle degenerative disease caused by dystrophin mutations. The purpose of this review is to highlight two emerging therapies designed to repair the primary genetic defect, called `exon skipping' and `nonsense codon suppression'. Recent findings A drug, PTC124, was identified that suppresses nonsense codon translation termination. PTC124 can lead to restoration of some dystrophin expression in human Duchenne muscular dystrophy muscles with mutations resulting in premature stops. Two drugs developed for exon skipping, PRO051 and AVI-4658, result in the exclusion of exon 51 from mature mRNA. They can restore the translational reading frame to dystrophin transcripts from patients with a particular subset of dystrophin gene deletions and lead to some restoration of dystrophin expression in affected boys' muscle in vivo. Both approaches have concluded phase I trials with no serious adverse events. Summary These novel therapies that act to correct the primary genetic defect of dystrophin deficiency are among the first generation of therapies tailored to correct specific mutations in humans. Thus, they represent paradigm forming approaches to personalized medicine with the potential to lead to life changing treatment for those affected by Duchenne muscular dystrophy. PMID:19745732
Codon usage bias in prokaryotic pyrimidine-ending codons is associated with the degeneracy of the encoded amino acids

PubMed Central

Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah

2012-01-01

Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Positive and negative feedback regulatory loops of thiol-oxidative stress response mediated by an unstable isoform of sigmaR in actinomycetes.

PubMed

Kim, Min-Sik; Hahn, Mi-Young; Cho, Yoobok; Cho, Sang-Nae; Roe, Jung-Hye

2009-09-01

Alternate sigma factors provide an effective way of diversifying bacterial gene expression in response to environmental changes. In Streptomyces coelicolor where more than 65 sigma factors are predicted, sigma(R) is the major regulator for response to thiol-oxidative stresses. sigma(R) becomes available when its bound anti-sigma factor RsrA is oxidized at sensitive cysteine thiols to form disulphide bonds. sigma(R) regulon includes genes for itself and multiple thiol-reducing systems, which constitute positive and negative feedback loops respectively. We found that the positive amplification loop involves an isoform of sigma(R) (sigma(R')) with an N-terminal extension of 55 amino acids, produced from an upstream start codon. A major difference between constitutive sigma(R) and inducible sigma(R') is that the latter is markedly unstable (t(1/2) approximately 10 min) compared with the former (> 70 min). The rapid turnover of sigma(R') is partly due to induced ClpP1/P2 proteases from the sigma(R) regulon. This represents a novel way of elaborating positive and negative feedback loops in a control circuit. Similar phenomenon may occur in other actinomycetes that harbour multiple start codons in the sigR homologous gene. We observed that sigH gene, the sigR orthologue in Mycobacterium smegmatis, produces an unstable larger isoform of sigma(H) upon induction by thiol-oxidative stress.
Multiple Transcript Properties Related to Translation Affect mRNA Degradation Rates in Saccharomyces cerevisiae

PubMed Central

Neymotin, Benjamin; Ettorre, Victoria; Gresham, David

2016-01-01

Degradation of mRNA contributes to variation in transcript abundance. Studies of individual mRNAs have shown that both cis and trans factors affect mRNA degradation rates. However, the factors underlying transcriptome-wide variation in mRNA degradation rates are poorly understood. We investigated the contribution of different transcript properties to transcriptome-wide degradation rate variation in the budding yeast, Saccharomyces cerevisiae, using multiple regression analysis. We find that multiple transcript properties are significantly associated with variation in mRNA degradation rates, and that a model incorporating these properties explains ∼50% of the genome-wide variance. Predictors of mRNA degradation rates include transcript length, ribosome density, biased codon usage, and GC content of the third position in codons. To experimentally validate these factors, we studied individual transcripts expressed from identical promoters. We find that decreasing ribosome density by mutating the first translational start site of a transcript increases its degradation rate. Using coding sequence variants of green fluorescent protein (GFP) that differ only at synonymous sites, we show that increased GC content of the third position of codons results in decreased rates of mRNA degradation. Thus, in steady-state conditions, a large fraction of genome-wide variation in mRNA degradation rates is determined by inherent properties of transcripts, many of which are related to translation, rather than specific regulatory mechanisms. PMID:27633789

Genomic position affects the expression of tobacco mosaic virus movement and coat protein genes.

PubMed Central

Culver, J N; Lehto, K; Close, S M; Hilf, M E; Dawson, W O

1993-01-01

Alterations in the genomic position of the tobacco mosaic virus (TMV) genes encoding the 30-kDa cell-to-cell movement protein or the coat protein greatly affected their expression. Higher production of 30-kDa protein was correlated with increased proximity of the gene to the viral 3' terminus. A mutant placing the 30-kDa open reading frame 207 nucleotides nearer the 3' terminus produced at least 4 times the wild-type TMV 30-kDa protein level, while a mutant placing the 30-kDa open reading frame 470 nucleotides closer to the 3' terminus produced at least 8 times the wild-type TMV 30-kDa protein level. Increases in 30-kDa protein production were not correlated with the subgenomic mRNA promoter (SGP) controlling the 30-kDa gene, since mutants with either the native 30-kDa SGP or the coat protein SGP in front of the 30-kDa gene produced similar levels of 30-kDa protein. Lack of coat protein did not affect 30-kDa protein expression, since a mutant with the coat protein start codon removed did not produce increased amounts of 30-kDa protein. Effects of gene positioning on coat protein expression were examined by using a mutant containing two different tandemly positioned tobamovirus (TMV and Odontoglossum ringspot virus) coat protein genes. Only coat protein expressed from the gene positioned nearest the 3' viral terminus was detected. Analysis of 30-kDa and coat protein subgenomic mRNAs revealed no proportional increase in the levels of mRNA relative to the observed levels of 30-kDa and coat proteins. This suggests that a translational mechanism is primarily responsible for the observed effect of genomic position on expression of 30-kDa movement and coat protein genes. Images Fig. 2 Fig. 3 Fig. 4 PMID:8446627
Translation efficiencies of synonymous codons are not always correlated with codon usage in tobacco chloroplasts.

PubMed

Nakamura, Masayuki; Sugiura, Masahiro

2007-01-01

Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design

PubMed Central

Villada, Juan C.; Brustolini, Otávio José Bernardes

2017-01-01

Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design.

PubMed

Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel

2017-08-01

Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

PubMed

Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

2016-07-01

The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.

PubMed

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-04-27

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Promoter analysis of the membrane protein gp64 gene of the cellular slime mold Polysphondylium pallidum.

PubMed

Takaoka, N; Fukuzawa, M; Saito, T; Sakaitani, T; Ochiai, H

1999-10-28

We cloned a genomic fragment of the membrane protein gp64 gene of the cellular slime mold Polysphondylium pallidum by inverse PCR. Primer extension analysis identified a major transcription start site 65 bp upstream of the translation start codon. The promoter region of the gp64 gene contains sequences homologous to a TATA box at position -47 to -37 and to an initiator (Inr, PyPyCAPyPyPyPy) at position -3 to +5 from the transcription start site. Successively truncated segments of the promoter were tested for their ability to drive expression of the beta-galactosidase reporter gene in transformed cells; also the difference in activity between growth conditions was compared. The results indicated that there are two positive vegetative regulatory elements extending between -187 and -62 bp from the transcription start site of the gp64 promoter; also their activity was two to three times higher in the cells grown with bacteria in shaken suspension than in the cells grown in an axenic medium.
Leaderless mRNAs are circularized in Chlamydomonas reinhardtii mitochondria.

PubMed

Cahoon, A Bruce; Qureshi, Ali A

2018-06-01

The mitochondrial genome of Chlamydomonas reinhardtii encodes eight protein coding genes transcribed on two polycistronic primary transcripts. The mRNAs are endonucleolytically cleaved from these transcripts directly upstream of their AUG start codons, creating leaderless mRNAs with 3' untranslated regions (UTR) comprised of most or all of their downstream intergenic regions. In this report, we provide evidence that these processed linear mRNAs are circularized, which places the 3' UTR upstream of the 5' start codon, creating a leader sequence ex post facto. The circular mRNAs were found to be ribosome associate by polysome profiling experiments suggesting they are translated. Sequencing of the 3'-5' junctions of the circularized mRNAs found the intra-molecular ligations occurred between fully processed 5' ends (the start AUG) and a variable 3' terminus. For five genes (cob, cox, nd2, nd4, and nd6), some of the 3' ends maintained an oligonucleotide addition during ligation, and for two of them, cob and nd6, these 3' termini were the most commonly recovered sequence. Previous reports have shown that after cleavage, three untemplated oligonucleotide additions may occur on the 3' termini of these mRNAs-adenylation, uridylylation, or cytidylation. These results suggest oligo(U) and oligo(C) additions may be part of the maturation process since they are maintained in the circular mRNAs. Circular RNAs occur in organisms across the biological spectrum, but their purpose in some systems, such as organelles (mitochondria and chloroplasts) is unclear. We hypothesize, that in C. reinhardtii mitochondria it may create a leader sequence to facilitate translation initiation, which may negate the need for an alternative translation initiation mechanism in this system, as previously speculated. In addition, circularization may play a protective role against exonucleases, and/or increase translational productivity.
Molecular cloning and sequence analysis of the gene coding for the 57kDa soluble antigen of the salmonid fish pathogen Renibacterium salmoninarum

USGS Publications Warehouse

Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.

1992-01-01

The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.

PubMed

Morton, B R

1993-09-01

Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
File Compression and Expansion of the Genetic Code by the use of the Yin/Yang Directions to find its Sphered Cube

PubMed Central

Castro-Chavez, Fernando

2014-01-01

Objective The objective of this article is to demonstrate that the genetic code can be studied and represented in a 3-D Sphered Cube for bioinformatics and for education by using the graphical help of the ancient “Book of Changes” or I Ching for the comparison, pair by pair, of the three basic characteristics of nucleotides: H-bonds, molecular structure, and their tautomerism. Methods The source of natural biodiversity is the high plasticity of the genetic code, analyzable with a reverse engineering of its 2-D and 3-D representations (here illustrated), but also through the classical 64-hexagrams of the ancient I Ching, as if they were the 64-codons or words of the genetic code. Results In this article, the four elements of the Yin/Yang were found by correlating the 3×2=6 sets of Cartesian comparisons of the mentioned properties of nucleic acids, to the directionality of their resulting blocks of codons grouped according to their resulting amino acids and/or functions, integrating a 384-codon Sphered Cube whose function is illustrated by comparing six brain peptides and a promoter of osteoblasts from Humans versus Neanderthal, as well as to Negadi’s work on the importance of the number 384 within the genetic code. Conclusions Starting with the codon/anticodon correlation of Nirenberg, published in full here for the first time, and by studying the genetic code and its 3-D display, the buffers of reiteration within codons codifying for the same amino acid, displayed the two long (binary number one) and older Yin/Yang arrows that travel in opposite directions, mimicking the parental DNA strands, while annealing to the two younger and broken (binary number zero) Yin/Yang arrows, mimicking the new DNA strands; the graphic analysis of the of the genetic code and its plasticity was helpful to compare compatible sequences (human compatible to human versus neanderthal compatible to neanderthal), while further exploring the wondrous biodiversity of nature for educational purposes. PMID:25340175
An initiator codon mutation in SDE2 causes recessive embryonic lethality in Holstein cattle.

PubMed

Fritz, Sébastien; Hoze, Chris; Rebours, Emmanuelle; Barbat, Anne; Bizard, Méline; Chamberlain, Amanda; Escouflaire, Clémentine; Vander Jagt, Christy; Boussaha, Mekki; Grohs, Cécile; Allais-Bonnet, Aurélie; Philippe, Maëlle; Vallée, Amélie; Amigues, Yves; Hayes, Benjamin J; Boichard, Didier; Capitan, Aurélien

2018-04-18

Researching depletions in homozygous genotypes for specific haplotypes among the large cohorts of animals genotyped for genomic selection is a very efficient strategy to map recessive lethal mutations. In this study, by analyzing real or imputed Illumina BovineSNP50 (Illumina Inc., San Diego, CA) genotypes from more than 250,000 Holstein animals, we identified a new locus called HH6 showing significant negative effects on conception rate and nonreturn rate at 56 d in at-risk versus control mating. We fine-mapped this locus in a 1.1-Mb interval and analyzed genome sequence data from 12 carrier and 284 noncarrier Holstein bulls. We report the identification of a strong candidate mutation in the gene encoding SDE2 telomere maintenance homolog (SDE2), a protein essential for genomic stability in eukaryotes. This A-to-G transition changes the initiator ATG (methionine) codon to ACG because the gene is transcribed on the reverse strand. Using RNA sequencing and quantitative reverse-transcription PCR, we demonstrated that this mutation does not significantly affect SDE2 splicing and expression level in heterozygous carriers compared with control animals. Initiation of translation at the closest in-frame methionine codon would truncate the SDE2 precursor by 83 amino acids, including the cleavage site necessary for its activation. Finally, no homozygote for the G allele was observed in a large population of nearly 29,000 individuals genotyped for the mutation. The low frequency (1.3%) of the derived allele in the French population and the availability of a diagnostic test on the Illumina EuroG10K SNP chip routinely used for genomic evaluation will enable rapid and efficient selection against this deleterious mutation. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Molecular Characterization of the Complete Genome of Three Basal-BR Isolates of Turnip mosaic virus Infecting Raphanus sativus in China.

PubMed

Zhu, Fuxiang; Sun, Ying; Wang, Yan; Pan, Hongyu; Wang, Fengting; Zhang, Xianghui; Zhang, Yanhua; Liu, Jinliang

2016-06-04

Turnip mosaic virus (TuMV) infects crops of plant species in the family Brassicaceae worldwide. TuMV isolates were clustered to five lineages corresponding to basal-B, basal-BR, Asian-BR, world-B and OMs. Here, we determined the complete genome sequences of three TuMV basal-BR isolates infecting radish from Shandong and Jilin Provinces in China. Their genomes were all composed of 9833 nucleotides, excluding the 3'-terminal poly(A) tail. They contained two open reading frames (ORFs), with the large one encoding a polyprotein of 3164 amino acids and the small overlapping ORF encoding a PIPO protein of 61 amino acids, which contained the typically conserved motifs found in members of the genus Potyvirus. In pairwise comparison with 30 other TuMV genome sequences, these three isolates shared their highest identities with isolates from Eurasian countries (Germany, Italy, Turkey and China). Recombination analysis showed that the three isolates in this study had no "clear" recombination. The analyses of conserved amino acids changed between groups showed that the codons in the TuMV out group (OGp) and OMs group were the same at three codon sites (852, 1006, 1548), and the other TuMV groups (basal-B, basal-BR, Asian-BR, world-B) were different. This pattern suggests that the codon in the OMs progenitor did not change but that in the other TuMV groups the progenitor sequence did change at divergence. Genetic diversity analyses indicate that the PIPO gene was under the highest selection pressure and the selection pressure on P3N-PIPO and P3 was almost the same. It suggests that most of the selection pressure on P3 was probably imposed through P3N-PIPO.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons

PubMed Central

Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan

2017-01-01

Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
The complete mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae).

PubMed

Zhou, Xuming; Chen, Yu; Zhu, Shanliang; Xu, Haigen; Liu, Yan; Chen, Lian

2016-01-01

The mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae) is the first complete mtDNA sequence reported in the genus Pomacea. The total length of mtDNA is 15,707 bp, which containing 13 protein-coding genes, 2 ribosomal RNAs, 22 transfer RNAs, and a 359 bp non-coding region. The A + T content of the overall base composition of H-strand is 71.7% (T: 41%, C: 12.7%, A: 30.7%, G: 15.6%). ATP6, ATP8, CO1, CO2, ND1-3, ND5, ND6, ND4L and Cyt b genes begin with ATG as start codon, CO3 and ND4 begin with ATA. ATP8, CO2-3, ND4L, ND2-6 and Cyt b genes are terminated with TAA as stop codon, ATP6, ND1, and CO1 end with TAG. A long non-coding region is found and a 23 bp repeat unit repeat 11 times in this region.
Systematic bacterialization of yeast genes identifies a near-universally swappable pathway

PubMed Central

Kachroo, Aashiq H; Laurent, Jon M; Akhmetov, Azat; Szilagyi-Jones, Madelyn; McWhite, Claire D; Zhao, Alice; Marcotte, Edward M

2017-01-01

Eukaryotes and prokaryotes last shared a common ancestor ~2 billion years ago, and while many present-day genes in these lineages predate this divergence, the extent to which these genes still perform their ancestral functions is largely unknown. To test principles governing retention of ancient function, we asked if prokaryotic genes could replace their essential eukaryotic orthologs. We systematically replaced essential genes in yeast by their 1:1 orthologs from Escherichia coli. After accounting for mitochondrial localization and alternative start codons, 31 out of 51 bacterial genes tested (61%) could complement a lethal growth defect and replace their yeast orthologs with minimal effects on growth rate. Replaceability was determined on a pathway-by-pathway basis; codon usage, abundance, and sequence similarity contributed predictive power. The heme biosynthesis pathway was particularly amenable to inter-kingdom exchange, with each yeast enzyme replaceable by its bacterial, human, or plant ortholog, suggesting it as a near-universally swappable pathway. DOI: http://dx.doi.org/10.7554/eLife.25093.001 PMID:28661399
Complete mitochondrial genome of the agarophyte red alga Gelidium vagum (Gelidiales).

PubMed

Yang, Eun Chan; Kim, Kyeong Mi; Boo, Ga Hun; Lee, Jung-Hyun; Boo, Sung Min; Yoon, Hwan Su

2014-08-01

We describe the first complete mitochondrial genome of Gelidium vagum (Gelidiales) (24,901 bp, 30.4% GC content), an agar-producing red alga. The circular mitochondrial genome contains 43 genes, including 23 protein-coding, 18 tRNA and 2 rRNA genes. All the protein-coding genes have a typical ATG start codon. No introns were found. Two genes, secY and rps12, were overlapped by 41 bp.
Effect of Estrogen on Mutagenesis in Human Mammary Epithelial Cells

DTIC Science & Technology

2005-06-01

instability remains undefined in most human cancers, it appears to arise from subtle, intragenic mutations of the genes , whose products play a key role in...cells and is less labor-intensive. A G-G or T-G mismatch was introduced into ATG start codon of the enhanced green fluorescent protein (EGFP) gene ...Repair of the G-G or T-G mismatch to G-C or T-A, respectively in the heteroduplex plasmid generates a functional EGFP gene expression. The heteroduplex
Genome-wide analysis of codon usage bias in Ebolavirus.

PubMed

Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor

2015-01-22

Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

PubMed Central

Whittle, C. A.; Sun, Y.; Johannesson, H.

2011-01-01

Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862

Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution.

PubMed

Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang

2015-08-26

The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.

PubMed

Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou

2017-04-20

Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Characterization of the porcine epidemic diarrhea virus codon usage bias.

PubMed

Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong

2014-12-01

Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.

PubMed

Kumar, Chandra Shekhar; Kumar, Sachin

2017-06-01

Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Codon usage bias in phylum Actinobacteria: relevance to environmental adaptation and host pathogenicity.

PubMed

Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup

2016-10-01

Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species

PubMed Central

2006-01-01

Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites

PubMed Central

Sun, Yu; Tamarit, Daniel

2017-01-01

Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
The complete mitochondrial genome of the Aluterus monoceros.

PubMed

Li, Wenshen; Zhang, Guoqing; Wen, Xin; Wang, Qian; Chen, Guohua

2016-07-01

The complete mitochondrial genome of Aluterus monoceros (A. monoceros) has been sequenced. The mitochondrial genome of A. monoceros is 16,429 bp in length, consisting of 22 tRNA genes, 2 rRNA genes, 13 protein-coding genes and a D-loop region (Gen Bank accession number KP637022). The base A + T of the mitochondrial genome is 63.25%, including 33.16% of A, 30.09% of T and 20.74% of C. Twelve protein-coding genes start with a standard ATG as the initiation codon, expect for the COXI, which begins with GTG. Some of the termination codons are incomplete T or TA, except for the ND1, COXI, ATP8, ND4L1, ND5 and ND6, which stop with TAA. Construction of phylogenetic trees based on the entire mitochondrial genome sequence of 14 Tetrodontiformes species constructed has suggested that A. monoceros has closer relationship with Acreichthys tomentosus and Monacanthus chinensis, and they constitute a sister group.
Complete mitochondrial genome of the mottled skate: Raja pulchra (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Myoung, Jung-Goo; Lee, Youn-Ho

2016-05-01

The complete sequence of mitochondrial DNA of a mottled skate, Raja pulchra was sequenced as being circular molecules of 16,907 bp including 2 rRNA, 22 tRNA, 13 protein-coding genes (PCGs), and an AT-rich control region. The organization of the PCGs is the same as those found in other Rajidae species. The nucleotide of L-strand is composed of 29.8% A, 28.0% C, 27.9% T, and 14.3% G with a bias toward A + T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of [Formula: see text] which has a reduced DHU arm. This mitogenome will provide essential information for better phylogenetic resolution and precision of the family Rajidae and the genus Raja as well as for establishment of a fish stock recovery plan of the species.
A detailed analysis of codon usage patterns and influencing factors in Zika virus.

PubMed

Singh, Niraj K; Tyagi, Anuj

2017-07-01

Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
YrdC exhibits properties expected of a subunit for a tRNA threonylcarbamoyl transferase.

PubMed

Harris, Kimberly A; Jones, Victoria; Bilbille, Yann; Swairjo, Manal A; Agris, Paul F

2011-09-01

The post-transcriptional nucleoside modifications of tRNA's anticodon domain form the loop structure and dynamics required for effective and accurate recognition of synonymous codons. The N(6)-threonylcarbamoyladenosine modification at position 37 (t(6)A(37)), 3'-adjacent to the anticodon, of many tRNA species in all organisms ensures the accurate recognition of ANN codons by increasing codon affinity, enhancing ribosome binding, and maintaining the reading frame. However, biosynthesis of this complex modification is only partially understood. The synthesis requires ATP, free threonine, a single carbon source for the carbamoyl, and an enzyme yet to be identified. Recently, the universal protein family Sua5/YciO/YrdC was associated with t(6)A(37) biosynthesis. To further investigate the role of YrdC in t(6)A(37) biosynthesis, the interaction of the Escherichia coli YrdC with a heptadecamer anticodon stem and loop of lysine tRNA (ASL(Lys)(UUU)) was examined. YrdC bound the unmodified ASL(Lys)(UUU) with high affinity compared with the t(6)A(37)-modified ASL(Lys)(UUU) (K(d) = 0.27 ± 0.20 μM and 1.36 ± 0.39 μM, respectively). YrdC also demonstrated specificity toward the unmodified versus modified anticodon pentamer UUUUA and toward threonine and ATP. The protein did not significantly alter the ASL architecture, nor was it able to base flip A(37), as determined by NMR, circular dichroism, and fluorescence of 2-aminopuine at position 37. Thus, current data support the hypothesis that YrdC, with many of the properties of a putative threonylcarbamoyl transferase, most likely functions as a component of a heteromultimeric protein complex for t(6)A(37) biosynthesis.
Identification of a novel species of papillomavirus in giraffe lesions using nanopore sequencing.

PubMed

Vanmechelen, Bert; Bertelsen, Mads Frost; Rector, Annabel; Van den Oord, Joost J; Laenen, Lies; Vergote, Valentijn; Maes, Piet

2017-03-01

Papillomaviridae form a large family of viruses that are known to infect a variety of vertebrates, including mammals, reptiles, birds and fish. Infections usually give rise to minor skin lesions but can in some cases lead to the development of malignant neoplasia. In this study, we identified a novel species of papillomavirus (PV), isolated from warts of four giraffes (Giraffa camelopardalis). The sequence of the L1 gene was determined and found to be identical for all isolates. Using nanopore sequencing, the full sequence of the PV genome could be determined. The coding region of the genome was found to contain seven open reading frames (ORF), encoding the early proteins E1, E2 and E5-E7 as well as the late proteins L1 and L2. In addition to these ORFs, a region located within the E2 gene is thought, based on sequence similarities to other papillomaviruses, to encode an E4 protein, although no start codon could be identified. Based on the sequence of the L1 gene, this novel PV was found to be most similar to Capreolus capreolus papillomavirus 1 (CcaPV1), with 67.96% nucleotide identity. We therefore suggest that the virus identified here is given the name Giraffa camelopardalis papillomavirus 1 (GcPV1) and is classified as a novel species within the genus Deltapapillomavirus, in line with the current guidelines for the nomenclature and classification of PVs. Copyright © 2017 Elsevier B.V. All rights reserved.
CodonLogo: a sequence logo-based viewer for codon patterns.

PubMed

Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

2012-07-15

Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
An Upstream Open Reading Frame Is Essential for Feedback Regulation of Ascorbate Biosynthesis in Arabidopsis

PubMed Central

Laing, William A.; Martínez-Sánchez, Marcela; Wright, Michele A.; Bulley, Sean M.; Brewster, Di; Dare, Andrew P.; Rassam, Maysoon; Wang, Daisy; Storey, Roy; Macknight, Richard C.; Hellens, Roger P.

2015-01-01

Ascorbate (vitamin C) is an essential antioxidant and enzyme cofactor in both plants and animals. Ascorbate concentration is tightly regulated in plants, partly to respond to stress. Here, we demonstrate that ascorbate concentrations are determined via the posttranscriptional repression of GDP-l-galactose phosphorylase (GGP), a major control enzyme in the ascorbate biosynthesis pathway. This regulation requires a cis-acting upstream open reading frame (uORF) that represses the translation of the downstream GGP open reading frame under high ascorbate concentration. Disruption of this uORF stops the ascorbate feedback regulation of translation and results in increased ascorbate concentrations in leaves. The uORF is predicted to initiate at a noncanonical codon (ACG rather than AUG) and encode a 60- to 65-residue peptide. Analysis of ribosome protection data from Arabidopsis thaliana showed colocation of high levels of ribosomes with both the uORF and the main coding sequence of GGP. Together, our data indicate that the noncanonical uORF is translated and encodes a peptide that functions in the ascorbate inhibition of translation. This posttranslational regulation of ascorbate is likely an ancient mechanism of control as the uORF is conserved in GGP genes from mosses to angiosperms. PMID:25724639
An upstream open reading frame is essential for feedback regulation of ascorbate biosynthesis in Arabidopsis.

PubMed

Laing, William A; Martínez-Sánchez, Marcela; Wright, Michele A; Bulley, Sean M; Brewster, Di; Dare, Andrew P; Rassam, Maysoon; Wang, Daisy; Storey, Roy; Macknight, Richard C; Hellens, Roger P

2015-03-01

Ascorbate (vitamin C) is an essential antioxidant and enzyme cofactor in both plants and animals. Ascorbate concentration is tightly regulated in plants, partly to respond to stress. Here, we demonstrate that ascorbate concentrations are determined via the posttranscriptional repression of GDP-l-galactose phosphorylase (GGP), a major control enzyme in the ascorbate biosynthesis pathway. This regulation requires a cis-acting upstream open reading frame (uORF) that represses the translation of the downstream GGP open reading frame under high ascorbate concentration. Disruption of this uORF stops the ascorbate feedback regulation of translation and results in increased ascorbate concentrations in leaves. The uORF is predicted to initiate at a noncanonical codon (ACG rather than AUG) and encode a 60- to 65-residue peptide. Analysis of ribosome protection data from Arabidopsis thaliana showed colocation of high levels of ribosomes with both the uORF and the main coding sequence of GGP. Together, our data indicate that the noncanonical uORF is translated and encodes a peptide that functions in the ascorbate inhibition of translation. This posttranslational regulation of ascorbate is likely an ancient mechanism of control as the uORF is conserved in GGP genes from mosses to angiosperms. © 2015 American Society of Plant Biologists. All rights reserved.
Novel Immune Modulating Cellular Vaccine for Prostate Cancer

DTIC Science & Technology

2014-10-01

restriction sites. Murine PSMA : The cDNA encoding mPSMA was purchased from Sino Biologicals and was cloned into the HindIII and BamHI sites of pSP73-Sph/A64...sequence) and reverse primer 5’-TATATAGAGCTCTCAGATGTTCCGATACACATCTC-3’ Murine PSMA no signal sequence (mPSMA-SS): Murine PSMA minus the signal sequence...contains a HindIII site for cloning and utilizes an ATG that lies downstream of the signal sequence as the start codon in PSMA -SS ( PSMA without signal
Parallel processing spacecraft communication system

NASA Technical Reports Server (NTRS)

Bolotin, Gary S. (Inventor); Donaldson, James A. (Inventor); Luong, Huy H. (Inventor); Wood, Steven H. (Inventor)

1998-01-01

An uplink controlling assembly speeds data processing using a special parallel codeblock technique. A correct start sequence initiates processing of a frame. Two possible start sequences can be used; and the one which is used determines whether data polarity is inverted or non-inverted. Processing continues until uncorrectable errors are found. The frame ends by intentionally sending a block with an uncorrectable error. Each of the codeblocks in the frame has a channel ID. Each channel ID can be separately processed in parallel. This obviates the problem of waiting for error correction processing. If that channel number is zero, however, it indicates that the frame of data represents a critical command only. That data is handled in a special way, independent of the software. Otherwise, the processed data further handled using special double buffering techniques to avoid problems from overrun. When overrun does occur, the system takes action to lose only the oldest data.
Genome-wide analysis of codon usage bias in four sequenced cotton species.

PubMed

Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen

2018-01-01

Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
A Frame-Reflective Discourse Analysis of Serious Games

ERIC Educational Resources Information Center

Mayer, Igor; Warmelink, Harald; Zhou, Qiqi

2016-01-01

The authors explore how framing theory and the method of frame-reflective discourse analysis provide foundations for the emerging discipline of serious games (SGs) research. Starting with Wittgenstein's language game and Berger and Luckmann's social constructivist view on science, the authors demonstrate why a definitional or taxonomic approach to…
Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients

DOE Office of Scientific and Technical Information (OSTI.GOV)

Petraroli, R.; Pocchiari, M.

1996-04-01

A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less

Improved Prefusion Stability, Optimized Codon Usage, and Augmented Virion Packaging Enhance the Immunogenicity of Respiratory Syncytial Virus Fusion Protein in a Vectored-Vaccine Candidate

PubMed Central

Liang, Bo; Ngwuta, Joan O.; Surman, Sonja; Kabatova, Barbora; Liu, Xiang; Lingemann, Matthias; Liu, Xueqiao; Yang, Lijuan; Herbert, Richard; Swerczek, Joanna; Chen, Man; Moin, Syed M.; Kumar, Azad; McLellan, Jason S.; Kwong, Peter D.; Graham, Barney S.; Collins, Peter L.

2017-01-01

ABSTRACT Respiratory syncytial virus (RSV) is the most important viral agent of severe pediatric respiratory tract disease worldwide, but it lacks a licensed vaccine or suitable antiviral drug. A live attenuated chimeric bovine/human parainfluenza virus type 3 (rB/HPIV3) was developed previously as a vector expressing RSV fusion (F) protein to confer bivalent protection against RSV and HPIV3. In a previous clinical trial in virus-naive children, rB/HPIV3 was well tolerated but the immunogenicity of wild-type RSV F was unsatisfactory. We previously modified RSV F with a designed disulfide bond (DS) to increase stability in the prefusion (pre-F) conformation and to be efficiently packaged in the vector virion. Here, we further stabilized pre-F by adding both disulfide and cavity-filling mutations (DS-Cav1), and we also modified RSV F codon usage to have a lower CpG content and a higher level of expression. This RSV F open reading frame was evaluated in rB/HPIV3 in three forms: (i) pre-F without vector-packaging signal, (ii) pre-F with vector-packaging signal, and (iii) secreted pre-F ectodomain trimer. Despite being efficiently expressed, the secreted pre-F was poorly immunogenic. DS-Cav1 stabilized pre-F, with or without packaging, induced higher titers of pre-F specific antibodies in hamsters, and improved the quality of RSV-neutralizing serum antibodies. Codon-optimized RSV F containing fewer CpG dinucleotides had higher F expression, replicated more efficiently in vivo, and was more immunogenic. The combination of DS-Cav1 pre-F stabilization, optimized codon usage, reduced CpG content, and vector packaging significantly improved vector immunogenicity and protective efficacy against RSV. This provides an improved vectored RSV vaccine candidate suitable for pediatric clinical evaluation. IMPORTANCE RSV and HPIV3 are the first and second leading viral causes of severe pediatric respiratory disease worldwide. Licensed vaccines or suitable antiviral drugs are not available. We are developing a chimeric rB/HPIV3 vector expressing RSV F as a bivalent RSV/HPIV3 vaccine and have been evaluating means to increase RSV F immunogenicity. In this study, we evaluated the effects of improved stabilization of F in the pre-F conformation and of codon optimization resulting in reduced CpG content and greater pre-F expression. Reduced CpG content dampened the interferon response to infection, promoting higher replication and increased F expression. We demonstrate that improved pre-F stabilization and strategic manipulation of codon usage, together with efficient pre-F packaging into vector virions, significantly increased F immunogenicity in the bivalent RSV/HPIV3 vaccine. The improved immunogenicity included induction of increased titers of high-quality complement-independent antibodies with greater pre-F site Ø binding and greater protection against RSV challenge. PMID:28539444
The mitochondrial genome of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae).

PubMed

Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong

2012-08-01

To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
Anopheles gambiae genome reannotation through synthesis of ab initio and comparative gene prediction algorithms

PubMed Central

Li, Jun; Riehle, Michelle M; Zhang, Yan; Xu, Jiannong; Oduol, Frederick; Gomez, Shawn M; Eiglmeier, Karin; Ueberheide, Beatrix M; Shabanowitz, Jeffrey; Hunt, Donald F; Ribeiro, José MC; Vernick, Kenneth D

2006-01-01

Background Complete genome annotation is a necessary tool as Anopheles gambiae researchers probe the biology of this potent malaria vector. Results We reannotate the A. gambiae genome by synthesizing comparative and ab initio sets of predicted coding sequences (CDSs) into a single set using an exon-gene-union algorithm followed by an open-reading-frame-selection algorithm. The reannotation predicts 20,970 CDSs supported by at least two lines of evidence, and it lowers the proportion of CDSs lacking start and/or stop codons to only approximately 4%. The reannotated CDS set includes a set of 4,681 novel CDSs not represented in the Ensembl annotation but with EST support, and another set of 4,031 Ensembl-supported genes that undergo major structural and, therefore, probably functional changes in the reannotated set. The quality and accuracy of the reannotation was assessed by comparison with end sequences from 20,249 full-length cDNA clones, and evaluation of mass spectrometry peptide hit rates from an A. gambiae shotgun proteomic dataset confirms that the reannotated CDSs offer a high quality protein database for proteomics. We provide a functional proteomics annotation, ReAnoXcel, obtained by analysis of the new CDSs through the AnoXcel pipeline, which allows functional comparisons of the CDS sets within the same bioinformatic platform. CDS data are available for download. Conclusion Comprehensive A. gambiae genome reannotation is achieved through a combination of comparative and ab initio gene prediction algorithms. PMID:16569258
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias

USDA-ARS?s Scientific Manuscript database

All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli.

PubMed

Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup

2015-01-01

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
Potential of Start Codon Targeted (SCoT) markers for DNA fingerprinting of newly synthesized tritordeums and their respective parents.

PubMed

Cabo, Sandra; Ferreira, Luciana; Carvalho, Ana; Martins-Lopes, Paula; Martín, António; Lima-Brito, José Eduardo

2014-08-01

Hexaploid tritordeum (H(ch)H(ch)AABB; 2n = 42) results from the cross between Hordeum chilense (H(ch)H(ch); 2n = 14) and cultivated durum wheat (Triticum turgidum ssp. durum (AABB; 2n = 28). Morphologically, tritordeum resembles the wheat parent, showing promise for agriculture and wheat breeding. Start Codon Targeted (SCoT) polymorphism is a recently developed technique that generates gene-targeted markers. Thus, we considered it interesting to evaluate its potential for the DNA fingerprinting of newly synthesized hexaploid tritordeums and their respective parents. In this study, 60 SCoT primers were tested, and 18 and 19 of them revealed SCoT polymorphisms in the newly synthesized tritordeum lines HT27 and HT22, respectively, and their parents. An analysis of the presence/absence of bands among tritordeums and their parents revealed three types of polymorphic markers: (i) shared by tritordeums and one of their parents, (ii) exclusively amplified in tritordeums, and (iii) exclusively amplified in the parents. No polymorphism was detected among individuals of each parental species. Three SCoT markers were exclusively amplified in tritordeums of lines HT22 and HT27, being considered as polyploidization-induced rearrangements. About 70% of the SCoT markers of H. chilense origin were not transmitted to the allopolyploids of both lines, and most of the SCoTs scored in the newly synthesized allopolyploids originated from wheat, reinforcing the potential use of tritordeum as an alternative crop.
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid

PubMed Central

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid.

PubMed

Kwon, Inchan; Choi, Eun Sil

2016-01-01

Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Massive programmed translational jumping in mitochondria

PubMed Central

Lang, B. Franz; Jakubkova, Michaela; Hegedusova, Eva; Daoud, Rachid; Forget, Lise; Brejova, Brona; Vinar, Tomas; Kosa, Peter; Fricova, Dominika; Nebohacova, Martina; Griac, Peter; Tomaska, Lubomir; Burger, Gertraud; Nosek, Jozef

2014-01-01

Programmed translational bypassing is a process whereby ribosomes “ignore” a substantial interval of mRNA sequence. Although discovered 25 y ago, the only experimentally confirmed example of this puzzling phenomenon is expression of the bacteriophage T4 gene 60. Bypassing requires translational blockage at a “takeoff codon” immediately upstream of a stop codon followed by a hairpin, which causes peptidyl-tRNA dissociation and reassociation with a matching “landing triplet” 50 nt downstream, where translation resumes. Here, we report 81 translational bypassing elements (byps) in mitochondria of the yeast Magnusiomyces capitatus and demonstrate in three cases, by transcript analysis and proteomics, that byps are retained in mitochondrial mRNAs but not translated. Although mitochondrial byps resemble the bypass sequence in the T4 gene 60, they utilize unused codons instead of stops for translational blockage and have relaxed matching rules for takeoff/landing sites. We detected byp-like sequences also in mtDNAs of several Saccharomycetales, indicating that byps are mobile genetic elements. These byp-like sequences lack bypassing activity and are tolerated when inserted in-frame in variable protein regions. We hypothesize that byp-like elements have the potential to contribute to evolutionary diversification of proteins by adding new domains that allow exploration of new structures and functions. PMID:24711422
Origin, antigenicity, and function of a secreted form of ORF2 in hepatitis E virus infection.

PubMed

Yin, Xin; Ying, Dong; Lhomme, Sébastien; Tang, Zimin; Walker, Christopher M; Xia, Ningshao; Zheng, Zizheng; Feng, Zongdi

2018-05-01

The enterically transmitted hepatitis E virus (HEV) adopts a unique strategy to exit cells by cloaking its capsid (encoded by the viral ORF2 gene) and circulating in the blood as "quasi-enveloped" particles. However, recent evidence suggests that the majority of the ORF2 protein present in the patient serum and supernatants of HEV-infected cell culture exists in a free form and is not associated with virus particles. The origin and biological functions of this secreted form of ORF2 (ORF2 S ) are unknown. Here we show that production of ORF2 S results from translation initiated at the previously presumed AUG start codon for the capsid protein, whereas translation of the actual capsid protein (ORF2 C ) is initiated at a previously unrecognized internal AUG codon (15 codons downstream of the first AUG). The addition of 15 amino acids to the N terminus of the capsid protein creates a signal sequence that drives ORF2 S secretion via the secretory pathway. Unlike ORF2 C , ORF2 S is glycosylated and exists as a dimer. Nonetheless, ORF2 S exhibits substantial antigenic overlap with the capsid, but the epitopes predicted to bind the putative cell receptor are lost. Consistent with this, ORF2 S does not block HEV cell entry but inhibits antibody-mediated neutralization. These results reveal a previously unrecognized aspect in HEV biology and shed new light on the immune evasion mechanisms and pathogenesis of this virus.
Targeting Nonsense Mutations in Diseases with Translational Read-Through-Inducing Drugs (TRIDs).

PubMed

Nagel-Wolfrum, Kerstin; Möller, Fabian; Penner, Inessa; Baasov, Timor; Wolfrum, Uwe

2016-04-01

In recent years, remarkable advances in the ability to diagnose genetic disorders have been made. The identification of disease-causing genes allows the development of gene-specific therapies with the ultimate goal to develop personalized medicines for each patient according to their own specific genetic defect. In-depth genotyping of many different genes has revealed that ~12% of inherited genetic disorders are caused by in-frame nonsense mutations. Nonsense (non-coding) mutations are caused by point mutations, which generate premature termination codons (PTCs) that cause premature translational termination of the mRNA, and subsequently inhibit normal full-length protein expression. Recently, a gene-based therapeutic approach for genetic diseases caused by nonsense mutations has emerged, namely the so-called translational read-through (TR) therapy. Read-through therapy is based on the discovery that small molecules, known as TR-inducing drugs (TRIDs), allow the translation machinery to suppress a nonsense codon, elongate the nascent peptide chain, and consequently result in the synthesis of full-length protein. Several TRIDs are currently under investigation and research has been performed on several genetic disorders caused by nonsense mutations over the years. These findings have raised hope for the usage of TR therapy as a gene-based pharmacogenetic therapy for nonsense mutations in various genes responsible for a variety of genetic diseases.
Tracking of Engineered Bacteria In Vivo Using Nonstandard Amino Acid Incorporation.

PubMed

Praveschotinunt, Pichet; Dorval Courchesne, Noémie-Manuelle; den Hartog, Ilona; Lu, Chaochen; Kim, Jessica J; Nguyen, Peter Q; Joshi, Neel S

2018-06-15

The rapidly growing field of microbiome research presents a need for better methods of monitoring gut microbes in vivo with high spatial and temporal resolution. We report a method of tracking microbes in vivo within the gastrointestinal tract by programming them to incorporate nonstandard amino acids (NSAA) and labeling them via click chemistry. Using established machinery constituting an orthogonal translation system (OTS), we engineered Escherichia coli to incorporate p-azido-l-phenylalanine (pAzF) in place of the UAG (amber) stop codon. We also introduced a mutant gene encoding for a cell surface protein (CsgA) that was altered to contain an in-frame UAG codon. After pAzF incorporation and extracellular display, the engineered strains could be covalently labeled via copper-free click reaction with a Cy5 dye conjugated to the dibenzocyclooctyl (DBCO) group. We confirmed the functionality of the labeling strategy in vivo using a murine model. Labeling of the engineered strain could be observed using oral administration of the dye to mice several days after colonization of the gastrointestinal tract. This work sets the foundation for the development of in vivo tracking microbial strategies that may be compatible with noninvasive imaging modalities and are capable of longitudinal spatiotemporal monitoring of specific microbial populations.
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems

DOE Office of Scientific and Technical Information (OSTI.GOV)

2016-11-03

Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
Properties of an intergenic terminator and start site switch that regulate IMD2 transcription in yeast.

PubMed

Jenks, M Harley; O'Rourke, Thomas W; Reines, Daniel

2008-06-01

The IMD2 gene in Saccharomyces cerevisiae is regulated by intracellular guanine nucleotides. Regulation is exerted through the choice of alternative transcription start sites that results in synthesis of either an unstable short transcript terminating upstream of the start codon or a full-length productive IMD2 mRNA. Start site selection is dictated by the intracellular guanine nucleotide levels. Here we have mapped the polyadenylation sites of the upstream, unstable short transcripts that form a heterogeneous family of RNAs of approximately 200 nucleotides. The switch from the upstream to downstream start sites required the Rpb9 subunit of RNA polymerase II. The enzyme's ability to locate the downstream initiation site decreased exponentially as the start was moved downstream from the TATA box. This suggests that RNA polymerase II's pincer grip is important as it slides on DNA in search of a start site. Exosome degradation of the upstream transcripts was highly dependent upon the distance between the terminator and promoter. Similarly, termination was dependent upon the Sen1 helicase when close to the promoter. These findings extend the emerging concept that distinct modes of termination by RNA polymerase II exist and that the distance of the terminator from the promoter, as well as its sequence, is important for the pathway chosen.
Rhizobium meliloti anthranilate synthase gene: cloning, sequence, and expression in Escherichia coli.

PubMed Central

Bae, Y M; Holmgren, E; Crawford, I P

1989-01-01

We determined the DNA sequence of the Rhizobium meliloti gene encoding anthranilate synthase, the first enzyme of the tryptophan pathway. Sequences similar to those seen for the two subunits of the enzyme as found in all other procaryotic species studied are present in a single open reading frame of 729 codons. This apparent gene fusion joins the C terminus of the large subunit (TrpE) to the N terminus of the small subunit (TrpG) through a short connecting segment. We designate the fused gene trpE(G). The gene is flanked by a typical rho-independent terminator at the 3' end and a complex regulatory region at the 5' end resembling those of operons under transcriptional attenuation control. The location of the promoter was determined by S1 nuclease protection, using Rhizobium mRNA. Although this promoter was inactive in Escherichia coli, mutations eliciting activity were easily obtained. One of these was a C----T change at position -9 in the -10 region. The +1 position of the mRNA is the first base of the initiation codon of the leader peptide, implying that unlike trpE(G), which has a normal Shine-Dalgarno sequence, the leader peptide gene lacks a ribosome-binding site. Images PMID:2656657
Complete secretion of activable bovine prochymosin by genetically engineered L forms of Proteus mirabilis.

PubMed Central

Klessen, C; Schmidt, K H; Gumpert, J; Grosse, H H; Malke, H

1989-01-01

To circumvent problems encountered in the synthesis of active chymosin in a number of bacteria and fungi, a recombinant DNA L-form expression system that directed the complete secretion of fully activable prochymosin into the extracellular culture medium was developed. The expression plasmid constructions involved the in-frame fusion of prochymosin cDNA minus codons 1 to 4 to streptococcal pyrogenic exotoxin type A gene (speA') sequences, including the speA promoter, ribosomal binding site, and signal sequence and five codons of mature SpeA. Secretion of fusion prochymosin enzymatically and immunologically indistinguishable from bovine prochymosin was achieved after transformation of two stable protoplast type L-form strains derived from Proteus mirabilis. The secreted proenzyme was converted by autocatalytic processing to chymosin showing milk-clotting activity. In controlled laboratory fermentation processes, a maximum specific rate of activable prochymosin synthesis of 0.57 x 10(-3)/h was determined from the time courses of biomass dry weight and product formation. Yields as high as 40 +/- 10 micrograms/ml were obtained in the cell-free culture fluid of strain L99 carrying a naturally altered expression plasmid of increased segregational stability. The expression-secretion system described may be generally useful for production of recombinant mammalian proteins synthesized intracellularly as aberrantly folded insoluble aggregates. Images PMID:2499253
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli

PubMed Central

Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.

2016-01-01

The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites

PubMed Central

Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer

2004-01-01

Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290
Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

PubMed

Barik, Sailen

2017-12-01

A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Complex codon usage pattern and compositional features of retroviruses.

PubMed

RoyChoudhury, Sourav; Mukherjee, Debaprasad

2013-01-01

Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.

Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.

PubMed

Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y

2013-02-27

We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Gene mutations and increased levels of p53 protein in human squamous cell carcinomas and their cell lines.

PubMed Central

Burns, J. E.; Baird, M. C.; Clark, L. J.; Burns, P. A.; Edington, K.; Chapman, C.; Mitchell, R.; Robertson, G.; Soutar, D.; Parkinson, E. K.

1993-01-01

Using immunocytochemical and Western blotting techniques we have demonstrated the presence of abnormally high levels of p53 protein in 8/24 (33%) of human squamous cell carcinomas (SCC) and 9/18 (50%) of SCC cell lines. There was a correlation between the immunocytochemical results obtained with eight SCC samples and their corresponding cell lines. Direct sequencing of PCR-amplified, reverse transcribed, p53 mRNA confirmed the expression of point mutations in six of the positive cell lines and detected in-frame deletions in two others. We also detected two stop mutations and three out-of-frame deletions in five lines which did not express elevated levels of p53 protein. Several of the mutations found in SCC of the tongue (3/7) were in a region (codons 144-166) previously identified as being a p53 mutational hot spot in non-small cell lung tumours (Mitsudomi et al., 1992). In 11/13 cases only the mutant alleles were expressed suggesting loss or reduced expression of the wild type alleles in these cases. Six of the mutations were also detected in the SCCs from which the lines were derived, strongly suggesting that the mutations occurred, and were selected, in vivo. The 12th mutation GTG-->GGG (valine-->glycine) at codon 216 was expressed in line SCC-12 clone B along with an apparently normal p53 allele and is to our knowledge a novel mutation. Line BICR-19 also expressed a normal p53 allele in addition to one where exon 10 was deleted. Additionally 15 of the SCC lines (including all of those which did not show elevated p53 protein levels) were screened for the presence of human papillomavirus types 16 and 18 and were found to be negative. These results are discussed in relation to the pathogenesis of SCC and the immortalisation of human keratinocytes in vitro. Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 PMID:8390283
Nuclear sequestration of COL1A1 mRNA transcript associated with type I osteogenesis imperfecta (OI)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Primorac, D.; Stover, M.L.; McKinstry, M.B.

Previously we identified an OI type I patient with a splice donor mutation that resulted in intron 26 retention instead of exon skipping and sequestration of normal levels of the mutant transcript in the nuclear compartment. Intron retention was consistent with the exon definition hypothesis for splice site selection since the size of the exon-intron-exon unit was less than 300 bp. Furthermore, the retained intron contained in-frame stop codons which is thought to cause the mutant RNA to remain within the nucleus rather than appearing in the cytoplasm. To test these hypotheses, genomic fragments containing the normal sequence or themore » donor mutation were cloned into a collagen minigene and expressed in stably tansfected NIH 3T3 cells. None of the modifications to the normal intron altered the level of RNA that accumulated in the cytoplasm, as expected. However none of the modifications to the mutant intron allowed accumulation of normal levels of mRNA in the cytoplasm. Moreover, in contrast to our findings in the patient`s cells only low levels of mutant transcript were found in the nucleus; a fraction of the transcript did appear in the cytoplasm which had spliced the mutant donor site correctly. Nuclear run-on experiments demonstrated equal levels of transcription from each transgene. Expression of another donor mutation known to cause in-frame exon skipping in OI type IV was accurately reproduced in the minigene in transfected 3T3 cells. Our experience suggests that either mechanism can lead to formation of a null allele possibly related to the type of splicing events surrounding the potential stop codons. Understanding the rules governing inactivation of a collagen RNA transcript may be important in designing a strategy to inactivate a dominate negative mutation associated with the more severe forms of OI.« less
Selective forces and mutational biases drive stop codon usage in the human genome: a comparison with sense codon usage.

PubMed

Trotta, Edoardo

2016-05-17

The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
Evolution of a pseudogene: exclusive survival of a functional mitochondrial nad7 gene supports Haplomitrium as the earliest liverwort lineage and proposes a secondary loss of RNA editing in Marchantiidae.

PubMed

Groth-Malonek, Milena; Wahrmund, Ute; Polsakiewicz, Monika; Knoop, Volker

2007-04-01

Gene transfer from the mitochondrion into the nucleus is a corollary of the endosymbiont hypothesis. The frequent and independent transfer of genes for mitochondrial ribosomal proteins is well documented with many examples in angiosperms, whereas transfer of genes for components of the respiratory chain is a rarity. A notable exception is the nad7 gene, encoding subunit 7 of complex I, in the liverwort Marchantia polymorpha, which resides as a full-length, intron-carrying and transcribed, but nonspliced pseudogene in the chondriome, whereas its functional counterpart is nuclear encoded. To elucidate the patterns of pseudogene degeneration, we have investigated the mitochondrial nad7 locus in 12 other liverworts of broad phylogenetic distribution. We find that the mitochondrial nad7 gene is nonfunctional in 11 of them. However, the modes of pseudogene degeneration vary: whereas point mutations, accompanied by single-nucleotide indels, predominantly introduce stop codons into the reading frame in marchantiid liverworts, larger indels introduce frameshifts in the simple thalloid and leafy jungermanniid taxa. Most notably, however, the mitochondrial nad7 reading frame appears to be intact in the isolated liverwort genus Haplomitrium. Its functional expression is shown by cDNA analysis identifying typical RNA-editing events to reconstitute conserved codon identities and also confirming functional splicing of the 2 liverwort-specific group II introns. We interpret our results 1) to indicate the presence of a functional mitochondrial nad7 gene in the earliest land plants and strongly supporting a basal placement of Haplomitrium among the liverworts, 2) to indicate different modes of pseudogene degeneration and chondriome evolution in the later branching liverwort clades, 3) to suggest a surprisingly long maintenance of a nonfunctional gene in the presumed oldest group of land plants, and 4) to support the model of a secondary loss of RNA-editing activity in marchantiid liverworts.
Pandemic influenza A virus codon usage revisited: biases, adaptation and implications for vaccine strain development

PubMed Central

2012-01-01

Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.

PubMed

Biddle, Wil; Schmitt, Margaret A; Fisk, John D

2015-12-22

Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.

PubMed

Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi

2016-08-01

Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses

PubMed Central

Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj

2016-01-01

Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

PubMed Central

Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

1984-01-01

The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019
The Complete Mitochondrial DNA Sequence of Scenedesmus obliquus Reflects an Intermediate Stage in the Evolution of the Green Algal Mitochondrial Genome

PubMed Central

Nedelcu, Aurora M.; Lee, Robert W.; Lemieux, Claude; Gray, Michael W.; Burger, Gertraud

2000-01-01

Two distinct mitochondrial genome types have been described among the green algal lineages investigated to date: a reduced–derived, Chlamydomonas-like type and an ancestral, Prototheca-like type. To determine if this unexpected dichotomy is real or is due to insufficient or biased sampling and to define trends in the evolution of the green algal mitochondrial genome, we sequenced and analyzed the mitochondrial DNA (mtDNA) of Scenedesmus obliquus. This genome is 42,919 bp in size and encodes 42 conserved genes (i.e., large and small subunit rRNA genes, 27 tRNA and 13 respiratory protein-coding genes), four additional free-standing open reading frames with no known homologs, and an intronic reading frame with endonuclease/maturase similarity. No 5S rRNA or ribosomal protein-coding genes have been identified in Scenedesmus mtDNA. The standard protein-coding genes feature a deviant genetic code characterized by the use of UAG (normally a stop codon) to specify leucine, and the unprecedented use of UCA (normally a serine codon) as a signal for termination of translation. The mitochondrial genome of Scenedesmus combines features of both green algal mitochondrial genome types: the presence of a more complex set of protein-coding and tRNA genes is shared with the ancestral type, whereas the lack of 5S rRNA and ribosomal protein-coding genes as well as the presence of fragmented and scrambled rRNA genes are shared with the reduced–derived type of mitochondrial genome organization. Furthermore, the gene content and the fragmentation pattern of the rRNA genes suggest that this genome represents an intermediate stage in the evolutionary process of mitochondrial genome streamlining in green algae. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF204057.] PMID:10854413
The high-level expression of human tissue plasminogen activator in the milk of transgenic mice with hybrid gene locus strategy.

PubMed

Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing

2012-02-01

Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.
Analyses of clinicopathological, molecular, and prognostic associations of KRAS codon 61 and codon 146 mutations in colorectal cancer: cohort study and literature review

PubMed Central

2014-01-01

Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
Codon adaptation and synonymous substitution rate in diatom plastid genes.

PubMed

Morton, Brian R; Sorhannus, Ulf; Fox, Martin

2002-07-01

Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
Evidence for ribosomal frameshifting and a novel overlapping gene in the genomes of insect-specific flaviviruses

DOE Office of Scientific and Technical Information (OSTI.GOV)

Firth, Andrew E., E-mail: a.firth@ucc.i; Blitvich, Bradley J., E-mail: blitvich@iastate.ed; Wills, Norma M., E-mail: nwills@genetics.utah.ed

2010-03-30

Flaviviruses have a positive-sense, single-stranded RNA genome of approx11 kb, encoding a large polyprotein that is cleaved to produce approx10 mature proteins. Cell fusing agent virus, Kamiti River virus, Culex flavivirus and several recently discovered flaviviruses have no known vertebrate host and apparently infect only insects. We present compelling bioinformatic evidence for a 253-295 codon overlapping gene (designated fifo) conserved throughout these insect-specific flaviviruses and immunofluorescent detection of its product. Fifo overlaps the NS2A/NS2B coding sequence in the - 1/+ 2 reading frame and is most likely expressed as a trans-frame fusion protein via ribosomal frameshifting at a conserved GGAUUUYmore » slippery heptanucleotide with 3'-adjacent RNA secondary structure (which stimulates efficient frameshifting in vitro). The discovery bears striking parallels to the recently discovered ribosomal frameshifting site in the NS2A coding sequence of the Japanese encephalitis serogroup of flaviviruses and suggests that programmed ribosomal frameshifting may be more widespread in flaviviruses than currently realized.« less
Differential regulation of hepatitis B virus core protein expression and genome replication by a small upstream open reading frame and naturally occurring mutations in the precore region.

PubMed

Zong, Li; Qin, Yanli; Jia, Haodi; Ye, Lei; Wang, Yongxiang; Zhang, Jiming; Wands, Jack R; Tong, Shuping; Li, Jisu

2017-05-01

Hepatitis B virus (HBV) transcribes two subsets of 3.5-kb RNAs: precore RNA for hepatitis B e antigen (HBeAg) expression, and pregenomic RNA for core and P protein translation as well as genome replication. HBeAg expression could be prevented by mutations in the precore region, while an upstream open reading frame (uORF) has been proposed as a negative regulator of core protein translation. We employed replication competent HBV DNA constructs and transient transfection experiments in Huh7 cells to verify the uORF effect and to explore the alternative function of precore RNA. Optimized Kozak sequence for the uORF or extra ATG codons as present in some HBV genotypes reduced core protein expression. G1896A nonsense mutation promoted more efficient core protein expression than mutated precore ATG, while a +1 frameshift mutation was ineffective. In conclusion, various HBeAg-negative precore mutations and mutations affecting uORF differentially regulate core protein expression and genome replication. Copyright © 2017 Elsevier Inc. All rights reserved.
Burkholderia Mallei tssM Encodes a Secreted Deubiquitinase that is Expressed Inside Infected RAW 264.7 Murine Macrophages

DTIC Science & Technology

2008-10-13

Furthermore, the encoded protein of this gene is only 30 kDa. A potential GTG start codon at position 625 also encodes a protein that is too small...horizontal bar and putative alternate translation initiation sites (ATG, GTG , and TTG) are indicated. The sizes and locations of the proteins encoded... gray line with rounded rectangles showing sequence features and motifs, including the Ala- and Pro-rich N-terminal region and the C-terminal Cys and
Molecular identification of Mango, Mangifera indica L.var. totupura

PubMed Central

Jagarlamudi, Sankar; G, Rosaiah; Kurapati, Ravi Kumar; Pinnamaneni, Rajasekhar

2011-01-01

Mango (>Mangifera indica) belonging to Anacardiaceae family is a fruit that grows in tropical regions. It is considered as the King of fruits. The present work was taken up to identify a tool in identifying the mango species at the molecular level. The chloroplast trnL-F region was amplified from extracted total genomic DNA using the polymerase chain reaction (PCR) and sequenced. Sequence of the dominant DGGE band revealed that Mangifera indica in tested leaves was Mangifera indica (100% similarity to the ITS sequences of Mangifera indica). This sequence was deposited in NCBI with the accession no. GQ927757. Abbreviations AFLP - Amplified fragment length polymorphism , cpDNA - Chloroplast DNA, DDGE - Denaturing gradient gel electrophoresis, DNA - Deoxyribo nucleic acid, EDTA - Ethylenediamine tetraacetic acid, HCl - Hydrochloric acid, ISSR - Inter simple sequence repeats, ITS - Internal transcribed spacer, MATAB - Methyl Ammonium Bromide, Na2SO3 - Sodium sulphite, NaCl - Sodium chloride, NCBI - National Centre for Biotechnology Information, PCR - Polymerase chain reaction, PEG - Polyethylene glycol, RAPD - Randomly amplified polymorphic DNA, trnL-F - Transfer RNA genes start codon- termination codon. PMID:21423885
Identification of a novel mutation in a patient with pseudohypoparathyroidism type Ia

PubMed Central

Lee, Ye Seung; Kim, Hui Kwon; Kim, Hye Rim; Lee, Jong Yoon; Choi, Joong Wan; Bae, Eun Ju; Oh, Phil Soo; Park, Won Il; Ki, Chang Seok

2014-01-01

Pseudohypoparathyroidism type Ia (PHP Ia) is a disorder characterized by multiform hormonal resistance including parathyroid hormone (PTH) resistance and Albright hereditary osteodystrophy (AHO). It is caused by heterozygous inactivating mutations within the Gs alpha-encoding GNAS exons. A 9-year-old boy presented with clinical and laboratory abnormalities including hypocalcemia, hyperphosphatemia, PTH resistance, multihormone resistance and AHO (round face, short stature, obesity, brachydactyly and osteoma cutis) which were typical of PHP Ia. He had a history of repeated convulsive episodes that started from the age of 2 months. A cranial computed tomography scan showed bilateral calcifications in the basal ganglia and his intelligence quotient testing indicated mild mental retardation. Family history revealed that the patient's maternal relatives, including his grandmother and 2 of his mother's siblings, had features suggestive of AHO. Sequencing of the GNAS gene of the patient identified a heterozygous nonsense mutation within exon 11 (c.637 C>T). The C>T transversion results in an amino acid substitution from Gln to stop codon at codon 213 (p.Gln213*). To our knowledge, this is a novel mutation in GNAS. PMID:25045367
Start Codon Targeted (SCoT) marker reveals genetic diversity of Dendrobium nobile Lindl., an endangered medicinal orchid species.

PubMed

Bhattacharyya, Paromik; Kumaria, Suman; Kumar, Shrawan; Tandon, Pramod

2013-10-15

Genetic variability in the wild genotypes of Dendrobium nobile Lindl. collected from different parts of Northeast India, was analyzed using a Start Codon Targeted (SCoT) marker system. A total of sixty individuals comprising of six natural populations were investigated for the existing natural genetic diversity. One hundred and thirty two (132) amplicons were produced by SCoT marker generating 96.21% polymorphism. The PIC value of the SCoT marker system was 0.78 and the Rp values of the primers ranged between 4.43 and 7.50. The percentage of polymorphic loci (Pp) ranging from 25% to 56.82%, Nei's gene diversity (h) from 0.08 to 0.15 with mean Nei's gene diversity of 0.28, and Shannon's information index (I) values ranging from 0.13 to 0.24 with an average value of 0.43 were recorded. The gene flow value (0.37) and the diversity among populations (0.57) demonstrated higher genetic variation among the populations. Analysis of molecular variance (AMOVA) showed 43.37% of variation within the populations, whereas 56.63% variation was recorded among the populations. Cluster analysis also reveals high genetic variation among the genotypes. Present investigation suggests the effectiveness of SCoT marker system to estimate the genetic diversity of D. nobile and that it can be seen as a preliminary point for future research on the population and evolutionary genetics of this endangered orchid species of medicinal importance. © 2013.

The prediction of human exons by oligonucleotide composition and discriminant analysis of spliceable open reading frames

DOE Office of Scientific and Technical Information (OSTI.GOV)

Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.

1994-12-31

Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes.

PubMed

Behura, Susanta K; Severson, David W

2013-02-01

Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Pseudo-polyprotein translated from the full-length ORF1 of capillovirus is important for pathogenicity, but a truncated ORF1 protein without variable and CP regions is sufficient for replication.

PubMed

Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou

2010-09-01

The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
Development of a codon optimization strategy using the efor RED reporter gene as a test case

NASA Astrophysics Data System (ADS)

Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila

2018-04-01

Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.
Codon usage regulates protein structure and function by affecting translation elongation speed in Drosophila cells.

PubMed

Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi

2017-08-21

Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts

PubMed Central

Wang, Hongju; Liu, Siqing; Zhang, Bo

2016-01-01

Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN

PubMed Central

Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice

2016-01-01

Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana

PubMed Central

Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea

2012-01-01

The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage

PubMed Central

Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent

2016-01-01

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Most Used Codons per Amino Acid and per Genome in the Code of Man Compared to Other Organisms According to the Rotating Circular Genetic Code

PubMed Central

Castro-Chavez, Fernando

2011-01-01

My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Differences in codon bias cannot explain differences in translational power among microbes.

PubMed

Dethlefsen, Les; Schmidt, Thomas M

2005-01-06

Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Vertebrate codon bias indicates a highly GC-rich ancestral genome.

PubMed

Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei

2013-04-25

Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
TTA codons in some genes prevent their expression in a class of developmental, antibiotic-negative, Streptomyces mutants.

PubMed Central

Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F

1991-01-01

In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

PubMed

Seligmann, Hervé; Warthi, Ganesh

2017-01-01

A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.

PubMed

Karniychuk, Uladzimir U

2016-09-02

Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Compound heterozygous HAX1 mutations in a Swedish patient with severe congenital neutropenia and no neurodevelopmental abnormalities.

PubMed

Carlsson, Göran; Elinder, Göran; Malmgren, Helena; Trebinska, Alicja; Grzybowska, Ewa; Dahl, Niklas; Nordenskjöld, Magnus; Fadeel, Bengt

2009-12-01

Kostmann disease or severe congenital neutropenia (SCN) is an autosomal recessive disorder of neutrophil production. Homozygous HAX1 mutations were recently identified in SCN patients belonging to the original family in northern Sweden described by Kostmann. Moreover, recent studies have suggested an association between neurological dysfunction and HAX1 deficiency. Here we describe a patient with a compound heterozygous HAX1 mutation consisting of a nonsense mutation (c.568C > T, p.Glu190X) and a frame-shift mutation (c.91delG, p.Glu31LysfsX54) resulting in a premature stop codon. The patient has a history of neutropenia and a propensity for infections, but has shown no signs of neurodevelopmental abnormalities.
Synonymous codon choices in the extremely GC-poor genome of Plasmodium falciparum: compositional constraints and translational selection.

PubMed

Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G

1999-07-01

We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.

PubMed

Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P

2017-07-01

Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Analysis of codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) and its relation to evolution.

PubMed

Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen

2016-08-24

Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Complete mitochondrial genome of the Kwangtung skate: Dipturus kwangtungensis (Rajiformes, Rajidae).

PubMed

Jeong, Dageum; Kim, Sung; Kim, Choong-Gon; Lee, Youn-Ho

2015-01-01

The complete sequence of mitochondrial DNA of a Kwangtung skate, Dipturus kwangtungensis, was determined as being circular molecules of 16,912 bp including 2 rRNA, 22 tRNA, 13 protein coding genes (PCGs) and a control region. The arrangement of the PCGs is the same as that found in other Rajidae species. The nucleotide of L-strand which encodes most of the proteins is composed of 30.2% A, 27.4% C, 28.2% T and 14.2% G with a bias toward A+T slightly. Twelve of 13 PCGs are initiated by the ATG codon while COX1 starts with GTG. Only ND4 harbors the incomplete termination codon, TA. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA with the exception of tRNA(Ser)AGY, which has a reduced DHU arm. This mitogenome is the first report for a species of the genus Dipturus, which will become an important source of information on the phylogenetic relationship and the evolution of the genus Dipturus within the family Rajidae.

A Mutation in the Start Codon of γ-Crystallin D Leads to Nuclear Cataracts in the Dahl SS/Jr-Ctr Strain

PubMed Central

Johnson, Ashley C.; Lee, Jonathan W.; Harmon, Ashlyn C.; Morris, Zaliya; Wang, Xuexiang; Fratkin, Jonathan; Rapp, John P.; Gomez-Sanchez, Elise; Garrett, Michael R.

2013-01-01

Cataracts are a major cause of blindness. The most common forms of cataracts are age and UV related and develops mostly in the elderly, while congenital cataracts appear at birth or in early childhood. The Dahl salt-sensitive (SS/Jr) rat is an extensively used model of salt-sensitive hypertension that exhibits concomitant renal disease. In the mid 1980’s, cataracts appeared in a few animals in the Dahl S colony, presumably the result of a spontaneous mutation. The mutation was fixed and bred to establish the SS/Jr-Ctr substrain. The SS/Jr-Ctr substrain has been exclusively used by a single investigator to study the role of steroids and hypertension. Using a classical positional cloning approach, we localized the cataract gene with high-resolution to a less than 1 Mbp region on chromosome 9 using an F1 (SS/Jr-Ctr X SHR) X SHR backcross population. The 1 Mbp region contained only 13 genes, including 4 genes from the γ-crystallins (Cryg) gene family which are known to play a role in cataract formation. All of the γ-crystallins were sequenced and a novel point mutation in the start codon (ATG → GTG) of the Crygd gene was identified which led to the complete absence of CRYGD protein in the eyes of the SS/Jr-Ctr strain. In summary, the identification of the genetic cause in this novel cataract model may provide an opportunity to better understand the development of cataracts, particularly in the context of hypertension. PMID:23404175
High-Efficiency "-1" and "-2" Ribosomal Frameshiftings Revealed by Force Spectroscopy.

PubMed

Tsai, Te-Wei; Yang, Haopeng; Yin, Heng; Xu, Shoujun; Wang, Yuhong

2017-06-16

Ribosomal frameshifting is a rare but ubiquitous process that is being studied extensively. Meanwhile, frameshifting motifs without any secondary mRNA structures were identified but rarely studied experimentally. We report unambiguous observation of highly efficient "-1" and "-2" frameshiftings on a GA 7 G slippery mRNA without the downstream secondary structure, using force-induced remnant magnetization spectroscopy combined with unique probing schemes. The result represents the first experimental evidence of multiple frameshifting steps. It is also one of the rare reports of the "-2" frameshifting. Our assay removed the ambiguity of transcriptional slippage involvement in other frameshifting assays. Two significant insights for the frameshifting mechanism were revealed. First, EF-G·GTP is indispensable to frameshifting. Although EFG·GDPCP has been shown to prompt translocation before, we found that it could not induce frameshifting. This implies that the GTP hydrolysis is responsible for the codon-anticodon re-pairing in frameshifting, which corroborates our previous mechanical force measurement of EF-G·GTP. Second, translation in all three reading frames of the slippery sequence can be induced by the corresponding in-frame aminoacyl tRNAs. Although A-site tRNA is known to affect the partition between "0" and "-1" frameshifting, it has not been reported that all three reading frames can be translated by their corresponding tRNAs. The in vitro results were confirmed by toe-printing assay and protein sequencing.
Molecular identification and transcriptional regulation of porcine IFIT2 gene.

PubMed

Yang, Xiuqin; Jing, Xiaoyan; Song, Yanfang; Zhang, Caixia; Liu, Di

2018-04-06

IFN-induced protein with tetratricopeptide repeats 2 (IFIT2) plays important roles in host defense against viral infection as revealed by studies in humans and mice. However, little is known on porcine IFIT2 (pIFIT2). Here, we performed molecular cloning, expression profile, and transcriptional regulation analysis of pIFIT2. pIFIT2 gene, located on chromosome 14, is composed of two exons and have a complete coding sequence of 1407 bp. The encoded polypeptide, 468 aa in length, has three tetratricopeptide repeat motifs. pIFIT2 gene was unevenly distributed in all eleven tissues studied with the most abundance in spleen. Poly(I:C) treatment notably strongly upregulated the mRNA level and promoter activity of pIFIT2 gene. Upstream sequence of 1759 bp from the start codon which was assigned +1 here has promoter activity, and deltaEF1 acts as transcription repressor through binding to sequences at position - 1774 to - 1764. Minimal promoter region exists within nucleotide position - 162 and - 126. Two adjacent interferon-stimulated response elements (ISREs) and two nuclear factor (NF)-κB binding sites were identified within position - 310 and - 126. The ISRE elements act alone and in synergy with the one closer to start codon having more strength, so do the NF-κB binding sites. Synergistic effect was also found between the ISRE and NF-κB binding sites. Additionally, a third ISRE element was identified within position - 1661 to - 1579. These findings will contribute to clarifying the antiviral effect and underlying mechanisms of pIFIT2.
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum.

PubMed

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
The first report of prion-related protein gene (PRNT) polymorphisms in goat.

PubMed

Kim, Yong-Chan; Jeong, Byung-Hoon

2017-06-01

Prion protein is encoded by the prion protein gene (PRNP). Polymorphisms of several members of the prion gene family have shown association with prion diseases in several species. Recent studies on a novel member of the prion gene family in rams have shown that prion-related protein gene (PRNT) has a linkage with codon 26 of prion-like protein (PRND). In a previous study, codon 26 polymorphism of PRND has shown connection with PRNP haplotype which is strongly associated with scrapie vulnerability. In addition, the genotype of a single nucleotide polymorphism (SNP) at codon 26 of PRND is related to fertilisation capacity. These findings necessitate studies on the SNP of PRNT gene which is connected with PRND. In goat, several polymorphism studies have been performed for PRNP, PRND, and shadow of prion protein gene (SPRN). However, polymorphism on PRNT has not been reported. Hence, the objective of this study was to determine the genotype and allelic distribution of SNPs of PRNT in 238 Korean native goats and compare PRNT DNA sequences between Korean native goats and several ruminant species. A total of five SNPs, including PRNT c.-114G > T, PRNT c.-58A > G in the upstream of PRNT gene, PRNT c.71C > T (p.Ala24Val) and PRNT c.102G > A in the open reading frame (ORF) and c.321C > T in the downstream of PRNT gene, were found in this study. All five SNPs of caprine PRNT gene in Korean native goat are in complete linkage disequilibrium (LD) with a D' value of 1.0. Interestingly, comparative sequence analysis of the PRNT gene revealed five mismatches between DNA sequences of Korean native goats and those of goats deposited in the GenBank. Korean native black goats also showed 5 mismatches in PRNT ORF with cattle. To the best of our knowledge, this is the first genetic research of the PRNT gene in goat.
Premature termination of SMARCB1 translation may be followed by reinitiation in schwannomatosis-associated schwannomas, but results in absence of SMARCB1 expression in rhabdoid tumors.

PubMed

Hulsebos, Theo J M; Kenter, Susan; Verhagen, Wim I M; Baas, Frank; Flucke, Uta; Wesseling, Pieter

2014-09-01

In schwannomatosis, germline SMARCB1 mutations predispose to the development of multiple schwannomas, but not vestibular schwannomas. Many of these are missense or splice-site mutations or in-frame deletions, which are presumed to result in the synthesis of altered SMARCB1 proteins. However, also nonsense and frameshift mutations, which are characteristic for rhabdoid tumors and are predicted to result in the absence of SMARCB1 protein via nonsense-mediated mRNA decay, have been reported in schwannomatosis patients. We investigated the consequences of four of the latter mutations, i.e. c.30delC, c.34C>T, c.38delA, and c.46A>T, all in SMARCB1-exon 1. We could demonstrate for the c.30delC and c.34C>T mutations that the respective mRNAs were still present in the schwannomas of the patients. We hypothesized that these were prevented from degradation by translation reinitiation at the AUG codon encoding methionine at position 27 of the SMARCB1 protein. To test this, we expressed the mutations in MON cells, rhabdoid cells without endogenous SMARCB1 protein, and found that all four resulted in synthesis of the N-terminally truncated protein. Mutation of the reinitiation methionine codon into a valine codon prevented synthesis of the truncated protein, thereby confirming its identity. Immunohistochemistry with a SMARCB1 antibody revealed a mosaic staining pattern in schwannomas of the patients with the c.30delC and c.34C>T mutations. Our findings support the concept that, in contrast to the complete absence of SMARCB1 expression in rhabdoid tumors, altered SMARCB1 proteins with modified activity and reduced (mosaic) expression are formed in the schwannomas of schwannomatosis patients with a germline SMARCB1 mutation.
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency

PubMed Central

Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi

2012-01-01

Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Molecular characterization of Banana streak virus isolate from Musa Acuminata in China.

PubMed

Zhuang, Jun; Wang, Jian-Hua; Zhang, Xin; Liu, Zhi-Xin

2011-12-01

Banana streak virus (BSV), a member of genus Badnavirus, is a causal agent of banana streak disease throughout the world. The genetic diversity of BSVs from different regions of banana plantations has previously been investigated, but there are relatively few reports of the genetic characteristic of episomal (non-integrated) BSV genomes isolated from China. Here, the complete genome, a total of 7722bp (GenBank accession number DQ092436), of an isolate of Banana streak virus (BSV) on cultivar Cavendish (BSAcYNV) in Yunnan, China was determined. The genome organises in the typical manner of badnaviruses. The intergenic region of genomic DNA contains a large stem-loop, which may contribute to the ribosome shift into the following open reading frames (ORFs). The coding region of BSAcYNV consists of three overlapping ORFs, ORF1 with a non-AUG start codon and ORF2 encoding two small proteins are individually involved in viral movement and ORF3 encodes a polyprotein. Besides the complete genome, a defective genome lacking the whole RNA leader region and a majority of ORF1 and which encompasses 6525bp was also isolated and sequenced from this BSV DNA reservoir in infected banana plants. Sequence analyses showed that BSAcYNV has closest similarity in terms of genome organization and the coding assignments with an BSV isolate from Vietnam (BSAcVNV). The corresponding coding regions shared identities of 88% and -95% at nucleotide and amino acid levels, respectively. Phylogenetic analysis also indicated BSAcYNV shared the closest geographical evolutionary relationship to BSAcVNV among sequenced banana streak badnaviruses.
Codon optimization of the adenoviral fiber negatively impacts structural protein expression and viral fitness

NASA Astrophysics Data System (ADS)

Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina

2016-06-01

Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
Production of 2-ketoisocaproate with Corynebacterium glutamicum strains devoid of plasmids and heterologous genes.

PubMed

Vogt, Michael; Haas, Sabine; Polen, Tino; van Ooyen, Jan; Bott, Michael

2015-03-01

2-Ketoisocaproate (KIC), the last intermediate in l-leucine biosynthesis, has various medical and industrial applications. After deletion of the ilvE gene for transaminase B in l-leucine production strains of Corynebacterium glutamicum, KIC became the major product, however, the strains were auxotrophic for l-isoleucine. To avoid auxotrophy, reduction of IlvE activity by exchanging the ATG start codon of ilvE by GTG was tested instead of an ilvE deletion. The resulting strains were indeed able to grow in glucose minimal medium without amino acid supplementation, but at the cost of lowered growth rates and KIC production parameters. The best production performance was obtained with strain MV-KICF1, which carried besides the ilvE start codon exchange three copies of a gene for a feedback-resistant 2-isopropylmalate synthase, one copy of a gene for a feedback-resistant acetohydroxyacid synthase and deletions of ltbR and iolR encoding transcriptional regulators. In the presence of 1 mM l-isoleucine, MV-KICF1 accumulated 47 mM KIC (6.1 g l(-1)) with a yield of 0.20 mol/mol glucose and a volumetric productivity of 1.41 mmol KIC l(-1) h(-1). Since MV-KICF1 is plasmid free and lacks heterologous genes, it is an interesting strain for industrial application and as platform for the production of KIC-derived compounds, such as 3-methyl-1-butanol. © 2014 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Tail-extension following the termination codon is critical for release of the nascent chain from membrane-bound ribosomes in a reticulocyte lysate cell-free system.

PubMed

Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao

2013-01-11

Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
A common periodic table of codons and amino acids.

PubMed

Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z

2003-06-27

A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

PubMed

Benyo, B; Biro, J C; Benyo, Z

2004-01-01

The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes

PubMed Central

Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

2016-01-01

The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
ChloroMitoCU: Codon patterns across organelle genomes for functional genomics and evolutionary applications.

PubMed

Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus

2017-06-01

Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
KRAS exon 2 codon 13 mutation is associated with a better prognosis than codon 12 mutation following lung metastasectomy in colorectal cancer

PubMed Central

Renaud, Stéphane; Guerrera, Francesco; Seitlinger, Joseph; Costardi, Lorena; Schaeffer, Mickaël; Romain, Benoit; Mossetti, Claudio; Claire-Voegeli, Anne; Filosso, Pier Luigi; Legrain, Michèle; Ruffini, Enrico; Falcoz, Pierre-Emmanuel; Oliaro, Alberto; Massard, Gilbert

2017-01-01

Introduction The utilization of molecular markers as routinely used biomarkers is steadily increasing. We aimed to evaluate the potential different prognostic values of KRAS exon 2 codons 12 and 13 after lung metastasectomy in colorectal cancer (CRC). Results KRAS codon 12 mutations were observed in 116 patients (77%), whereas codon 13 mutations were observed in 34 patients (23%). KRAS codon 13 mutations were associated with both longer time to pulmonary recurrence (TTPR) (median TTPR: 78 months (95% CI: 50.61–82.56) vs 56 months (95% CI: 68.71–127.51), P = 0.008) and improved overall survival (OS) (median OS: 82 months vs 54 months (95% CI: 48.93–59.07), P = 0.009). Multivariate analysis confirmed that codon 13 mutations were associated with better outcomes (TTPR: HR: 0.40 (95% CI: 0.17–0.93), P = 0.033); OS: HR: 0.39 (95% CI: 0.14–1.07), P = 0.07). Otherwise, no significant difference in OS (P = 0.78) or TTPR (P = 0.72) based on the type of amino-acid substitutions was observed among KRAS codon 12 mutations. Materials and Methods We retrospectively reviewed data from 525 patients who underwent a lung metastasectomy for CRC in two departments of thoracic surgery from 1998 to 2015 and focused on 150 patients that had KRAS exon 2 codon 12/13 mutations. Conclusions KRAS exon 2 codon 13 mutations, compared to codon 12 mutations, seem to be associated with better outcomes following lung metastasectomy in CRC. Prospective multicenter studies are necessary to fully understand the prognostic value of KRAS mutations in the lung metastases of CRC. PMID:27911859
Transcripts of the NADH-dehydrogenase subunit 3 gene are differentially edited in Oenothera mitochondria.

PubMed Central

Schuster, W; Wissinger, B; Unseld, M; Brennicke, A

1990-01-01

A number of cytosines are altered to be recognized as uridines in transcripts of the nad3 locus in mitochondria of the higher plant Oenothera. Such nucleotide modifications can be found at 16 different sites within the nad3 coding region. Most of these alterations in the mRNA sequence change codon identities to specify amino acids better conserved in evolution. Individual cDNA clones differ in their degree of editing at five nucleotide positions, three of which are silent, while two lead to codon alterations specifying different amino acids. None of the cDNA clones analysed is maximally edited at all possible sites, suggesting slow processing or lowered stringency of editing at these nucleotides. Differentially edited transcripts could be editing intermediates or could code for differing polypeptides. Two edited nucleotides in an open reading frame located upstream of nad3 change two amino acids in the deduced polypeptide. Part of the well-conserved ribosomal protein gene rps12 also encoded downstream of nad3 in other plants, is lost in Oenothera mitochondria by recombination events. The functional rps12 protein must be imported from the cytoplasm since the deleted sequences of this gene are not found in the Oenothera mitochondrial genome. The pseudogene sequence is not edited at any nucleotide position. Images Fig. 3. Fig. 4. Fig. 7. PMID:1688531
The Selenocysteine-Specific Elongation Factor Contains Unique Sequences That Are Required for Both Nuclear Export and Selenocysteine Incorporation.

PubMed

Dubey, Aditi; Copeland, Paul R

2016-01-01

Selenocysteine (Sec) is a critical residue in at least 25 human proteins that are essential for antioxidant defense and redox signaling in cells. Sec is inserted into proteins cotranslationally by the recoding of an in-frame UGA termination codon to a Sec codon. In eukaryotes, this recoding event requires several specialized factors, including a dedicated, Sec-specific elongation factor called eEFSec, which binds Sec-tRNASec with high specificity and delivers it to the ribosome for selenoprotein production. Unlike most translation factors, including the canonical elongation factor eEF1A, eEFSec readily localizes to the nucleus of mammalian cells and shuttles between the cytoplasmic and nuclear compartments. The functional significance of eEFSec's nuclear localization has remained unclear. In this study, we have examined the subcellular localization of eEFSec in the context of altered Sec incorporation to demonstrate that reduced selenoprotein production does not correlate with changes in the nuclear localization of eEFSec. In addition, we identify several novel sequences of the protein that are essential for localization as well as Sec insertion activity, and show that eEFSec utilizes CRM1-mediated nuclear export pathway. Our findings argue for two distinct pools of eEFSec in the cell, where the cytoplasmic pool participates in Sec incorporation and the nuclear pool may be involved in an as yet unknown function.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

PubMed

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.
Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

PubMed Central

Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

2014-01-01

The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631

Bicluster Pattern of Codon Context Usages between Flavivirus and Vector Mosquito Aedes aegypti: Relevance to Infection and Transcriptional Response of Mosquito Genes

PubMed Central

Behura, Susanta K.; Severson, David W.

2014-01-01

The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953
Codon usage and amino acid usage influence genes expression level.

PubMed

Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo

2018-02-01

Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
Functional Analyses of c.2268dup in Thyroid Peroxidase Gene Associated with Goitrous Congenital Hypothyroidism

PubMed Central

Harun, Fatimah; Jalaludin, Muhammad Yazid; Lim, Chor Yin; Ng, Khoon Leong

2014-01-01

The c.2268dup mutation in thyroid peroxidase (TPO) gene was reported to be a founder mutation in Taiwanese patients with dyshormonogenetic congenital hypothyroidism (CH). The functional impact of the mutation is not well documented. In this study, homozygous c.2268dup mutation was detected in two Malaysian-Chinese sisters with goitrous CH. Normal and alternatively spliced TPO mRNA transcripts were present in thyroid tissues of the two sisters. The abnormal transcript contained 34 nucleotides originating from intron 12. The c.2268dup is predicted to generate a premature termination codon (PTC) at position 757 (p.Glu757X). Instead of restoring the normal reading frame, the alternatively spliced transcript has led to another stop codon at position 740 (p.Asp739ValfsX740). The two PTCs are located at 116 and 201 nucleotides upstream of the exons 13/14 junction fulfilling the requirement for a nonsense-mediated mRNA decay (NMD). Quantitative RT-PCR revealed an abundance of unidentified transcripts believed to be associated with the NMD. TPO enzyme activity was not detected in both patients, even though a faint TPO band of about 80 kD was present. In conclusion, the c.2268dup mutation leads to the formation of normal and alternatively spliced TPO mRNA transcripts with a consequential loss of TPO enzymatic activity in Malaysian-Chinese patients with goitrous CH. PMID:24745015
Nonstructural proteins nsP3 and nsP4 of Ross River and O'Nyong-nyong viruses: sequence and comparison with those of other alphaviruses.

PubMed

Strauss, E G; Levinson, R; Rice, C M; Dalrymple, J; Strauss, J H

1988-05-01

We have sequenced the nsP3 and nsP4 region of two alphaviruses, Ross River virus and O'Nyong-nyong virus, in order to examine these viruses for the presence or absence of an opal termination codon present between nsP3 and nsP4 in many alphaviruses. We found that Ross River virus possesses an in-phase opal termination codon between nsP3 and nsP4, whereas in O'Nyong-nyong virus this termination codon is replaced by an arginine codon. Previous studies have shown that two other alphaviruses, Sindbis virus and Middelburg virus, possess an opal termination codon separating nsP3 and nsP4 [E.G. Strauss, C.M. Rice, and J.H. Strauss (1983), Proc. Natl. Acad. Sci. USA 80, 5271-5275], whereas Semliki Forest virus possesses an arginine codon in lieu of the opal codon [K. Takkinen (1986), Nucleic Acids Res. 14, 5667-5682]. Thus, of the five alphaviruses examined to date, three possess the opal codon and two do not. Production of nsP4 requires readthrough of the opal codon in those alphaviruses that possess this termination codon and the function of the termination codon may be to regulate the amount of nsP4 produced. It is an open question then as to whether alphaviruses with no termination codon use other mechanisms to regulate the activity of this gene. The nsP4s of these five alphaviruses are highly conserved, sharing 71-76% amino acid sequence similarity, and all five contain the Gly-Asp-Asp motif found in many RNA virus replicases. The nsP3s are somewhat less conserved, sharing 52-73% amino acid sequence similarity throughout most of the protein, but each possesses a nonconserved C-terminal domain of 134 to 246 amino acids of unknown function.
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.

PubMed

Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter

2016-02-19

Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Lateral Displacement and Shear Lag Effect of Combination of Diagrid-Frame

NASA Astrophysics Data System (ADS)

Abd. Samat, Roslida; Chua, Fong Teng; Mustakim, Nur Akmal Hayati Mohd; Saad, Sariffuddin; Abu Bakar, Suhaimi

2018-03-01

Diagrid system, which is the portmanteau of diagonal grid member, is an exterior lateral load resisting system for tall building that has gained a wide acceptance in the design of tall buildings. There is abundance of researches that studied the efficiency of diagrid systems, which are constructed from the ground level to the top of the buildings in resisting the lateral load. Nevertheless, no study had been performed on the effectiveness of the diagrid that is constructed above other tall building systems despite the existence of a few buildings in the world that employ such system. The objective of this research is to understand the behavior of the lateral displacement and shear lag effect due to wind load when the diagrid structure is constructed above a frame. Models of 60-story buildings with a footprint of 36m x 36m were analyzed by using Staad.Pro software. The level where the diagrid members started was altered. The lateral displacement was reduced to 60.6 percent and 41 percent of the lateral displacement of a building with full frame system when the combination of frame-diagrid that had the diagrid started at Level 1 and Level 45, respectively were employed. Furthermore, the shear lag ratio was reduced from 1.7 to 1.3 when the level where the diagrid started was increased from Level 1 to Level 45.
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.

PubMed

Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen

2015-05-06

The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Factors influencing readthrough therapy for frequent cystic fibrosis premature termination codons

PubMed Central

Pranke, Iwona; Bidou, Laure; Martin, Natacha; Blanchet, Sandra; Hatton, Aurélie; Karri, Sabrina; Cornu, David; Costes, Bruno; Chevalier, Benoit; Tondelier, Danielle; Coupet, Matthieu; Edelman, Aleksander; Fanen, Pascale; Namy, Olivier; Sermet-Gaudelus, Isabelle

2018-01-01

Premature termination codons (PTCs) are generally associated with severe forms of genetic diseases. Readthrough of in-frame PTCs using small molecules is a promising therapeutic approach. Nonetheless, the outcome of preclinical studies has been low and variable. Treatment efficacy depends on: 1) the level of drug-induced readthrough, 2) the amount of target transcripts, and 3) the activity of the recoded protein. The aim of the present study was to identify, in the cystic fibrosis transmembrane conductance regulator (CFTR) model, recoded channels from readthrough therapy that may be enhanced using CFTR modulators. First, drug-induced readthrough of 15 PTCs was measured using a dual reporter system under basal conditions and in response to gentamicin and negamycin. Secondly, exon skipping associated with these PTCs was evaluated with a minigene system. Finally, incorporated amino acids were identified by mass spectrometry and the function of the predicted recoded CFTR channels corresponding to these 15 PTCs was measured. Nonfunctional channels were subjected to CFTR-directed ivacaftor-lumacaftor treatments. The results demonstrated that CFTR modulators increased activity of recoded channels, which could also be confirmed in cells derived from a patient. In conclusion, this work will provide a framework to adapt treatments to the patient's genotype by identifying the most efficient molecule for each PTC and the recoded channels needing co-therapies to rescue channel function. PMID:29497617
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.

PubMed

Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S

2017-08-29

Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
General Solution for Theoretical Packet Data Loss Rate

NASA Technical Reports Server (NTRS)

Lansdowne, Chatwin; Schlesinger, Adam

2006-01-01

Communications systems which transfer blocks ("frames") of data must use a marker ("frame synchronization pattern") for identifying where a block begins. A technique ("frame synchronization strategy") is used to locate the start of each frame and maintain synchronization as additional blocks are processed. A device which strips out the frame synchronization pattern [FSP] and provides an "end of frame" pulse is called a frame synchronizer. As clock and data errors are introduced into the system, the start-of-block marker becomes displaced and/or corrupted. The capability of the frame synchronizer to stay locked to the pattern under these conditions is a figure of merit for the frame synchronization strategy. It is important to select a strategy which will stay locked nearly all the time at bit error rates where the data is usable. ("Bit error rate" [BER] is the fraction of binary bits which are inverted by passage through a communication system.) The fraction of frames that are discarded because the frame synchronizer is not locked is called "Percent Data Loss" or "Packet Data Loss rate" [PDL]. A general approach for accurately predicting PDL given BER was developed in Theoretical Percent Data Loss Calculation and Measurement Accuracy, T. P. Kelly, LESC-30554, December 1992. Kelly gave a solution in terms of matrix equations, and only addressed "level" channel encoding. This paper goes on to give a closed-form polynomial solution for the most common class of frame synchronizer strategies, and will also address "mark" and "space" (differential) channel encoding, and burst error environments. The paper is divided into four sections and follows a logically ordered presentation, with results developed before they are evaluated. However, most readers will derive the greatest benefit from this paper by treating the results as reference material. The result developed for differential encoding can be extended to other applications (like block codes) where the probability is needed that a block contains only a certain number of errors.
Di-codon Usage for Gene Classification

NASA Astrophysics Data System (ADS)

Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.

Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
Comparative evolutionary genomics of Corynebacterium with special reference to codon and amino acid usage diversities.

PubMed

Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab

2018-02-01

The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
START II Frame Work

DTIC Science & Technology

1992-04-01

Colonel Richard J. Barringer , USAF The Industrial College of the Armed Forces National Defense University Fort McNair, Washington, D.C. 20319-6000 93...Executive Research Project A94 START II Frame Work Lieutenant Colonel Donald E. Belche U.S. Air Force Faculty Research Advisor Colonel Richard J. Barringer ...1,659 - 3,456 0.48 Bombers 0.11 x 4,208 = 463 - 1,100 = 0.42 Total 9,064 2,248 - 5,956 = 0.38 charts 12 and 13 28 CHART 14 U.§ IA~aI ULA QC TU R AFTER 5
Representations of voluntary childlessness in the UK press, 1990-2008.

PubMed

Giles, David; Shaw, Rachel L; Morgan, William

2009-11-01

Representations of voluntary childlessness--the declaration by an individual that he or she does not wish to bear or raise children--were studied in 116 articles published in British national newspapers in the period 1990-2008. Media framing analysis was used to examine broad patterns of framing of the topic, identifying four frames: voluntary childlessness as an individual rights issue, as a form of resistance, as a social trend, and as a personal decision. These frames, it is argued, may act as potential 'scripts' for newspaper readers who are debating the decision to start a family.
Distance between RBS and AUG plays an important role in overexpression of recombinant proteins.

PubMed

Berwal, Sunil K; Sreejith, R K; Pal, Jayanta K

2010-10-15

The spacing between ribosome binding site (RBS) and AUG is crucial for efficient overexpression of genes when cloned in prokaryotic expression vectors. We undertook a brief study on the overexpression of genes cloned in Escherichia coli expression vectors, wherein the spacing between the RBS and the start codon was varied. SDS-PAGE and Western blot analysis indicated a high level of protein expression only in constructs where the spacing between RBS and AUG was approximately 40 nucleotides or more, despite the synthesis of the transcripts in the representative cases investigated. Copyright 2010 Elsevier Inc. All rights reserved.
Two novel mutations in the Norrie disease gene associated with the classical ocular phenotype.

PubMed

Caballero, M; Veske, A; Rodriguez, J J; Lugo, N; Schroeder, B; Hesse, L; Gal, A

1996-12-01

Norrie disease (ND) is a rare X-linked recessive disorder characterized by congenital blindness due to a degenerative and proliferative dysplasia of the neuroretina and, occasionally, by deafness and mental handicap. Here, we report two novel mutations detected in patients with the classical eye features of ND. Both the one-base pair insertion in exon II (544/545 insA) and the two-base pair deletion in the start codon (418delTG) of the ND gene predict a functional 'null allele', i.e. the complete absence of the corresponding gene product.
Cloning of Russian sturgeon (Acipenser gueldenstaedtii) growth hormone and insulin-like growth factor I and their expression in male and female fish during the first period of growth.

PubMed

Yom Din, S; Hurvitz, A; Goldberg, D; Jackson, K; Levavi-Sivan, B; Degani, G

2008-03-01

In this study, the GH and IGF-I of the Russian sturgeon (rs), Acipenser gueldenstaedtii, were cloned and sequenced, and their mRNA gene expression determined. In addition, to improve our understanding of the GH function, the expression of this hormone was assessed in young males and females. Moreover, IGF-I expression was quantified in young males and compared to that in older ones. The nucleotide sequence of the rsGH cDNA was 980 bp long and had an open reading frame of 642 bp, beginning with the first ATG codon at position 39 and ending with the stop codon at position 683. A putative polyadenylation signal, AATAAA, was recognized 42 bp upstream of the poly (A) tail. The position of the signal- peptide cleavage site was predicted to be at position 111, yielding a signal peptide of 24 amino-acids (aa) and a mature peptide of 190 aa. When the rsGH aa sequence was compared with other species, the highest degree of identity was found to be with mammalians (66-70% identity), followed by anguilliformes and amphibia (61%) and other fish (39-47%). The level of rsGH mRNA was discovered to be similar in pituitaries of females and males of 5 age groups (1, 2, 3, 4, and 5- yr-old). In females and males, the levels did not change dramatically during the first 5 yr of growth. The partial nucleotide sequence of the rsIGF-I was 445 bp long and had an open reading frame of 396 bp, beginning with the ATG codon at position 50. The position of the signal-peptide cleavage site was predicted to be at position 187, yielding a signal peptide of 44 aa. The highest level of IGF-I mRNA expression was recorded in the kidney of adult sturgeons. The IGF-I mRNA expression levels in the intestine, pituitary gland, and liver were not significantly different. Low levels of expression were found in the brain, heart, and muscle. In most tissues, there was no significant difference between mRNA levels of one and 5-yr-old fish. In conclusion, based on the GH-sequence analysis, A. gueldenstaedtii is genetically distant from other teleosts. The expression of the GH mRNA was similar in males and females, and its level remained constant during the first 5 yr of growth. While the IGF-I mRNA expression differed amongst various tissues, the level in each tissue was similar in 1 and 5-yr-old fish.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

NASA Astrophysics Data System (ADS)

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-11-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence

PubMed Central

Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.

2016-01-01

Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Synonymous codon changes in the oncogenes of the cottontail rabbit papillomavirus lead to increased oncogenicity and immunogenicity of the virus

PubMed Central

Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.

2013-01-01

Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866

A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes.

PubMed

Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin

2016-07-01

The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index

PubMed Central

Xia, Xuhua

2015-01-01

Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Exploring synonymous codon usage preferences of disulfide-bonded and non-disulfide bonded cysteines in the E. coli genome.

PubMed

Song, Jiangning; Wang, Minglei; Burrage, Kevin

2006-07-21

High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Codon optimization underpins generalist parasitism in fungi

PubMed Central

Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain

2017-01-01

The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Developmental stage related patterns of codon usage and genomic GC content: searching for evolutionary fingerprints with models of stem cell differentiation

PubMed Central

2007-01-01

Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
DOE Office of Scientific and Technical Information (OSTI.GOV)

Ohshima, Kazusato, E-mail: ohshimak@cc.saga-u.ac.jp; The United Graduate School of Agricultural Sciences, Kagoshima University, Kagoshima; Matsumoto, Kosuke

Cucumber mosaic virus (CMV) is a damaging pathogen of over 200 mono- and dicotyledonous crop species worldwide. It has the broadest known host range of any virus, but the timescale of its evolution is unknown. To investigate the evolutionary history of this virus, we obtained the genomic sequences of 40 CMV isolates from brassicas sampled in Iran, Turkey and Japan, and combined them with published sequences. Our synonymous ('silent') site analyses revealed that the present CMV population is the progeny of a single ancestor existing 1550–2600 years ago, but that the population mostly radiated 295–545 years ago. We found thatmore » the major CMV lineages are not phylogeographically confined, but that recombination and reassortment is restricted to local populations and that no reassortant lineage is more than 251 years old. Our results highlight the different evolutionary patterns seen among viral pathogens of brassica crops across the world. - Highlights: • Present-day CMV lineages had a most recent common ancestor 1550–2600 years ago. • The CMV population mostly radiated less than 295–545 years ago. • No reassortant found in the present populations is more than 251 years old. • The open-reading frames evolve at around 2.3–4.7×10{sup −4} substitutions/site/year. • Synonymous codons of CMV seem to have a more precise temporal signal than all codons.« less
tRNA1Ser(G34) with the anticodon GGA can recognize not only UCC and UCU codons but also UCA and UCG codons.

PubMed

Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki

2003-04-15

The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.

PubMed

Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K

2016-08-02

Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Analysis of polyglutamine-coding repeats in the TATA-binding protein in different human populations and in patients with schizophrenia an bipolar affective disorder

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rubinsztein, D.C.; Leggo, J.; Crow, T.J.

A new class of disease (including Huntington disease, Kennedy disease, and spinocerebellar ataxias types 1 and 3) results from abnormal expansions of CAG trinucleotides in the coding regions of genes. In all of these diseases the CAG repeats are thought to be translated into polyglutamine tracts. There is accumulating evidence arguing for CAG trinucleotide expansions as one of the causative disease mutations in schizophrenia and bipolar affective disorder. We and others believe that the TATA-binding protein (TBP) is an important candidate to investigate in these diseases as it contains a highly polymorphic stretch of glutamine codons, which are close tomore » the threshold length where the polyglutamine tracts start to be associated with disease. Thus, we examined the lengths of this polyglutamine repeat in normal unrelated East Anglians, South African Blacks, sub-Saharan Africans mainly from Nigeria, and Asian Indians. We also examined 43 bipolar affective disorder patients and 65 schizophrenic patients. The range of polyglutamine tract-lengths that we found in humans was from 26-42 codons. No patients with bipolar affective disorder and schizophrenia had abnormal expansions at this locus. 22 refs., 1 tab.« less
The expression of full length Gp91-phox protein is associated with reduced amphotropic retroviral production.

PubMed

Bellantuono, I; Lashford, L S; Rafferty, J A; Fairbairn, L J

2000-05-01

As a single gene defect in mature bone marrow cells, chronic granulomatous disease (X-CGD) represents a disorder which may be amenable to gene therapy by the transfer of the missing subunit into hemopoietic stem cells. In the majority of cases lack of Gp91-phox causes the disease. So far, studies involving transfer of Gp91-phox cDNA, including a phase I clinical trial, have yielded disappointing results. Most often, low titers of virus have been reported. In the present study we investigated the possible reasons for low titer amphotropic viral production. To investigate the effect of Gp91 cDNA on the efficiency of retroviral production from the packaging cell line, GP+envAm12, we constructed vectors containing either the native cDNA, truncated versions of the cDNA or a mutated form (LATG) in which the natural translational start codon was changed to a stop codon. Following derivation of clonal packaging cell lines, these were assessed for viral titer by RNA slot blot and analyzed by non-parametrical statistical analysis (Whitney-Mann U-test). An improvement in viral titer of just over two-fold was found in packaging cells containing the start-codon mutant of Gp91 and no evidence of truncated viral RNA was seen in these cells. Further analysis revealed the presence of rearranged forms of the provirus in Gp91-expressing cells, and the production of truncated, unpackaged viral RNA. Protein analysis revealed that LATG-transduced cells did not express full-length Gp91-phox, whereas those containing the wild-type cDNA did. However, a truncated protein was seen in ATG-transduced cells which was also present in wild type cells. No evidence for the presence of a negative transcriptional regulatory element was found from studies with the deletion mutants. A statistically significant effect of protein production on the production of virus from Gp91-expressing cells was found. Our data point to a need to restrict expression of the Gp91-phox protein and its derivatives in order to enhance retroviral production and suggest that improvements in current vectors for CGD gene therapy may need to include controlled, directed expression only in mature neutrophils.
Complete Mitochondrial Genome of the Red Fox (Vuples vuples) and Phylogenetic Analysis with Other Canid Species.

PubMed

Zhong, Hua-Ming; Zhang, Hong-Hai; Sha, Wei-Lai; Zhang, Cheng-De; Chen, Yu-Cai

2010-04-01

The whole mitochondrial genome sequence of red fox (Vuples vuples) was determined. It had a total length of 16 723 bp. As in most mammal mitochondrial genome, it contained 13 protein coding genes, two ribosome RNA genes, 22 transfer RNA genes and one control region. The base composition was 31.3% A, 26.1% C, 14.8% G and 27.8% T, respectively. The codon usage of red fox, arctic fox, gray wolf, domestic dog and coyote followed the same pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 3 gene in the red fox. A long tandem repeat rich in AC was found between conserved sequence block 1 and 2 in the control region. In order to confirm the phylogenetic relationships of red fox to other canids, phylogenetic trees were reconstructed by neighbor-joining and maximum parsimony methods using 12 concatenated heavy-strand protein-coding genes. The result indicated that arctic fox was the sister group of red fox and they both belong to the red fox-like clade in family Canidae, while gray wolf, domestic dog and coyote belong to wolf-like clade. The result was in accordance with existing phylogenetic results.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position

PubMed Central

Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.

2017-01-01

Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
A single U/C nucleotide substitution changing alanine to valine in the beet necrotic yellow vein virus P25 protein promotes increased virus accumulation in roots of mechanically inoculated, partially resistant sugar beet seedlings.

PubMed

Koenig, R; Loss, S; Specht, J; Varrelmann, M; Lüddecke, P; Deml, G

2009-03-01

Beet necrotic yellow vein virus (BNYVV) A type isolates E12 and S8, originating from areas where resistance-breaking had or had not been observed, respectively, served as starting material for studying the influence of sequence variations in BNYVV RNA 3 on virus accumulation in partially resistant sugar beet varieties. Sub-isolates containing only RNAs 1 and 2 were obtained by serial local lesion passages; biologically active cDNA clones were prepared for RNAs 3 which differed in their coding sequences for P25 aa 67, 68 and 129. Sugar beet seedlings were mechanically inoculated with RNA 1+2/RNA 3 pseudorecombinants. The origin of RNAs 1+2 had little influence on virus accumulation in rootlets. E12 RNA 3 coding for V(67)C(68)Y(129) P25, however, enabled a much higher virus accumulation than S8 RNA 3 coding for A(67)H(68)H(129) P25. Mutants revealed that this was due only to the V(67) 'GUU' codon as opposed to the A(67) 'GCU' codon.
Transcription of the cottontail rabbit papillomavirus early region and identification of two E6 polypeptides in COS-7 cells.

PubMed Central

Barbosa, M S; Wettstein, F O

1987-01-01

Cottontail rabbit papillomavirus (CRPV) early proteins are present at very low levels in virus-induced tumors and cannot be detected by immunological methods. Furthermore, cells in culture are not readily transformed by the virus. To overcome these difficulties in identifying and characterizing the putative transforming protein(s) coded by the E6 open reading frame, the early cottontail rabbit papillomavirus region was expressed under the control of the late simian virus 40 promoter. Mapping of the transcripts in transiently transfected COS-7 cells indicated that transcription was initiated in the late region of simian virus 40. Two E6-coded polypeptides were identified, representing translation products initiated at the first and second AUG codons. Images PMID:3039182
Complete mitochondrial genome of Taharana fasciana (Insecta, Hemiptera: Cicadellidae) and comparison with other Cicadellidae insects.

PubMed

Wang, Jiajia; Li, Hu; Dai, Renhuai

2017-12-01

Here, we describe the first complete mitochondrial genome (mitogenome) sequence of the leafhopper Taharana fasciana (Coelidiinae). The mitogenome sequence contains 15,161 bp with an A + T content of 77.9%. It includes 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding (A + T-rich) region; in addition, a repeat region is also present (GenBank accession no. KY886913). These genes/regions are in the same order as in the inferred insect ancestral mitogenome. All protein-coding genes have ATN as the start codon, and TAA or single T as the stop codons, except the gene ND3, which ends with TAG. Furthermore, we predicted the secondary structures of the rRNAs in T. fasciana. Six domains (domain III is absent in arthropods) and 41 helices were predicted for 16S rRNA, and 12S rRNA comprised three structural domains and 24 helices. Phylogenetic tree analysis confirmed that T. fasciana and other members of the Cicadellidae are clustered into a clade, and it identified the relationships among the subfamilies Deltocephalinae, Coelidiinae, Idiocerinae, Cicadellinae, and Typhlocybinae.
Preferences of AAA/AAG codon recognition by modified nucleosides, τm5s2U34 and t6A37 present in tRNALys.

PubMed

Sonawane, Kailas D; Kamble, Asmita S; Fandilolu, Prayagraj M

2017-12-27

Deficiency of 5-taurinomethyl-2-thiouridine, τm 5 s 2 U at the 34th 'wobble' position in tRNA Lys causes MERRF (Myoclonic Epilepsy with Ragged Red Fibers), a neuromuscular disease. This modified nucleoside of mt tRNA Lys , recognizes AAA/AAG codons during protein biosynthesis process. Its preference to identify cognate codons has not been studied at the atomic level. Hence, multiple MD simulations of various molecular models of anticodon stem loop (ASL) of mt tRNA Lys in presence and absence of τm 5 s 2 U 34 and N 6 -threonylcarbamoyl adenosine (t 6 A 37 ) along with AAA and AAG codons have been accomplished. Additional four MD simulations of multiple ASL mt tRNA Lys models in the context of ribosomal A-site residues have also been performed to investigate the role of A-site in recognition of AAA/AAG codons. MD simulation results show that, ASL models in presence of τm 5 s 2 U 34 and t 6 A 37 with codons AAA/AAG are more stable than the ASL lacking these modified bases. MD trajectories suggest that τm 5 s 2 U recognizes the codons initially by 'wobble' hydrogen bonding interactions, and then tRNA Lys might leave the explicit codon by a novel 'single' hydrogen bonding interaction in order to run the protein biosynthesis process smoothly. We propose this model as the 'Foot-Step Model' for codon recognition, in which the single hydrogen bond plays a crucial role. MD simulation results suggest that, tRNA Lys with τm 5 s 2 U and t 6 A recognizes AAA codon more preferably than AAG. Thus, these results reveal the consequences of τm 5 s 2 U and t 6 A in recognition of AAA/AAG codons in mitochondrial disease, MERRF.
Compositional pressure and translational selection determine codon usage in the extremely GC-poor unicellular eukaryote Entamoeba histolytica.

PubMed

Romero, H; Zavala, A; Musto, H

2000-01-25

It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
The cytochrome oxidase subunit I and subunit III genes in Oenothera mitochondria are transcribed from identical promoter sequences

PubMed Central

Hiesel, Rudolf; Schobel, Werner; Schuster, Wolfgang; Brennicke, Axel

1987-01-01

Two loci encoding subunit III of the cytochrome oxidase (COX) in Oenothera mitochondria have been identified from a cDNA library of mitochondrial transcripts. A 657-bp sequence block upstream from the open reading frame is also present in the two copies of the COX subunit I gene and is presumably involved in homologous sequence rearrangement. The proximal points of sequence rearrangements are located 3 bp upstream from the COX I and 1139 bp upstream from the COX III initiation codons. The 5'-termini of both COX I and COX III mRNAs have been mapped in this common sequence confining the promoter region for the Oenothera mitochondrial COX I and COX III genes to the homologous sequence block. ImagesFig. 5. PMID:15981332
Novel insertion in exon 5 of the TCOF1 gene in twin sisters with Treacher Collins syndrome.

PubMed

Marszałek-Kruk, Bożena Anna; Wójcicki, Piotr; Smigiel, Robert; Trzeciak, Wiesław H

2012-08-01

Treacher Collins syndrome (TCS) is associated with an abnormal differentiation of the first and second pharyngeal arches during fetal development. This causes mostly craniofacial deformities, which require numerous corrective surgeries. TCS is an autosomal dominant disorder and it occurs in the general population at a frequency of 1 in 50,000 live births. The syndrome is caused by mutations in the TCOF1 gene, which encodes the serine/alanine-rich protein named Treacle. Over 120 mutations of the TCOF1 gene responsible for TCS have been described. About 70% of recognized mutations are deletions, which lead to a frame shift, formation of a termination codon, and shortening of the protein product of the gene. Herewith, a new heterozygotic insertion, c.484_668ins185bp, was described in two monozygotic twin sisters suffering from TCS. This mutation was absent in their father, brother, and uncle, indicating a de novo origin. The insertion causes a shift in the reading frame and premature termination of translation at 167 aa. The novel insertion is the longest ever found in the TCOF1 gene and the only one found among monozygotic twin sisters.
Numeral series hidden in the distribution of atomic mass of amino acids to codon domains in the genetic code.

PubMed

Wohlin, Åsa

2015-03-21

The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.

Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus

PubMed Central

Kumar, Chandra Shekhar; Kumar, Sachin

2014-01-01

Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum

PubMed Central

Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan

2006-01-01

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
The mitochondrial genome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae).

PubMed

Xin, Tianrong; Li, Lei; Yao, Chengyi; Wang, Yayu; Zou, Zhiwen; Wang, Jing; Xia, Bin

2016-07-01

We present the complete mitogenome of Cethosia biblis (Drury) (Lepidoptera: Nymphalidae) in this article. The mitogenome was a circle molecular consisting of 15,286 nucleotides, 37 genes, and an A + T-rich region. The order of 37 genes was typical of insect mitochondrial DNA sequences described to date. The overall base composition of the genome is A (37.41%), T (42.80%), C (11.87%), and G (7.91%) with an A + T-rich hallmark as that of other invertebrate mitochondrial genomes. The start codon was mainly ATA in most of the mitochondrial protein-coding genes such as ND2, COI, ATP8, ND3, ND5, ND4, ND6, and ND1, but COII, ATP6, COIII, ND4L, and Cob genes employing ATG. The stop codon was TAA in all the protein-coding genes. The A + T region is located between 12S rRNA and tRNA(M)(et). The phylogenetic relationships of Lepidoptera species were constructed based on the nucleotides sequences of 13 PCGs of mitogenomes using the neighbor-joining method. The molecular-based phylogeny supported the traditional morphological classification on relationships within Lepidoptera species.
Charges and Fields in a Current-Carrying Wire

ERIC Educational Resources Information Center

Redzic, Dragan V.

2012-01-01

Charges and fields in a straight, infinite, cylindrical wire carrying a steady current are determined in the rest frames of ions and electrons, starting from the standard assumption that the net charge per unit length is zero in the lattice frame and taking into account a self-induced pinch effect. The analysis presented illustrates the mutual…
Gene expression regulation by upstream open reading frames and human disease.

PubMed

Barbosa, Cristina; Peixeiro, Isabel; Romão, Luísa

2013-01-01

Upstream open reading frames (uORFs) are major gene expression regulatory elements. In many eukaryotic mRNAs, one or more uORFs precede the initiation codon of the main coding region. Indeed, several studies have revealed that almost half of human transcripts present uORFs. Very interesting examples have shown that these uORFs can impact gene expression of the downstream main ORF by triggering mRNA decay or by regulating translation. Also, evidence from recent genetic and bioinformatic studies implicates disturbed uORF-mediated translational control in the etiology of many human diseases, including malignancies, metabolic or neurologic disorders, and inherited syndromes. In this review, we will briefly present the mechanisms through which uORFs regulate gene expression and how they can impact on the organism's response to different cell stress conditions. Then, we will emphasize the importance of these structures by illustrating, with specific examples, how disturbed uORF-mediated translational control can be involved in the etiology of human diseases, giving special importance to genotype-phenotype correlations. Identifying and studying more cases of uORF-altering mutations will help us to understand and establish genotype-phenotype associations, leading to advancements in diagnosis, prognosis, and treatment of many human disorders.
Deletion of a Single-Copy Trna Affects Microtubule Function in Saccharomyces Cerevisiae

PubMed Central

Reijo, R. A.; Cho, D. S.; Huffaker, T. C.

1993-01-01

rts1-1 was identified as an extragenic suppressor of tub2-104, a cold-sensitive allele of the sole gene encoding β-tubulin in the yeast, Saccharomyces cerevisiae. In addition, rts1-1 cells are heat sensitive and resistant to the microtubule-destabilizing drug, benomyl. The rts1-1 mutation is a deletion of approximately 5 kb of genomic DNA on chromosome X that includes one open reading frame and three tRNA genes. Dissection of this region shows that heat sensitivity is due to deletion of the open reading frame (HIT1). Suppression and benomyl resistance are caused by deletion of the gene encoding a tRNA(AGG)(Arg) (HSX1). Northern analysis of rts1-1 cells indicates that HSX1 is the only gene encoding this tRNA. Deletion of HSX1 does not suppress the tub2-104 mutation by misreading at the AGG codons in TUB2. It also does not suppress by interfering with the protein arginylation that targets certain proteins for degradation. These results leave open the prospect that this tRNA(AGG)(Arg) plays a novel role in the cell. PMID:8307335
Efficient mutagenesis by Cas9 protein-mediated oligonucleotide insertion and large-scale assessment of single-guide RNAs.

PubMed

Gagnon, James A; Valen, Eivind; Thyme, Summer B; Huang, Peng; Akhmetova, Laila; Ahkmetova, Laila; Pauli, Andrea; Montague, Tessa G; Zimmerman, Steven; Richter, Constance; Schier, Alexander F

2014-01-01

The CRISPR/Cas9 system has been implemented in a variety of model organisms to mediate site-directed mutagenesis. A wide range of mutation rates has been reported, but at a limited number of genomic target sites. To uncover the rules that govern effective Cas9-mediated mutagenesis in zebrafish, we targeted over a hundred genomic loci for mutagenesis using a streamlined and cloning-free method. We generated mutations in 85% of target genes with mutation rates varying across several orders of magnitude, and identified sequence composition rules that influence mutagenesis. We increased rates of mutagenesis by implementing several novel approaches. The activities of poor or unsuccessful single-guide RNAs (sgRNAs) initiating with a 5' adenine were improved by rescuing 5' end homogeneity of the sgRNA. In some cases, direct injection of Cas9 protein/sgRNA complex further increased mutagenic activity. We also observed that low diversity of mutant alleles led to repeated failure to obtain frame-shift mutations. This limitation was overcome by knock-in of a stop codon cassette that ensured coding frame truncation. Our improved methods and detailed protocols make Cas9-mediated mutagenesis an attractive approach for labs of all sizes.
Molecular analysis of beta-globin gene mutations among Thai beta-thalassemia children: results from a single center study

PubMed Central

Boonyawat, Boonchai; Monsereenusorn, Chalinee; Traivaree, Chanchai

2014-01-01

Background Beta-thalassemia is one of the most common genetic disorders in Thailand. Clinical phenotype ranges from silent carrier to clinically manifested conditions including severe beta-thalassemia major and mild beta-thalassemia intermedia. Objective This study aimed to characterize the spectrum of beta-globin gene mutations in pediatric patients who were followed-up in Phramongkutklao Hospital. Patients and methods Eighty unrelated beta-thalassemia patients were enrolled in this study including 57 with beta-thalassemia/hemoglobin E, eight with homozygous beta-thalassemia, and 15 with heterozygous beta-thalassemia. Mutation analysis was performed by multiplex amplification refractory mutation system (M-ARMS), direct DNA sequencing of beta-globin gene, and gap polymerase chain reaction for 3.4 kb deletion detection, respectively. Results A total of 13 different beta-thalassemia mutations were identified among 88 alleles. The most common mutation was codon 41/42 (-TCTT) (37.5%), followed by codon 17 (A>T) (26.1%), IVS-I-5 (G>C) (8%), IVS-II-654 (C>T) (6.8%), IVS-I-1 (G>T) (4.5%), and codon 71/72 (+A) (2.3%), and all these six common mutations (85.2%) were detected by M-ARMS. Six uncommon mutations (10.2%) were identified by DNA sequencing including 4.5% for codon 35 (C>A) and 1.1% initiation codon mutation (ATG>AGG), codon 15 (G>A), codon 19 (A>G), codon 27/28 (+C), and codon 123/124/125 (-ACCCCACC), respectively. The 3.4 kb deletion was detected at 4.5%. The most common genotype of beta-thalassemia major patients was codon 41/42 (-TCTT)/codon 26 (G>A) or betaE accounting for 40%. Conclusion All of the beta-thalassemia alleles have been characterized by a combination of techniques including M-ARMS, DNA sequencing, and gap polymerase chain reaction for 3.4 kb deletion detection. Thirteen mutations account for 100% of the beta-thalassemia genes among the pediatric patients in our study. PMID:25525381
Evolutionary characterization of Tembusu virus infection through identification of codon usage patterns.

PubMed

Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun

2015-10-01

Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
Three stages during the evolution of the genetic code. [Abstract only

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1994-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
endAFS, a novel family E endoglucanase gene from Fibrobacter succinogenes AR1.

PubMed Central

Cavicchioli, R; East, P D; Watson, K

1991-01-01

The complete nucleotide sequence of endAFS, an endoglucanase gene isolated from the ruminal anaerobe Fibrobacter succinogenes AR1, was determined. endAFS encodes two overlapping open reading frames (ORF1 and ORF2), and it was proposed that a -1 ribosomal frameshift was required to allow contiguous synthesis of a 453-amino-acid endoglucanase. A proline- and threonine-rich region at the C terminus of ORF1 and rare codons for arginine and threonine were coincident with the proposed frameshift site. ENDAFS is proposed to be a member of subgroup 1 of family E endoglucanases, of which endoglucanases from Thermomonospora fusca and Persea americana (avocado) are also members. Endoglucanases from Clostridium thermocellum and Pseudomonas fluorescens form subgroup 2. Images PMID:1708767
Relative codon adaptation: a generic codon bias index for prediction of gene expression.

PubMed

Fox, Jesse M; Erill, Ivan

2010-06-01

The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
Cryptic out-of-frame translational initiation of TBCE rescues tubulin formation in compound heterozygous HRD.

PubMed

Tian, Guoling; Huang, Melissa C; Parvari, Ruti; Diaz, George A; Cowan, Nicholas J

2006-09-05

Microtubules are indispensable dynamic structures that contribute to many essential biological functions. Assembly of the native alpha/beta tubulin heterodimer, the subunit that polymerizes to form microtubules, requires the participation of several molecular chaperones, namely prefoldin, the cytosolic chaperonin CCT, and a series of five tubulin-specific chaperones termed cofactors A-E (TBCA-E). Among these, TBCC, TBCD, and TBCE are essential in higher eukaryotes; they function together as a multimolecular machine that assembles quasinative CCT-generated alpha- and beta-tubulin polypeptides into new heterodimers. Deletion and truncation mutations in the gene encoding TBCE have been shown to cause the rare autosomal recessive syndrome known as HRD, a devastating disorder characterized by congenital hypoparathyroidism, mental retardation, facial dysmorphism, and extreme growth failure. Here we identify cryptic translational initiation at each of three out-of-frame AUG codons upstream of the genetic lesion as a unique mechanism that rescues a mutant HRD allele by producing a functional TBCE protein. Our data explain how afflicted individuals, who would otherwise lack the capacity to make functional TBCE, can survive and point to a limiting capacity to fold tubulin heterodimers de novo as a contributing factor to disease pathogenesis.
Cryptic out-of-frame translational initiation of TBCE rescues tubulin formation in compound heterozygous HRD

PubMed Central

Tian, Guoling; Huang, Melissa C.; Parvari, Ruti; Diaz, George A.; Cowan, Nicholas J.

2006-01-01

Microtubules are indispensable dynamic structures that contribute to many essential biological functions. Assembly of the native α/β tubulin heterodimer, the subunit that polymerizes to form microtubules, requires the participation of several molecular chaperones, namely prefoldin, the cytosolic chaperonin CCT, and a series of five tubulin-specific chaperones termed cofactors A–E (TBCA–E). Among these, TBCC, TBCD, and TBCE are essential in higher eukaryotes; they function together as a multimolecular machine that assembles quasinative CCT-generated α- and β-tubulin polypeptides into new heterodimers. Deletion and truncation mutations in the gene encoding TBCE have been shown to cause the rare autosomal recessive syndrome known as HRD, a devastating disorder characterized by congenital hypoparathyroidism, mental retardation, facial dysmorphism, and extreme growth failure. Here we identify cryptic translational initiation at each of three out-of-frame AUG codons upstream of the genetic lesion as a unique mechanism that rescues a mutant HRD allele by producing a functional TBCE protein. Our data explain how afflicted individuals, who would otherwise lack the capacity to make functional TBCE, can survive and point to a limiting capacity to fold tubulin heterodimers de novo as a contributing factor to disease pathogenesis. PMID:16938882
Diverse expression levels of two codon-optimized genes that encode human papilloma virus type 16 major protein L1 in Hansenula polymorpha.

PubMed

Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing

2014-05-01

Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
A common class of transcripts with 5'-intron depletion, distinct early coding sequence features, and N1-methyladenosine modification.

PubMed

Cenik, Can; Chua, Hon Nian; Singh, Guramrit; Akef, Abdalla; Snyder, Michael P; Palazzo, Alexander F; Moore, Melissa J; Roth, Frederick P

2017-03-01

Introns are found in 5' untranslated regions (5'UTRs) for 35% of all human transcripts. These 5'UTR introns are not randomly distributed: Genes that encode secreted, membrane-bound and mitochondrial proteins are less likely to have them. Curiously, transcripts lacking 5'UTR introns tend to harbor specific RNA sequence elements in their early coding regions. To model and understand the connection between coding-region sequence and 5'UTR intron status, we developed a classifier that can predict 5'UTR intron status with >80% accuracy using only sequence features in the early coding region. Thus, the classifier identifies transcripts with 5 ' proximal- i ntron- m inus-like-coding regions ("5IM" transcripts). Unexpectedly, we found that the early coding sequence features defining 5IM transcripts are widespread, appearing in 21% of all human RefSeq transcripts. The 5IM class of transcripts is enriched for non-AUG start codons, more extensive secondary structure both preceding the start codon and near the 5' cap, greater dependence on eIF4E for translation, and association with ER-proximal ribosomes. 5IM transcripts are bound by the exon junction complex (EJC) at noncanonical 5' proximal positions. Finally, N 1 -methyladenosines are specifically enriched in the early coding regions of 5IM transcripts. Taken together, our analyses point to the existence of a distinct 5IM class comprising ∼20% of human transcripts. This class is defined by depletion of 5' proximal introns, presence of specific RNA sequence features associated with low translation efficiency, N 1 -methyladenosines in the early coding region, and enrichment for noncanonical binding by the EJC. © 2017 Cenik et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Association between mismatch repair gene MSH3 codons 1036 and 222 polymorphisms and sporadic prostate cancer in the Iranian population.

PubMed

Jafary, Fariba; Salehi, Mansoor; Sedghi, Maryam; Nouri, Nayereh; Jafary, Farzaneh; Sadeghi, Farzaneh; Motamedi, Shima; Talebi, Maede

2012-01-01

The mismatch repair system (MMR) is a post-replicative DNA repair mechanism whose defects can lead to cancer. The MSH3 protein is an essential component of the system. We postulated that MSH3 gene polymorphisms might therefore be associated with prostate cancer (PC). We studied MSH3 codon 222 and MSH3 codon 1036 polymorphisms in a group of Iranian sporadic PC patients. A total of 60 controls and 18 patients were assessed using the polymerase chain reaction and single strand conformational polymorphism. For comparing the genotype frequencies of patients and controls the chi-square test was applied. The obtained result indicated that there was significantly association between G/A genotype of MSH3 codon 222 and G/G genotype of MSH3 codon 1036 with an increased PC risk (P=0.012 and P=0.02 respectively). Our results demonstrated that MSH3 codon 222 and MSH3 codon 1036 polymorphisms may be risk factors for sporadic prostate cancer in the Iranian population.
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.

PubMed

Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo

2018-01-01

The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
An integrated, structure- and energy-based view of the genetic code.

PubMed

Grosjean, Henri; Westhof, Eric

2016-09-30

The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.

PubMed

Pechmann, Sebastian; Frydman, Judith

2013-02-01

The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.

Overcoming codon bias: a method for high-level overexpression of Plasmodium and other AT-rich parasite genes in Escherichia coli.

PubMed

Baca, A M; Hol, W G

2000-02-01

Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Crossing Borders in Educational Innovation: Framing Foreign Examples in Discussing Comprehensive Education in the Netherlands, 1969-1979

ERIC Educational Resources Information Center

Greveling, Linda; Amsing, Hilda T. A.; Dekker, Jeroen J. H.

2014-01-01

In the Netherlands, crossing borders to study comprehensive schools was an important strategy in the 1970s, a decisive period for the start and the end of the innovation. According to policy-borrowing theory, actors that engage in debating educational issues are framing foreign examples of comprehensive schooling to convince their audiences.…
A Novel Frameshift Mutation at Codons 138/139 (HBB: c.417_418insT) on the β-Globin Gene Leads to β-Thalassemia.

PubMed

Jiang, Fan; Huang, Lv-Yin; Chen, Gui-Lan; Zhou, Jian-Ying; Xie, Xing-Mei; Li, Dong-Zhi

2017-01-01

We describe a new β-thalassemic mutation in a Chinese subject. This allele develops by insertion of one nucleotide (+T) between codons 138 and 139 in the third exon of the β-globin gene. The mutation causes a frameshift that leads to a termination codon at codon 139. In the heterozygote, this allele has the phenotype of classical β-thalassemia (β-thal) minor.
Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

PubMed

Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

2017-12-02

The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
[Genetic diversity and genetic structure of endangered wild Sinopodophyllum emodi by start codon targeted polymorphism].

PubMed

Chen, Da-Xia; Zhao, Ji-Feng; Liu, Xiang; Wang, Chang-Hua; Zhang, Zhi-Wei; Qin, Song-Yun; Zhong, Guo-Yue

2013-01-01

Revealed the genetic diversity level and genetic structure characteristics in Sinopodophyllum emodi, a rare and endangered species in China. We detected the genetic polymorphism within and among six wild populations (45 individuals) by the approach of Start Codon Targeted (SCoT) Polymorphism. The associated genetic parameters were calculated by POP-GENE1.31 and the relationship was constructed based on UPGMA method. A total of 350 bands were scored by 27 primers and 284 bands of them were polymorphic. The average polymorphic bands of each primer were 10.52. At species level, there was a high level of genetic diversity among six populations (PPB = 79.27%, N(e) = 1.332 7, H = 0.210 9 and H(sp) = 0.328 6). At population level, the genetic diversity level was low (PPB = 10.48% (4.00% -23.71%), N(e) = 1.048 7 (1.020 7-1.103 7), H = 0.029 7 (0.012 9-0.063 1), H(pop) = 0.046 2 (0.019 9-0.098 6). The Nei's coefficient of genetic differentiation was 0.841 1, which was consistent with the Shannon's coefficient of genetic differentiation (0.849 4). Two calculated methods all showed that most of the genetic variation existed among populations. The gene flow (N(m) = 0.094 4) was less among populations, indicating that the degree of genetic differentiation was higher. Genetic similarity coefficient were changed from 0.570 8 to 0.978 7. By clustering analysis, the tested populations were divided into two classes and had a tendency that the same geographical origin or material of similar habitats clustered into one group. The genetic diversity of samples of S. emodi is high,which laid a certain foundation for effective protection and improvement of germplasm resources.
Naturally Occurring Mutations in Large Surface Genes Related to Occult Infection of Hepatitis B Virus Genotype C

PubMed Central

Kim, Hong; Lee, Seoung-Ae; Kim, Dong-Won; Lee, Sueng-Hyun; Kim, Bum-Joon

2013-01-01

Molecular mechanisms related to occult hepatitis B virus (HBV) infection, particularly those based on genotype C infection, have rarely been determined thus far in the ongoing efforts to determine infection mechanisms. Therefore, we aim to elucidate the mutation patterns in the surface open reading frame (S ORF) underlying occult infections of HBV genotype C in the present study. Nested PCRs were applied to 624 HBV surface antigen (HBsAg) negative Korean subjects. Cloning and sequencing of the S ORF gene was applied to 41 occult cases and 40 control chronic carriers. Forty-one (6.6%) of the 624 Korean adults with HBsAg-negative serostatus were found to be positive for DNA according to nested PCR tests. Mutation frequencies in the three regions labeled here as preS1, preS2, and S were significantly higher in the occult subjects compared to the carriers in all cases. A total of two types of deletions, preS1 deletions in the start codon and preS2 deletions as well as nine types of point mutations were significantly implicated in the occult infection cases. Mutations within the “a” determinant region in HBsAg were found more frequently in the occult subjects than in the carriers. Mutations leading to premature termination of S ORF were found in 16 occult subjects (39.0%) but only in one subject from among the carriers (2.5%). In conclusion, our data suggest that preS deletions, the premature termination of S ORF, and “a” determinant mutations are associated with occult infections of HBV genotype C among a HBsAg-negative population. The novel mutation patterns related to occult infection introduced in the present study can help to broaden our understanding of HBV occult infections. PMID:23349904
Three stages in the evolution of the genetic code

NASA Technical Reports Server (NTRS)

Baumann, U.; Oro, J.

1993-01-01

A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Carbon source-dependent expansion of the genetic code in bacteria

PubMed Central

Prat, Laure; Heinemann, Ilka U.; Aerni, Hans R.; Rinehart, Jesse; O’Donoghue, Patrick; Söll, Dieter

2012-01-01

Despite the fact that the genetic code is known to vary between organisms in rare cases, it is believed that in the lifetime of a single cell the code is stable. We found Acetohalobium arabaticum cells grown on pyruvate genetically encode 20 amino acids, but in the presence of trimethylamine (TMA), A. arabaticum dynamically expands its genetic code to 21 amino acids including pyrrolysine (Pyl). A. arabaticum is the only known organism that modulates the size of its genetic code in response to its environment and energy source. The gene cassette pylTSBCD, required to biosynthesize and genetically encode UAG codons as Pyl, is present in the genomes of 24 anaerobic archaea and bacteria. Unlike archaeal Pyl-decoding organisms that constitutively encode Pyl, we observed that A. arabaticum controls Pyl encoding by down-regulating transcription of the entire Pyl operon under growth conditions lacking TMA, to the point where no detectable Pyl-tRNAPyl is made in vivo. Pyl-decoding archaea adapted to an expanded genetic code by minimizing TAG codon frequency to typically ∼5% of ORFs, whereas Pyl-decoding bacteria (∼20% of ORFs contain in-frame TAGs) regulate Pyl-tRNAPyl formation and translation of UAG by transcriptional deactivation of genes in the Pyl operon. We further demonstrate that Pyl encoding occurs in a bacterium that naturally encodes the Pyl operon, and identified Pyl residues by mass spectrometry in A. arabaticum proteins including two methylamine methyltransferases. PMID:23185002
WES homozygosity mapping in a recessive form of Charcot-Marie-Tooth neuropathy reveals intronic GDAP1 variant leading to a premature stop codon.

PubMed

Masingue, Marion; Perrot, Jimmy; Carlier, Robert-Yves; Piguet-Lacroix, Guenaelle; Latour, Philippe; Stojkovic, Tanya

2018-05-01

Charcot-Marie-Tooth disease (CMT) refers to a group of clinically and genetically heterogeneous inherited neuropathies. Ganglioside-induced differentiation-associated protein 1 GDAP1-related CMT has been reported in an autosomal dominant or recessive form in patients presenting either axonal or demyelinating neuropathy. We report two Sri Lankan sisters born to consanguineous parents and presenting with a severe axonal sensorimotor neuropathy. The early onset of the disease, the distal and proximal weakness and atrophy leading to major disability, along with areflexia, and, most notably, vocal cord and diaphragm paralysis were highly evocative of a GDAP1-related CMT. However, sequencing of the coding regions of the gene was normal. Whole-exome sequencing (WES) was performed and revealed that the largest region of homozygosity was around GDAP1 with several variants, mostly in non-coding regions. In view of the high clinical suspicion of GDAP1 gene involvement, we examined the variants in this gene and this, along with functional studies, allowed us to identify an alternative splicing site revealing a cryptic in-frame stop codon in intron 4 responsible for a severe loss of wild-type GDAP1. This work is the first to describe a deleterious mutation in GDAP1 gene outside of coding sequences or intronic junctions and emphasizes the importance of interpreting molecular analysis, and in particular WES results, in light of the clinical and electrophysiological phenotype.
Congenital nephrogenic diabetes insipidus with a novel mutation in the aquaporin 2 gene.

PubMed

Park, Youn Jong; Baik, Haing Woon; Cheong, Hae Il; Kang, Ju Hyung

2014-07-01

Congenital nephrogenic diabetes insipidus (CNDI) is a rare disorder caused by mutations of the arginine vasopressin (AVP) V2 receptor or aquaporin 2 ( AQP2 ) genes. The current study presented the case of CNDI in a 1-month-old male with a novel mutation in the AQP2 gene. The patient was referred due to the occurrence of hypernatremia and mild-intermittent fever since birth. An AVP stimulation test was compatible with CNDI as there was no significant response to desmopressin. Molecular genetic analysis demonstrated two mutations in exon 1 of the AQP2 gene: C to T transition, which resulted in a missense mutation of 108 Thr (ACG) to Met (ATG); and a 127, 128 delCA, which resulted in a deletion mutation of glutamine in position 43 at codon CAG as the first affected amino acid, with the new reading frame endign in a termination codon at position 62. The molecular genetic analysis of the parents showed that the missense mutation was inherited maternally and the deletion mutation was inherited paternally. The parents showed no signs or symptoms of CNDI, indicating autosomal recessive inheritance. The 108 Thr (ACG) to Met (ATG) mutation was confirmed as a novel mutation. Therefore, the molecular identification of the AQP2 gene has clinical significance, as early recognition of CNDI in infants that show only non-specific symptoms, can be facilitated. Thus, repeated episodes of dehydration, which may cause physical and mental retardation can be avoided.
Congenital nephrogenic diabetes insipidus with a novel mutation in the aquaporin 2 gene

PubMed Central

PARK, YOUN JONG; BAIK, HAING WOON; CHEONG, HAE IL; KANG, JU HYUNG

2014-01-01

Congenital nephrogenic diabetes insipidus (CNDI) is a rare disorder caused by mutations of the arginine vasopressin (AVP) V2 receptor or aquaporin 2 (AQP2) genes. The current study presented the case of CNDI in a 1-month-old male with a novel mutation in the AQP2 gene. The patient was referred due to the occurrence of hypernatremia and mild-intermittent fever since birth. An AVP stimulation test was compatible with CNDI as there was no significant response to desmopressin. Molecular genetic analysis demonstrated two mutations in exon 1 of the AQP2 gene: C to T transition, which resulted in a missense mutation of 108Thr (ACG) to Met (ATG); and a 127, 128 delCA, which resulted in a deletion mutation of glutamine in position 43 at codon CAG as the first affected amino acid, with the new reading frame endign in a termination codon at position 62. The molecular genetic analysis of the parents showed that the missense mutation was inherited maternally and the deletion mutation was inherited paternally. The parents showed no signs or symptoms of CNDI, indicating autosomal recessive inheritance. The 108Thr (ACG) to Met (ATG) mutation was confirmed as a novel mutation. Therefore, the molecular identification of the AQP2 gene has clinical significance, as early recognition of CNDI in infants that show only non-specific symptoms, can be facilitated. Thus, repeated episodes of dehydration, which may cause physical and mental retardation can be avoided. PMID:24944815
Cardiomyopathy in epidermolysis bullosa simplex patients with mutations in the KLHL24 gene.

PubMed

Yenamandra, V K; van den Akker, P C; Lemmink, H H; Jan, S Z; Diercks, G F H; Vermeer, M; van den Berg, M P; van der Meer, P; Pasmooij, A M G; Sinke, R J; Jonkman, M F; Bolling, M C

2018-05-19

Dominant mutations in the KLHL24 gene, encoding for kelch-like protein 24, have been implicated in the pathogenesis of epidermolysis bullosa simplex (EBS). So far, 26 patients from different ethnicities have been reported and all of them harboured a heterozygous KLHL24 start-codon mutation, with c.1A>G;p.Met1? being the most prevalent. 1-3 Through this report, we aimed to expand the phenotypic spectrum by incorporating additional findings, in particular, dilated cardiomyopathy, seen in a Dutch family. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Comparative analysis of the full genome sequence of European bat lyssavirus type 1 and type 2 with other lyssaviruses and evidence for a conserved transcription termination and polyadenylation motif in the G-L 3' non-translated region.

PubMed

Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R

2007-04-01

We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.
Identification and codon reading properties of 5-cyanomethyl uridine, a new modified nucleoside found in the anticodon wobble position of mutant haloarchaeal isoleucine tRNAs

PubMed Central

Mandal, Debabrata; Köhrer, Caroline; Su, Dan; Babu, I. Ramesh; Chan, Clement T.Y.; Liu, Yuchen; Söll, Dieter; Blum, Paul; Kuwahara, Masayasu; Dedon, Peter C.; RajBhandary, Uttam L.

2014-01-01

Most archaea and bacteria use a modified C in the anticodon wobble position of isoleucine tRNA to base pair with A but not with G of the mRNA. This allows the tRNA to read the isoleucine codon AUA without also reading the methionine codon AUG. To understand why a modified C, and not U or modified U, is used to base pair with A, we mutated the C34 in the anticodon of Haloarcula marismortui isoleucine tRNA (tRNA2Ile) to U, expressed the mutant tRNA in Haloferax volcanii, and purified and analyzed the tRNA. Ribosome binding experiments show that although the wild-type tRNA2Ile binds exclusively to the isoleucine codon AUA, the mutant tRNA binds not only to AUA but also to AUU, another isoleucine codon, and to AUG, a methionine codon. The G34 to U mutant in the anticodon of another H. marismortui isoleucine tRNA species showed similar codon binding properties. Binding of the mutant tRNA to AUG could lead to misreading of the AUG codon and insertion of isoleucine in place of methionine. This result would explain why most archaea and bacteria do not normally use U or a modified U in the anticodon wobble position of isoleucine tRNA for reading the codon AUA. Biochemical and mass spectrometric analyses of the mutant tRNAs have led to the discovery of a new modified nucleoside, 5-cyanomethyl U in the anticodon wobble position of the mutant tRNAs. 5-Cyanomethyl U is present in total tRNAs from euryarchaea but not in crenarchaea, eubacteria, or eukaryotes. PMID:24344322
New mutations of DAX-1 genes in two Japanese patients with X-linked congenital adrenal hypoplasia and hypogonadotropic hypogonadism

DOE Office of Scientific and Technical Information (OSTI.GOV)

Yanase, Toshihiko; Takayanagi, Ryoichi; Oba, Koichi

Congenital adrenal hypoplasia, an X-linked disorder, is characterized by primary adrenal insufficiency and frequent association with hypogonadotropic hypogonadism. The X-chromosome gene DAX-1 has been most recently identified and shown to be responsible for this disorder. We analyzed the DAX-1 genes of two unrelated Japanese patients with congenital adrenal hypoplasia and hypogonadotropic hypogonadism by using PCR amplification of genomic DNA and its complete exonic sequencing. In a family containing several affected individuals, the proband male patient had a stop codon (TGA) in place of tryptophan (TGG) at amino acid position 171. As expected, his mother was a heterozygous carrier for themore » mutation, whereas his father and unaffected brother did not carry this mutation. In another male patient with noncontributory family history, sequencing revealed a 1-bp (T) deletion at amino acid position 280, leading to a frame shift and, subsequently a premature stop codon at amino acid position 371. The presence of this mutation in the patients` genome was further confirmed by digestion of genomic PCR product with MspI created by this mutation. Family studies using MspI digestion of genomic PCR products revealed that neither parent of this individual carried the mutation. These results clearly indicate that congenital adrenal hypoplasia and hypogonadotropic hypogonadism result from not only inherited but also de novo mutation in the DAX-1 gene. 31 refs., 4 figs., 2 tabs.« less
Expanded subgenomic mRNA transcriptome and coding capacity of a nidovirus

PubMed Central

Di, Han; Madden, Joseph C.; Morantz, Esther K.; Tang, Hsin-Yao; Graham, Rachel L.; Baric, Ralph S.

2017-01-01

Members of the order Nidovirales express their structural protein ORFs from a nested set of 3′ subgenomic mRNAs (sg mRNAs), and for most of these ORFs, a single genomic transcription regulatory sequence (TRS) was identified. Nine TRSs were previously reported for the arterivirus Simian hemorrhagic fever virus (SHFV). In the present study, which was facilitated by next-generation sequencing, 96 SHFV body TRSs were identified that were functional in both infected MA104 cells and macaque macrophages. The abundance of sg mRNAs produced from individual TRSs was consistent over time in the two different cell types. Most of the TRSs are located in the genomic 3′ region, but some are in the 5′ ORF1a/1b region and provide alternative sources of nonstructural proteins. Multiple functional TRSs were identified for the majority of the SHFV 3′ ORFs, and four previously identified TRSs were found not to be the predominant ones used. A third of the TRSs generated sg mRNAs with variant leader–body junction sequences. Sg mRNAs encoding E′, GP2, or ORF5a as their 5′ ORF as well as sg mRNAs encoding six previously unreported alternative frame ORFs or 14 previously unreported C-terminal ORFs of known proteins were also identified. Mutation of the start codon of two C-terminal ORFs in an infectious clone reduced virus yield. Mass spectrometry detected one previously unreported protein and suggested translation of some of the C-terminal ORFs. The results reveal the complexity of the transcriptional regulatory mechanism and expanded coding capacity for SHFV, which may also be characteristic of other nidoviruses. PMID:29073030
Expression of codon-optmized phosphoenolpyruvate carboxylase gene from Glaciecola sp. HTCC2999 in Escherichia coli and its application for C4 chemical production.

PubMed

Park, Soohyun; Pack, Seung Pil; Lee, Jinwon

2012-08-01

We examined the expression of the phosphoenolpyruvate carboxylase (PEPC) gene from marine bacteria in Escherichia coli using codon optimization. The codon-optimized PEPC gene was expressed in the E. coli K-12 strain W3110. SDS-PAGE analysis revealed that the codon-optimized PEPC gene was only expressed in E. coli, and measurement of enzyme activity indicated the highest PEPC activity in the E. coli SGJS112 strain that contained the codon-optimized PEPC gene. In fermentation assays, the E. coli SGJS112 produced the highest yield of oxaloacetate using glucose as the source and produced a 20-times increase in the yield of malate compared to the control. We concluded that the codon optimization enabled E. coli to express the PEPC gene derived from the Glaciecola sp. HTCC2999. Also, the expressed protein exhibited an enzymatic activity similar to that of E. coli PEPC and increased the yield of oxaloacetate and malate in an E. coli system.
Antidepressants at environmentally relevant concentrations affect predator avoidance behavior of larval fathead minnows (Pimephales promelas).

USGS Publications Warehouse

Furlong, Edward T.; Barber, Larry B.; Meghan R. McGee,; Megan A. Buerkley,; Matthew L. Julius,; Vajda, Alan M.; Heiko L. Schoenfuss,; Schultz, Melissa M.; Norris, David O.

2009-01-01

The effects of embryonic and larval exposure to environmentally relevant (ng/L) concentrations of common antidepressants, fluoxetine, sertraline, venlafaxine, and bupropion (singularly and in mixture) on C-start escape behavior were evaluated in fathead minnows (Pimephales promelas). Embryos (postfertilization until hatching) were exposed for 5 d and, after hatching, were allowed to grow in control well water until 12 d old. Similarly, posthatch fathead minnows were exposed for 12 d to these compounds. High-speed (1,000 frames/s) video recordings of escape behavior were collected and transferred to National Institutes of Health Image for frame-by- frame analysis of latency periods, escape velocities, and total escape response (combination of latency period and escape velocity). When tested 12 d posthatch, fluoxetine and venlafaxine adversely affected C-start performance of larvae exposed as embryos. Conversely, larvae exposed for 12 d posthatch did not exhibit altered escape responses when exposed to fluoxetine but were affected by venlafaxine and bupropion exposure. Mixtures of these four antidepressant pharmaceuticals slowed predator avoidance behaviors in larval fathead minnows regardless of the exposure window. The direct impact of reduced C-start performance on survival and, ultimately, reproductive fitness provides an avenue to assess the ecological relevance of exposure in an assay of relatively short duration.
Antidepressants at environmentally relevant concentrations affect predator avoidance behavior of larval fathead minnows (Pimephales promelas)

USGS Publications Warehouse

Painter, M.M.; Buerkley, M.A.; Julius, M.L.; Vajda, A.M.; Norris, D.O.; Barber, L.B.; Furlong, E.T.; Schultz, M.M.; Schoenfuss, H.L.

2009-01-01

The effects of embryonic and larval exposure to environmentally relevant (ng/L) concentrations of common antidepressants, fluoxetine, sertraline, venlafaxine, and bupropion (singularly and in mixture) on C-start escape behavior were evaluated in fathead minnows (Pimephales promelas). Embryos (postfertilization until hatching) were exposed for 5 d and, after hatching, were allowed to grow in control well water until 12 d old. Similarly, posthatch fathead minnows were exposed for 12 d to these compounds. High-speed (1,000 frames/s) video recordings of escape behavior were collected and transferred to National Institutes of Health Image for frame-by-frame analysis of latency periods, escape velocities, and total escape response (combination of latency period and escape velocity). When tested 12 d posthatch, fluoxetine and venlafaxine adversely affected C-start performance of larvae exposed as embryos. Conversely, larvae exposed for 12 d posthatch did not exhibit altered escape responses when exposed to fluoxetine but were affected by venlafaxine and bupropion exposure. Mixtures of these four antidepressant pharmaceuticals slowed predator avoidance behaviors in larval fathead minnows regardless of the exposure window. The direct impact of reduced C-start performance on survival and, ultimately, reproductive fitness provides an avenue to assess the ecological relevance of exposure in an assay of relatively short duration. ?? 2009 SETAC.
EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

PubMed

Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

2003-07-01

EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGENE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGENE'HOM to handle sequences from a variety of organisms. The current target of EUGENE'HOM is plant sequences. The EUGENE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl.

Demonstration of GTG as an endogenous initiation codon for a human mRNA transcript revealed by molecular cloning of the serpin endopin 2B.

PubMed

Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H

2004-08-16

This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
Emergent Rules for Codon Choice Elucidated by Editing Rare Arginine Codons in Escherichia coli

DTIC Science & Technology

2016-09-20

alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we imple- mented a CRISPR ... Crispr -assisted MAGE). First, we designed oligos that changed not only the target AGR codon to NNN but also made several synonymous changes at least 50...nt downstream that would disrupt a 20-bp CRISPR target lo- cus. MAGE was used to replace each AGR with NNN in parallel, and CRISPR /cas9 was used to
Comparison of codon usage bias across Leishmania and Trypanosomatids to understand mRNA secondary structure, relative protein abundance and pathway functions.

PubMed

Subramanian, Abhishek; Sarkar, Ram Rup

2015-10-01

Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
Elements in the murine c-mos messenger RNA 5'-untranslated region repress translation of downstream coding sequences.

PubMed

Steel, L F; Telly, D L; Leonard, J; Rice, B A; Monks, B; Sawicki, J A

1996-10-01

Murine c-mos transcripts isolated from testes have 5'-untranslated regions (5'UTRs) of approximately 300 nucleotides with a series of four overlapping open reading frames (ORFs) upstream of the AUG codon that initiates the Mos ORF. Ovarian c-mos transcripts have shorter 5'UTRs (70-80 nucleotides) and contain only 1-2 of the upstream ORFs (uORFs). To test whether these 5'UTRs affect translational efficiency, we have constructed plasmids for the expression of chimeric transcripts with a mos-derived 5'UTR fused to the Escherichia coli beta-galactosidase coding region. Translational efficiency has been evaluated by measuring beta-galactosidase activity NIH3T3 cells transiently transfected with these plasmids and with plasmids where various mutations have been introduced into the 5'UTR. We show that the 5'UTR characteristic of testis-specific c-mos mRNA strongly represses translation relative to the translation of transcripts that contain a 5'UTR derived from beta-globin mRNA, and this is mainly due to the four uORFs. Each of the four upstream AUG triplets can be recognized as a start site for translation, and no single uAUG dominates the repressive effect. The uORFs repress translation by a mechanism that is not affected by the amino acid sequence in the COOH-terminal region of the uORF-encoded peptides. The very short uORF (AUGUGA) present in ovary-specific transcripts does not repress translation. Staining of testis sections from transgenic mice carrying chimeric beta-galactosidase transgene constructs, which contain a mos 5'UTR with or without the uATGs, suggests that the uORFs can dramatically change the pattern of expression in spermatogenic cells.
Near-cognate suppression of amber, opal and quadruplet codons competes with aminoacyl-tRNAPyl for genetic code expansion

PubMed Central

O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter

2012-01-01

Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Importance of codon usage for the temporal regulation of viral gene expression

PubMed Central

Shin, Young C.; Bischof, Georg F.; Lauer, William A.; Desrosiers, Ronald C.

2015-01-01

The glycoproteins of herpesviruses and of HIV/SIV are made late in the replication cycle and are derived from transcripts that use an unusual codon usage that is quite different from that of the host cell. Here we show that the actions of natural transinducers from these two different families of persistent viruses (Rev of SIV and ORF57 of the rhesus monkey rhadinovirus) are dependent on the nature of the skewed codon usage. In fact, the transinducibility of expression of these glycoproteins by Rev and by ORF57 can be flipped simply by changing the nature of the codon usage. Even expression of a luciferase reporter could be made Rev dependent or ORF57 dependent by distinctive changes to its codon usage. Our findings point to a new general principle in which different families of persisting viruses use a poor codon usage that is skewed in a distinctive way to temporally regulate late expression of structural gene products. PMID:26504241
Identification and characterization of Kaposi's sarcoma-associated herpesvirus open reading frame 11 promotor activation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Lei

2008-01-01

Open reading frame 11 (ORF11) of Kaposi's sarcoma-associated herpesvirus belongs to a herpesviral homologous protein family shared by some members of the gamma- herpesvirus subfamily. Little is known about this ORF11 homologous protein family. We have characterized an unknown open reading frame, ORF11, located adjacent and in the opposite orientation to a well-characterized viral IL-6 gene. Northern blot analysis reveals that ORF11 is expressed during the KSHV lytic cycle with delayed-early transcription kinetics. We have determined the 5{prime} and 3{prime} untranslated region of the unspliced ORF11 transcript and identified both the transcription start site and the transcription termination site. Coremore » promoter region, representing ORF11 promoter activity, was mapped to a 159nt fragment 5{prime} most proximal to the transcription start site. A functional TATA box was identified in the core promoter region. Interestingly, we found that ORF11 transcriptional activation is not responsive to Rta, the KSHV lytic switch protein. We also discovered that part of the ORF11 promoter region, the 209nt fragment upstream of the transcription start site, was repressed by phorbol esters. Our data help to understand transcription regulation of ORF11 and to elucidate roles of ORF11 in KSHV pathogenesis and life cycle.« less
Stop codons in the hepatitis B surface proteins are enriched during antiviral therapy and are associated with host cell apoptosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Colledge, Danielle; Soppe, Sally; Yuen, Lilly

Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
Planning actions in robot automated operations

NASA Technical Reports Server (NTRS)

Das, A.

1988-01-01

Action planning in robot automated operations requires intelligent task level programming. Invoking intelligence necessiates a typical blackboard based architecture, where, a plan is a vector between the start frame and the goal frame. This vector is composed of partially ordered bases. A partial ordering of bases presents good and bad sides in action planning. Partial ordering demands the use of a temporal data base management system.
Comparative Genomic Analysis MERS CoV Isolated from Humans and Camels with Special Reference to Virus Encoded Helicase.

PubMed

Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud

2017-01-01

Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage

PubMed Central

Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng

2015-01-01

Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
Analyzing gene expression from relative codon usage bias in Yeast genome: a statistical significance and biological relevance.

PubMed

Das, Shibsankar; Roymondal, Uttam; Sahoo, Satyabrata

2009-08-15

Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score of the relative codon usage bias (RCBS), we have explicitly tested the performance of this numerical measure to predict the gene expression level and illustrate this with an analysis of Yeast genomes. In contradiction with previous other studies, we observe a weak correlations between GC content and RCBS, but a selective pressure on the codon preferences in highly expressed genes. The assertion that the expression of a given gene depends on the score of relative codon usage bias (RCBS) is supported by the data. We further observe a strong correlation between RCBS and protein length indicating natural selection in favour of shorter genes to be expressed at higher level. We also attempt a statistical analysis to assess the strength of relative codon bias in genes as a guide to their likely expression level, suggesting a decrease of the informational entropy in the highly expressed genes.
Codon Usage Bias and Determining Forces in Taenia solium Genome.

PubMed

Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng

2015-12-01

The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
Genome-wide analysis reveals class and gene specific codon usage adaptation in avian paramyxoviruses 1

USDA-ARS?s Scientific Manuscript database

In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
Theoretical foundations for quantitative paleogenetics. III - The molecular divergence of nucleic acids and proteins for the case of genetic events of unequal probability

NASA Technical Reports Server (NTRS)

Holmquist, R.; Pearl, D.

1980-01-01

Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Ribosome reinitiation at leader peptides increases translation of bacterial proteins.

PubMed

Korolev, Semen A; Zverkov, Oleg A; Seliverstov, Alexandr V; Lyubetsky, Vassily A

2016-04-16

Short leader genes usually do not encode stable proteins, although their importance in expression control of bacterial genomes is widely accepted. Such genes are often involved in the control of attenuation regulation. However, the abundance of leader genes suggests that their role in bacteria is not limited to regulation. Specifically, we hypothesize that leader genes increase the expression of protein-coding (structural) genes via ribosome reinitiation at the leader peptide in the case of a short distance between the stop codon of the leader gene and the start codon of the structural gene. For instance, in Actinobacteria, the frequency of leader genes at a distance of 10-11 bp is about 70 % higher than the mean frequency within the 1 to 65 bp range; and it gradually decreases as the range grows longer. A pronounced peak of this frequency-distance relationship is also observed in Proteobacteria, Bacteroidetes, Spirochaetales, Acidobacteria, the Deinococcus-Thermus group, and Planctomycetes. In contrast, this peak falls to the distance of 15-16 bp and is not very pronounced in Firmicutes; and no such peak is observed in cyanobacteria and tenericutes. Generally, this peak is typical for many bacteria. Some leader genes located close to a structural gene probably play a regulatory role as well.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.

PubMed

Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S

2011-03-01

The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
Enhanced expression of codon optimized Mycobacterium avium subsp. paratuberculosis antigens in Lactobacillus salivarius

USDA-ARS?s Scientific Manuscript database

We have previously identified the mycobacterial high G+C codon usage bias as a limiting factor in heterologous expression of MAP proteins from Lb.salivarius, and demonstrated that codon optimisation of a synthetic coding gene greatly enhances MAP protein production. Here, we effectively demonstrate ...
Recent evidence for evolution of the genetic code

NASA Technical Reports Server (NTRS)

Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.

1992-01-01

The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
The complete mitochondrial genome of the butterfly Apatura metis (Lepidoptera: Nymphalidae).

PubMed

Zhang, Min; Nie, Xinping; Cao, Tianwen; Wang, Juping; Li, Tao; Zhang, Xiaonan; Guo, Yaping; Ma, Enbo; Zhong, Yang

2012-06-01

As an important pest in the Slender Leaved Willow (Salix alba), Apatura metis is called Freyer's purple emperor, and its mitochondrial genome is 15,236 bp long. The encoded genes for 22 tRNA genes, two ribosomal RNA (rrnL and rrnS) genes, and 13 protein-coding genes (PCGs), and a control region in the A. metis mitochondria are highly homologous to other lepidopteran species. The mitochondrial genome of A. metis is biased toward a high A + T content (A + T = 80.5%). All protein-coding genes, except for COI begins with the CGA codon as observed in other lepidopterans, start with a typical ATN initiation codon. All tRNAs show the classic clover-leaf structure, except that the dihydrouridine (DHU) arm of tRNA(Ser(AGN)) forms a simple loop. The A. metis A + T-rich region contains some conserved structures including a structure combining the motif 'ATAGA' and 19 bp poly (T) stretch, which is similar to those found in other lepidopteran mitogenomes. The phylogenetic analyses of lepidopterans based on mitogenomes sequences demonstrate that each of the six superfamilies is monophyletic, and the relationship among them is (((Noctuoidea + (Geometroidea + Bombycoidea)) + Pyraloidea) + Papilionoidea) + Tortricoidea. In Papilionoidea group, our conclusion argues that ((Lycaenidae + Pieridae) + Nymphalidae) + Papilionidae.

Framing effects under cognitive load: the role of working memory in risky decisions.

PubMed

Whitney, Paul; Rinehart, Christa A; Hinson, John M

2008-12-01

Framing effects occur in a wide range of laboratory and natural decision contexts, but the underlying processes that produce framing effects are not well understood. We explored the role of working memory (WM) in framing by manipulating WM loads during risky decisions. After starting with a hypothetical stake of money, participants were then presented a lesser amount that they could keep for certain (positive frame) or lose for certain (negative frame). They made a choice between the sure amount and a gamble in which they could either keep or lose all of the original stake. On half of the trials, the choice was made while maintaining a concurrent WM load of random letters. In both load and no-load conditions, we replicated the typical finding of risk aversion with positive frames and risk seeking with negative frames. In addition, people made fewer decisions to accept the gamble under conditions of higher cognitive load. The data are congruent with a dual-process reasoning framework in which people employ a heuristic to make satisfactory decisions with minimal effort.
Codon usage bias reveals genomic adaptations to environmental conditions in an acidophilic consortium.

PubMed

Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet

2018-01-01

The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315.

PubMed

Sass, Andrea M; Van Acker, Heleen; Förstner, Konrad U; Van Nieuwerburgh, Filip; Deforce, Dieter; Vogel, Jörg; Coenye, Tom

2015-10-13

Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Stop codon readthrough generates a C-terminally extended variant of the human vitamin D receptor with reduced calcitriol response

PubMed Central

Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.

2018-01-01

Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
Determinants of translation speed are randomly distributed across transcripts resulting in a universal scaling of protein synthesis times

NASA Astrophysics Data System (ADS)

Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.

2018-02-01

Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

PubMed

Dass, J Febin Prabhu; Sudandiradoss, C

2012-07-15

5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
The complete mitochondrial genome of the mudsnail Cipangopaludina cathayensis (Gastropoda: Viviparidae).

PubMed

Yang, Huirong; Zhang, Jia-En; Luo, Hao; Luo, Mingzhu; Guo, Jing; Deng, Zhixin; Zhao, Benliang

2016-05-01

We present the complete mitochondrial genome of Cipangopaludina cathayensis in this study. The mitochondrial genome is 17,157 bp in length, containing 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes. All of them are encoded on the heavy strand except 7 tRNA genes on the light strand. Overall nucleotide compositions of the light strand are 44.51% of A, 26.74% of T, 20.48% of C and 8.28% of G. All the protein-coding genes start with ATG initiation codon except ATP6 with ATA and ND4 with TTG, and 2 types of termination codons are TAA (ATP6, ND2, COX1, COX2, ATP8, ND1, ND6, Cytb, COX3, ND4) and TAG (ND4L, ND5, ND3). There are 29 intergenic spacers and 5 gene overlaps. The tandem repeat sequences are observed in COX2, tRNA(Asp), ATP6, tRNA(Cys), S-rRNA, ND1, Cytb, ND4 and COX3 genes. Gene arrangement and distribution are different from the typical vertebrates. The absence of D-loop is consistent with the Gastropoda, but at least one lengthy non-coding region is essential regulatory element for the initiation of transcription and replication.
Complete mitochondrial genome of Bactrocera arecae (Insecta: Tephritidae) by next-generation sequencing and molecular phylogeny of Dacini tribe

PubMed Central

Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Chan, Kok-Gan; Chow, Wan-Loo; Eamsobhana, Praphathip

2015-01-01

The whole mitochondrial genome of the pest fruit fly Bactrocera arecae was obtained from next-generation sequencing of genomic DNA. It had a total length of 15,900 bp, consisting of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The control region (952 bp) was flanked by rrnS and trnI genes. The start codons included 6 ATG, 3 ATT and 1 each of ATA, ATC, GTG and TCG. Eight TAA, two TAG, one incomplete TA and two incomplete T stop codons were represented in the protein-coding genes. The cloverleaf structure for trnS1 lacked the D-loop, and that of trnN and trnF lacked the TΨC-loop. Molecular phylogeny based on 13 protein-coding genes was concordant with 37 mitochondrial genes, with B. arecae having closest genetic affinity to B. tryoni. The subgenus Bactrocera of Dacini tribe and the Dacinae subfamily (Dacini and Ceratitidini tribes) were monophyletic. The whole mitogenome of B. arecae will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26472633
A condition-specific codon optimization approach for improved heterologous gene expression in Saccharomyces cerevisiae

PubMed Central

2014-01-01

Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping.

PubMed

Gao, Quan Q; Wyatt, Eugene; Goldstein, Jeff A; LoPresti, Peter; Castillo, Lisa M; Gazda, Alec; Petrossian, Natalie; Earley, Judy U; Hadhazy, Michele; Barefield, David Y; Demonbreun, Alexis R; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M

2015-11-02

Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations.
Reengineering a transmembrane protein to treat muscular dystrophy using exon skipping

PubMed Central

Gao, Quan Q.; Wyatt, Eugene; Goldstein, Jeff A.; LoPresti, Peter; Castillo, Lisa M.; Gazda, Alec; Petrossian, Natalie; Earley, Judy U.; Hadhazy, Michele; Barefield, David Y.; Demonbreun, Alexis R.; Bönnemann, Carsten; Wolf, Matthew; McNally, Elizabeth M.

2015-01-01

Exon skipping uses antisense oligonucleotides as a treatment for genetic diseases. The antisense oligonucleotides used for exon skipping are designed to bypass premature stop codons in the target RNA and restore reading frame disruption. Exon skipping is currently being tested in humans with dystrophin gene mutations who have Duchenne muscular dystrophy. For Duchenne muscular dystrophy, the rationale for exon skipping derived from observations in patients with naturally occurring dystrophin gene mutations that generated internally deleted but partially functional dystrophin proteins. We have now expanded the potential for exon skipping by testing whether an internal, in-frame truncation of a transmembrane protein γ-sarcoglycan is functional. We generated an internally truncated γ-sarcoglycan protein that we have termed Mini-Gamma by deleting a large portion of the extracellular domain. Mini-Gamma provided functional and pathological benefits to correct the loss of γ-sarcoglycan in a Drosophila model, in heterologous cell expression studies, and in transgenic mice lacking γ-sarcoglycan. We generated a cellular model of human muscle disease and showed that multiple exon skipping could be induced in RNA that encodes a mutant human γ-sarcoglycan. Since Mini-Gamma represents removal of 4 of the 7 coding exons in γ-sarcoglycan, this approach provides a viable strategy to treat the majority of patients with γ-sarcoglycan gene mutations. PMID:26457733
Lack of correlation between p53 codon 72 polymorphism and anal cancer risk

PubMed Central

Contu, Simone S; Agnes, Grasiela; Damin, Andrea P; Contu, Paulo C; Rosito, Mário A; Alexandre, Claudio O; Damin, Daniel C

2009-01-01

AIM: To investigate the potential role of p53 codon 72 polymorphism as a risk factor for development of anal cancer. METHODS: Thirty-two patients with invasive anal carcinoma and 103 healthy blood donors were included in the study. p53 codon 72 polymorphism was analyzed in blood samples through polymerase chain reaction-restriction fragment length polymorphism and DNA sequencing. RESULTS: The relative frequency of each allele was 0.60 for Arg and 0.40 for Pro in patients with anal cancer, and 0.61 for Arg and 0.39 for Pro in normal controls. No significant differences in distribution of the codon 72 genotypes between patients and controls were found. CONCLUSION: These results do not support a role for the p53 codon 72 polymorphism in anal carcinogenesis. PMID:19777616
Energetics of codon-anticodon recognition on the small ribosomal subunit.

PubMed

Almlöf, Martin; Andér, Martin; Aqvist, Johan

2007-01-09

Recent crystal structures of the small ribosomal subunit have made it possible to examine the detailed energetics of codon recognition on the ribosome by computational methods. The binding of cognate and near-cognate anticodon stem loops to the ribosome decoding center, with mRNA containing the Phe UUU and UUC codons, are analyzed here using explicit solvent molecular dynamics simulations together with the linear interaction energy (LIE) method. The calculated binding free energies are in excellent agreement with experimental binding constants and reproduce the relative effects of mismatches in the first and second codon position versus a mismatch at the wobble position. The simulations further predict that the Leu2 anticodon stem loop is about 10 times more stable than the Ser stem loop in complex with the Phe UUU codon. It is also found that the ribosome significantly enhances the intrinsic stability differences of codon-anticodon complexes in aqueous solution. Structural analysis of the simulations confirms the previously suggested importance of the universally conserved nucleotides A1492, A1493, and G530 in the decoding process.
Random codon re-encoding induces stable reduction of replicative fitness of Chikungunya virus in primate and mosquito cells.

PubMed

Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier

2013-02-01

Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
Generation of obese rat model by transcription activator-like effector nucleases targeting the leptin receptor gene.

PubMed

Chen, Yuting; Lu, Wenqing; Gao, Na; Long, Yi; Shao, Yanjiao; Liu, Meizhen; Chen, Huaqing; Ye, Shixin; Ma, Xueyun; Liu, Mingyao; Li, Dali

2017-02-01

The laboratory rat is a valuable mammalian model organism for basic research and drug discovery. Here we demonstrate an efficient methodology by applying transcription activator-like effector nucleases (TALENs) technology to generate Leptin receptor (Lepr) knockout rats on the Sprague Dawley (SD) genetic background. Through direct injection of in vitro transcribed mRNA of TALEN pairs into SD rat zygotes, somatic mutations were induced in two of three resulting pups. One of the founders carrying bi-allelic mutation exhibited early onset of obesity and infertility. The other founder carried a chimeric mutation which was efficiently transmitted to the progenies. Through phenotyping of the resulting three lines of rats bearing distinct mutations in the Lepr locus, we found that the strains with a frame-shifted or premature stop codon mutation led to obesity and metabolic disorders. However, no obvious defect was observed in a strain with an in-frame 57 bp deletion in the extracellular domain of Lepr. This suggests the deleted amino acids do not significantly affect Lepr structure and function. This is the first report of generating the Lepr mutant obese rat model in SD strain through a reverse genetic approach. This suggests that TALEN is an efficient and powerful gene editing technology for the generation of disease models.
Hierarchical video summarization

NASA Astrophysics Data System (ADS)

Ratakonda, Krishna; Sezan, M. Ibrahim; Crinon, Regis J.

1998-12-01

We address the problem of key-frame summarization of vide in the absence of any a priori information about its content. This is a common problem that is encountered in home videos. We propose a hierarchical key-frame summarization algorithm where a coarse-to-fine key-frame summary is generated. A hierarchical key-frame summary facilitates multi-level browsing where the user can quickly discover the content of the video by accessing its coarsest but most compact summary and then view a desired segment of the video with increasingly more detail. At the finest level, the summary is generated on the basis of color features of video frames, using an extension of a recently proposed key-frame extraction algorithm. The finest level key-frames are recursively clustered using a novel pairwise K-means clustering approach with temporal consecutiveness constraint. We also address summarization of MPEG-2 compressed video without fully decoding the bitstream. We also propose efficient mechanisms that facilitate decoding the video when the hierarchical summary is utilized in browsing and playback of video segments starting at selected key-frames.
Idiosyncratic recognition of UUG/UUA codons by modified nucleoside 5-taurinomethyluridine, τm5U present at 'wobble' position in anticodon loop of tRNALeu: A molecular modeling approach.

PubMed

Kamble, Asmita S; Fandilolu, Prayagraj M; Sambhare, Susmit B; Sonawane, Kailas D

2017-01-01

Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the 'wobble' 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by 'wobble' as well as a novel 'single' hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons.
Idiosyncratic recognition of UUG/UUA codons by modified nucleoside 5-taurinomethyluridine, τm5U present at ‘wobble’ position in anticodon loop of tRNALeu: A molecular modeling approach

PubMed Central

Kamble, Asmita S.; Fandilolu, Prayagraj M.; Sambhare, Susmit B.; Sonawane, Kailas D.

2017-01-01

Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the ‘wobble’ 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by ‘wobble’ as well as a novel ‘single’ hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons. PMID:28453549
Reducing codon redundancy and screening effort of combinatorial protein libraries created by saturation mutagenesis.

PubMed

Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo

2013-02-15

Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
The first two mitochondrial genomes from Taeniopterygidae (Insecta: Plecoptera): Structural features and phylogenetic implications.

PubMed

Chen, Zhi-Teng; Du, Yu-Zhou

2018-05-01

The complete mitochondrial genomes (mitogenomes) of Taeniopteryx ugola and Doddsia occidentalis (Plecoptera: Taeniopterygidae) were firstly sequenced from the family Taeniopterygidae. The 15,353-bp long mitogenome of T. ugola and the 16,020-bp long mitogenome of D. occidentalis each contained 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA genes (tRNAs), two ribosomal RNA genes (rRNAs) and a control region (CR). The mitochondrial gene arrangement of the two taeniopterygids and other stoneflies was identical with the putative ancestral mitogenome of Drosophila yakuba. Most PCGs used standard ATN start codons and TAN termination codons. Twenty-one of the 22 tRNAs in each mitogenome could fold into the cloverleaf secondary structures, while the dihydrouridine (DHU) arm of trnSer (AGN) was reduced or absent. Stem-loop (SL) structures, poly-T stretch, poly-[AT] n stretch and tandem repeats were found in the CRs of the two mitogenomes. The phylogenetic analyses using Bayesian inference (BI) and maximum likelihood methods (ML) generated identical results, both supporting the monophyly of all stonefly families and the two infraorders, Systellognatha and Euholognatha. Taeniopterygidae was grouped with another two families from Euholognatha. The relationships within Plecoptera were recovered as (((Perlidae+Peltoperlidae)+((Pteronarcyidae+Chloroperlidae)+Styloperlidae))+((Capniidae+Taeniopterygidae)+Nemouridae))+Gripopterygidae. Copyright © 2017 Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.