Pek, Han Bin; Klement, Maximilian; Ang, Kok Siong; Chung, Bevan Kai-Sheng; Ow, Dave Siak-Wei; Lee, Dong-Yup
2015-01-01
Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion. Copyright © 2015 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
We have previously identified the mycobacterial high G+C codon usage bias as a limiting factor in heterologous expression of MAP proteins from Lb.salivarius, and demonstrated that codon optimisation of a synthetic coding gene greatly enhances MAP protein production. Here, we effectively demonstrate ...
Codon Optimization to Enhance Expression Yields Insights into Chloroplast Translation1[OPEN
Chan, Hui-Ting; Williams-Carrier, Rosalind; Barkan, Alice
2016-01-01
Codon optimization based on psbA genes from 133 plant species eliminated 105 (human clotting factor VIII heavy chain [FVIII HC]) and 59 (polio VIRAL CAPSID PROTEIN1 [VP1]) rare codons; replacement with only the most highly preferred codons decreased transgene expression (77- to 111-fold) when compared with the codon usage hierarchy of the psbA genes. Targeted proteomic quantification by parallel reaction monitoring analysis showed 4.9- to 7.1-fold or 22.5- to 28.1-fold increase in FVIII or VP1 codon-optimized genes when normalized with stable isotope-labeled standard peptides (or housekeeping protein peptides), but quantitation using western blots showed 6.3- to 8-fold or 91- to 125-fold increase of transgene expression from the same batch of materials, due to limitations in quantitative protein transfer, denaturation, solubility, or stability. Parallel reaction monitoring, to our knowledge validated here for the first time for in planta quantitation of biopharmaceuticals, is especially useful for insoluble or multimeric proteins required for oral drug delivery. Northern blots confirmed that the increase of codon-optimized protein synthesis is at the translational level rather than any impact on transcript abundance. Ribosome footprints did not increase proportionately with VP1 translation or even decreased after FVIII codon optimization but is useful in diagnosing additional rate-limiting steps. A major ribosome pause at CTC leucine codons in the native gene of FVIII HC was eliminated upon codon optimization. Ribosome stalls observed at clusters of serine codons in the codon-optimized VP1 gene provide an opportunity for further optimization. In addition to increasing our understanding of chloroplast translation, these new tools should help to advance this concept toward human clinical studies. PMID:27465114
Balanced Codon Usage Optimizes Eukaryotic Translational Efficiency
Qian, Wenfeng; Yang, Jian-Rong; Pearson, Nathaniel M.; Maclean, Calum; Zhang, Jianzhi
2012-01-01
Cellular efficiency in protein translation is an important fitness determinant in rapidly growing organisms. It is widely believed that synonymous codons are translated with unequal speeds and that translational efficiency is maximized by the exclusive use of rapidly translated codons. Here we estimate the in vivo translational speeds of all sense codons from the budding yeast Saccharomyces cerevisiae. Surprisingly, preferentially used codons are not translated faster than unpreferred ones. We hypothesize that this phenomenon is a result of codon usage in proportion to cognate tRNA concentrations, the optimal strategy in enhancing translational efficiency under tRNA shortage. Our predicted codon–tRNA balance is indeed observed from all model eukaryotes examined, and its impact on translational efficiency is further validated experimentally. Our study reveals a previously unsuspected mechanism by which unequal codon usage increases translational efficiency, demonstrates widespread natural selection for translational efficiency, and offers new strategies to improve synthetic biology. PMID:22479199
Shin, Young C.; Desrosiers, Ronald C.
2011-01-01
Open reading frame 57 (ORF57) of gamma-2 herpesviruses is a key regulator of viral gene expression. It has been reported to enhance the expression of viral genes by transcriptional, posttranscriptional, or translational activation mechanisms. Previously we have shown that the expression of gH and gL of rhesus monkey rhadinovirus (RRV), a close relative of the human Kaposi's sarcoma-associated herpesvirus (KSHV), could be dramatically rescued by codon optimization as well as by ORF57 coexpression (J. P. Bilello, J. S. Morgan, and R. C. Desrosiers, J. Virol. 82:7231–7237, 2008). We show here that ORF57 coexpression and codon optimization had similar effects, except that the rescue of expression by codon optimization was temporally delayed relative to that of ORF57 coexpression. The transfection of gL mRNA directly into cells with or without ORF57 coexpression and with or without codon optimization recapitulated the effects of these modes of induction on transfected DNA. These findings suggested an important role for the enhancement of mRNA stability and/or the translation of mRNA for these very different modes of induced expression. This conclusion was confirmed by several different measures of gH and gL mRNA stability and accumulation with or without ORF57 coexpression and with or without codon optimization. Our results indicate that RRV gH and gL expression is severely limited by the stability of the mRNA and that ORF57 coexpression and codon optimization independently induce gH and gL expression principally by allowing accumulation and translation of these mRNAs. PMID:21613403
Leroch, Michaela; Mernke, Dennis; Koppenhoefer, Dieter; Schneider, Prisca; Mosbach, Andreas; Doehlemann, Gunther; Hahn, Matthias
2011-05-01
The green fluorescent protein (GFP) and its variants have been widely used in modern biology as reporters that allow a variety of live-cell imaging techniques. So far, GFP has rarely been used in the gray mold fungus Botrytis cinerea because of low fluorescence intensity. The codon usage of B. cinerea genes strongly deviates from that of commonly used GFP-encoding genes and reveals a lower GC content than other fungi. In this study, we report the development and use of a codon-optimized version of the B. cinerea enhanced GFP (eGFP)-encoding gene (Bcgfp) for improved expression in B. cinerea. Both the codon optimization and, to a smaller extent, the insertion of an intron resulted in higher mRNA levels and increased fluorescence. Bcgfp was used for localization of nuclei in germinating spores and for visualizing host penetration. We further demonstrate the use of promoter-Bcgfp fusions for quantitative evaluation of various toxic compounds as inducers of the atrB gene encoding an ABC-type drug efflux transporter of B. cinerea. In addition, a codon-optimized mCherry-encoding gene was constructed which yielded bright red fluorescence in B. cinerea.
An expanded genetic code in mammalian cells with a functional quadruplet codon.
Niu, Wei; Schultz, Peter G; Guo, Jiantao
2013-07-19
We have utilized in vitro evolution to identify tRNA variants with significantly enhanced activity for the incorporation of unnatural amino acids into proteins in response to a quadruplet codon in both bacterial and mammalian cells. This approach will facilitate the creation of an optimized and standardized system for the genetic incorporation of unnatural amino acids using quadruplet codons, which will allow the biosynthesis of biopolymers that contain multiple unnatural building blocks.
Villada, Juan C.; Brustolini, Otávio José Bernardes
2017-01-01
Abstract Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent–non-optimal cluster and enrichment at the 5′-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. PMID:28449100
Villada, Juan C; Brustolini, Otávio José Bernardes; Batista da Silveira, Wendel
2017-08-01
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Generate Optimized Genetic Rhythm for Enzyme Expression in Non-native systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
2016-11-03
Most amino acids are represented by more than one codon, resulting in redundancy in the genetic code. Silent codon substitutions that do not alter the amino acid sequence still have an effect on protein expression. We have developed an algorithm, GoGREEN, to enhance the expression of foreign proteins in a host organism. GoGREEN selects codons according to frequency patterns seen in the gene of interest using the codon usage table from the host organism. GoGREEN is also designed to accommodate gaps in the sequence.This software takes for input (1) the aligned protein sequences for genes the user wishes to express,more » (2) the codon usage table for the host organism, (3) and the DNA sequence for the target protein found in the host organism. The program will select codons based on codon usage patterns for the target DNA sequence. The program will also select codons for “gaps” found in the aligned protein sequences using the codon usage table from the host organism.« less
A codon-optimized green fluorescent protein for live cell imaging in Zymoseptoria tritici☆
Kilaru, S.; Schuster, M.; Studholme, D.; Soanes, D.; Lin, C.; Talbot, N.J.; Steinberg, G.
2015-01-01
Fluorescent proteins (FPs) are powerful tools to investigate intracellular dynamics and protein localization. Cytoplasmic expression of FPs in fungal pathogens allows greater insight into invasion strategies and the host-pathogen interaction. Detection of their fluorescent signal depends on the right combination of microscopic setup and signal brightness. Slow rates of photo-bleaching are pivotal for in vivo observation of FPs over longer periods of time. Here, we test green-fluorescent proteins, including Aequorea coerulescens GFP (AcGFP), enhanced GFP (eGFP) from Aequorea victoria and a novel Zymoseptoria tritici codon-optimized eGFP (ZtGFP), for their usage in conventional and laser-enhanced epi-fluorescence, and confocal laser-scanning microscopy. We show that eGFP, expressed cytoplasmically in Z. tritici, is significantly brighter and more photo-stable than AcGFP. The codon-optimized ZtGFP performed even better than eGFP, showing significantly slower bleaching and a 20–30% further increase in signal intensity. Heterologous expression of all GFP variants did not affect pathogenicity of Z. tritici. Our data establish ZtGFP as the GFP of choice to investigate intracellular protein dynamics in Z. tritici, but also infection stages of this wheat pathogen inside host tissue. PMID:26092799
Development of a codon optimization strategy using the efor RED reporter gene as a test case
NASA Astrophysics Data System (ADS)
Yip, Chee-Hoo; Yarkoni, Orr; Ajioka, James; Wan, Kiew-Lian; Nathan, Sheila
2018-04-01
Synthetic biology is a platform that enables high-level synthesis of useful products such as pharmaceutically related drugs, bioplastics and green fuels from synthetic DNA constructs. Large-scale expression of these products can be achieved in an industrial compliant host such as Escherichia coli. To maximise the production of recombinant proteins in a heterologous host, the genes of interest are usually codon optimized based on the codon usage of the host. However, the bioinformatics freeware available for standard codon optimization might not be ideal in determining the best sequence for the synthesis of synthetic DNA. Synthesis of incorrect sequences can prove to be a costly error and to avoid this, a codon optimization strategy was developed based on the E. coli codon usage using the efor RED reporter gene as a test case. This strategy replaces codons encoding for serine, leucine, proline and threonine with the most frequently used codons in E. coli. Furthermore, codons encoding for valine and glycine are substituted with the second highly used codons in E. coli. Both the optimized and original efor RED genes were ligated to the pJS209 plasmid backbone using Gibson Assembly and the recombinant DNAs were transformed into E. coli E. cloni 10G strain. The fluorescence intensity per cell density of the optimized sequence was improved by 20% compared to the original sequence. Hence, the developed codon optimization strategy is proposed when designing an optimal sequence for heterologous protein production in E. coli.
Sun, Yu; Tamarit, Daniel
2017-01-01
Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Codon optimization underpins generalist parasitism in fungi
Badet, Thomas; Peyraud, Remi; Mbengue, Malick; Navaud, Olivier; Derbyshire, Mark; Oliver, Richard P; Barbacci, Adelin; Raffaele, Sylvain
2017-01-01
The range of hosts that parasites can infect is a key determinant of the emergence and spread of disease. Yet, the impact of host range variation on the evolution of parasite genomes remains unknown. Here, we show that codon optimization underlies genome adaptation in broad host range parasites. We found that the longer proteins encoded by broad host range fungi likely increase natural selection on codon optimization in these species. Accordingly, codon optimization correlates with host range across the fungal kingdom. At the species level, biased patterns of synonymous substitutions underpin increased codon optimization in a generalist but not a specialist fungal pathogen. Virulence genes were consistently enriched in highly codon-optimized genes of generalist but not specialist species. We conclude that codon optimization is related to the capacity of parasites to colonize multiple hosts. Our results link genome evolution and translational regulation to the long-term persistence of generalist parasitism. DOI: http://dx.doi.org/10.7554/eLife.22472.001 PMID:28157073
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.
Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen
2015-05-06
The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
NASA Astrophysics Data System (ADS)
Villanueva, Eneko; Martí-Solano, Maria; Fillat, Cristina
2016-06-01
Codon usage adaptation of lytic viruses to their hosts is determinant for viral fitness. In this work, we analyzed the codon usage of adenoviral proteins by principal component analysis and assessed their codon adaptation to the host. We observed a general clustering of adenoviral proteins according to their function. However, there was a significant variation in the codon preference between the host-interacting fiber protein and the rest of structural late phase proteins, with a non-optimal codon usage of the fiber. To understand the impact of codon bias in the fiber, we optimized the Adenovirus-5 fiber to the codon usage of the hexon structural protein. The optimized fiber displayed increased expression in a non-viral context. However, infection with adenoviruses containing the optimized fiber resulted in decreased expression of the fiber and of wild-type structural proteins. Consequently, this led to a drastic reduction in viral release. The insertion of an exogenous optimized protein as a late gene in the adenovirus with the optimized fiber further interfered with viral fitness. These results highlight the importance of balancing codon usage in viral proteins to adequately exploit cellular resources for efficient infection and open new opportunities to regulate viral fitness for virotherapy and vaccine development.
Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta
Whittle, C. A.; Sun, Y.; Johannesson, H.
2011-01-01
Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862
Evolutionary conservation of codon optimality reveals hidden signatures of cotranslational folding.
Pechmann, Sebastian; Frydman, Judith
2013-02-01
The choice of codons can influence local translation kinetics during protein synthesis. Whether codon preference is linked to cotranslational regulation of polypeptide folding remains unclear. Here, we derive a revised translational efficiency scale that incorporates the competition between tRNA supply and demand. Applying this scale to ten closely related yeast species, we uncover the evolutionary conservation of codon optimality in eukaryotes. This analysis reveals universal patterns of conserved optimal and nonoptimal codons, often in clusters, which associate with the secondary structure of the translated polypeptides independent of the levels of expression. Our analysis suggests an evolved function for codon optimality in regulating the rhythm of elongation to facilitate cotranslational polypeptide folding, beyond its previously proposed role of adapting to the cost of expression. These findings establish how mRNA sequences are generally under selection to optimize the cotranslational folding of corresponding polypeptides.
Lorenz, Felix K. M.; Wilde, Susanne; Voigt, Katrin; Kieback, Elisa; Mosetter, Barbara; Schendel, Dolores J.; Uckert, Wolfgang
2015-01-01
Codon optimization of nucleotide sequences is a widely used method to achieve high levels of transgene expression for basic and clinical research. Until now, immunological side effects have not been described. To trigger T cell responses against human papillomavirus, we incubated T cells with dendritic cells that were pulsed with RNA encoding the codon-optimized E7 oncogene. All T cell receptors isolated from responding T cell clones recognized target cells expressing the codon-optimized E7 gene but not the wild type E7 sequence. Epitope mapping revealed recognition of a cryptic epitope from the +3 alternative reading frame of codon-optimized E7, which is not encoded by the wild type E7 sequence. The introduction of a stop codon into the +3 alternative reading frame protected the transgene product from recognition by T cell receptor gene-modified T cells. This is the first experimental study demonstrating that codon optimization can render a transgene artificially immunogenic through generation of a dominant cryptic epitope. This finding may be of great importance for the clinical field of gene therapy to avoid rejection of gene-corrected cells and for the design of DNA- and RNA-based vaccines, where codon optimization may artificially add a strong immunogenic component to the vaccine. PMID:25799237
Johnston, Christopher D; Bannantine, John P; Govender, Rodney; Endersen, Lorraine; Pletzer, Daniel; Weingart, Helge; Coffey, Aidan; O'Mahony, Jim; Sleator, Roy D
2014-01-01
It is well documented that open reading frames containing high GC content show poor expression in A+T rich hosts. Specifically, G+C-rich codon usage is a limiting factor in heterologous expression of Mycobacterium avium subsp. paratuberculosis (MAP) proteins using Lactobacillus salivarius. However, re-engineering opening reading frames through synonymous substitutions can offset codon bias and greatly enhance MAP protein production in this host. In this report, we demonstrate that codon-usage manipulation of MAP2121c can enhance the heterologous expression of the major membrane protein (MMP), analogous to the form in which it is produced natively by MAP bacilli. When heterologously over-expressed, antigenic determinants were preserved in synthetic MMP proteins as shown by monoclonal antibody mediated ELISA. Moreover, MMP is a membrane protein in MAP, which is also targeted to the cellular surface of recombinant L. salivarius at levels comparable to MAP. Additionally, we previously engineered MAP3733c (encoding MptD) and show herein that MptD displays the tendency to associate with the cytoplasmic membrane boundary under confocal microscopy and the intracellularly accumulated protein selectively adheres to the MptD-specific bacteriophage fMptD. This work demonstrates there is potential for L. salivarius as a viable antigen delivery vehicle for MAP, which may provide an effective mucosal vaccine against Johne's disease.
2014-01-01
Background Heterologous gene expression is an important tool for synthetic biology that enables metabolic engineering and the production of non-natural biologics in a variety of host organisms. The translational efficiency of heterologous genes can often be improved by optimizing synonymous codon usage to better match the host organism. However, traditional approaches for optimization neglect to take into account many factors known to influence synonymous codon distributions. Results Here we define an alternative approach for codon optimization that utilizes systems level information and codon context for the condition under which heterologous genes are being expressed. Furthermore, we utilize a probabilistic algorithm to generate multiple variants of a given gene. We demonstrate improved translational efficiency using this condition-specific codon optimization approach with two heterologous genes, the fluorescent protein-encoding eGFP and the catechol 1,2-dioxygenase gene CatA, expressed in S. cerevisiae. For the latter case, optimization for stationary phase production resulted in nearly 2.9-fold improvements over commercial gene optimization algorithms. Conclusions Codon optimization is now often a standard tool for protein expression, and while a variety of tools and approaches have been developed, they do not guarantee improved performance for all hosts of applications. Here, we suggest an alternative method for condition-specific codon optimization and demonstrate its utility in Saccharomyces cerevisiae as a proof of concept. However, this technique should be applicable to any organism for which gene expression data can be generated and is thus of potential interest for a variety of applications in metabolic and cellular engineering. PMID:24636000
Relative codon adaptation: a generic codon bias index for prediction of gene expression.
Fox, Jesse M; Erill, Ivan
2010-06-01
The development of codon bias indices (CBIs) remains an active field of research due to their myriad applications in computational biology. Recently, the relative codon usage bias (RCBS) was introduced as a novel CBI able to estimate codon bias without using a reference set. The results of this new index when applied to Escherichia coli and Saccharomyces cerevisiae led the authors of the original publications to conclude that natural selection favours higher expression and enhanced codon usage optimization in short genes. Here, we show that this conclusion was flawed and based on the systematic oversight of an intrinsic bias for short sequences in the RCBS index and of biases in the small data sets used for validation in E. coli. Furthermore, we reveal that how the RCBS can be corrected to produce useful results and how its underlying principle, which we here term relative codon adaptation (RCA), can be made into a powerful reference-set-based index that directly takes into account the genomic base composition. Finally, we show that RCA outperforms the codon adaptation index (CAI) as a predictor of gene expression when operating on the CAI reference set and that this improvement is significantly larger when analysing genomes with high mutational bias.
Park, Soohyun; Pack, Seung Pil; Lee, Jinwon
2012-08-01
We examined the expression of the phosphoenolpyruvate carboxylase (PEPC) gene from marine bacteria in Escherichia coli using codon optimization. The codon-optimized PEPC gene was expressed in the E. coli K-12 strain W3110. SDS-PAGE analysis revealed that the codon-optimized PEPC gene was only expressed in E. coli, and measurement of enzyme activity indicated the highest PEPC activity in the E. coli SGJS112 strain that contained the codon-optimized PEPC gene. In fermentation assays, the E. coli SGJS112 produced the highest yield of oxaloacetate using glucose as the source and produced a 20-times increase in the yield of malate compared to the control. We concluded that the codon optimization enabled E. coli to express the PEPC gene derived from the Glaciecola sp. HTCC2999. Also, the expressed protein exhibited an enzymatic activity similar to that of E. coli PEPC and increased the yield of oxaloacetate and malate in an E. coli system.
Liu, Cunbao; Yang, Xu; Yao, Yufeng; Huang, Weiwei; Sun, Wenjia; Ma, Yanbing
2014-05-01
Two versions of an optimized gene that encodes human papilloma virus type 16 major protein L1 were designed according to the codon usage frequency of Pichia pastoris. Y16 was highly expressed in both P. pastoris and Hansenula polymorpha. M16 expression was as efficient as that of Y16 in P. pastoris, but merely detectable in H. polymorpha even though transcription levels of M16 and Y16 were similar. H. polymorpha had a unique codon usage frequency that contains many more rare codons than Saccharomyces cerevisiae or P. pastoris. These findings indicate that even codon-optimized genes that are expressed well in S. cerevisiae and P. pastoris may be inefficiently expressed in H. polymorpha; thus rare codons must be avoided when universal optimized gene versions are designed to facilitate expression in a variety of yeast expression systems, especially H. polymorpha is involved.
Zhao, Fangzhou; Yu, Chien-Hung; Liu, Yi
2017-08-21
Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sun, Xianhua; Xue, Xianli; Li, Mengzhu; Gao, Fei; Hao, Zhenzhen; Huang, Huoqing; Luo, Huiying; Qin, Lina; Yao, Bin; Su, Xiaoyun
2017-12-20
Cellulase and mannanase are both important enzyme additives in animal feeds. Expressing the two enzymes simultaneously within one microbial host could potentially lead to cost reductions in the feeding of animals. For this purpose, we codon-optimized the Aspergillus niger Man5A gene to the codon-usage bias of Trichoderma reesei. By comparing the free energies and the local structures of the nucleotide sequences, one optimized sequence was finally selected and transformed into the T. reesei pyridine-auxotrophic strain TU-6. The codon-optimized gene was expressed to a higher level than the original one. Further expressing the codon-optimized gene in a mutated T. reesei strain through fed-batch cultivation resulted in coproduction of cellulase and mannanase up to 1376 U·mL -1 and 1204 U·mL -1 , respectively.
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Stachyra, Anna; Redkiewicz, Patrycja; Kosson, Piotr; Protasiuk, Anna; Góra-Sochacka, Anna; Kudla, Grzegorz; Sirko, Agnieszka
2016-08-26
Highly pathogenic avian influenza viruses are a serious threat to domestic poultry and can be a source of new human pandemic and annual influenza strains. Vaccination is the main strategy of protection against influenza, thus new generation vaccines, including DNA vaccines, are needed. One promising approach for enhancing the immunogenicity of a DNA vaccine is to maximize its expression in the immunized host. The immunogenicity of three variants of a DNA vaccine encoding hemagglutinin (HA) from the avian influenza virus A/swan/Poland/305-135V08/2006 (H5N1) was compared in two animal models, mice (BALB/c) and chickens (broilers and layers). One variant encoded the wild type HA while the other two encoded HA without proteolytic site between HA1 and HA2 subunits and differed in usage of synonymous codons. One of them was enriched for codons preferentially used in chicken genes, while in the other modified variant the third position of codons was occupied in almost 100 % by G or C nucleotides. The variant of the DNA vaccine containing almost 100 % of the GC content in the third position of codons stimulated strongest immune response in two animal models, mice and chickens. These results indicate that such modification can improve not only gene expression but also immunogenicity of DNA vaccine. Enhancement of the GC content in the third position of the codon might be a good strategy for development of a variant of a DNA vaccine against influenza that could be highly effective in distant hosts, such as birds and mammals, including humans.
Kianmehr, Anvarsadat; Golavar, Raziyeh; Rouintan, Mandana; Mahrooz, Abdolkarim; Fard-Esfahani, Pezhman; Oladnabi, Morteza; Khajeniazi, Safoura; Mostafavi, Seyede Samaneh; Omidinia, Eskandar
2016-02-01
Darbepoetin alfa is an engineered and hyperglycosylated analog of recombinant human erythropoietin (EPO) which is used as a drug in treating anemia in patients with chronic kidney failure and cancer. This study desribes the secretory expression of a codon-optimized recombinant form of darbepoetin alfa in Leishmania tarentolae T7-TR. Synthetic codon-optimized gene was amplified by PCR and cloned into the pLEXSY-I-blecherry3 vector. The resultant expression vector, pLEXSYDarbo, was purified, digested, and electroporated into the L. tarentolae. Expression of recombinant darbepoetin alfa was evaluated by ELISA, reverse-transcription PCR (RT-PCR), Western blotting, and biological activity. After codon optimization, codon adaptation index (CAI) of the gene raised from 0.50 to 0.99 and its GC% content changed from 56% to 58%. Expression analysis confirmed the presence of a protein band at 40 kDa. Furthermore, reticulocyte experiment results revealed that the activity of expressed darbepoetin alfa was similar to that of its equivalent expressed in Chinese hamster ovary (CHO) cells. These data suggested that the codon optimization and expression in L. tarentolae host provided an efficient approach for high level expression of darbepoetin alfa. Copyright © 2015 Elsevier Inc. All rights reserved.
Codon usage affects the structure and function of the Drosophila circadian clock protein PERIOD.
Fu, Jingjing; Murphy, Katherine A; Zhou, Mian; Li, Ying H; Lam, Vu H; Tabuloc, Christine A; Chiu, Joanna C; Liu, Yi
2016-08-01
Codon usage bias is a universal feature of all genomes, but its in vivo biological functions in animal systems are not clear. To investigate the in vivo role of codon usage in animals, we took advantage of the sensitivity and robustness of the Drosophila circadian system. By codon-optimizing parts of Drosophila period (dper), a core clock gene that encodes a critical component of the circadian oscillator, we showed that dper codon usage is important for circadian clock function. Codon optimization of dper resulted in conformational changes of the dPER protein, altered dPER phosphorylation profile and stability, and impaired dPER function in the circadian negative feedback loop, which manifests into changes in molecular rhythmicity and abnormal circadian behavioral output. This study provides an in vivo example that demonstrates the role of codon usage in determining protein structure and function in an animal system. These results suggest a universal mechanism in eukaryotes that uses a codon usage "code" within genetic codons to regulate cotranslational protein folding. © 2016 Fu et al.; Published by Cold Spring Harbor Laboratory Press.
Zheng, Desong; Sun, Quanxi; Liu, Jiang; Li, Yaxiao; Hua, Jinping
2016-01-01
Eicosapentaenoic acid (EPA, 20:5Δ5,8,11,14,17) and Docosahexaenoic acid (DHA, 22:6Δ4,7,10,13,16,19) are nutritionally beneficial to human health. Transgenic production of EPA and DHA in oilseed crops by transferring genes originating from lower eukaryotes, such as microalgae and fungi, has been attempted in recent years. However, the low yield of EPA and DHA produced in these transgenic crops is a major hurdle for the commercialization of these transgenics. Many factors can negatively affect transgene expression, leading to a low level of converted fatty acid products. Among these the codon bias between the transgene donor and the host crop is one of the major contributing factors. Therefore, we carried out codon optimization of a fatty acid delta-6 desaturase gene PinD6 from the fungus Phytophthora infestans, and a delta-9 elongase gene, IgASE1 from the microalga Isochrysis galbana for expression in Saccharomyces cerevisiae and Arabidopsis respectively. These are the two key genes encoding enzymes for driving the first catalytic steps in the Δ6 desaturation/Δ6 elongation and the Δ9 elongation/Δ8 desaturation pathways for EPA/DHA biosynthesis. Hence expression levels of these two genes are important in determining the final yield of EPA/DHA. Via PCR-based mutagenesis we optimized the least preferred codons within the first 16 codons at their N-termini, as well as the most biased CGC codons (coding for arginine) within the entire sequences of both genes. An expression study showed that transgenic Arabidopsis plants harbouring the codon-optimized IgASE1 contained 64% more elongated fatty acid products than plants expressing the native IgASE1 sequence, whilst Saccharomyces cerevisiae expressing the codon optimized PinD6 yielded 20 times more desaturated products than yeast expressing wild-type (WT) PinD6. Thus the codon optimization strategy we developed here offers a simple, effective and low-cost alternative to whole gene synthesis for high expression of foreign genes in yeast and Arabidopsis. PMID:27433934
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Haq, Ikram Ul; Akram, Fatima
2017-09-01
Commonly, unintentional induction and inadvertently preparing medium for engineered Escherichia coli BL21 CodonPlus (DE3)-RIPL, give poor or variable yields of heterologous proteins. Therefore, to enhance the activity and production of an industrially relevant recombinant processive endo-1,4-β-glucanase (CenC) propagated in Escherichia coli BL21 CodonPlus(DE3)-RIPL through various cultivation and induction strategies. Investigation of various growth media and induction parameters revealed that high-cell-density and optimal CenC expression were obtained in ZYBM9 medium induced either with 0.5 mM IPTG/150 mM lactose, after 6 h induction at 37 °C; and before induction, bacterial cells were given heat shock (42 °C) for 1 h when culture density (OD 600nm ) reached at 0.6. Intracellular enzyme activity was enhanced by 6.67 and 3.20-fold in ZYBM9 and 3×ZYBM9 medium, respectively, under optimal conditions. Using YNG auto-induction medium, activity was 2.5-fold increased after 10 h incubation at 37 °C. Approximately similar results were obtained by transferring the optimized process at the bioreactor level. Results showed that the effective process strategy is essential to enhance recombinant bacterial cell mass and enzyme production from small to large-scale. To the best of our knowledge, this is the first ever report on enhanced production of thermostable processive endo-1,4-β-glucanase cloned from Ruminiclostridium thermocellum, which is a suitable candidate for industrial applications. Graphical Abstract Flow Chart Summary of Enhanced Production of a Recombinant Multidomain Thermostable GH9 Processive Endo-1,4-β-glucanase from Ruminiclostridium thermocellum.
Construction of the yeast whole-cell Rhizopus oryzae lipase biocatalyst with high activity.
Chen, Mei-ling; Guo, Qin; Wang, Rui-zhi; Xu, Juan; Zhou, Chen-wei; Ruan, Hui; He, Guo-qing
2011-07-01
Surface display is effectively utilized to construct a whole-cell biocatalyst. Codon optimization has been proven to be effective in maximizing production of heterologous proteins in yeast. Here, the cDNA sequence of Rhizopus oryzae lipase (ROL) was optimized and synthesized according to the codon bias of Saccharomyces cerevisiae, and based on the Saccharomyces cerevisiae cell surface display system with α-agglutinin as an anchor, recombinant yeast displaying fully codon-optimized ROL with high activity was successfully constructed. Compared with the wild-type ROL-displaying yeast, the activity of the codon-optimized ROL yeast whole-cell biocatalyst (25 U/g dried cells) was 12.8-fold higher in a hydrolysis reaction using p-nitrophenyl palmitate (pNPP) as the substrate. To our knowledge, this was the first attempt to combine the techniques of yeast surface display and codon optimization for whole-cell biocatalyst construction. Consequently, the yeast whole-cell ROL biocatalyst was constructed with high activity. The optimum pH and temperature for the yeast whole-cell ROL biocatalyst were pH 7.0 and 40 °C. Furthermore, this whole-cell biocatalyst was applied to the hydrolysis of tributyrin and the resulted conversion of butyric acid reached 96.91% after 144 h.
Analysis of transcriptome data reveals multifactor constraint on codon usage in Taenia multiceps.
Huang, Xing; Xu, Jing; Chen, Lin; Wang, Yu; Gu, Xiaobin; Peng, Xuerong; Yang, Guangyou
2017-04-20
Codon usage bias (CUB) is an important evolutionary feature in genomes that has been widely observed in many organisms. However, the synonymous codon usage pattern in the genome of T. multiceps remains to be clarified. In this study, we analyzed the codon usage of T. multiceps based on the transcriptome data to reveal the constraint factors and to gain an improved understanding of the mechanisms that shape synonymous CUB. Analysis of a total of 8,620 annotated mRNA sequences from T. multiceps indicated only a weak codon bias, with mean GC and GC3 content values of 49.29% and 51.43%, respectively. Our analysis indicated that nucleotide composition, mutational pressure, natural selection, gene expression level, amino acids with grand average of hydropathicity (GRAVY) and aromaticity (Aromo) and the effective selection of amino-acids all contributed to the codon usage in T. multiceps. Among these factors, natural selection was implicated as the major factor affecting the codon usage variation in T. multiceps. The codon usage of ribosome genes was affected mainly by mutations, while the essential genes were affected mainly by selection. In addition, 21codons were identified as "optimal codons". Overall, the optimal codons were GC-rich (GC:AU, 41:22), and ended with G or C (except CGU). Furthermore, different degrees of variation in codon usage were found between T. multiceps and Escherichia coli, yeast, Homo sapiens. However, little difference was found between T. multiceps and Taenia pisiformis. In this study, the codon usage pattern of T. multiceps was analyzed systematically and factors affected CUB were also identified. This is the first study of codon biology in T. multiceps. Understanding the codon usage pattern in T. multiceps can be helpful for the discovery of new genes, molecular genetic engineering and evolutionary studies.
Yin, Changchuan
2015-04-01
To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation. PMID:27028506
Kwon, Inchan; Choi, Eun Sil
2016-01-01
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Algorithms for optimizing cross-overs in DNA shuffling.
He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris
2012-03-21
DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
2007-01-01
Background The usage of synonymous codons shows considerable variation among mammalian genes. How and why this usage is non-random are fundamental biological questions and remain controversial. It is also important to explore whether mammalian genes that are selectively expressed at different developmental stages bear different molecular features. Results In two models of mouse stem cell differentiation, we established correlations between codon usage and the patterns of gene expression. We found that the optimal codons exhibited variation (AT- or GC-ending codons) in different cell types within the developmental hierarchy. We also found that genes that were enriched (developmental-pivotal genes) or specifically expressed (developmental-specific genes) at different developmental stages had different patterns of codon usage and local genomic GC (GCg) content. Moreover, at the same developmental stage, developmental-specific genes generally used more GC-ending codons and had higher GCg content compared with developmental-pivotal genes. Further analyses suggest that the model of translational selection might be consistent with the developmental stage-related patterns of codon usage, especially for the AT-ending optimal codons. In addition, our data show that after human-mouse divergence, the influence of selective constraints is still detectable. Conclusion Our findings suggest that developmental stage-related patterns of gene expression are correlated with codon usage (GC3) and GCg content in stem cell hierarchies. Moreover, this paper provides evidence for the influence of natural selection at synonymous sites in the mouse genome and novel clues for linking the molecular features of genes to their patterns of expression during mammalian ontogenesis. PMID:17349061
Hiwasa-Tanase, Kyoko; Nyarubona, Mpanja; Hirai, Tadayoshi; Kato, Kazuhisa; Ichikawa, Takanari; Ezura, Hiroshi
2011-01-01
In our previous study, a transgenic tomato line that expressed the MIR gene under control of the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator (tNOS) produced the taste-modifying protein miraculin (MIR). However, the concentration of MIR in the tomatoes was lower than that in the MIR gene's native miracle fruit. To increase MIR production, the native MIR terminator (tMIR) was used and a synthetic gene encoding MIR protein (sMIR) was designed to optimize its codon usage for tomato. Four different combinations of these genes and terminators (MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR) were constructed and used for transformation. The average MIR concentrations in MIR-tNOS, MIR-tMIR, sMIR-tNOS and sMIR-tMIR fruits were 131, 197, 128 and 287 μg/g fresh weight, respectively. The MIR concentrations using tMIR were higher than those using tNOS. The highest MIR accumulation was detected in sMIR-tMIR fruits. On the other hand, the MIR concentration was largely unaffected by sMIR-tNOS. The expression levels of both MIR and sMIR mRNAs terminated by tMIR tended to be higher than those terminated by tNOS. Read-through mRNA transcripts terminated by tNOS were much longer than those terminated by tMIR. These results suggest that tMIR enhances mRNA expression and permits the multiplier effect of optimized codon usage.
Shao, M; Sha, Z; Zhang, X; Rao, Z; Xu, M; Yang, T; Xu, Z; Yang, S
2017-01-01
3-ketosteroid-Δ 1 -dehydrogenase (KSDD), a flavin adenine dinucleotide (FAD)-dependent enzyme involved in sterol metabolism, specifically catalyses the conversion of androst-4-ene-3,17-dione (AD) to androst-1,4-diene-3,17-dione (ADD). However, the low KSDD activity and the toxic effects of hydrogen peroxide (H 2 O 2 ) generated during the biotransformation of AD to ADD with FAD regeneration hinder its application on AD conversion. The aim of this work was to improve KSDD activity and eliminate the toxic effects of the generated H 2 O 2 to enhance ADD production. The ksdd gene obtained from Mycobacterium neoaurum JC-12 was codon-optimized to increase its expression level in Bacillus subtilis, and the KSDD activity reached 12·3 U mg -1 , which was sevenfold of that of codon-unoptimized gene. To improve AD conversion, catalase was co-expressed with KSDD in B. subtilis 168/pMA5-ksdd opt -katA to eliminate the toxic effects of H 2 O 2 generated during AD conversion. Finally, under optimized bioconversion conditions, fed-batch strategy was carried out and the ADD yield improved to 8·76 g l -1 . This work demonstrates the potential to improve enzyme activity by codon-optimization and eliminate the toxic effects of H 2 O 2 by co-expressing catalase. This study showed the highest ADD productivity ever reported and provides a promising strain for efficient ADD production in the pharmaceutical industry. © 2016 The Society for Applied Microbiology.
Inouye, Satoshi; Suzuki, Takahiro
2016-12-01
The protein expressions of three preferred human codon-optimized Gaussia luciferase genes (pGLuc, EpGLuc, and KpGLuc) were characterized in mammalian and bacterial cells by comparing them with those of wild-type Gaussia luciferase gene (wGLuc) and human codon-optimized Gaussia luciferase gene (hGLuc). Two synthetic genes of EpGLuc and KpGLuc containing the complete preferred human codons have an artificial open-reading frame; however, they had the similar protein expression levels to those of pGLuc and hGLuc in mammalian cells. In bacterial cells, the protein expressions of pGLuc, EpGLuc, and KpGLuc with approximately 65% GC content were the same and showed approximately 60% activities of wGLuc and hGLuc. The artificial open-reading frame in EpGLuc and KpGLuc did not affect the protein expression in mammalian and bacterial cells. Copyright © 2016 Elsevier Inc. All rights reserved.
Codon Usage Bias and Determining Forces in Taenia solium Genome.
Yang, Xing; Ma, Xusheng; Luo, Xuenong; Ling, Houjun; Zhang, Xichen; Cai, Xuepeng
2015-12-01
The tapeworm Taenia solium is an important human zoonotic parasite that causes great economic loss and also endangers public health. At present, an effective vaccine that will prevent infection and chemotherapy without any side effect remains to be developed. In this study, codon usage patterns in the T. solium genome were examined through 8,484 protein-coding genes. Neutrality analysis showed that T. solium had a narrow GC distribution, and a significant correlation was observed between GC12 and GC3. Examination of an NC (ENC vs GC3s)-plot showed a few genes on or close to the expected curve, but the majority of points with low-ENC (the effective number of codons) values were detected below the expected curve, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified 26 optimal codons in the T. solium genome, all of which ended with either a G or C residue. These optimal codons in the T. solium genome are likely consistent with tRNAs that are highly expressed in the cell, suggesting that mutational and translational selection forces are probably driving factors of codon usage bias in the T. solium genome.
Laguía-Becher, Melina; Martín, Valentina; Kraemer, Mauricio; Corigliano, Mariana; Yacono, María L; Goldman, Alejandra; Clemente, Marina
2010-07-15
Codon optimization and subcellular targeting were studied with the aim to increase the expression levels of the SAG178-322 antigen of Toxoplasma gondii in tobacco leaves. The expression of the tobacco-optimized and native versions of the SAG1 gene was explored by transient expression from the Agrobacterium tumefaciens binary expression vector, which allows targeting the recombinant protein to the endoplasmic reticulum (ER) and the apoplast. Finally, mice were subcutaneously and orally immunized with leaf extracts-SAG1 and the strategy of prime boost with rSAG1 expressed in Escherichia coli was used to optimize the oral immunization with leaf extracts-SAG1. Leaves agroinfiltrated with an unmodified SAG1 gene accumulated 5- to 10-fold more than leaves agroinfiltrated with a codon-optimized SAG1 gene. ER localization allowed the accumulation of higher levels of native SAG1. However, no significant differences were observed between the mRNA accumulations of the different versions of SAG1. Subcutaneous immunization with leaf extracts-SAG1 (SAG1) protected mice against an oral challenge with a non-lethal cyst dose, and this effect could be associated with the secretion of significant levels of IFN-gamma. The protection was increased when mice were ID boosted with rSAG1 (SAG1+boost). This group elicited a significant Th1 humoral and cellular immune response characterized by high levels of IFN-gamma. In an oral immunization assay, the SAG1+boost group showed a significantly lower brain cyst burden compared to the rest of the groups. Transient agroinfiltration was useful for the expression of all of the recombinant proteins tested. Our results support the usefulness of endoplasmic reticulum signal peptides in enhancing the production of recombinant proteins meant for use as vaccines. The results showed that this plant-produced protein has potential for use as vaccine and provides a potential means for protecting humans and animals against toxoplasmosis.
Codon Optimizing for Increased Membrane Protein Production: A Minimalist Approach.
Mirzadeh, Kiavash; Toddo, Stephen; Nørholm, Morten H H; Daley, Daniel O
2016-01-01
Reengineering a gene with synonymous codons is a popular approach for increasing production levels of recombinant proteins. Here we present a minimalist alternative to this method, which samples synonymous codons only at the second and third positions rather than the entire coding sequence. As demonstrated with two membrane-embedded transporters in Escherichia coli, the method was more effective than optimizing the entire coding sequence. The method we present is PCR based and requires three simple steps: (1) the design of two PCR primers, one of which is degenerate; (2) the amplification of a mini-library by PCR; and (3) screening for high-expressing clones.
Spatz, Stephen J; Volkening, Jeremy D; Mullis, Robert; Li, Fenglan; Mercado, John; Zsak, Laszlo
2013-10-01
Meleagrid herpesvirus type 1 (MeHV-1) is an ideal vector for the expression of antigens from pathogenic avian organisms in order to generate vaccines. Chicken parvovirus (ChPV) is a widespread infectious virus that causes serious disease in chickens. It is one of the etiological agents largely suspected in causing Runting Stunting Syndrome (RSS) in chickens. Initial attempts to express the wild-type gene encoding the capsid protein VP2 of ChPV by insertion into the thymidine kinase gene of MeHV-1 were unsuccessful. However, transient expression of a codon-optimized synthetic VP2 gene cloned into the bicistronic vector pIRES2-Ds-Red2, could be demonstrated by immunocytochemical staining of transfected chicken embryo fibroblasts (CEFs). Red fluorescence could also be detected in these transfected cells since the red fluorescent protein gene is downstream from the internal ribosome entry site (IRES). Strikingly, fluorescence could not be demonstrated in cells transiently transfected with the bicistronic vector containing the wild-type or non-codon-optimized VP2 gene. Immunocytochemical staining of these cells also failed to demonstrate expression of wild-type VP2, indicating that the lack of expression was at the RNA level and the VP2 protein was not toxic to CEFs. Chickens vaccinated with a DNA vaccine consisting of the bicistronic vector containing the codon-optimized VP2 elicited a humoral immune response as measured by a VP2-specific ELISA. This VP2 codon-optimized bicistronic cassette was rescued into the MeHV-1 genome generating a vectored vaccine against ChPV disease.
Liang, Bo; Ngwuta, Joan O.; Surman, Sonja; Kabatova, Barbora; Liu, Xiang; Lingemann, Matthias; Liu, Xueqiao; Yang, Lijuan; Herbert, Richard; Swerczek, Joanna; Chen, Man; Moin, Syed M.; Kumar, Azad; McLellan, Jason S.; Kwong, Peter D.; Graham, Barney S.; Collins, Peter L.
2017-01-01
ABSTRACT Respiratory syncytial virus (RSV) is the most important viral agent of severe pediatric respiratory tract disease worldwide, but it lacks a licensed vaccine or suitable antiviral drug. A live attenuated chimeric bovine/human parainfluenza virus type 3 (rB/HPIV3) was developed previously as a vector expressing RSV fusion (F) protein to confer bivalent protection against RSV and HPIV3. In a previous clinical trial in virus-naive children, rB/HPIV3 was well tolerated but the immunogenicity of wild-type RSV F was unsatisfactory. We previously modified RSV F with a designed disulfide bond (DS) to increase stability in the prefusion (pre-F) conformation and to be efficiently packaged in the vector virion. Here, we further stabilized pre-F by adding both disulfide and cavity-filling mutations (DS-Cav1), and we also modified RSV F codon usage to have a lower CpG content and a higher level of expression. This RSV F open reading frame was evaluated in rB/HPIV3 in three forms: (i) pre-F without vector-packaging signal, (ii) pre-F with vector-packaging signal, and (iii) secreted pre-F ectodomain trimer. Despite being efficiently expressed, the secreted pre-F was poorly immunogenic. DS-Cav1 stabilized pre-F, with or without packaging, induced higher titers of pre-F specific antibodies in hamsters, and improved the quality of RSV-neutralizing serum antibodies. Codon-optimized RSV F containing fewer CpG dinucleotides had higher F expression, replicated more efficiently in vivo, and was more immunogenic. The combination of DS-Cav1 pre-F stabilization, optimized codon usage, reduced CpG content, and vector packaging significantly improved vector immunogenicity and protective efficacy against RSV. This provides an improved vectored RSV vaccine candidate suitable for pediatric clinical evaluation. IMPORTANCE RSV and HPIV3 are the first and second leading viral causes of severe pediatric respiratory disease worldwide. Licensed vaccines or suitable antiviral drugs are not available. We are developing a chimeric rB/HPIV3 vector expressing RSV F as a bivalent RSV/HPIV3 vaccine and have been evaluating means to increase RSV F immunogenicity. In this study, we evaluated the effects of improved stabilization of F in the pre-F conformation and of codon optimization resulting in reduced CpG content and greater pre-F expression. Reduced CpG content dampened the interferon response to infection, promoting higher replication and increased F expression. We demonstrate that improved pre-F stabilization and strategic manipulation of codon usage, together with efficient pre-F packaging into vector virions, significantly increased F immunogenicity in the bivalent RSV/HPIV3 vaccine. The improved immunogenicity included induction of increased titers of high-quality complement-independent antibodies with greater pre-F site Ø binding and greater protection against RSV challenge. PMID:28539444
Charoensri, Nicha; Suphatrakul, Amporn; Sriburi, Rungtawan; Yasanga, Thippawan; Junjhon, Jiraphan; Keelapang, Poonsook; Utaipat, Utaiwan; Puttikhunt, Chunya; Kasinrerk, Watchara; Malasit, Prida; Sittisombut, Nopporn
2014-09-01
Recombinant virus-like particles (rVLPs) of flaviviruses are non-infectious particles released from cells expressing the envelope glycoproteins prM and E. Dengue virus rVLPs are recognized as a potential vaccine candidate, but large scale production of these particles is hindered by low yields and the occurrence of cytopathic effects. In an approach to improve the yield of rVLPs from transfected insect cells, several components of a dengue serotype 2 virus prM+E expression cassette were modified and the effect of these modifications was assessed during transient expression. Enhancement of extracellular rVLP levels by simultaneous substitutions of the prM signal peptide and the stem-anchor region of E with homologous cellular and viral counterparts, respectively, was further augmented by codon optimization. Extensive formation of multinucleated cells following transfection with the codon-optimized expression cassette was abrogated by introducing an E fusion loop mutation. This mutation also helped restore the extracellular E levels affected negatively by alteration of a charged residue at the pr-M junction, which was intended to promote maturation of rVLPs during export. Optimized expression cassettes generated in this multiple add-on modification approach should be useful in the generation of stably expressing clones and production of dengue virus rVLPs for immunogenicity studies. Copyright © 2014 Elsevier B.V. All rights reserved.
Cloning, Codon Optimization, and Expression of Yersinia intermedia Phytase Gene in E. coli.
Mirzaei, Maryam; Saffar, Behnaz; Shareghi, Behzad
2016-06-01
Phytate is an anti-nutritional factor in plants, which catches the most phosphorus contents and some vital minerals. Therefore, Phytase is added mainly as an additive to the monogastric animals' foods to hydrolyze phytate and increase absorption of phosphorus. Y. intermedia phytase is a new phytase with special characteristics such as high specific activity, pH stability, and thermostability. Our aim was to clone, express, and characterizea codon optimized Y. intermedia phytase gene in E. coli . The Y. intermedia phytase gene was optimized according to the codon usage in E. coli . The sequence was synthesized and sub-cloned in pET-22b (+) vector and transformed into E. coli Bl21 (DE3). The protein was expressed in the presence of IPTG at a final concentration of 1 mM at 30°C. The purification of recombinant protein was performed by Ni 2+ affinity chromatography. Phytase activity and stability were determined in various pH and temperatures. The codon optimized Y. intermedia phytase gene was sub-cloned successfully.The expression was confirmed by SDS-PAGE and Western blot analysis. The recombinant enzyme (approximately 45 kDa) was purified. Specific activity of enzyme was 3849 (U.mg -1 ) with optimal pH 5 and optimal temperature of 55°C. Thermostability (80°C for 15 min) and pH stability (3-6) of the enzyme were 56 and more than 80%, respectively. The results of the expression and enzyme characterization revealed that the optimized Y. intermedia phytase gene has a good potential to be produced commercially andto be applied in animals' foodsindustry.
Musto, H; Romero, H; Zavala, A; Jabbari, K; Bernardi, G
1999-07-01
We have analyzed the patterns of synonymous codon preferences of the nuclear genes of Plasmodium falciparum, a unicellular parasite characterized by an extremely GC-poor genome. When all genes are considered, codon usage is strongly biased toward A and T in third codon positions, as expected, but multivariate statistical analysis detects a major trend among genes. At one end genes display codon choices determined mainly by the extreme genome composition of this parasite, and very probably their expression level is low. At the other end a few genes exhibit an increased relative usage of a particular subset of codons, many of which are C-ending. Since the majority of these few genes is putatively highly expressed, we postulate that the increased C-ending codons are translationally optimal. In conclusion, while codon usage of the majority of P. falciparum genes is determined mainly by compositional constraints, a small number of genes exhibit translational selection.
Skoczinski, Pia; Volkenborn, Kristina; Fulton, Alexander; Bhadauriya, Anuseema; Nutschel, Christina; Gohlke, Holger; Knapp, Andreas; Jaeger, Karl-Erich
2017-09-25
Bacillus subtilis produces and secretes proteins in amounts of up to 20 g/l under optimal conditions. However, protein production can be challenging if transcription and cotranslational secretion are negatively affected, or the target protein is degraded by extracellular proteases. This study aims at elucidating the influence of a target protein on its own production by a systematic mutational analysis of the homologous B. subtilis model protein lipase A (LipA). We have covered the full natural diversity of single amino acid substitutions at 155 positions of LipA by site saturation mutagenesis excluding only highly conserved residues and qualitatively and quantitatively screened about 30,000 clones for extracellular LipA production. Identified variants with beneficial effects on production were sequenced and analyzed regarding B. subtilis growth behavior, extracellular lipase activity and amount as well as changes in lipase transcript levels. In total, 26 LipA variants were identified showing an up to twofold increase in either amount or activity of extracellular lipase. These variants harbor single amino acid or codon substitutions that did not substantially affect B. subtilis growth. Subsequent exemplary combination of beneficial single amino acid substitutions revealed an additive effect solely at the level of extracellular lipase amount; however, lipase amount and activity could not be increased simultaneously. Single amino acid and codon substitutions can affect LipA secretion and production by B. subtilis. Several codon-related effects were observed that either enhance lipA transcription or promote a more efficient folding of LipA. Single amino acid substitutions could improve LipA production by increasing its secretion or stability in the culture supernatant. Our findings indicate that optimization of the expression system is not sufficient for efficient protein production in B. subtilis. The sequence of the target protein should also be considered as an optimization target for successful protein production. Our results further suggest that variants with improved properties might be identified much faster and easier if mutagenesis is prioritized towards elements that contribute to enzymatic activity or structural integrity.
Expression of recombinant myostatin propeptide pPIC9K-Msp plasmid in Pichia pastoris.
Du, W; Xia, J; Zhang, Y; Liu, M J; Li, H B; Yan, X M; Zhang, J S; Li, N; Zhou, Z Y; Xie, W Z
2015-12-28
Myostatin propeptide can inhibit the biological activity of myostatin protein and promote muscle growth. To express myostatin propeptide in vitro with a higher biological activity, we performed codon optimization on the sheep myostatin propeptide gene sequence, and mutated aspartic acid-76 to alanine based on the codon usage bias of Pichia pastoris and the enhanced biological activity of myostatin propeptide mutant. Modified myostatin propeptide gene was cloned into the pPIC9K plasmid to form the recombinant plasmid pPIC9K-Msp. Recombinant plasmid pPIC9K-Msp was transformed into Pichia pastoris GS115 by electrotransformation. Transformed cells were screened, and methanol was used to induce expression. SDS-PAGE and western blotting were used to verify the successful expression of myostatin propeptide with biological activity in Pichia pastoris, providing the basis for characterization of this protein.
Human HLA-Ev (147) Expression in Transgenic Animals.
Matsuura, R; Maeda, A; Sakai, R; Eguchi, H; Lo, P-C; Hasuwa, H; Ikawa, M; Nakahata, K; Zenitani, M; Yamamichi, T; Umeda, S; Deguchi, K; Okuyama, H; Miyagawa, S
2016-05-01
In our previous study, we reported on the development of substituting S147C for HLA-E as a useful gene tool for xenotransplantation. In this study we exchanged the codon of HLA-Ev (147), checked its function, and established a line of transgenic mice. A new construct, a codon exchanging human HLA-Ev (147) + IRES + human beta 2-microgloblin, was established. The construct was subcloned into pCXN2 (the chick beta-actin promoter and cytomegalovirus enhancer) vector. Natural killer cell- and macrophage-mediated cytotoxicities were performed using the established the pig endothelial cell (PEC) line with the new gene. Transgenic mice with it were next produced using a micro-injection method. The expression of the molecule on PECs was confirmed by the transfection of the plasmid. The established molecules on PECs functioned well in regulating natural killer cell-mediated cytotoxicity and macrophage-mediated cytotoxicity. We have also successfully generated several lines of transgenic mice with this plasmid. The expression of HLA-Ev (147) in each mouse organ was confirmed by assessing the mRNA. The chick beta-actin promoter and cytomegalovirus enhancer resulted in a relatively broad expression of the gene in each organ, and a strong expression in the cases of the heart and lung. A synthetic HLA-Ev (147) gene with a codon usage optimized to a mammalian system represents a critical factor in the development of transgenic animals for xenotransplantation. Copyright © 2016 Elsevier Inc. All rights reserved.
Decoding Mechanisms by which Silent Codon Changes Influence Protein Biogenesis and Function
Bali, Vedrana; Bebok, Zsuzsanna
2015-01-01
Scope Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. Purpose This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. Physiological and medical relevance Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies. PMID:25817479
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-02-24
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts.
Ma, Yan-Ping; Ke, Hao; Liang, Zhi-Ling; Liu, Zhen-Xing; Hao, Le; Ma, Jiang-Yao; Li, Yu-Gu
2016-01-01
Streptococcus agalactiae is an important human and animal pathogen. To better understand the genetic features and evolution of S. agalactiae, multiple factors influencing synonymous codon usage patterns in S. agalactiae were analyzed in this study. A- and U-ending rich codons were used in S. agalactiae function genes through the overall codon usage analysis, indicating that Adenine (A)/Thymine (T) compositional constraints might contribute an important role to the synonymous codon usage pattern. The GC3% against the effective number of codon (ENC) value suggested that translational selection was the important factor for codon bias in the microorganism. Principal component analysis (PCA) showed that (i) mutational pressure was the most important factor in shaping codon usage of all open reading frames (ORFs) in the S. agalactiae genome; (ii) strand specific mutational bias was not capable of influencing the codon usage bias in the leading and lagging strands; and (iii) gene length was not the important factor in synonymous codon usage pattern in this organism. Additionally, the high correlation between tRNA adaptation index (tAI) value and codon adaptation index (CAI), frequency of optimal codons (Fop) value, reinforced the role of natural selection for efficient translation in S. agalactiae. Comparison of synonymous codon usage pattern between S. agalactiae and susceptible hosts (human and tilapia) showed that synonymous codon usage of S. agalactiae was independent of the synonymous codon usage of susceptible hosts. The study of codon usage in S. agalactiae may provide evidence about the molecular evolution of the bacterium and a greater understanding of evolutionary relationships between S. agalactiae and its hosts. PMID:26927064
Kawaguchi, Kosuke; Yurimoto, Hiroya; Sakai, Yasuyoshi
2014-01-01
A codon-optimized Aspergillus niger pectin methylesterase (PME) gene was expressed in the methylotrophic yeast Canidia boidinii. The PME-producing strains showed better growth on pectin than the wild-type strains, suggesting that the PME-producing strains could efficiently utilize methyl ester moieties of pectin. On the other hand, overproduction of PME negatively affected the proliferation of C. boidinii on leaves of Arabidopsis thaliana.
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
The Relation of Codon Bias to Tissue-Specific Gene Expression in Arabidopsis thaliana
Camiolo, Salvatore; Farina, Lorenzo; Porceddu, Andrea
2012-01-01
The codon composition of coding sequences plays an important role in the regulation of gene expression. Herein, we report systematic differences in the usage of synonymous codons among Arabidopsis thaliana genes that are expressed specifically in distinct tissues. Although we observed that both regionally and transcriptionally associated mutational biases were associated significantly with codon bias, they could not explain the observed differences fully. Similarly, given that transcript abundances did not account for the differences in codon usage, it is unlikely that selection for translational efficiency can account exclusively for the observed codon bias. Thus, we considered the possible evolution of codon bias as an adaptive response to the different abundances of tRNAs in different tissues. Our analysis demonstrated that in some cases, codon usage in genes that were expressed in a broad range of tissues was influenced primarily by the tissue in which the gene was expressed maximally. On the basis of this finding we propose that genes that are expressed in certain tissues might show a tissue-specific compositional signature in relation to codon usage. These findings might have implications for the design of transgenes in relation to optimizing their expression. PMID:22865738
Seligmann, Hervé; Warthi, Ganesh
2017-01-01
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species
2006-01-01
Background Codon usage has direct utility in molecular characterization of species and is also a marker for molecular evolution. To understand codon usage within the diverse phylum Nematoda, we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons were analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. Results Codon usage similarity in Nematoda usually persists over the breadth of a genus but then rapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32% to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides (N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. Conclusion Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes. PMID:26271136
2011-04-01
Sensitive Dual Color In Vivo Bioluminescence Imaging Using a New Red Codon Optimized Firefly Luciferase and a Green Click Beetle Luciferase Laura...20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that...spectral emissions using a suitable spectral unmixing algorithm . This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future
Enhancement of heterogeneous alkaline xylanase production in Pichia pastoris GS115
NASA Astrophysics Data System (ADS)
Zheng, Wei
2017-08-01
A series of strategies were applied to improve expression level of the recombinant alkaline xylanase from Bacillus pumilus G1-3 in Pichia pastoris GS115. Codon optimization of xylanase gene xynG1-3 from B. pumilus G1-3 were carried out for its heterogeneous expression in P. pastoris. The activity of xylanase encoded by optimized gene (xynG1-3-opt) was up to 33641 U/mL, which was 37% higher than that by wild-type (xynG1-3) gene. The results will greatly contribute to increasing the production of recombinant proteins in P. pastoris and improving the industrial production of the alkaline xylanase.
Takahara, Michiyo; Sakaue, Haruka; Onishi, Yukiko; Yamagishi, Marifu; Kida, Yuichiro; Sakaguchi, Masao
2013-01-11
Nascent chain release from membrane-bound ribosomes by the termination codon was investigated using a cell-free translation system from rabbit supplemented with rough microsomal membrane vesicles. Chain release was extremely slow when mRNA ended with only the termination codon. Tail extension after the termination codon enhanced the release of the nascent chain. Release reached plateau levels with tail extension of 10 bases. This requirement was observed with all termination codons: TAA, TGA and TAG. Rapid release was also achieved by puromycin even in the absence of the extension. Efficient translation termination cannot be achieved in the presence of only a termination codon on the mRNA. Tail extension might be required for correct positioning of the termination codon in the ribosome and/or efficient recognition by release factors. Copyright © 2012. Published by Elsevier Inc.
Grant-Klein, Rebecca J; Altamura, Louis A; Badger, Catherine V; Bounds, Callie E; Van Deusen, Nicole M; Kwilas, Steven A; Vu, Hong A; Warfield, Kelly L; Hooper, Jay W; Hannaman, Drew; Dupuy, Lesley C; Schmaljohn, Connie S
2015-01-01
Cynomolgus macaques were vaccinated by intramuscular electroporation with DNA plasmids expressing codon-optimized glycoprotein (GP) genes of Ebola virus (EBOV) or Marburg virus (MARV) or a combination of codon-optimized GP DNA vaccines for EBOV, MARV, Sudan virus and Ravn virus. When measured by ELISA, the individual vaccines elicited slightly higher IgG responses to EBOV or MARV than did the combination vaccines. No significant differences in immune responses of macaques given the individual or combination vaccines were measured by pseudovirion neutralization or IFN-γ ELISpot assays. Both the MARV and mixed vaccines were able to protect macaques from lethal MARV challenge (5/6 vs. 6/6). In contrast, a greater proportion of macaques vaccinated with the EBOV vaccine survived lethal EBOV challenge in comparison to those that received the mixed vaccine (5/6 vs. 1/6). EBOV challenge survivors had significantly higher pre-challenge neutralizing antibody titers than those that succumbed.
Genome-wide analysis of codon usage bias in four sequenced cotton species.
Wang, Liyuan; Xing, Huixian; Yuan, Yanchao; Wang, Xianlin; Saeed, Muhammad; Tao, Jincai; Feng, Wei; Zhang, Guihua; Song, Xianliang; Sun, Xuezhen
2018-01-01
Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
A Simple Combinatorial Codon Mutagenesis Method for Targeted Protein Engineering.
Belsare, Ketaki D; Andorfer, Mary C; Cardenas, Frida S; Chael, Julia R; Park, Hyun June; Lewis, Jared C
2017-03-17
Directed evolution is a powerful tool for optimizing enzymes, and mutagenesis methods that improve enzyme library quality can significantly expedite the evolution process. Here, we report a simple method for targeted combinatorial codon mutagenesis (CCM). To demonstrate the utility of this method for protein engineering, CCM libraries were constructed for cytochrome P450 BM3 , pfu prolyl oligopeptidase, and the flavin-dependent halogenase RebH; 10-26 sites were targeted for codon mutagenesis in each of these enzymes, and libraries with a tunable average of 1-7 codon mutations per gene were generated. Each of these libraries provided improved enzymes for their respective transformations, which highlights the generality, simplicity, and tunability of CCM for targeted protein engineering.
Moustakas, A; Sonstegard, T S; Hackett, P B
1993-01-01
The Rous sarcoma virus (RSV) leader RNA has three short open reading frames (ORF1 to ORF3) which are conserved in all avian sarcoma-leukosis retroviruses. Effects on virus propagation were determined following three types of alterations in the ORFs: (i) replacement of AUG initiation codons in order to prohibit ORF translation, (ii) alterations of the codon context around the AUG initiation codon to enhance translation of the normally silent ORF3, and (iii) elongation of the ORF coding sequences. Mutagenesis of the AUG codons for ORF1 and ORF2 (AUG1 and AUG2) singly or together delayed the onset of viral replication and cell transformation. In contrast, mutagenesis of AUG3 almost completely suppressed these viral activities. Mutagenesis of ORF3 to enhance its translation inhibited viral propagation. When the mutant ORF3 included an additional frameshift mutation which extended the ORF beyond the initiation site for the gag, gag-pol, and env proteins, host cells were initially transformed but died soon thereafter. Elongation of ORF1 from 7 to 62 codons led to the accumulation of transformation-defective virus with a delayed onset of replication. In contrast, viruses with elongation of ORF1 from 7 to 30 codons, ORF2 from 16 to 48 codons, or ORF3 from 9 to 64 codons, without any alterations in the AUG context, exhibited wild-type phenotypes. These results are consistent with a model that translation of the ORFs is necessary to facilitate virus production. Images PMID:7685415
Al-Babili, Salim; Hoa, Tran Thi Cuc; Schaub, Patrick
2006-01-01
To increase the beta-carotene (provitamin A) content and thus the nutritional value of Golden Rice, the optimization of the enzymes employed, phytoene synthase (PSY) and the Erwinia uredovora carotene desaturase (CrtI), must be considered. CrtI was chosen for this study because this bacterial enzyme, unlike phytoene synthase, was expressed at barely detectable levels in the endosperm of the Golden Rice events investigated. The low protein amounts observed may be caused by either weak cauliflower mosaic virus 35S promoter activity in the endosperm or by inappropriate codon usage. The protein level of CrtI was increased to explore its potential for enhancing the flux of metabolites through the pathway. For this purpose, a synthetic CrtI gene with a codon usage matching that of rice storage proteins was generated. Rice plants were transformed to express the synthetic gene under the control of the endosperm-specific glutelin B1 promoter. In addition, transgenic plants expressing the original bacterial gene were generated, but the endosperm-specific glutelin B1 promoter was employed instead of the cauliflower mosaic virus 35S promoter. Independent of codon optimization, the use of the endosperm-specific promoter resulted in a large increase in bacterial desaturase production in the T(1) rice grains. However, this did not lead to a significant increase in the carotenoid content, suggesting that the bacterial enzyme is sufficiently active in rice endosperm even at very low levels and is not rate-limiting. The endosperm-specific expression of CrtI did not affect the carotenoid pattern in the leaves, which was observed upon its constitutive expression. Therefore, tissue-specific expression of CrtI represents the better option.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
NASA Astrophysics Data System (ADS)
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-11-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria--which models tuberculous granulomas--are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria.
tRNA-mediated codon-biased translation in mycobacterial hypoxic persistence
Chionh, Yok Hian; McBee, Megan; Babu, I. Ramesh; Hia, Fabian; Lin, Wenwei; Zhao, Wei; Cao, Jianshu; Dziergowska, Agnieszka; Malkiewicz, Andrzej; Begley, Thomas J.; Alonso, Sylvie; Dedon, Peter C.
2016-01-01
Microbial pathogens adapt to the stress of infection by regulating transcription, translation and protein modification. We report that changes in gene expression in hypoxia-induced non-replicating persistence in mycobacteria—which models tuberculous granulomas—are partly determined by a mechanism of tRNA reprogramming and codon-biased translation. Mycobacterium bovis BCG responded to each stage of hypoxia and aerobic resuscitation by uniquely reprogramming 40 modified ribonucleosides in tRNA, which correlate with selective translation of mRNAs from families of codon-biased persistence genes. For example, early hypoxia increases wobble cmo5U in tRNAThr(UGU), which parallels translation of transcripts enriched in its cognate codon, ACG, including the DosR master regulator of hypoxic bacteriostasis. Codon re-engineering of dosR exaggerates hypoxia-induced changes in codon-biased DosR translation, with altered dosR expression revealing unanticipated effects on bacterial survival during hypoxia. These results reveal a coordinated system of tRNA modifications and translation of codon-biased transcripts that enhance expression of stress response proteins in mycobacteria. PMID:27834374
Maksiutov, R A; Shchelkunov, S N
2011-01-01
Efficacy of candidate DNA-vaccines based on the variola virus natural gene A30L and artificial gene A30Lopt with modified codon usage, optimized for expression in mammalian cells, was tested. The groups of mice were intracutaneously immunized three times with three-week intervals with candidate DNA-vaccines: pcDNA_A30L or pcDNA_A30Lopt, and in three weeks after the last immunization all mice in the groups were intraperitoneally infected by the ectromelia virus K1 strain in 10 LD50 dose for the estimation of protection. It was shown that the DNA-vaccines based on natural gene A30L and codon-optimized gene A30Lopt elicited virus, thereby neutralizing the antibody response and protected mice from lethal intraperitoneal challenge with the ectromelia virus with lack of statistically significant difference.
Kim, Jung-Hun; Wang, Chonglong; Jang, Hui-Jung; Cha, Myeong-Seok; Park, Ju-Eon; Jo, Seon-Yeong; Choi, Eui-Sung; Kim, Seon-Won
2016-12-23
Isoprene, a volatile C5 hydrocarbon, is an important platform chemical used in the manufacturing of synthetic rubber for tires and various other applications, such as elastomers and adhesives. In this study, Escherichia coli MG1655 harboring Populus trichocarpa isoprene synthase (PtispS) and the exogenous mevalonate (MVA) pathway produced 80 mg/L isoprene. Codon optimization and optimal expression of the ispS gene via adjustment of the RBS strength and inducer concentration increased isoprene production to 199 and 337 mg/L, respectively. To augment expression of MVA pathway genes, the MVA pathway was cloned on a high-copy plasmid (pBR322 origin) with a strong promoter (P trc ), which resulted in an additional increase in isoprene production up to 956 mg/L. To reduce the formation of byproducts derived from acetyl-CoA (an initial substrate of the MVA pathway), nine relevant genes were deleted to generate the E. coli AceCo strain (E. coli MG1655 ΔackA-pta, poxB, ldhA, dld, adhE, pps, and atoDA). The AceCo strain harboring the ispS gene and MVA pathway showed enhanced isoprene production of 1832 mg/L in flask culture with reduced accumulation of byproducts. We achieved a 23-fold increase in isoprene production by codon optimization of PtispS, augmentation of the MVA pathway, and deletion of genes involved in byproduct formation.
Broadbent, Andrew J.; Santos, Celia P.; Anafu, Amanda; Wimmer, Eckard; Mueller, Steffen; Subbarao, Kanta
2015-01-01
Codon-pair bias de-optimization (CPBD) of viruses involves re-writing viral genes using statistically underrepresented codon pairs, without any changes to the amino acid sequence or codon usage. Previously, this technology has been used to attenuate the influenza A/Puerto Rico/8/34 (H1N1) virus. The de-optimized virus was immunogenic and protected inbred mice from challenge. In order to assess whether CPBD could be used to produce a live vaccine against a clinically relevant influenza virus, we generated an influenza A/California/07/2009 pandemic H1N1 (2009 pH1N1) virus with de-optimized HA and NA gene segments (2009 pH1N1-(HA+NA)Min), and evaluated viral replication and protein expression in MDCK cells, and attenuation, immunogenicity, and efficacy in outbred ferrets. The 2009 pH1N1-(HA+NA)Min virus grew to a similar titer as the 2009 pH1N1 wild type (wt) virus in MDCK cells (~106 TCID50/ml), despite reduced HA and NA protein expression on western blot. In ferrets, intranasal inoculation of 2009 pH1N1-(HA+NA)Min virus at doses ranging from 103 to 105 TCID50 led to seroconversion in all animals and protection from challenge with the 2009 pH1N1 wt virus 28 days later. The 2009 pH1N1-(HA+NA)Min virus did not cause clinical illness in ferrets, but replicated to a similar titer as the wt virus in the upper and lower respiratory tract, suggesting that de-optimization of additional gene segments may be warranted for improved attenuation. Taken together, our data demonstrate the potential of using CPBD technology for the development of a live influenza virus vaccine if the level of attenuation is optimized. PMID:26655630
Williams, N P; Mueller, P P; Hinnebusch, A G
1988-01-01
Translational control of GCN4 expression in the yeast Saccharomyces cerevisiae is mediated by multiple AUG codons present in the leader of GCN4 mRNA, each of which initiates a short open reading frame of only two or three codons. Upstream AUG codons 3 and 4 are required to repress GCN4 expression in normal growth conditions; AUG codons 1 and 2 are needed to overcome this repression in amino acid starvation conditions. We show that the regulatory function of AUG codons 1 and 2 can be qualitatively mimicked by the AUG codons of two heterologous upstream open reading frames (URFs) containing the initiation regions of the yeast genes PGK and TRP1. These AUG codons inhibit GCN4 expression when present singly in the mRNA leader; however, they stimulate GCN4 expression in derepressing conditions when inserted upstream from AUG codons 3 and 4. This finding supports the idea that AUG codons 1 and 2 function in the control mechanism as translation initiation sites and further suggests that suppression of the inhibitory effects of AUG codons 3 and 4 is a general consequence of the translation of URF 1 and 2 sequences upstream. Several observations suggest that AUG codons 3 and 4 are efficient initiation sites; however, these sequences do not act as positive regulatory elements when placed upstream from URF 1. This result suggests that efficient translation is only one of the important properties of the 5' proximal URFs in GCN4 mRNA. We propose that a second property is the ability to permit reinitiation following termination of translation and that URF 1 is optimized for this regulatory function. Images PMID:3065626
Revelation of Influencing Factors in Overall Codon Usage Bias of Equine Influenza Viruses
Bhatia, Sandeep; Sood, Richa; Selvaraj, Pavulraj
2016-01-01
Equine influenza viruses (EIVs) of H3N8 subtype are culprits of severe acute respiratory infections in horses, and are still responsible for significant outbreaks worldwide. Adaptability of influenza viruses to a particular host is significantly influenced by their codon usage preference, due to an absolute dependence on the host cellular machinery for their replication. In the present study, we analyzed genome-wide codon usage patterns in 92 EIV strains, including both H3N8 and H7N7 subtypes by computing several codon usage indices and applying multivariate statistical methods. Relative synonymous codon usage (RSCU) analysis disclosed bias of preferred synonymous codons towards A/U-ended codons. The overall codon usage bias in EIVs was slightly lower, and mainly affected by the nucleotide compositional constraints as inferred from the RSCU and effective number of codon (ENc) analysis. Our data suggested that codon usage pattern in EIVs is governed by the interplay of mutation pressure, natural selection from its hosts and undefined factors. The H7N7 subtype was found less fit to its host (horse) in comparison to H3N8, by possessing higher codon bias, lower mutation pressure and much less adaptation to tRNA pool of equine cells. To the best of our knowledge, this is the first report describing the codon usage analysis of the complete genomes of EIVs. The outcome of our study is likely to enhance our understanding of factors involved in viral adaptation, evolution, and fitness towards their hosts. PMID:27119730
Evolutionary Consequences of DNA Methylation in a Basal Metazoan
Dixon, Groves B.; Bay, Line K.; Matz, Mikhail V.
2016-01-01
Gene body methylation (gbM) is an ancestral and widespread feature in Eukarya, yet its adaptive value and evolutionary implications remain unresolved. The occurrence of gbM within protein-coding sequences is particularly puzzling, because methylation causes cytosine hypermutability and hence is likely to produce deleterious amino acid substitutions. We investigate this enigma using an evolutionarily basal group of Metazoa, the stony corals (order Scleractinia, class Anthozoa, phylum Cnidaria). We show that patterns of coral gbM are similar to other invertebrate species, predicting wide and active transcription and slower sequence evolution. We also find a strong correlation between gbM and codon bias, resulting from systematic replacement of CpG bearing codons. We conclude that gbM has strong effects on codon evolution and speculate that this may influence establishment of optimal codons. PMID:27189563
Farshadpour, Fatemeh; Makvandi, Manoochehr; Taherkhani, Reza
2015-01-01
Background: Hepatitis E Virus (HEV) is the causative agent of enterically transmitted acute hepatitis and has high mortality rate of up to 30% among pregnant women. Therefore, development of a novel vaccine is a desirable goal. Objectives: The aim of this study was to construct tPAsp-PADRE-truncated open reading frame 2 (ORF2) and truncated ORF2 DNA plasmid, which can assist future studies with the preparation of an effective vaccine against Hepatitis E Virus. Materials and Methods: A synthetic codon-optimized gene cassette encoding tPAsp-PADRE-truncated ORF2 protein was designed, constructed and analyzed by some bioinformatics software. Furthermore, a codon-optimized truncated ORF2 gene was amplified by the polymerase chain reaction (PCR), with a specific primer from the previous construct. The constructs were sub-cloned in the pVAX1 expression vector and finally expressed in eukaryotic cells. Results: Sequence analysis and bioinformatics studies of the codon-optimized gene cassette revealed that codon adaptation index (CAI), GC content, and frequency of optimal codon usage (Fop) value were improved, and performance of the secretory signal was confirmed. Cloning and sub-cloning of the tPAsp-PADRE-truncated ORF2 gene cassette and truncated ORF2 gene were confirmed by colony PCR, restriction enzymes digestion and DNA sequencing of the recombinant plasmids pVAX-tPAsp-PADRE-truncated ORF2 (aa 112-660) and pVAX-truncated ORF2 (aa 112-660). The expression of truncated ORF2 protein in eukaryotic cells was approved by an Immunofluorescence assay (IFA) and the reverse transcriptase polymerase chain reaction (RT-PCR) method. Conclusions: The results of this study demonstrated that the tPAsp-PADRE-truncated ORF2 gene cassette and the truncated ORF2 gene in recombinant plasmids are successfully expressed in eukaryotic cells. The immunogenicity of the two recombinant plasmids with different formulations will be evaluated as a novel DNA vaccine in future investigations. PMID:26865938
Park, Soohyun; Chang, Kwang Suk; Jin, Eonseon; Pack, Seung Pil; Lee, Jinwon
2013-01-01
A new phosphoenolpyruvate carboxylase (PEPC) gene of Dunaliella salina is identified using homology analysis was conducted using PEPC gene of Chlamydomonas reinhardtii and Arabidopsis thaliana. Recombinant E. coli SGJS115 with increased production of malate and oxaloacetate was developed by introducing codon-optimized phosphoenolpyruvate carboxylase2 (OPDSPEPC2) gene of Dunaliella salina. E. coli SGJS115 yielded a 9.9 % increase in malate production. In addition, E. coli SGJS115 exhibited two times increase in the yield of oxaloacetate over the E. coli SGJS114 having identified PEPC2 gene obtained from Dunaliella salina.
Sequence Polishing Library (SPL) v10.0
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oberortner, Ernst
The Sequence Polishing Library (SPL) is a suite of software tools in order to automate "Design for Synthesis and Assembly" workflows. Specifically: The SPL "Converter" tool converts files among the following sequence data exchange formats: CSV, FASTA, GenBank, and Synthetic Biology Open Language (SBOL); The SPL "Juggler" tool optimizes the codon usages of DNA coding sequences according to an optimization strategy, a user-specific codon usage table and genetic code. In addition, the SPL "Juggler" can translate amino acid sequences into DNA sequences.:The SPL "Polisher" verifies NA sequences against DNA synthesis constraints, such as GC content, repeating k-mers, and restriction sites.more » In case of violations, the "Polisher" reports the violations in a comprehensive manner. The "Polisher" tool can also modify the violating regions according to an optimization strategy, a user-specific codon usage table and genetic code;The SPL "Partitioner" decomposes large DNA sequences into smaller building blocks with partial overlaps that enable an efficient assembly. The "Partitioner" enables the user to configure the characteristics of the overlaps, which are mostly determined by the utilized assembly protocol, such as length, GC content, or melting temperature.« less
Ilegems, Erwin; Pick, Horst M.; Vogel, Horst
2002-01-01
A reporter assay was developed to detect and quantify nonsense codon suppression by chemically aminoacylated tRNAs in mammalian cells. It is based on the cellular expression of the enhanced green fluorescent protein (EGFP) as a reporter for the site-specific amino acid incorporation in its sequence using an orthogonal suppressor tRNA derived from Escherichia coli. Suppression of an engineered amber codon at position 64 in the EGFP run-off transcript could be achieved by the incorporation of a leucine via an in vitro aminoacylated suppressor tRNA. Microinjection of defined amounts of mutagenized EGFP mRNA and suppressor tRNA into individual cells allowed us to accurately determine suppression efficiencies by measuring the EGFP fluorescence intensity in individual cells using laser-scanning confocal microscopy. Control experiments showed the absence of natural suppression or aminoacylation of the synthetic tRNA by endogenous aminoacyl-tRNA synthetases. This reporter assay opens the way for the optimization of essential experimental parameters for expanding the scope of the suppressor tRNA technology to different cell types. PMID:12466560
Ahn, Jin-Ho; Hwang, Mi-Yeon; Lee, Kyung-Ho; Choi, Cha-Yong; Kim, Dong-Myung
2007-01-01
This study developed a method to boost the expression of recombinant proteins in a cell-free protein synthesis system without leaving additional amino acid residues. It was found that the nucleotide sequences of the signal peptides serve as an efficient downstream box to stimulate protein synthesis when they were fused upstream of the target genes. The extent of stimulation was critically affected by the identity of the second codons of the signal sequences. Moreover, the yield of the synthesized protein was enhanced by as much as 10 times in the presence of an optimal second codon. The signal peptides were in situ cleaved and the target proteins were produced in their native sizes by carrying out the cell-free synthesis reactions in the presence of Triton X-100, most likely through the activation of signal peptidase in the S30 extract. The amplification of the template DNA and the addition of the signal sequences were accomplished by PCR. Hence, elevated levels of recombinant proteins were generated within several hours. PMID:17185295
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
Hui, A; Hayflick, J; Dinkelspiel, K; de Boer, H A
1984-01-01
The effect on the translation efficiency of various mutations in the three bases (the -1 triplet) that precede the AUG start codon of the beta-galactosidase mRNA in Escherichia coli was studied. Of the 39 mutants examined, the level of expression varies over a 20-fold range. The most favorable combinations of bases in the -1 triplet are UAU and CUU. The expression levels in the mutants with UUC, UCA or AGG as the -1 triplet are 20-fold lower than those with UAU or CUU. In general, a U residue immediately preceding the start codon is more favorable for expression than any other base; furthermore, an A residue at the -2 position enhances the translation efficiency in most instances. In both cases, however, the degree of enhancement depends on its context, i.e. the neighboring bases. Although the rules derived from this study are complex, the results show that mutations in any of the three bases preceding the start codon can strongly affect the translational efficiency of the beta-galactosidase mRNA. PMID:6425057
Zhou, Hao; Yan, Bing; Chen, Shun; Wang, Mingshu; Jia, Renyong; Cheng, Anchun
2015-10-01
Tembusu virus (TMUV) is a single-stranded, positive-sense RNA virus. As reported, TMUV infection has resulted in significant poultry losses, and the virus may also pose a threat to public health. To characterize TMUV evolutionarily and to understand the factors accounting for codon usage properties, we performed, for the first time, a comprehensive analysis of codon usage bias for the genomes of 60 TMUV strains. The most recently published TMUV strains were found to be widely distributed in coastal cities of southeastern China. Codon preference among TMUV genomes exhibits a low bias (effective number of codons (ENC)=53.287) and is maintained at a stable level. ENC-GC3 plots and the high correlation between composition constraints and principal component factor analysis of codon usage demonstrated that mutation pressure dominates over natural selection pressure in shaping the TMUV coding sequence composition. The high correlation between the major components of the codon usage pattern and hydrophobicity (Gravy) or aromaticity (Aromo) was obvious, indicating that properties of viral proteins also account for the observed variation in TMUV codon usage. Principal component analysis (PCA) showed that CQW1 isolated from Chongqing may have evolved from GX2013H or GX2013G isolated from Guangxi, thus indicating that TMUV likely disseminated from southeastern China to the mainland. Moreover, the preferred codons encoding eight amino acids were consistent with the optimal codons for human cells, indicating that TMUV may pose a threat to public health due to possible cross-species transmission (birds to birds or birds to humans). The results of this study not only have theoretical value for uncovering the characteristics of synonymous codon usage patterns in TMUV genomes but also have significant meaning with regard to the molecular evolutionary tendencies of TMUV. Copyright © 2015 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Tang, Nicholas C.; Chilkoti, Ashutosh
2016-04-01
Most genes are synthesized using seamless assembly methods that rely on the polymerase chain reaction (PCR). However, PCR of genes encoding repetitive proteins either fails or generates nonspecific products. Motivated by the need to efficiently generate new protein polymers through high-throughput gene synthesis, here we report a codon-scrambling algorithm that enables the PCR-based gene synthesis of repetitive proteins by exploiting the codon redundancy of amino acids and finding the least-repetitive synonymous gene sequence. We also show that the codon-scrambling problem is analogous to the well-known travelling salesman problem, and obtain an exact solution to it by using De Bruijn graphs and a modern mixed integer linear programme solver. As experimental proof of the utility of this approach, we use it to optimize the synthetic genes for 19 repetitive proteins, and show that the gene fragments are amenable to PCR-based gene assembly and recombinant expression.
Ma, X X; Feng, Y P; Gu, Y X; Zhou, J H; Ma, Z R
2016-06-01
As for the alternative AUGs in foot-and-mouth disease virus (FMDV), nucleotide bias of the context flanking the AUG(2nd) could be used as a strong signal to initiate translation. To determine the role of the specific nucleotide context, dicistronic reporter constructs were engineered to contain different versions of nucleotide context linking between internal ribosome entry site (IRES) and downstream gene. The results indicate that under FMDV IRES-dependent mechanism, the nucleotide contexts flanking start codon can influence the translation initiation efficiencies. The most optimal sequences for both start codons have proved to be UUU AUG(1st) AAC and AAG AUG(2nd) GAA.
Energetics of codon-anticodon recognition on the small ribosomal subunit.
Almlöf, Martin; Andér, Martin; Aqvist, Johan
2007-01-09
Recent crystal structures of the small ribosomal subunit have made it possible to examine the detailed energetics of codon recognition on the ribosome by computational methods. The binding of cognate and near-cognate anticodon stem loops to the ribosome decoding center, with mRNA containing the Phe UUU and UUC codons, are analyzed here using explicit solvent molecular dynamics simulations together with the linear interaction energy (LIE) method. The calculated binding free energies are in excellent agreement with experimental binding constants and reproduce the relative effects of mismatches in the first and second codon position versus a mismatch at the wobble position. The simulations further predict that the Leu2 anticodon stem loop is about 10 times more stable than the Ser stem loop in complex with the Phe UUU codon. It is also found that the ribosome significantly enhances the intrinsic stability differences of codon-anticodon complexes in aqueous solution. Structural analysis of the simulations confirms the previously suggested importance of the universally conserved nucleotides A1492, A1493, and G530 in the decoding process.
Simple-MSSM: a simple and efficient method for simultaneous multi-site saturation mutagenesis.
Cheng, Feng; Xu, Jian-Miao; Xiang, Chao; Liu, Zhi-Qiang; Zhao, Li-Qing; Zheng, Yu-Guo
2017-04-01
To develop a practically simple and robust multi-site saturation mutagenesis (MSSM) method that enables simultaneously recombination of amino acid positions for focused mutant library generation. A general restriction enzyme-free and ligase-free MSSM method (Simple-MSSM) based on prolonged overlap extension PCR (POE-PCR) and Simple Cloning techniques. As a proof of principle of Simple-MSSM, the gene of eGFP (enhanced green fluorescent protein) was used as a template gene for simultaneous mutagenesis of five codons. Forty-eight randomly selected clones were sequenced. Sequencing revealed that all the 48 clones showed at least one mutant codon (mutation efficiency = 100%), and 46 out of the 48 clones had mutations at all the five codons. The obtained diversities at these five codons are 27, 24, 26, 26 and 22, respectively, which correspond to 84, 75, 81, 81, 69% of the theoretical diversity offered by NNK-degeneration (32 codons; NNK, K = T or G). The enzyme-free Simple-MSSM method can simultaneously and efficiently saturate five codons within one day, and therefore avoid missing interactions between residues in interacting amino acid networks.
Thakur, Anil; Hinnebusch, Alan G
2018-05-01
The eukaryotic 43S preinitiation complex (PIC), bearing initiator methionyl transfer RNA (Met-tRNA i ) in a ternary complex (TC) with eukaryotic initiation factor 2 (eIF2)-GTP, scans the mRNA leader for an AUG codon in favorable context. AUG recognition evokes rearrangement from an open PIC conformation with TC in a "P OUT " state to a closed conformation with TC more tightly bound in a "P IN " state. eIF1 binds to the 40S subunit and exerts a dual role of enhancing TC binding to the open PIC conformation while antagonizing the P IN state, necessitating eIF1 dissociation for start codon selection. Structures of reconstituted PICs reveal juxtaposition of eIF1 Loop 2 with the Met-tRNA i D loop in the P IN state and predict a distortion of Loop 2 from its conformation in the open complex to avoid a clash with Met-tRNA i We show that Ala substitutions in Loop 2 increase initiation at both near-cognate UUG codons and AUG codons in poor context. Consistently, the D71A-M74A double substitution stabilizes TC binding to 48S PICs reconstituted with mRNA harboring a UUG start codon, without affecting eIF1 affinity for 40S subunits. Relatively stronger effects were conferred by arginine substitutions; and no Loop 2 substitutions perturbed the rate of TC loading on scanning 40S subunits in vivo. Thus, Loop 2-D loop interactions specifically impede Met-tRNA i accommodation in the P IN state without influencing the P OUT mode of TC binding; and Arg substitutions convert the Loop 2-tRNA i clash to an electrostatic attraction that stabilizes P IN and enhances selection of poor start codons in vivo.
Optimizing doped libraries by using genetic algorithms
NASA Astrophysics Data System (ADS)
Tomandl, Dirk; Schober, Andreas; Schwienhorst, Andreas
1997-01-01
The insertion of random sequences into protein-encoding genes in combination with biologicalselection techniques has become a valuable tool in the design of molecules that have usefuland possibly novel properties. By employing highly effective screening protocols, a functionaland unique structure that had not been anticipated can be distinguished among a hugecollection of inactive molecules that together represent all possible amino acid combinations.This technique is severely limited by its restriction to a library of manageable size. Oneapproach for limiting the size of a mutant library relies on `doping schemes', where subsetsof amino acids are generated that reveal only certain combinations of amino acids in a proteinsequence. Three mononucleotide mixtures for each codon concerned must be designed, suchthat the resulting codons that are assembled during chemical gene synthesis represent thedesired amino acid mixture on the level of the translated protein. In this paper we present adoping algorithm that `reverse translates' a desired mixture of certain amino acids into threemixtures of mononucleotides. The algorithm is designed to optimally bias these mixturestowards the codons of choice. This approach combines a genetic algorithm with localoptimization strategies based on the downhill simplex method. Disparate relativerepresentations of all amino acids (and stop codons) within a target set can be generated.Optional weighing factors are employed to emphasize the frequencies of certain amino acidsand their codon usage, and to compensate for reaction rates of different mononucleotidebuilding blocks (synthons) during chemical DNA synthesis. The effect of statistical errors thataccompany an experimental realization of calculated nucleotide mixtures on the generatedmixtures of amino acids is simulated. These simulations show that the robustness of differentoptima with respect to small deviations from calculated values depends on their concomitantfitness. Furthermore, the calculations probe the fitness landscape locally and allow apreliminary assessment of its structure.
Metabolic engineering of Escherichia coli for production of valerenadiene.
Nybo, S Eric; Saunders, Jacqueline; McCormick, Sean P
2017-11-20
Valeriana officinalis is a medicinal herb which produces a suite of compounds in its root tissue useful for treatment of anxiety and insomnia. The sesquiterpene components of the root extract, valerenic acid and valerena-1,10-diene, are thought to contribute to most of the observed anxiolytic of Valerian root preparations. However, valerenic acid and its biosynthetic intermediates are only produced in low quantities in the roots of V. officinalis. Thus, in this report, Escherichia coli was metabolically engineered to produce substantial quantities of valerena-1,10-diene in shake flask fermentations with decane overlay. Expression of the wildtype valerenadiene synthase gene (pZE-wvds) resulted in production of 12μg/mL in LB cultures using endogenous FPP metabolism. Expression of a codon-optimized version of the valerenadiene synthase gene (pZE-cvds) resulted in 3-fold higher titers of valerenadiene (32μg/mL). Co-expression of pZE-cvds with an engineered methyl erythritol phosphate (MEP) pathway improved valerenadiene titers 65-fold to 2.09mg/L valerenadiene. Optimization of the fermentation medium to include glycerol supplementation enhanced yields by another 5.5-fold (11.0mg/L valerenadiene). The highest production of valerenadiene resulted from engineering the codon-optimized valerenadiene synthase gene under strong P trc and P T7 promoters and via co-expression of an exogenous mevalonate (MVA) pathway. These efforts resulted in an E. coli production strain that produced 62.0mg/L valerenadiene (19.4mg/L/OD 600 specific productivity). This E. coli production platform will serve as the foundation for the synthesis of novel valerenic acid analogues potentially useful for the treatment of anxiety disorders. Copyright © 2017 Elsevier B.V. All rights reserved.
Gao, Zhaowei; Li, Zhuofu; Zhang, Yuhong; Huang, Huoqing; Li, Mu; Zhou, Liwei; Tang, Yunming; Yao, Bin; Zhang, Wei
2012-03-01
The glucose oxidase (GOD) gene from Penicillium notatum was expressed in Pichia pastoris. The 1,815 bp gene, god-w, encodes 604 amino acids. Recombinant GOD-w had optimal activity at 35-40°C and pH 6.2 and was stable, from pH 3 to 7 maintaining >75% maximum activity after incubation at 50°C for 1 h. GOD-w worked as well as commercial GODs to improve bread making. To achieve high-level expression of recombinant GOD in P. pastoris, 272 nucleotides involving 228 residues were mutated, consistent with the codon bias of P. pastoris. The optimized recombinant GOD-m yielded 615 U ml(-1) (2.5 g protein l(-1)) in a 3 l fermentor--410% higher than GOD-w (148 U ml(-1)), and thus is a low-cost alternative for the bread baking industry.
MacDonald, Chris; Piper, Robert C.
2015-01-01
Here we expand the set of tools for genetically manipulating Saccharomyces cerevisiae. We show that puromycin-resistance can be achieved in yeast through expression of a bacterial puromycin-resistance gene optimized to the yeast codon bias, which in turn serves as an easy to use dominant genetic marker suitable for gene disruption. We have constructed a similar DNA cassette expressing yeast codon-optimized mutant human dihydrofolate reductase (DHFR) that confers resistance to methotrexate and can also be used as a dominant selectable marker. Both of these drug-resistant marker cassettes are flanked by loxP sites allowing for their excision from the genome following expression of cre-recombinase. Finally, we have created a series of plasmids for low-level constitutive expression of cre-recombinase in yeast that allows for efficient excision of loxP-flanked markers. PMID:25688547
Absence of classical heat shock response in the citrus pathogen Xylella fastidiosa.
Martins-de-Souza, Daniel; Martins, Daniel; Astua-Monge, Gustavo; Coletta-Filho, Helvécio Della; Winck, Flavia Vischi; Baldasso, Paulo Aparecido; de Oliveira, Bruno Menezes; Marangoni, Sérgio; Machado, Marcos Antônio; Novello, José Camillo; Smolka, Marcus Bustamante
2007-02-01
The fastidious bacterium Xylella fastidiosa is associated with important crop diseases worldwide. We have recently shown that X. fastidiosa is a peculiar organism having unusually low values of gene codon bias throughout its genome and, unexpectedly, in the group of the most abundant proteins. Here, we hypothesized that the lack of codon usage optimization in X. fastidiosa would incapacitate this organism to undergo quick and massive changes in protein expression as occurs in a classical stress response. Proteomic analysis of the response to heat stress in X. fastidiosa revealed that no changes in protein expression can be detected. Moreover, stress-inducible proteins identified in the closely related citrus pathogen Xanthomonas axonopodis pv citri were found to be constitutively expressed in X. fastidiosa. These proteins have extremely high codon bias values in the X. citri and other well-studied organisms, but low values in X. fastidiosa. Because biased codon usage is well known to correlate to the rate of protein synthesis, we speculate that the peculiar codon bias distribution in X. fastidiosa is related to the absence of a classical stress response, and, probably, alternative strategies for survival of X. fastidiosa under stressfull conditions.
Site-specific incorporation of 4-iodo-L-phenylalanine through opal suppression.
Kodama, Koichiro; Nakayama, Hiroshi; Sakamoto, Kensaku; Fukuzawa, Seketsu; Kigawa, Takanori; Yabuki, Takashi; Kitabatake, Makoto; Takio, Koji; Yokoyama, Shigeyuki
2010-08-01
A variety of unique codons have been employed to expand the genetic code. The use of the opal (UGA) codon is promising, but insufficient information is available about the UGA suppression approach, which facilitates the incorporation of non-natural amino acids through suppression of the UGA codon. In this study, the UGA codon was used to incorporate 4-iodo-l-phenylalanine into position 32 of the Ras protein in an Escherichia coli cell-free translation system. The undesired incorporation of tryptophan in response to the UGA codon was completely repressed by the addition of indolmycin. The minor amount (3%) of contaminating 4-bromo-l-phenylalanine in the building block 4-iodo-l-phenylalanine led to the significant incorporation of 4-bromo-l-phenylalanine (21%), and this problem was solved by using a purified 4-iodo-l-phenylalanine sample. Optimization of the incubation time was also important, since the undesired incorporation of free phenylalanine increased during the cell-free translation reaction. The 4-iodo-l-phenylalanine residue can be used for the chemoselective modification of proteins. This method will contribute to advancements in protein engineering studies with non-natural amino acid substitutions.
Heinemann, Ilka U.; Rovner, Alexis J.; Aerni, Hans R.; Rogulina, Svetlana; Cheng, Laura; Olds, William; Fischer, Jonathan T.; Söll, Dieter; Isaacs, Farren J.; Rinehart, Jesse
2012-01-01
Genetically encoded phosphoserine incorporation programmed by the UAG codon was achieved by addition of engineered elongation factor and an archaeal aminoacyl-tRNA synthetase to the normal Escherichia coli translation machinery (Park (2011) Science 333, 1151). However, protein yield suffers from expression of the orthogonal phosphoserine translation system and competition with release factor 1 (RF-1). In a strain lacking RF-1, phosphoserine phosphatase, and where 7 UAG codons residing in essential genes were converted to UAA, phosphoserine incorporation into GFP and WNK4 was significantly elevated, but with an accompanying loss in cellular fitness and viability. PMID:22982858
Analysis of base and codon usage by rubella virus.
Zhou, Yumei; Chen, Xianfeng; Ushijima, Hiroshi; Frey, Teryl K
2012-05-01
Rubella virus (RUBV), a small, plus-strand RNA virus that is an important human pathogen, has the unique feature that the GC content of its genome (70%) is the highest (by 20%) among RNA viruses. To determine the effect of this GC content on genomic evolution, base and codon usage were analyzed across viruses from eight diverse genotypes of RUBV. Despite differences in frequency of codon use, the favored codons in the RUBV genome matched those in the human genome for 18 of the 20 amino acids, indicating adaptation to the host. Although usage patterns were conserved in corresponding genes in the diverse genotypes, within-genome comparison revealed that both base and codon usages varied regionally, particularly in the hypervariable region (HVR) of the P150 replicase gene. While directional mutation pressure was predominant in determining base and codon usage within most of the genome (with the strongest tendency being towards C's at third codon positions), natural selection was predominant in the HVR region. The GC content of this region was the highest in the genome (>80%), and it was not clear if selection at the nucleotide level accompanied selection at the amino acid level. Dinucleotide frequency analysis of the RUBV genome revealed that TpA usage was lower than expected, similar to mammalian genes; however, CpG usage was not suppressed, and TpG usage was not enhanced, as is the case in mammalian genes.
Manufacture of TATB and TNT from Biosynthesized Phloroglucinols
2010-07-01
the microbial synthesis of mono-O-methylphloroglucinols, phloroglucinol O-methyl transferase (POMT) from Rosa chinensis var. spontanea has been...successfully de novo synthesized in codon-optimized form for expression in E. coli, which is the host currently used for microbial synthesis of...efforts had been made in both strain development and optimizing fermentation conditions for microbial phloroglucinol synthesis . Under optimized resin
van Til, Niek P; de Boer, Helen; Mashamba, Nomusa; Wabik, Agnieszka; Huston, Marshall; Visser, Trudi P; Fontana, Elena; Poliani, Pietro Luigi; Cassani, Barbara; Zhang, Fang; Thrasher, Adrian J; Villa, Anna; Wagemaker, Gerard
2012-10-01
Recombination activating gene 2 (RAG2) deficiency results in severe combined immunodeficiency (SCID) with complete lack of T and B lymphocytes. Initial gammaretroviral gene therapy trials for other types of SCID proved effective, but also revealed the necessity of safe vector design. We report the development of lentiviral vectors with the spleen focus forming virus (SF) promoter driving codon-optimized human RAG2 (RAG2co), which improved phenotype amelioration compared to native RAG2 in Rag2(-/-) mice. With the RAG2co therapeutic transgene, T-cell receptor (TCR) and immunoglobulin repertoire, T-cell mitogen responses, plasma immunoglobulin levels and T-cell dependent and independent specific antibody responses were restored. However, the thymus double positive T-cell population remained subnormal, possibly due to the SF virus derived element being sensitive to methylation/silencing in the thymus, which was prevented by replacing the SF promoter by the previously reported silencing resistant element (ubiquitous chromatin opening element (UCOE)), and also improved B-cell reconstitution to eventually near normal levels. Weak cellular promoters were effective in T-cell reconstitution, but deficient in B-cell reconstitution. We conclude that immune functions are corrected in Rag2(-/-) mice by genetic modification of stem cells using the UCOE driven codon-optimized RAG2, providing a valid optional vector for clinical implementation.
Yao, Li; Yu, Lin-Lu; Zhang, Jun-Jie; Xie, Xiang-Ting; Tao, Qing; Yan, Xin; Hong, Qing; Qiu, Ji-Guo
2016-01-01
ABSTRACT Sphingomonas sp. strain Ndbn-20 degrades and utilizes the herbicide dicamba as its sole carbon and energy source. In the present study, a tetrahydrofolate (THF)-dependent dicamba methyltransferase gene, dmt, was cloned from the strain, and three other genes, metF, dhc, and purU, which are involved in THF metabolism, were found to be located downstream of dmt. A transcriptional study revealed that the four genes constituted one transcriptional unit that was constitutively transcribed. Lysates of cells grown with glucose or dicamba exhibited almost the same activities, which further suggested that the dmt gene is constitutively expressed in the strain. Dmt shared 46% and 45% identities with the methyltransferases DesA and LigM from Sphingomonas paucimobilis SYK-6, respectively. The purified Dmt catalyzed the transfer of methyl from dicamba to THF to form the herbicidally inactive metabolite 3,6-dichlorosalicylic acid (DCSA) and 5-methyl-THF. The activity of Dmt was inhibited by 5-methyl-THF but not by DCSA. The introduction of a codon-optimized dmt gene into Arabidopsis thaliana enhanced resistance against dicamba. In conclusion, this study identified a THF-dependent dicamba methyltransferase, Dmt, with potential applications for the genetic engineering of dicamba-resistant crops. IMPORTANCE Dicamba is a very important herbicide that is widely used to control more than 200 types of broadleaf weeds and is a suitable target herbicide for the engineering of herbicide-resistant transgenic crops. A study of the mechanism of dicamba metabolism by soil microorganisms will benefit studies of its dissipation, transformation, and migration in the environment. This study identified a THF-dependent methyltransferase, Dmt, capable of catalyzing dicamba demethylation in Sphingomonas sp. Ndbn-20, and a preliminary study of its enzymatic characteristics was performed. Introduction of a codon-optimized dmt gene into Arabidopsis thaliana enhanced resistance against dicamba, suggesting that the dmt gene has potential applications for the genetic engineering of herbicide-resistant crops. PMID:27422839
Effect of DNA sequence of Fab fragment on yield characteristics and cell growth of E. coli.
Kulmala, Antti; Huovinen, Tuomas; Lamminmäki, Urpo
2017-06-19
Codon usage is one of the factors influencing recombinant protein expression. We were interested in the codon usage of an antibody Fab fragment gene exhibiting extreme toxicity in the E. coli host. The toxic synthetic human Fab gene contained domains optimized by the "one amino acid-one codon" method. We redesigned five segments of the Fab gene with a "codon harmonization" method described by Angov et al. and studied the effects of these changes on cell viability, Fab yield and display on filamentous phage using different vectors and bacterial strains. The harmonization considerably reduced toxicity, increased Fab expression from negligible levels to 10 mg/l, and restored the display on phage. Testing the impact of the individual redesigned segments revealed that the most significant effects were conferred by changes in the constant domain of the light chain. For some of the Fab gene variants, we also observed striking differences in protein yields when cloned from a chloramphenicol resistant vector into an identical vector, except with ampicillin resistance. In conclusion, our results show that the expression of a heterodimeric secretory protein can be improved by harmonizing selected DNA segments by synonymous codons and reveal additional complexity involved in heterologous protein expression.
Optimizing antibody expression: The nuts and bolts.
Ayyar, B Vijayalakshmi; Arora, Sushrut; Ravi, Shiva Shankar
2017-03-01
Antibodies are extensively utilized entities in biomedical research, and in the development of diagnostics and therapeutics. Many of these applications require high amounts of antibodies. However, meeting this ever-increasing demand of antibodies in the global market is one of the outstanding challenges. The need to maintain a balance between demand and supply of antibodies has led the researchers to discover better means and methods for optimizing their expression. These strategies aim to increase the volumetric productivity of the antibodies along with the reduction of associated manufacturing costs. Recent years have witnessed major advances in recombinant protein technology, owing to the introduction of novel cloning strategies, gene manipulation techniques, and an array of cell and vector engineering techniques, together with the progress in fermentation technologies. These innovations were also highly beneficial for antibody expression. Antibody expression depends upon the complex interplay of multiple factors that may require fine tuning at diverse levels to achieve maximum yields. However, each antibody is unique and requires individual consideration and customization for optimizing the associated expression parameters. This review provides a comprehensive overview of several state-of-the-art approaches, such as host selection, strain engineering, codon optimization, gene optimization, vector modification and process optimization that are deemed suitable for enhancing antibody expression. Copyright © 2017 Elsevier Inc. All rights reserved.
Global analysis of translation termination in E. coli.
Baggett, Natalie E; Zhang, Yan; Gross, Carol A
2017-03-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins.
Construction of an efficient Escherichia coli whole-cell biocatalyst for D-mannitol production.
Reshamwala, Shamlan M S; Pagar, Sandip K; Velhal, Vishal S; Maranholakar, Vijay M; Talangkar, Vishal G; Lali, Arvind M
2014-12-01
Mannitol is a six carbon sugar alcohol that finds applications in the pharmaceutical and food industries. A novel Escherichia coli strain capable of converting D-glucose to D-mannitol has been constructed, wherein native mannitol-1-phosphate dehydrogenase (MtlD) and codon-optimized Eimeria tenella mannitol-1-phosphatase (M1Pase) have been overexpressed. Codon-optimized Pseudomonas stutzeri phosphite dehydrogenase (PtxD) was overexpressed for cofactor (NADH) regeneration with the concomitant oxidation of phosphite to phosphate. Whole-cell biotransformation using resting cells in a medium containing D-glucose and equimolar sodium phosphite resulted in d-mannitol yield of 87 mol%. Thus, production of an industrially relevant biochemical without using complex media components and elaborate process control mechanisms has been demonstrated. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Regions of extreme synonymous codon selection in mammalian genes
Schattner, Peter; Diekhans, Mark
2006-01-01
Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Designing logical codon reassignment - Expanding the chemistry in biology.
Dumas, Anaëlle; Lercher, Lukas; Spicer, Christopher D; Davis, Benjamin G
2015-01-01
Over the last decade, the ability to genetically encode unnatural amino acids (UAAs) has evolved rapidly. The programmed incorporation of UAAs into recombinant proteins relies on the reassignment or suppression of canonical codons with an amino-acyl tRNA synthetase/tRNA (aaRS/tRNA) pair, selective for the UAA of choice. In order to achieve selective incorporation, the aaRS should be selective for the designed tRNA and UAA over the endogenous amino acids and tRNAs. Enhanced selectivity has been achieved by transferring an aaRS/tRNA pair from another kingdom to the organism of interest, and subsequent aaRS evolution to acquire enhanced selectivity for the desired UAA. Today, over 150 non-canonical amino acids have been incorporated using such methods. This enables the introduction of a large variety of structures into proteins, in organisms ranging from prokaryote, yeast and mammalian cells lines to whole animals, enabling the study of protein function at a level that could not previously be achieved. While most research to date has focused on the suppression of 'non-sense' codons, recent developments are beginning to open up the possibility of quadruplet codon decoding and the more selective reassignment of sense codons, offering a potentially powerful tool for incorporating multiple amino acids. Here, we aim to provide a focused review of methods for UAA incorporation with an emphasis in particular on the different tRNA synthetase/tRNA pairs exploited or developed, focusing upon the different UAA structures that have been incorporated and the logic behind the design and future creation of such systems. Our hope is that this will help rationalize the design of systems for incorporation of unexplored unnatural amino acids, as well as novel applications for those already known.
Hong, Jin-Bon; Chou, Fu-Ju; Ku, Amy T; Fan, Hsiang-Hsuan; Lee, Tung-Lung; Huang, Yung-Hsin; Yang, Tsung-Lin; Su, I-Chang; Yu, I-Shing; Lin, Shu-Wha; Chien, Chung-Liang; Ho, Hong-Nerng; Chen, You-Tzung
2014-01-01
PiggyBac is a prevalent transposon system used to deliver transgenes and functionally explore the mammalian untouched genomic territory. The important features of piggyBac transposon are the relatively low insertion site preference and the ability of seamless removal from genome, which allow its potential uses in functional genomics and regenerative medicine. Efforts to increase its transposition efficiency in mammals were made through engineering the corresponding transposase (PBase) codon usage to enhance its expression level and through screening for mutant PBase variants with increased enzyme activity. To improve the safety for its potential use in regenerative medicine applications, site-specific transposition was achieved by using engineered zinc finger- and Gal4-fused PBases. An excision-prone PBase variant has also been successfully developed. Here we describe the construction of a nucleolus-predominant PBase, NP-mPB, by adding a nucleolus-predominant (NP) signal peptide from HIV-1 TAT protein to a mammalian codon-optimized PBase (mPB). Although there is a predominant fraction of the NP-mPB-tGFP fusion proteins concentrated in the nucleoli, an insertion site preference toward nucleolar organizer regions is not detected. Instead a 3-4 fold increase in piggyBac transposition efficiency is reproducibly observed in mouse and human cells.
Lathe, R
1985-05-05
Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Immunogenicity of virus-like particles containing modified goose parvovirus VP2 protein.
Chen, Zongyan; Li, Chuanfeng; Zhu, Yingqi; Wang, Binbin; Meng, Chunchun; Liu, Guangqing
2012-10-01
The major capsid protein VP2 of goose parvovirus (GPV) expressed using a baculovirus expression system (BES) assembles into virus-like particles (VLPs). To optimize VP2 gene expression in Sf9 cells, we converted wild-type VP2 (VP2) codons into codons that are more common in insect genes. This change greatly increased VP2 protein production in Sf9 cells. The protein generated from the codon-optimized VP2 (optVP2) was detected by immunoblotting and an indirect immunofluorescence assay (IFA). Transmission electron microscopy analysis revealed the formation of VLPs. These findings indicate that optVP2 yielded stable and high-quality VLPs. Immunogenicity assays revealed that the VLPs are highly immunogenic, elicit a high level of neutralizing antibodies and provide protection against lethal challenge. The antibody levels appeared to be directly related to the number of GP-Ag-positive hepatocytes. The variation trends for GP-Ag-positive hepatocytes were similar in the vaccine groups. In comparison with the control group, the optVP2 VLPs groups exhibited obviously better responses. These data indicate that the VLPs retained immunoreactivity and had strong immunogenicity in susceptible geese. Thus, GPV optVP2 appears to be a good candidate for the vaccination of goslings. Copyright © 2012 Elsevier B.V. All rights reserved.
Emerging engineering principles for yield improvement in microbial cell design.
Comba, Santiago; Arabolaza, Ana; Gramajo, Hugo
2012-01-01
Metabolic Engineering has undertaken a rapid transformation in the last ten years making real progress towards the production of a wide range of molecules and fine chemicals using a designed cellular host. However, the maximization of product yields through pathway optimization is a constant and central challenge of this field. Traditional methods used to improve the production of target compounds from engineered biosynthetic pathways in non-native hosts include: codon usage optimization, elimination of the accumulation of toxic intermediates or byproducts, enhanced production of rate-limiting enzymes, selection of appropriate promoter and ribosome binding sites, application of directed evolution of enzymes, and chassis re-circuit. Overall, these approaches tend to be specific for each engineering project rather than a systematic practice based on a more generalizable strategy. In this mini-review, we highlight some novel and extensive approaches and tools intended to address the improvement of a target product formation, founded in sophisticated principles such as dynamic control, pathway genes modularization, and flux modeling.
Emerging engineering principles for yield improvement in microbial cell design
Comba, Santiago; Arabolaza, Ana; Gramajo, Hugo
2012-01-01
Metabolic Engineering has undertaken a rapid transformation in the last ten years making real progress towards the production of a wide range of molecules and fine chemicals using a designed cellular host. However, the maximization of product yields through pathway optimization is a constant and central challenge of this field. Traditional methods used to improve the production of target compounds from engineered biosynthetic pathways in non-native hosts include: codon usage optimization, elimination of the accumulation of toxic intermediates or byproducts, enhanced production of rate-limiting enzymes, selection of appropriate promoter and ribosome binding sites, application of directed evolution of enzymes, and chassis re-circuit. Overall, these approaches tend to be specific for each engineering project rather than a systematic practice based on a more generalizable strategy. In this mini-review, we highlight some novel and extensive approaches and tools intended to address the improvement of a target product formation, founded in sophisticated principles such as dynamic control, pathway genes modularization, and flux modeling. PMID:24688676
Diecke, Sebastian; Lisowski, Leszek; Kooreman, Nigel G; Wu, Joseph C
2014-01-01
The ability to induce pluripotency in somatic cells is one of the most important scientific achievements in the fields of stem cell research and regenerative medicine. This technique allows researchers to obtain pluripotent stem cells without the controversial use of embryos, providing a novel and powerful tool for disease modeling and drug screening approaches. However, using viruses for the delivery of reprogramming genes and transcription factors may result in integration into the host genome and cause random mutations within the target cell, thus limiting the use of these cells for downstream applications. To overcome this limitation, various non-integrating techniques, including Sendai virus, mRNA, minicircle, and plasmid-based methods, have recently been developed. Utilizing a newly developed codon optimized 4-in-1 minicircle (CoMiC), we were able to reprogram human adult fibroblasts using chemically defined media and without the need for feeder cells.
Xu, Dong-Qing; Mattox, William
2006-01-01
Exonic splicing enhancers (ESEs) are sequences that facilitate recognition of splice sites and prevent exon-skipping. Because ESEs are often embedded within proteincoding sequences, alterations in them can also often be interpreted as nonsense, missense or silent mutations. To correctly interpret exonic mutations and their roles in disease, it is important to develop strategies that identify ESE mutations. Potential ESEs can be found computationally in many exons but it has proven difficult to predict if a given mutation will have effects on splicing based on sequence alone. Here we describe a flexible in vitro method that can be used to functionally compare the effects of multiple sequence variants on ESE activity in a single in vitro splicing reaction. We have applied this method in parallel with conventional splicing assays to test for a splicing enhancer in exon 17 of the human MLH1 gene. Point mutations associated with hereditary nonpolyposis colorectal cancer (HNPCC) have previously been found to correlate with exon-skipping in both lymphocytes and tumors from patients. We show that sequences from this exon can replace an ESE from the mouse IgM gene to support RNA splicing in HeLa nuclear extracts. ESE activity was reduced by HNPCC point mutations in codon 659 indicating that their primary effect is on splicing. Surprisingly the strongest enhancer function mapped to a different region of the exon upstream of this codon. Together our results indicate that HNPCC point mutations in codon 659 affect an auxillary element that augments the enhancer function to ensure exon inclusion. PMID:16357104
Vasquez, Kevin A; Hatridge, Taylor A; Curtis, Nicholas C; Contreras, Lydia M
2016-02-19
Recent studies have demonstrated that effective protein production requires coordination of multiple cotranslational cellular processes, which are heavily affected by translation timing. Until recently, protein engineering has focused on codon optimization to maximize protein production rates, mostly considering the effect of tRNA abundance. However, as it relates to complex multidomain proteins, it has been hypothesized that strategic translational pauses between domains and between distinct individual structural motifs can prevent interactions between nascent chain fragments that generate kinetically trapped misfolded peptides and thereby enhance protein yields. In this study, we introduce synthetic transient pauses between structural domains in a heterologous model protein based on designed patterns of affinity between the mRNA and the anti-Shine-Dalgarno (aSD) sequence on the ribosome. We demonstrate that optimizing translation attenuation at domain boundaries can predictably affect solubility patterns in bacteria. Exploration of the affinity space showed that modifying less than 1% of the nucleotides (on a small 12 amino acid linker) can vary soluble protein yields up to ∼7-fold without altering the primary sequence of the protein. In the context of longer linkers, where a larger number of distinct structural motifs can fold outside the ribosome, optimal synonymous codon variations resulted in an additional 2.1-fold increase in solubility, relative to that of nonoptimized linkers of the same length. While rational construction of 54 linkers of various affinities showed a significant correlation between protein solubility and predicted affinity, only weaker correlations were observed between tRNA abundance and protein solubility. We also demonstrate that naturally occurring high-affinity clusters are present between structural domains of β-galactosidase, one of Escherichia coli's largest native proteins. Interdomain ribosomal affinity is an important factor that has not previously been explored in the context of protein engineering.
Mezzanotte, Laura; Que, Ivo; Kaijzel, Eric; Branchini, Bruce; Roda, Aldo; Löwik, Clemens
2011-04-22
Despite a plethora of bioluminescent reporter genes being cloned and used for cell assays and molecular imaging purposes, the simultaneous monitoring of multiple events in small animals is still challenging. This is partly attributable to the lack of optimization of cell reporter gene expression as well as too much spectral overlap of the color-coupled reporter genes. A new red emitting codon-optimized luciferase reporter gene mutant of Photinus pyralis, Ppy RE8, has been developed and used in combination with the green click beetle luciferase, CBG99. Human embryonic kidney cells (HEK293) were transfected with vectors that expressed red Ppy RE8 and green CBG99 luciferases. Populations of red and green emitting cells were mixed in different ratios. After addition of the shared single substrate, D-luciferin, bioluminescent (BL) signals were imaged with an ultrasensitive cooled CCD camera using a series of band pass filters (20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that expressed the two luciferases were injected at different depth in the animals. Spectrally-separate images and quantification of the dual BL signals in a mixed population of cells was achieved when cells were either injected subcutaneously or directly into the prostate. We report here the re-engineering of different luciferase genes for in vitro and in vivo dual color imaging applications to address the technical issues of using dual luciferases for imaging. In respect to previously used dual assays, our study demonstrated enhanced sensitivity combined with spatially separate BL spectral emissions using a suitable spectral unmixing algorithm. This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future for more accurate quantitative gene expression studies in vivo by simultaneously monitoring two events in real time.
Mezzanotte, Laura; Que, Ivo; Kaijzel, Eric; Branchini, Bruce; Roda, Aldo; Löwik, Clemens
2011-01-01
Background Despite a plethora of bioluminescent reporter genes being cloned and used for cell assays and molecular imaging purposes, the simultaneous monitoring of multiple events in small animals is still challenging. This is partly attributable to the lack of optimization of cell reporter gene expression as well as too much spectral overlap of the color-coupled reporter genes. A new red emitting codon-optimized luciferase reporter gene mutant of Photinus pyralis, Ppy RE8, has been developed and used in combination with the green click beetle luciferase, CBG99. Principal Findings Human embryonic kidney cells (HEK293) were transfected with vectors that expressed red Ppy RE8 and green CBG99 luciferases. Populations of red and green emitting cells were mixed in different ratios. After addition of the shared single substrate, D-luciferin, bioluminescent (BL) signals were imaged with an ultrasensitive cooled CCD camera using a series of band pass filters (20 nm). Spectral unmixing algorithms were applied to the images where good separation of signals was observed. Furthermore, HEK293 cells that expressed the two luciferases were injected at different depth in the animals. Spectrally-separate images and quantification of the dual BL signals in a mixed population of cells was achieved when cells were either injected subcutaneously or directly into the prostate. Significance We report here the re-engineering of different luciferase genes for in vitro and in vivo dual color imaging applications to address the technical issues of using dual luciferases for imaging. In respect to previously used dual assays, our study demonstrated enhanced sensitivity combined with spatially separate BL spectral emissions using a suitable spectral unmixing algorithm. This new D-luciferin-dependent reporter gene couplet opens up the possibility in the future for more accurate quantitative gene expression studies in vivo by simultaneously monitoring two events in real time. PMID:21544210
Does adaptation to vertebrate codon usage relate to flavivirus emergence potential?
Freire, Caio César de Melo
2018-01-01
Codon adaptation index (CAI) is a measure of synonymous codon usage biases given a usage reference. Through mutation, selection, and drift, viruses can optimize their replication efficiency and produce more offspring, which could increase the chance of secondary transmission. To evaluate how higher CAI towards the host has been associated with higher viral titers, we explored temporal trends of several historic and extensively sequenced zoonotic flaviviruses and relationships within the genus itself. To showcase evolutionary and epidemiological relationships associated with silent, adaptive synonymous changes of viruses, we used codon usage tables from human housekeeping and antiviral immune genes, as well as tables from arthropod vectors and vertebrate species involved in the flavivirus maintenance cycle. We argue that temporal trends of CAI changes could lead to a better understanding of zoonotic emergences, evolutionary dynamics, and host adaptation. CAI appears to help illustrate historically relevant trends of well-characterized viruses, in different viral species and genetic diversity within a single species. CAI can be a useful tool together with in vivo and in vitro kinetics, phylodynamics, and additional functional genomics studies to better understand species trafficking and viral emergence in a new host. PMID:29385205
The Role of +4U as an Extended Translation Termination Signal in Bacteria
Wei, Yulong; Xia, Xuhua
2017-01-01
Termination efficiency of stop codons depends on the first 3′ flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria. PMID:27903612
Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G
2007-01-01
Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.
Tran, Tuan-Anh; Vo, Nam Tri; Nguyen, Hoang Duc; Pham, Bao The
2015-12-01
Recombinant proteins play an important role in many aspects of life and have generated a huge income, notably in the industrial enzyme business. A gene is introduced into a vector and expressed in a host organism-for example, E. coli-to obtain a high productivity of target protein. However, transferred genes from particular organisms are not usually compatible with the host's expression system because of various reasons, for example, codon usage bias, GC content, repetitive sequences, and secondary structure. The solution is developing programs to optimize for designing a nucleotide sequence whose origin is from peptide sequences using properties of highly expressed genes (HEGs) of the host organism. Existing data of HEGs determined by practical and computer-based methods do not satisfy for qualifying and quantifying. Therefore, the demand for developing a new HEG prediction method is critical. We proposed a new method for predicting HEGs and criteria to evaluate gene optimization. Codon usage bias was weighted by amplifying the difference between HEGs and non-highly expressed genes (non-HEGs). The number of predicted HEGs is 5% of the genome. In comparison with Puigbò's method, the result is twice as good as Puigbò's one, in kernel ratio and kernel sensitivity. Concerning transcription/translation factor proteins (TF), the proposed method gives low TF sensitivity, while Puigbò's method gives moderate one. In summary, the results indicated that the proposed method can be a good optional applying method to predict optimized genes for particular organisms, and we generated an HEG database for further researches in gene design.
Expression of a functional cold active β-galactosidase from Planococcus sp-L4 in Pichia pastoris.
Mahdian, Seyed Mohammad Amin; Karimi, Ehsan; Tanipour, Mohammad Hossein; Parizadeh, Seyed Mohammad Reza; Ghayour-Mobarhan, Majid; Bazaz, Mojtaba Mousavi; Mashkani, Baratali
2016-09-01
Lactase deficiency problem discourages many adults from consuming milk as a major source of micro- and macronutrients. Enzymatic hydrolysis of lactose is an ideal solution for this problem but such processing adds significant costs. In this study, a cold active β-galactosidase from Planococcus sp-L4 (bgal) was optimized for expression of recombinant "BGalP" in Pichia pastoris. As a result of codon optimization, the codon adaptation index was improved from 0.58 to 0.85 after replacing rare codons. After transformation of two P. pastoris strains (KM71H and GS115), the activity of BGalP enzyme was measured in the culture supernatants using ortho-Nitrophenyl-β-galactoside (ONPG). Maximal activity was recorded as 3.7U/ml on day 11 in KM71H clone #2 which was 20% higher than the best GS115 clone. Activity measurements under different conditions indicated optimal activity at pH 6.5. It was active at temperatures ranging from 0 to 55°C with deactivation occurring at or above 60°C. Protein analysis of the crude ultra-filtrate showed the enzyme was ∼75kDa and was the major constituent (85%) of the sample. This enzyme have the potential to find utility for the breakdown of lactose in chilled milk and subsequently can be deactivated by pasteurization. The use of BGalP would minimize energy consumption thus decreasing cost and also help to preserve the nutritional elements of the milk. Copyright © 2015 Elsevier Inc. All rights reserved.
The impact of KRAS mutations on VEGF-A production and tumour vascular network
2013-01-01
Background The malignant potential of tumour cells may be influenced by the molecular nature of KRAS mutations being codon 13 mutations less aggressive than codon 12 ones. Their metabolic profile is also different, with an increased anaerobic glycolytic metabolism in cells harbouring codon 12 KRAS mutations compared with cells containing codon 13 mutations. We hypothesized that this distinct metabolic behaviour could be associated with different HIF-1α expression and a distinct angiogenic profile. Methods Codon13 KRAS mutation (ASP13) or codon12 KRAS mutation (CYS12) NIH3T3 transfectants were analyzed in vitro and in vivo. Expression of HIF-1α, and VEGF-A was studied at RNA and protein levels. Regulation of VEGF-A promoter activity was assessed by means of luciferase assays using different plasmid constructs. Vascular network was assessed in tumors growing after subcutaneous inoculation. Non parametric statistics were used for analysis of results. Results Our results show that in normoxic conditions ASP13 transfectants exhibited less HIF-1α protein levels and activity than CYS12. In contrast, codon 13 transfectants exhibited higher VEGF-A mRNA and protein levels and enhanced VEGF-A promoter activity. These differences were due to a differential activation of Sp1/AP2 transcription elements of the VEGF-A promoter associated with increased ERKs signalling in ASP13 transfectants. Subcutaneous CYS12 tumours expressed less VEGF-A and showed a higher microvessel density (MVD) than ASP13 tumours. In contrast, prominent vessels were only observed in the latter. Conclusion Subtle changes in the molecular nature of KRAS oncogene activating mutations occurring in tumour cells have a major impact on the vascular strategy devised providing with new insights on the role of KRAS mutations on angiogenesis. PMID:23506169
Sarrazin, Sandrine; Starck, Joëlle; Gonnet, Colette; Doubeikovski, Alexandre; Melet, Fabrice; Morle, François
2000-01-01
The proto-oncogene Fli-1 encodes a transcription factor of the ets family whose overexpression is associated with multiple virally induced leukemias in mouse, inhibits murine and avian erythroid cell differentiation, and induces drastic perturbations of early development in Xenopus. This study demonstrates the surprisingly sophisticated regulation of Fli-1 mRNA translation. We establish that two FLI-1 protein isoforms (of 51 and 48 kDa) detected by Western blotting in vivo are synthesized by alternative translation initiation through the use of two highly conserved in-frame initiation codons, AUG +1 and AUG +100. Furthermore, we show that the synthesis of these two FLI-1 isoforms is regulated by two short overlapping 5′ upstream open reading frames (uORF) beginning at two highly conserved upstream initiation codons, AUG −41 and GUG −37, and terminating at two highly conserved stop codons, UGA +35 and UAA +15. The mutational analysis of these two 5′ uORF revealed that each of them negatively regulates FLI-1 protein synthesis by precluding cap-dependent scanning to the 48- and 51-kDa AUG codons. Simultaneously, the translation termination of the two 5′ uORF appears to enhance 48-kDa protein synthesis, by allowing downstream reinitiation at the 48-kDa AUG codon, and 51-kDa protein synthesis, by allowing scanning ribosomes to pile up and consequently allowing upstream initiation at the 51-kDa AUG codon. To our knowledge, this is the first example of a cellular mRNA displaying overlapping 5′ uORF whose translation termination appears to be involved in the positive control of translation initiation at both downstream and upstream initiation codons. PMID:10757781
Mutations in eukaryotic release factors 1 and 3 act as general nonsense suppressors in Drosophila.
Chao, Anna T; Dierick, Herman A; Addy, Tracie M; Bejsovec, Amy
2003-01-01
In a screen for suppressors of the Drosophila wingless(PE4) nonsense allele, we isolated mutations in the two components that form eukaryotic release factor. eRF1 and eRF3 comprise the translation termination complex that recognizes stop codons and catalyzes the release of nascent polypeptide chains from ribosomes. Mutations disrupting the Drosophila eRF1 and eRF3 show a strong maternal-effect nonsense suppression due to readthrough of stop codons and are zygotically lethal during larval stages. We tested nonsense mutations in wg and in other embryonically acting genes and found that different stop codons can be suppressed but only a subset of nonsense alleles are subject to suppression. We suspect that the context of the stop codon is significant: nonsense alleles sensitive to suppression by eRF1 and eRF3 encode stop codons that are immediately followed by a cytidine. Such suppressible alleles appear to be intrinsically weak, with a low level of readthrough that is enhanced when translation termination is disrupted. Thus the eRF1 and eRF3 mutations provide a tool for identifying nonsense alleles that are leaky. Our findings have important implications for assigning null mutant phenotypes and for selecting appropriate alleles to use in suppressor screens. PMID:14573473
Floquet, Célia; Hatin, Isabelle; Rousset, Jean-Pierre; Bidou, Laure
2012-01-01
The efficiency of translation termination depends on the nature of the stop codon and the surrounding nucleotides. Some molecules, such as aminoglycoside antibiotics (gentamicin), decrease termination efficiency and are currently being evaluated for diseases caused by premature termination codons. However, the readthrough response to treatment is highly variable and little is known about the rules governing readthrough level and response to aminoglycosides. In this study, we carried out in-depth statistical analysis on a very large set of nonsense mutations to decipher the elements of nucleotide context responsible for modulating readthrough levels and gentamicin response. We quantified readthrough for 66 sequences containing a stop codon, in the presence and absence of gentamicin, in cultured mammalian cells. We demonstrated that the efficiency of readthrough after treatment is determined by the complex interplay between the stop codon and a larger sequence context. There was a strong positive correlation between basal and induced readthrough levels, and a weak negative correlation between basal readthrough level and gentamicin response (i.e. the factor of increase from basal to induced readthrough levels). The identity of the stop codon did not affect the response to gentamicin treatment. In agreement with a previous report, we confirm that the presence of a cytosine in +4 position promotes higher basal and gentamicin-induced readthrough than other nucleotides. We highlight for the first time that the presence of a uracil residue immediately upstream from the stop codon is a major determinant of the response to gentamicin. Moreover, this effect was mediated by the nucleotide itself, rather than by the amino-acid or tRNA corresponding to the −1 codon. Finally, we point out that a uracil at this position associated with a cytosine at +4 results in an optimal gentamicin-induced readthrough, which is the therapeutically relevant variable. PMID:22479203
Global analysis of translation termination in E. coli
Baggett, Natalie E.
2017-01-01
Terminating protein translation accurately and efficiently is critical for both protein fidelity and ribosome recycling for continued translation. The three bacterial release factors (RFs) play key roles: RF1 and 2 recognize stop codons and terminate translation; and RF3 promotes disassociation of bound release factors. Probing release factors mutations with reporter constructs containing programmed frameshifting sequences or premature stop codons had revealed a propensity for readthrough or frameshifting at these specific sites, but their effects on translation genome-wide have not been examined. We performed ribosome profiling on a set of isogenic strains with well-characterized release factor mutations to determine how they alter translation globally. Consistent with their known defects, strains with increasingly severe release factor defects exhibit increasingly severe accumulation of ribosomes over stop codons, indicative of an increased duration of the termination/release phase of translation. Release factor mutant strains also exhibit increased occupancy in the region following the stop codon at a significant number of genes. Our global analysis revealed that, as expected, translation termination is generally efficient and accurate, but that at a significant number of genes (≥ 50) the ribosome signature after the stop codon is suggestive of translation past the stop codon. Even native E. coli K-12 exhibits the ribosome signature suggestive of protein extension, especially at UGA codons, which rely exclusively on the reduced function RF2 variant of the K-12 strain for termination. Deletion of RF3 increases the severity of the defect. We unambiguously demonstrate readthrough and frameshifting protein extensions and their further accumulation in mutant strains for a few select cases. In addition to enhancing recoding, ribosome accumulation over stop codons disrupts attenuation control of biosynthetic operons, and may alter expression of some overlapping genes. Together, these functional alterations may either augment the protein repertoire or produce deleterious proteins. PMID:28301469
Datta, Dibyadyuti; Bansal, Geetha P; Gerloff, Dietlind L; Ellefsen, Barry; Hannaman, Drew; Kumar, Nirbhay
2017-01-05
Pfs48/45 and Pfs25 are leading candidates for the development of Plasmodium falciparum transmission blocking vaccines (TBV). Expression of Pfs48/45 in the erythrocytic sexual stages and presentation to the immune system during infection in the human host also makes it ideal for natural boosting. However, it has been challenging to produce a fully folded, functionally active Pfs48/45, using various protein expression platforms. In this study, we demonstrate that full-length Pfs48/45 encoded by DNA plasmids is able to induce significant transmission reducing immune responses. DNA plasmids encoding Pfs48/45 based on native (WT), codon optimized (SYN), or codon optimized and mutated (MUT1 and MUT2), to prevent any asparagine (N)-linked glycosylation were compared with or without intramuscular electroporation (EP). EP significantly enhanced antibody titers and transmission blocking activity elicited by immunization with SYN Pfs48/45 DNA vaccine. Mosquito membrane feeding assays also revealed improved functional immunogenicity of SYN Pfs48/45 (N-glycosylation sites intact) as compared to MUT1 or MUT2 Pfs48/45 DNA plasmids (all N-glycosylation sites mutated). Boosting with recombinant Pfs48/45 protein after immunization with each of the different DNA vaccines resulted in significant boosting of antibody response and improved transmission reducing capabilities of all four DNA vaccines. Finally, immunization with a combination of DNA plasmids (SYN Pfs48/45 and SYN Pfs25) also provides support for the possibility of combining antigens targeting different life cycle stages in the parasite during transmission through mosquitoes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Biotechnical paving of recombinant enterocin A as the candidate of anti-Listeria agent.
Hu, Xiaoyuan; Mao, Ruoyu; Zhang, Yong; Teng, Da; Wang, Xiumin; Xi, Di; Huang, Jianzhong; Wang, Jianhua
2014-08-28
Enterocin A is a classic IIa bacteriocin isolated firstly from Enterococcus faecium CTC492 with selective antimicrobial activity against Listeria strains. However, the application of enterocin A as an anti-Listeria agent has been limited due to its very low native yield. The present work describes high production of enterocin A through codon optimization strategy and its character study. The gene sequence of enterocin A was optimized based on preferential codon usage in Pichia pastoris to increase its expression efficiency. The highest anti-Listeria activity reached 51,200 AU/ml from 180 mg/l of total protein after 24 h of induction in a 5-L fermenter. Recombinant enterocin A (rEntA), purified by gel filtration chromatography, showed very strong activity against Listeria ivanovii ATCC 19119 with a low MIC of 20 ng/ml. In addition, the rEntA killed over 99% of tested L. ivanovii ATCC19119 within 4 h when exposed to 4 × MIC (80 ng/ml). Moreover, it showed high stability under a wide pH range (2-10) and maintained full activity after 1 h of treatment at 80°C within a pH range of 2-8. Its antimicrobial activity was enhanced at 25 and 50 mM NaCl, while 100-400 mM NaCl had little effect on the bactericidal ability of rEntA. The EntA was successfully expressed in P. pastoris, and this feasible system could pave the pre-industrial technological path of rEntA as a competent candidate as an anti-Listeria agent. Furthermore, it showed high stability under wide ranges of conditions, which could be potential as the new candidate of anti-Listeria agent.
A Candidate H1N1 Pandemic Influenza Vaccine Elicits Protective Immunity in Mice
Steitz, Julia; Barlow, Peter G.; Hossain, Jaber; Kim, Eun; Okada, Kaori; Kenniston, Tom; Rea, Sheri; Donis, Ruben O.; Gambotto, Andrea
2010-01-01
Background In 2009 a new pandemic disease appeared and spread globally. The recent emergence of the pandemic influenza virus H1N1 first isolated in Mexico and USA raised concerns about vaccine availability. We here report our development of an adenovirus-based influenza H1N1 vaccine tested for immunogenicity and efficacy to confer protection in animal model. Methods We generated two adenovirus(Ad5)-based influenza vaccine candidates encoding the wildtype or a codon-optimized hemagglutinin antigen (HA) from the recently emerged swine influenza isolate A/California/04/2009 (H1N1)pdm. After verification of antigen expression, immunogenicity of the vaccine candidates were tested in a mouse model using dose escalations for subcutaneous immunization. Sera of immunized animals were tested in microneutalization and hemagglutination inhibition assays for the presence of HA-specific antibodies. HA-specific T-cells were measured in IFNγ Elispot assays. The efficiency of the influenza vaccine candidates were evaluated in a challenge model by measuring viral titer in lung and nasal turbinate 3 days after inoculation of a homologous H1N1 virus. Conclusions/Significance A single immunization resulted in robust cellular and humoral immune response. Remarkably, the intensity of the immune response was substantially enhanced with codon-optimized antigen, indicating the benefit of manipulating the genetic code of HA antigens in the context of recombinant influenza vaccine design. These results highlight the value of advanced technologies in vaccine development and deployment in response to infections with pandemic potential. Our study emphasizes the potential of an adenoviral-based influenza vaccine platform with the benefits of speed of manufacture and efficacy of a single dose immunization. PMID:20463955
Ederveen, Thomas H. A.; Overmars, Lex; van Hijum, Sacha A. F. T.
2013-01-01
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotations. Automated genome annotation engines provide users a straight-forward and complete solution for predicting ORF coordinates and function. For many labs, the use of AGEs is therefore essential to decrease the time necessary for annotating a given prokaryotic genome. However, it is not uncommon for AGEs to provide different and sometimes conflicting predictions. Combining multiple AGEs might allow for more accurate predictions. Here we analyzed the ab initio open reading frame (ORF) calling performance of different AGEs based on curated genome annotations of eight strains from different bacterial species with GC% ranging from 35–52%. We present a case study which demonstrates a novel way of comparative genome annotation, using combinations of AGEs in a pre-defined order (or path) to predict ORF start codons. The order of AGE combinations is from high to low specificity, where the specificity is based on the eight genome annotations. For each AGE combination we are able to derive a so-called projected confidence value, which is the average specificity of ORF start codon prediction based on the eight genomes. The projected confidence enables estimating likeliness of a correct prediction for a particular ORF start codon by a particular AGE combination, pinpointing ORFs notoriously difficult to predict start codons. We correctly predict start codons for 90.5±4.8% of the genes in a genome (based on the eight genomes) with an accuracy of 81.1±7.6%. Our consensus-path methodology allows a marked improvement over majority voting (9.7±4.4%) and with an optimal path ORF start prediction sensitivity is gained while maintaining a high specificity. PMID:23675487
New Universal Rules of Eukaryotic Translation Initiation Fidelity
Zur, Hadas; Tuller, Tamir
2013-01-01
The accepted model of eukaryotic translation initiation begins with the scanning of the transcript by the pre-initiation complex from the 5′end until an ATG codon with a specific nucleotide (nt) context surrounding it is recognized (Kozak rule). According to this model, ATG codons upstream to the beginning of the ORF should affect translation. We perform for the first time, a genome-wide statistical analysis, uncovering a new, more comprehensive and quantitative, set of initiation rules for improving the cost of translation and its efficiency. Analyzing dozens of eukaryotic genomes, we find that in all frames there is a universal trend of selection for low numbers of ATG codons; specifically, 16–27 codons upstream, but also 5–11 codons downstream of the START ATG, include less ATG codons than expected. We further suggest that there is selection for anti optimal ATG contexts in the vicinity of the START ATG. Thus, the efficiency and fidelity of translation initiation is encoded in the 5′UTR as required by the scanning model, but also at the beginning of the ORF. The observed nt patterns suggest that in all the analyzed organisms the pre-initiation complex often misses the START ATG of the ORF, and may start translation from an alternative initiation start-site. Thus, to prevent the translation of undesired proteins, there is selection for nucleotide sequences with low affinity to the pre-initiation complex near the beginning of the ORF. With the new suggested rules we were able to obtain a twice higher correlation with ribosomal density and protein levels in comparison to the Kozak rule alone (e.g. for protein levels r = 0.7 vs. r = 0.31; p<10−12). PMID:23874179
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-05-15
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage.
Analyses of frameshifting at UUU-pyrimidine sites.
Schwartz, R; Curran, J F
1997-01-01
Others have recently shown that the UUU phenylalanine codon is highly frameshift-prone in the 3'(rightward) direction at pyrimidine 3'contexts. Here, several approaches are used to analyze frameshifting at such sites. The four permutations of the UUU/C (phenylalanine) and CGG/U (arginine) codon pairs were examined because they vary greatly in their expected frameshifting tendencies. Furthermore, these synonymous sites allow direct tests of the idea that codon usage can control frameshifting. Frameshifting was measured for these dicodons embedded within each of two broader contexts: the Escherichia coli prfB (RF2 gene) programmed frameshift site and a 'normal' message site. The principal difference between these contexts is that the programmed frameshift contains a purine-rich sequence upstream of the slippery site that can base pair with the 3'end of 16 S rRNA (the anti-Shine-Dalgarno) to enhance frameshifting. In both contexts frameshift frequencies are highest if the slippery tRNAPhe is capable of stable base pairing in the shifted reading frame. This requirement is less stringent in the RF2 context, as if the Shine-Dalgarno interaction can help stabilize a quasi-stable rephased tRNA:message complex. It was previously shown that frameshifting in RF2 occurs more frequently if the codon 3'to the slippery site is read by a rare tRNA. Consistent with that earlier work, in the RF2 context frameshifting occurs substantially more frequently if the arginine codon is CGG, which is read by a rare tRNA. In contrast, in the 'normal' context frameshifting is only slightly greater at CGG than at CGU. It is suggested that the Shine-Dalgarno-like interaction elevates frameshifting specifically during the pause prior to translation of the second codon, which makes frameshifting exquisitely sensitive to the rate of translation of that codon. In both contexts frameshifting increases in a mutant strain that fails to modify tRNA base A37, which is 3'of the anticodon. Thus, those base modifications may limit frameshifting at UUU codons. Finally, statistical analyses show that UUU Ynn dicodons are extremely rare in E.coli genes that have highly biased codon usage. PMID:9115369
Dean, Kimberly M; Grayhack, Elizabeth J
2012-12-01
We have developed a robust and sensitive method, called RNA-ID, to screen for cis-regulatory sequences in RNA using fluorescence-activated cell sorting (FACS) of yeast cells bearing a reporter in which expression of both superfolder green fluorescent protein (GFP) and yeast codon-optimized mCherry red fluorescent protein (RFP) is driven by the bidirectional GAL1,10 promoter. This method recapitulates previously reported progressive inhibition of translation mediated by increasing numbers of CGA codon pairs, and restoration of expression by introduction of a tRNA with an anticodon that base pairs exactly with the CGA codon. This method also reproduces effects of paromomycin and context on stop codon read-through. Five key features of this method contribute to its effectiveness as a selection for regulatory sequences: The system exhibits greater than a 250-fold dynamic range, a quantitative and dose-dependent response to known inhibitory sequences, exquisite resolution that allows nearly complete physical separation of distinct populations, and a reproducible signal between different cells transformed with the identical reporter, all of which are coupled with simple methods involving ligation-independent cloning, to create large libraries. Moreover, we provide evidence that there are sequences within a 9-nt library that cause reduced GFP fluorescence, suggesting that there are novel cis-regulatory sequences to be found even in this short sequence space. This method is widely applicable to the study of both RNA-mediated and codon-mediated effects on expression.
[Prokaryotic expression of recombinant prochymosin gene and its antiserum preparation].
Li, Xin-ping; Liu, Huan-huan; Pu, Yan; Zhang, Fu-chun; Li, Yi-jie
2012-07-01
To optimize the prochymosin (pCHY) gene codons and express the gene in Escherichia coli (E.coli), and to prepare its antiserum and detect chymosin protein specifically. According to codon usage bias of E.coli, prochymosin gene sequence was synthesized based on the conserved sequences of prochymosin gene from bovine, lamb and camel, and then cloned into the plasmid pET-30a and pcDNA3-AAT-COMP-C3d3 (pcD-ACC), respectively. pET-30a-pCHY was expressed, as the detected antigen, in E.coli BL21(DE3) after IPTG induction. RT-PCR was used to detect prochymosin mRNA expression in liver from the mice injected pcDNA3-AAT-COMP-pCHY-C3d3(pACCC) by hydrodynamics-based transfection method. To prepare the antiserum of prochymosin, pACCC and GST-pCHY proteins were used to immunize New Zealand rabbits in accordance with DNA prime-protein boost strategy. Antibody levels were tested by ELISA. Western blotting showed the molecular weight of His-pCHY protein was about 55 000, similar to the expected molecular size. ELISA demonstrated that the titer level of prochymosin antiserum was high. Based on the codon optimization, we have obtained high-titer prochymosin antiserum through DNA vaccine vector pcD-ACC combined with DNA prime-protein boost strategy, similar to that by protein vaccine.
DyNAVacS: an integrative tool for optimized DNA vaccine design.
Harish, Nagarajan; Gupta, Rekha; Agarwal, Parul; Scaria, Vinod; Pillai, Beena
2006-07-01
DNA vaccines have slowly emerged as keystones in preventive immunology due to their versatility in inducing both cell-mediated as well as humoral immune responses. The design of an efficient DNA vaccine, involves choice of a suitable expression vector, ensuring optimal expression by codon optimization, engineering CpG motifs for enhancing immune responses and providing additional sequence signals for efficient translation. DyNAVacS is a web-based tool created for rapid and easy design of DNA vaccines. It follows a step-wise design flow, which guides the user through the various sequential steps in the design of the vaccine. Further, it allows restriction enzyme mapping, design of primers spanning user specified sequences and provides information regarding the vectors currently used for generation of DNA vaccines. The web version uses Apache HTTP server. The interface was written in HTML and utilizes the Common Gateway Interface scripts written in PERL for functionality. DyNAVacS is an integrated tool consisting of user-friendly programs, which require minimal information from the user. The software is available free of cost, as a web based application at URL: http://miracle.igib.res.in/dynavac/.
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Grand, Roger Le; Fomsgaard, Anders
2013-07-19
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques.
Borggren, Marie; Vinner, Lasse; Andresen, Betina Skovgaard; Grevstad, Berit; Repits, Johanna; Melchers, Mark; Elvang, Tara Laura; Sanders, Rogier W; Martinon, Frédéric; Dereuddre-Bosquet, Nathalie; Bowles, Emma Joanne; Stewart-Jones, Guillaume; Biswas, Priscilla; Scarlatti, Gabriella; Jansson, Marianne; Heyndrickx, Leo; Le Grand, Roger; Fomsgaard, Anders
2013-01-01
HIV-1 DNA vaccines have many advantageous features. Evaluation of HIV-1 vaccine candidates often starts in small animal models before macaque and human trials. Here, we selected and optimized DNA vaccine candidates through systematic testing in rabbits for the induction of broadly neutralizing antibodies (bNAb). We compared three different animal models: guinea pigs, rabbits and cynomolgus macaques. Envelope genes from the prototype isolate HIV-1 Bx08 and two elite neutralizers were included. Codon-optimized genes, encoded secreted gp140 or membrane bound gp150, were modified for expression of stabilized soluble trimer gene products, and delivered individually or mixed. Specific IgG after repeated i.d. inoculations with electroporation confirmed in vivo expression and immunogenicity. Evaluations of rabbits and guinea pigs displayed similar results. The superior DNA construct in rabbits was a trivalent mix of non-modified codon-optimized gp140 envelope genes. Despite NAb responses with some potency and breadth in guinea pigs and rabbits, the DNA vaccinated macaques displayed less bNAb activity. It was concluded that a trivalent mix of non-modified gp140 genes from rationally selected clinical isolates was, in this study, the best option to induce high and broad NAb in the rabbit model, but this optimization does not directly translate into similar responses in cynomolgus macaques. PMID:26344115
Au, Hilda H T; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNA(i)-independent IRES translation.
Insights into Factorless Translational Initiation by the tRNA-Like Pseudoknot Domain of a Viral IRES
Au, Hilda H. T.; Jan, Eric
2012-01-01
The intergenic region internal ribosome entry site (IGR IRES) of the Dicistroviridae family adopts an overlapping triple pseudoknot structure to directly recruit the 80S ribosome in the absence of initiation factors. The pseudoknot I (PKI) domain of the IRES mimics a tRNA-like codon:anticodon interaction in the ribosomal P site to direct translation initiation from a non-AUG initiation codon in the A site. In this study, we have performed a comprehensive mutational analysis of this region to delineate the molecular parameters that drive IRES translation. We demonstrate that IRES-mediated translation can initiate at an alternate adjacent and overlapping start site, provided that basepairing interactions within PKI remain intact. Consistent with this, IGR IRES translation tolerates increases in the variable loop region that connects the anticodon- and codon-like elements within the PKI domain, as IRES activity remains relatively robust up to a 4-nucleotide insertion in this region. Finally, elements from an authentic tRNA anticodon stem-loop can functionally supplant corresponding regions within PKI. These results verify the importance of the codon:anticodon interaction of the PKI domain and further define the specific elements within the tRNA-like domain that contribute to optimal initiator Met-tRNAi-independent IRES translation. PMID:23236506
Chretien, Anne-Sophie; Harlé, Alexandre; Meyer-Lefebvre, Magali; Rouyer, Marie; Husson, Marie; Ramacci, Carole; Harter, Valentin; Genin, Pascal; Leroux, Agnès; Merlin, Jean-Louis
2013-02-01
KRAS mutation detection represents a crucial issue in metastatic colorectal cancer (mCRC). The optimization of KRAS mutation detection delay enabling rational prescription of first-line treatment in mCRC including anti-EGFR-targeted therapy requires robust and rapid molecular biology techniques. Routine analysis of mutations in codons 12 and 13 on 674 paraffin-embedded tissue specimens of mCRC has been performed for KRAS mutations detection using three molecular biology techniques, that is, high-resolution melting (HRM), polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP), and allelic discrimination PCR (TaqMan PCR). Discordant cases were assessed with COBAS 4800 KRAS CE-IVD assay. Among the 674 tumor specimens, 1.5% (10/674) had excessive DNA degradation and could not be analyzed. KRAS mutations were detected in 38.0% (256/674) of the analysable specimens (82.4% in codon 12 and 17.6% in codon 13). Among 613 specimens in whom all three techniques were used, 12 (2.0%) cases of discordance between the three techniques were observed. 83.3% (10/12) of the discordances were due to PCR-RFLP as confirmed by COBAS 4800 retrospective analysis. The three techniques were statistically comparable (κ > 0.9; P < 0.001). From these results, optimization of the routine procedure consisted of proceeding to systematic KRAS detection using HRM and TaqMan and PCR-RFLP in case of discordance and allowed significant decrease in delays. The results showed an excellent correlation between the three techniques. Using HRM and TaqMan warrants high-quality and rapid-routine KRAS mutation detection in paraffin-embedded tumor specimens. The new procedure allowed a significant decrease in delays for reporting results, enabling rational prescription of first-line-targeted therapy in mCRC.
Liu, Song; Wang, Miao; Du, Guocheng; Chen, Jian
2016-10-28
Transglutaminases (TGase), which are synthesized as a zymogen (pro-TGase) in Streptomyces sp., are important enzymes in the food industry. Because this pro-peptide is essential for the correct folding of Streptomyces TGase, TGase is usually expressed in an inactive pro-TGase form, which is then converted to active TGase by the addition of activating proteases in vitro. In this study, Streptomyces hygroscopicus TGase was actively produced by Streptomyces lividans through promoter engineering and codon optimization. A gene fragment (tg1, 2.6 kb) that encoded the pro-TGase and its endogenous promoter region, signal peptide and terminator was amplified from S. hygroscopicus WSH03-13 and cloned into plasmid pIJ86, which resulted in pIJ86/tg1. After fermentation for 2 days, S. lividans TK24 that harbored pIJ86/tg1 produced 1.8 U/mL of TGase, and a clear TGase band (38 kDa) was detected in the culture supernatant. These results indicated that the pro-TGase was successfully expressed and correctly processed into active TGase in S. lividans TK24 by using the TGase promoter. Based on deletion analysis, the complete sequence of the TGase promoter is restricted to the region from -693 to -48. We also identified a negative element (-198 to -148) in the TGase promoter, and the deletion of this element increased the TGase production by 81.3 %, in contrast to the method by which S. lividans expresses pIJ86/tg1. Combining the deletion of the negative element of the promoter and optimization of the gene codons, the yield and productivity of TGase reached 5.73 U/mL and 0.14 U/mL/h in the recombinant S. lividans, respectively. We constructed an active TGase-producing strain that had a high yield and productivity, and the optimized TGase promoter could be a good candidate promoter for the expression of other proteins in Streptomyces.
Sanli, G; Blaber, S I; Blaber, M
2001-01-01
Corynebacteria codon usage exhibits an overall GC content of 67%, and a wobble-position GC content of 88%. Escherichia coli, on the other hand has an overall GC content of 51%, and a wobble-position GC content of 55%. The high GC content of Corynebacteria genes results in an unfavorable codon preference for heterologous expression, and can present difficulties for polymerase-based manipulations due to secondary-structure effects. Since these characteristics are due primarily to base composition at the wobble-position, synthetic genes can, in principle, be designed to eliminate these problems and retain the wild-type amino acid sequence. Such genes would obviate the need for special additives or bases during in vitro polymerase-based manipulation and mutant host strains containing uncommon tRNA's for heterologous expression. We have evaluated synthetic genes with reduced wobble-position G/C content using two variants of the enzyme 2,5-diketo-D-gluconic acid reductase (2,5-DKGR A and B) from Corynebacterium. The wild-type genes are refractory to polymerase-based manipulations and exhibit poor heterologous expression in enteric bacteria. The results indicate that a subset of codons for five amino acids (alanine, arginine, glutamate, glycine and valine) contribute the greatest contribution to reduction in G/C content at the wobble-position. Furthermore, changes in codons for two amino acids (leucine and proline) enhance bias for expression in enteric bacteria without affecting the overall G/C content. The synthetic genes are readily amplified using polymerase-based methodologies, and exhibit high levels of heterologous expression in E. coli.
High-level expression of Camelid nanobodies in Nicotiana benthamiana.
Teh, Yi-Hui Audrey; Kavanagh, Tony A
2010-08-01
Nanobodies (or VHHs) are single-domain antigen-binding fragments derived from Camelid heavy chain-only antibodies. Their small size, monomeric behaviour, high stability and solubility, and ability to bind epitopes not accessible to conventional antibodies make them especially suitable for many therapeutic and biotechnological applications. Here we describe high-level expression, in Nicotiana benthamiana, of three versions of an anti-hen egg white lysozyme (HEWL) nanobody which include the original VHH from an immunized library (cAbLys3), a codon-optimized derivative, and a codon-optimized hybrid nanobody comprising the CDRs of cAbLys3 grafted onto an alternative 'universal' nanobody framework. His6- and StrepII-tagged derivatives of each nanobody were targeted for accumulation in the cytoplasm, chloroplast and apoplast using different pre-sequences. When targeted to the apoplast, intact functional nanobodies accumulated at an exceptionally high level (up to 30% total leaf protein), demonstrating the great potential of plants as a nanobody production system.
Wen, Jiexia; Pan, Sumin; Liang, Shuang; Zhong, Zhenyu; He, Ying; Lin, Hongyu; Li, Wenyan; Wang, Liyue; Li, Xiujin; Zhong, Fei
2013-01-01
Canine parvovirus (CPV) disease is an acute, highly infectious disease threatening the dog-raising industry. So far there are no effective therapeutic strategies to control this disease. Although the canine transferrin receptor (TfR) was identified as a receptor for CPV infection, whether extracellular domain of TfR (called soluble TfR (sTfR)) possesses anti-CPV activities remains elusive. Here, we used the recombinant sTfR prepared from HEK293T cells with codon-optimized gene structure to investigate its anti-CPV activity both in vitro and in vivo. Our results indicated that codon optimization could significantly improve sTfR expression in HEK293T cells. The prepared recombinant sTfR possessed a binding activity to both CPV and CPV VP2 capsid proteins and significantly inhibited CPV infection of cultured feline F81 cells and decreased the mortality of CPV-infected dogs, which indicates that the sTfR has the anti-CPV activity both in vitro and in vivo. PMID:24089666
Park, Soohyun; Hong, Soohye; Pack, Seung Pil; Lee, Jinwon
2014-02-01
Phosphoenolpyruvate carboxylase (PEPC) of Photobacterium profundum SS9 can be expressed and purified using the Escherichia coli expression system. In this study, a codon-optimized PEPC gene (OPPP) was used to increase expression levels. We confirmed OPPP expression and purified it from extracts of recombinant E. coli SGJS117 harboring the OPPP gene. The purified OPPP showed a specific activity value of 80.3 U/mg protein. The OPPP was stable under low temperature (5-30 °C) and weakly basic conditions (pH 8.5-10). The enzymatic ability of OPPP was investigated for in vitro production of oxaloacetate using phosphoenolpyruvate (PEP) and bicarbonate. Only samples containing the OPPP, PEP, and bicarbonate resulted in oxaloacetate production. OPPP production system using E. coli could be a platform technology to produce high yields of heterogeneous gene and provide the PEPC enzyme, which has high enzyme activity.
DNASynth: a software application to optimization of artificial gene synthesis
NASA Astrophysics Data System (ADS)
Muczyński, Jan; Nowak, Robert M.
2017-08-01
DNASynth is a client-server software application in which the client runs in a web browser. The aim of this program is to support and optimize process of artificial gene synthesizing using Ligase Chain Reaction. Thanks to LCR it is possible to obtain DNA strand coding defined by user peptide. The DNA sequence is calculated by optimization algorithm that consider optimal codon usage, minimal energy of secondary structures and minimal number of required LCR. Additionally absence of sequences characteristic for defined by user set of restriction enzymes is guaranteed. The presented software was tested on synthetic and real data.
Zhao, Qianqian; Liu, Fei; Hou, Zhongwen; Yuan, Chao; Zhu, Xiqiang
2014-03-01
A β-galactosidase gene from Aspergillus oryzae was engineered utilizing codon usage optimization to be constitutively and highly expressed in the Pichia pastoris SMD1168H strain in a high-cell-density fermentation. After fermentation for 96 h in a 50-L fermentor using glucose and glycerol as combined carbon sources, the recombinant enzyme in the culture supernatant had an activity of 4,239.07 U mL(-1) with o-nitrophenyl-β-D-galactopyranoside as the substrate, and produced a total of extracellular protein content of 7.267 g L(-1) in which the target protein (6.24 g L(-1)) occupied approximately 86 %. The recombinant β-galactosidase exhibited an excellent lactose hydrolysis ability. With 1,000 U of the enzyme in 100 mL milk, 92.44 % lactose was degraded within 24 h at 60 °C, and the enzyme could also accomplish the hydrolysis at low temperatures of 37, 25, and 10 °C. Thus, this engineered strain had significantly higher fermentation level of A. oryzae lactase than that before optimization and the β-galactosidase may have a good application potential in whey and milk industries.
Salvatori, Francesca; Pappadà, Mariangela; Breveglieri, Giulia; D'Aversa, Elisabetta; Finotti, Alessia; Lampronti, Ilaria; Gambari, Roberto; Borgatti, Monica
2018-05-15
Nonsense mutations promote premature translational termination, introducing stop codons within the coding region of mRNAs and causing inherited diseases, including thalassemia. For instance, in β 0 39 thalassemia the CAG (glutamine) codon is mutated to the UAG stop codon, leading to premature translation termination and to mRNA destabilization through the well described NMD (nonsense-mediated mRNA decay). In order to develop an approach facilitating translation and, therefore, protection from NMD, ribosomal read-through molecules, such as aminoglycoside antibiotics, have been tested on mRNAs carrying premature stop codons. These findings have introduced new hopes for the development of a pharmacological approach to the β 0 39 thalassemia therapy. While several strategies, designed to enhance translational read-through, have been reported to inhibit NMD efficiency concomitantly, experimental tools for systematic analysis of mammalian NMD inhibition by translational read-through are lacking. We developed a human cellular model of the β 0 39 thalassemia mutation with UPF-1 suppressed and showing a partial NMD suppression. This novel cellular model could be used for the screening of molecules exhibiting preferential read-through activity allowing a great rescue of the mutated transcripts.
Koutsoudakis, George; Urbanowicz, Richard A.; Mirza, Deeman; Ginkel, Corinne; Riebesehl, Nina; Calland, Noémie; Albecka, Anna; Price, Louisa; Hudson, Natalia; Descamps, Véronique; Backx, Matthijs; McClure, C. Patrick; Duverlie, Gilles; Pecheur, Eve-Isabelle; Dubuisson, Jean; Perez-del-Pulgar, Sofia; Forns, Xavier; Steinmann, Eike; Tarr, Alexander W.; Pietschmann, Thomas
2014-01-01
Serine is encoded by two divergent codon types, UCN and AGY, which are not interchangeable by a single nucleotide substitution. Switching between codon types therefore occurs via intermediates (threonine or cysteine) or via simultaneous tandem substitutions. Hepatitis C virus (HCV) chronically infects 2 to 3% of the global population. The highly variable glycoproteins E1 and E2 decorate the surface of the viral envelope, facilitate cellular entry, and are targets for host immunity. Comparative sequence analysis of globally sampled E1E2 genes, coupled with phylogenetic analysis, reveals the signatures of multiple archaic codon-switching events at seven highly conserved serine residues. Limited detection of intermediate phenotypes indicates that associated fitness costs restrict their fixation in divergent HCV lineages. Mutational pathways underlying codon switching were probed via reverse genetics, assessing glycoprotein functionality using multiple in vitro systems. These data demonstrate selection against intermediate phenotypes can act at the structural/functional level, with some intermediates displaying impaired virion assembly and/or decreased capacity for target cell entry. These effects act in residue/isolate-specific manner. Selection against intermediates is also provided by humoral targeting, with some intermediates exhibiting increased epitope exposure and enhanced neutralization sensitivity, despite maintaining a capacity for target cell entry. Thus, purifying selection against intermediates limits their frequencies in globally sampled strains, with divergent functional constraints at the protein level restricting the fixation of deleterious mutations. Overall our study provides an experimental framework for identification of barriers limiting viral substitutional evolution and indicates that serine codon-switching represents a genomic “fossil record” of historical purifying selection against E1E2 intermediate phenotypes. PMID:24173227
Application of SGT1-Hsp90 chaperone complex for soluble expression of NOD1 LRR domain in E. coli
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hong, Tae-Joon; Hahn, Ji-Sook
NOD1 is an intracellular sensor of innate immunity which is related to a number of inflammatory diseases. NOD1 is known to be difficult to express and purify for structural and biochemical studies. Based on the fact that Hsp90 and its cochaperone SGT1 are necessary for the stabilization and activation of NOD1 in mammals, SGT1 was chosen as a fusion partner of the leucine-rich repeat (LRR) domain of NOD1 for its soluble expression in Escherichia coli. Fusion of human SGT1 (hSGT1) to NOD1 LRR significantly enhanced the solubility, and the fusion protein was stabilized by coexpression of mouse Hsp90α. The expressionmore » level of hSGT1-NOD1 LRR was further enhanced by supplementation of rare codon tRNAs and exchange of antibiotic marker genes. - Highlights: • The NOD1 LRR domain was solubilized by SGT1 fusion in E. coli. • The coexpression of HSP90 stabilized the SGT1-NOD1 LRR fusion protein. • Several optimizations could enhance the expression level of the fusion protein.« less
Fletcher, Simon P; Ali, Iraj K; Kaminski, Ann; Digard, Paul; Jackson, Richard J
2002-01-01
Classical swine fever virus (CSFV) is a member of the pestivirus family, which shares many features in common with hepatitis C virus (HCV). It is shown here that CSFV has an exceptionally efficient cis-acting internal ribosome entry segment (IRES), which, like that of HCV, is strongly influenced by the sequences immediately downstream of the initiation codon, and is optimal with viral coding sequences in this position. Constructs that retained 17 or more codons of viral coding sequence exhibited full IRES activity, but with only 12 codons, activity was approximately 66% of maximum in vitro (though close to maximum in transfected BHK cells), whereas with just 3 codons or fewer, the activity was only approximately 15% of maximum. The minimal coding region elements required for high activity were exchanged between HCV and CSFV. Although maximum activity was observed in each case with the homologous combination of coding region and 5' UTR, the heterologous combinations were sufficiently active to rule out a highly specific functional interplay between the 5' UTR and coding sequences. On the other hand, inversion of the coding sequences resulted in low IRES activity, particularly with the HCV coding sequences. RNA structure probing showed that the efficiency of internal initiation of these chimeric constructs correlated most closely with the degree of single-strandedness of the region around and immediately downstream of the initiation codon. The low activity IRESs could not be rescued by addition of supplementary eIF4A (the initiation factor with ATP-dependent RNA helicase activity). The extreme sensitivity to secondary structure around the initiation codon is likely to be due to the fact that the eIF4F complex (which has eIF4A as one of its subunits) is not required for and does not participate in initiation on these IRESs. PMID:12515388
Gene Composer: database software for protein construct design, codon engineering, and gene synthesis
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-01-01
Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies. PMID:19383142
Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance
2009-04-21
To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease error correction in combination with PIPE cloning. In a sister manuscript we present data on how Gene Composer designed genes and protein constructs can result in improved protein production for structural studies.
2010-01-01
The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of “adaptive bridges,” neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept. PMID:20711776
Shi, Feng; Luan, Mingyue; Li, Yongfu
2018-04-18
Glutamate decarboxylase (GAD) converts L-glutamate (Glu) into γ-aminobutyric acid (GABA). Corynebacterium glutamicum that expresses exogenous GAD gene, gadB2 or gadB1, can synthesize GABA from its own produced Glu. To enhance GABA production in C. glutamicum, ribosomal binding site (RBS) sequence and promoter were searched and optimized for increasing the expression efficiency of gadB2. R4 exhibited the highest strength among RBS sequences tested, with 6 nt the optimal aligned spacing (AS) between RBS and start codon. This combination of RBS sequence and AS contributed to gadB2 expression, increased GAD activity by 156% and GABA production by 82% compared to normal strong RBS and AS combination. Then, a series of native promoters were selected for transcribing gadB2 under optimal RBS and AS combination. P dnaK , P dtsR , P odhI and P clgR expressed gadB2 and produced GABA as effectively as widely applied P tuf and P cspB promoters and more effectively than P sod promoter. However, each native promoter did not work as well as the synthetic strong promoter P tacM , which produced 20.2 ± 0.3 g/L GABA. Even with prolonged length and bicistronic architecture, the strength of P dnaK did not enhance. Finally, gadB2 and mutant gadB1 were co-expressed under the optimal promoter and RBS combination, thus converted Glu into GABA completely and improved GABA production to more than 25 g/L. This study provides useful promoters and RBS sequences for gene expression in C. glutamicum.
The Role of ABC Proteins in Drug-Resistant Breast Cancer Cells
2007-04-01
and a biotin acceptor domain) under control of the alcohol oxidase promoter (Figure 2). Upon methanol induction, the yeast expressed high levels of...as native cDNA. Therefore, we backtranslated the protein into a nucleotide sequence codon-optimized for expression in Pichia pastoris yeast. Yeast
Komatsubara, Akira T.; Matsuda, Michiyuki; Aoki, Kazuhiro
2015-01-01
Biosensors based on the principle of Förster (or fluorescence) resonance energy transfer (FRET) have been developed to visualize spatio-temporal dynamics of signalling molecules in living cells. Many of them adopt a backbone of intramolecular FRET biosensor with a cyan fluorescent protein (CFP) and yellow fluorescent protein (YFP) as donor and acceptor, respectively. However, there remains the difficulty of establishing cells stably expressing FRET biosensors with a YFP and CFP pair by lentiviral or retroviral gene transfer, due to the high incidence of recombination between YFP and CFP genes. To address this, we examined the effects of codon-diversification of YFP on the recombination of FRET biosensors introduced by lentivirus or retrovirus. The YFP gene that was fully codon-optimized to E.coli evaded the recombination in lentiviral or retroviral gene transfer, but the partially codon-diversified YFP did not. Further, the length of spacer between YFP and CFP genes clearly affected recombination efficiency, suggesting that the intramolecular template switching occurred in the reverse-transcription process. The simple mathematical model reproduced the experimental data sufficiently, yielding a recombination rate of 0.002–0.005 per base. Together, these results show that the codon-diversified YFP is a useful tool for expressing FRET biosensors by lentiviral or retroviral gene transfer. PMID:26290434
NASA Astrophysics Data System (ADS)
Monajjemi, M.; Razavian, M. H.; Mollaamin, F.; Naderi, F.; Honarparvar, B.
2008-12-01
Quantum-chemical solvent effect theories describe the electronic structure of a molecular subsystem embedded in a solvent or other molecular environment. The solvation of biomolecules is important in molecular biology, since numerous processes involve proteins interacting in changing solvent-solute systems. In this theoretical study, we focus on mRNA-tRNA base pairs as a fundamental step in protein synthesis influenced by hydrogen bonding between two antiparallel trinucleotides, namely, the mRNA codon and tRNA anticodon. We use the mean reaction field theories, which describe electrostatic and polarization interactions between solute and solvent in the AAA, UUU, AAG, and UUC triplex sequences optimized in various solvent media such as water, dimethylsulfoxide, methanol, ethanol, and cyclopean using the self-consistent reaction field model. This process depends on either the reaction potential function of the solvent or charge transfer operators that appear in solute-solvent interaction. Because of codon and anticodon biological criteria, we performed nonempirical quantum-mechanical calculations at the BLYP and B3LYP/3-21G, 6-31G, and 6-31G* levels of theory in the gas phase and five solvents at three temperatures. Finally, to obtain more information, we calculated thermochemical parameters to find that the dielectric constant of solvents plays an important role in the displacement of amino acid sequences on codon-anticodon residues in proteins, which can cause some mutations in humans.
Elena, Claudia; Ravasi, Pablo; Castelli, María E.; Peirú, Salvador; Menzella, Hugo G.
2014-01-01
The efficient production of functional proteins in heterologous hosts is one of the major bases of modern biotechnology. Unfortunately, many genes are difficult to express outside their original context. Due to their apparent “silent” nature, synonymous codon substitutions have long been thought to be trivial. In recent years, this dogma has been refuted by evidence that codon replacement can have a significant impact on gene expression levels and protein folding. In the past decade, considerable advances in the speed and cost of gene synthesis have facilitated the complete redesign of entire gene sequences, dramatically improving the likelihood of high protein expression. This technology significantly impacts the economic feasibility of microbial-based biotechnological processes by, for example, increasing the volumetric productivities of recombinant proteins or facilitating the redesign of novel biosynthetic routes for the production of metabolites. This review discusses the current applications of this technology, particularly those regarding the production of small molecules and industrially relevant recombinant enzymes. Suggestions for future research and potential uses are provided as well. PMID:24550894
Strutton, Benjamin; Jaffé, Stephen R P; Pandhal, Jagroop; Wright, Phillip C
2018-01-01
Although Escherichia coli has been engineered to perform N-glycosylation of recombinant proteins, an optimal glycosylating strain has not been created. By inserting a codon optimised Campylobacter oligosaccharyltransferase onto the E. coli chromosome, we created a glycoprotein platform strain, where the target glycoprotein, sugar synthesis and glycosyltransferase enzymes, can be inserted using expression vectors to produce the desired homogenous glycoform. To assess the functionality and glycoprotein producing capacity of the chromosomally based OST, a combined Western blot and parallel reaction monitoring mass spectrometry approach was applied, with absolute quantification of glycoprotein. We demonstrated that chromosomal oligosaccharyltransferase remained functional and facilitated N-glycosylation. Although the engineered strain produced less total recombinant protein, the glycosylation efficiency increased by 85%, and total glycoprotein production was enhanced by 17%. Copyright © 2017 Elsevier Inc. All rights reserved.
Zhang, Jianfeng; Jex, Edward; Feng, Tsungwei; Sivko, Gloria S; Baillie, Leslie W; Goldman, Stanley; Van Kampen, Kent R; Tang, De-chu C
2013-01-01
Bacillus anthracis is the causative agent of anthrax, and its spores have been developed into lethal bioweapons. To mitigate an onslaught from airborne anthrax spores that are maliciously disseminated, it is of paramount importance to develop a rapid-response anthrax vaccine that can be mass administered by nonmedical personnel during a crisis. We report here that intranasal instillation of a nonreplicating adenovirus vector encoding B. anthracis protective antigen could confer rapid and sustained protection against inhalation anthrax in mice in a single-dose regimen in the presence of preexisting adenovirus immunity. The potency of the vaccine was greatly enhanced when codons of the antigen gene were optimized to match the tRNA pool found in human cells. In addition, an adenovirus vector encoding lethal factor can confer partial protection against inhalation anthrax and might be coadministered with a protective antigen-based vaccine.
Jex, Edward; Feng, Tsungwei; Sivko, Gloria S.; Baillie, Leslie W.; Goldman, Stanley; Van Kampen, Kent R.; Tang, De-chu C.
2013-01-01
Bacillus anthracis is the causative agent of anthrax, and its spores have been developed into lethal bioweapons. To mitigate an onslaught from airborne anthrax spores that are maliciously disseminated, it is of paramount importance to develop a rapid-response anthrax vaccine that can be mass administered by nonmedical personnel during a crisis. We report here that intranasal instillation of a nonreplicating adenovirus vector encoding B. anthracis protective antigen could confer rapid and sustained protection against inhalation anthrax in mice in a single-dose regimen in the presence of preexisting adenovirus immunity. The potency of the vaccine was greatly enhanced when codons of the antigen gene were optimized to match the tRNA pool found in human cells. In addition, an adenovirus vector encoding lethal factor can confer partial protection against inhalation anthrax and might be coadministered with a protective antigen-based vaccine. PMID:23100479
Wei, Junhong; Tian, Jinjin; Pan, Guoqing; Xie, Jie; Bao, Jialing; Zhou, Zeyang
2017-06-01
To develop a reliable and easy to use expression system for antibiotic production improvement of Streptomyces. A two-compound T7 RNA polymerase-dependent gene expression system was developed to fulfill this demand. In this system, the T7 RNA polymerase coding sequence was optimized based on the codon usage of Streptomyces coelicolor. To evaluate the functionality of this system, we constructed an activator gene overexpression strain for enhancement of actinorhodin production. By overexpression of the positive regulator actII-ORF4 with this system, the maximum actinorhodin yield of engineered strain was 15-fold higher and the fermentation time was decreased by 48 h. The modified two-compound T7 expression system improves both antibiotic production and accelerates the fermentation process in Streptomyces. This provides a general and useful strategy for strain improvement of important antibiotic producing Streptomyces strains.
Liang, Bo; Surman, Sonja; Amaro-Carambot, Emerito; Kabatova, Barbora; Mackow, Natalie; Lingemann, Matthias; Yang, Lijuan; McLellan, Jason S.; Graham, Barney S.; Kwong, Peter D.; Schaap-Nutt, Anne; Collins, Peter L.
2015-01-01
ABSTRACT Respiratory syncytial virus (RSV) and human parainfluenza virus type 3 (HPIV3) are the first and second leading viral agents of severe respiratory tract disease in infants and young children worldwide. Vaccines are not available, and an RSV vaccine is particularly needed. A live attenuated chimeric recombinant bovine/human PIV3 (rB/HPIV3) vector expressing the RSV fusion (F) glycoprotein from an added gene has been under development as a bivalent vaccine against RSV and HPIV3. Previous clinical evaluation of this vaccine candidate suggested that increased genetic stability and immunogenicity of the RSV F insert were needed. This was investigated in the present study. RSV F expression was enhanced 5-fold by codon optimization and by modifying the amino acid sequence to be identical to that of an early passage of the original clinical isolate. This conferred a hypofusogenic phenotype that presumably reflects the original isolate. We then compared vectors expressing stabilized prefusion and postfusion versions of RSV F. In a hamster model, prefusion F induced increased quantity and quality of RSV-neutralizing serum antibodies and increased protection against wild-type (wt) RSV challenge. In contrast, a vector expressing the postfusion F was less immunogenic and protective. The genetic stability of the RSV F insert was high and was not affected by enhanced expression or the prefusion or postfusion conformation of RSV F. These studies provide an improved version of the previously well-tolerated rB/HPIV3-RSV F vaccine candidate that induces a superior RSV-neutralizing serum antibody response. IMPORTANCE Respiratory syncytial virus (RSV) and human parainfluenza virus type 3 (HPIV3) are two major causes of pediatric pneumonia and bronchiolitis. The rB/HPIV3 vector expressing RSV F protein is a candidate bivalent live vaccine against HPIV3 and RSV. Previous clinical evaluation indicated the need to increase the immunogenicity and genetic stability of the RSV F insert. Here, we increased RSV F expression by codon optimization and by modifying the RSV F amino acid sequence to conform to that of an early passage of the original isolate. This resulted in a hypofusogenic phenotype, which likely represents the original phenotype before adaptation to cell culture. We also included stabilized versions of prefusion and postfusion RSV F protein. Prefusion RSV F induced a larger quantity and higher quality of RSV-neutralizing serum antibodies and was highly protective. This provides an improved candidate for further clinical evaluation. PMID:26157122
Liang, Bo; Surman, Sonja; Amaro-Carambot, Emerito; Kabatova, Barbora; Mackow, Natalie; Lingemann, Matthias; Yang, Lijuan; McLellan, Jason S; Graham, Barney S; Kwong, Peter D; Schaap-Nutt, Anne; Collins, Peter L; Munir, Shirin
2015-09-01
Respiratory syncytial virus (RSV) and human parainfluenza virus type 3 (HPIV3) are the first and second leading viral agents of severe respiratory tract disease in infants and young children worldwide. Vaccines are not available, and an RSV vaccine is particularly needed. A live attenuated chimeric recombinant bovine/human PIV3 (rB/HPIV3) vector expressing the RSV fusion (F) glycoprotein from an added gene has been under development as a bivalent vaccine against RSV and HPIV3. Previous clinical evaluation of this vaccine candidate suggested that increased genetic stability and immunogenicity of the RSV F insert were needed. This was investigated in the present study. RSV F expression was enhanced 5-fold by codon optimization and by modifying the amino acid sequence to be identical to that of an early passage of the original clinical isolate. This conferred a hypofusogenic phenotype that presumably reflects the original isolate. We then compared vectors expressing stabilized prefusion and postfusion versions of RSV F. In a hamster model, prefusion F induced increased quantity and quality of RSV-neutralizing serum antibodies and increased protection against wild-type (wt) RSV challenge. In contrast, a vector expressing the postfusion F was less immunogenic and protective. The genetic stability of the RSV F insert was high and was not affected by enhanced expression or the prefusion or postfusion conformation of RSV F. These studies provide an improved version of the previously well-tolerated rB/HPIV3-RSV F vaccine candidate that induces a superior RSV-neutralizing serum antibody response. Respiratory syncytial virus (RSV) and human parainfluenza virus type 3 (HPIV3) are two major causes of pediatric pneumonia and bronchiolitis. The rB/HPIV3 vector expressing RSV F protein is a candidate bivalent live vaccine against HPIV3 and RSV. Previous clinical evaluation indicated the need to increase the immunogenicity and genetic stability of the RSV F insert. Here, we increased RSV F expression by codon optimization and by modifying the RSV F amino acid sequence to conform to that of an early passage of the original isolate. This resulted in a hypofusogenic phenotype, which likely represents the original phenotype before adaptation to cell culture. We also included stabilized versions of prefusion and postfusion RSV F protein. Prefusion RSV F induced a larger quantity and higher quality of RSV-neutralizing serum antibodies and was highly protective. This provides an improved candidate for further clinical evaluation. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius
2013-05-01
Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Trinucleotide cassettes increase diversity of T7 phage-displayed peptide library.
Krumpe, Lauren R H; Schumacher, Kathryn M; McMahon, James B; Makowski, Lee; Mori, Toshiyuki
2007-10-05
Amino acid sequence diversity is introduced into a phage-displayed peptide library by randomizing library oligonucleotide DNA. We recently evaluated the diversity of peptide libraries displayed on T7 lytic phage and M13 filamentous phage and showed that T7 phage can display a more diverse amino acid sequence repertoire due to differing processes of viral morphogenesis. In this study, we evaluated and compared the diversity of a 12-mer T7 phage-displayed peptide library randomized using codon-corrected trinucleotide cassettes with a T7 and an M13 12-mer phage-displayed peptide library constructed using the degenerate codon randomization method. We herein demonstrate that the combination of trinucleotide cassette amino acid codon randomization and T7 phage display construction methods resulted in a significant enhancement to the functional diversity of a 12-mer peptide library. This novel library exhibited superior amino acid uniformity and order-of-magnitude increases in amino acid sequence diversity as compared to degenerate codon randomized peptide libraries. Comparative analyses of the biophysical characteristics of the 12-mer peptide libraries revealed the trinucleotide cassette-randomized library to be a unique resource. The combination of T7 phage display and trinucleotide cassette randomization resulted in a novel resource for the potential isolation of binding peptides for new and previously studied molecular targets.
Huang, Kun; Caplan, Jeff; Sweigard, James A; Czymmek, Kirk J; Donofrio, Nicole M
2017-02-01
Reactive oxygen species (ROS) production and breakdown have been studied in detail in plant-pathogenic fungi, including the rice blast fungus, Magnaporthe oryzae; however, the examination of the dynamic process of ROS production in real time has proven to be challenging. We resynthesized an existing ROS sensor, called HyPer, to exhibit optimized codon bias for fungi, specifically Neurospora crassa, and used a combination of microscopy and plate reader assays to determine whether this construct could detect changes in fungal ROS during the plant infection process. Using confocal microscopy, we were able to visualize fluctuating ROS levels during the formation of an appressorium on an artificial hydrophobic surface, as well as during infection on host leaves. Using the plate reader, we were able to ascertain measurements of hydrogen peroxide (H 2 O 2 ) levels in conidia as detected by the MoHyPer sensor. Overall, by the optimization of codon usage for N. crassa and related fungal genomes, the MoHyPer sensor can be used as a robust, dynamic and powerful tool to both monitor and quantify H 2 O 2 dynamics in real time during important stages of the plant infection process. © 2016 BSPP AND JOHN WILEY & SONS LTD.
Stable, fertile, high polyhydroxyalkanoate producing plants and methods of producing them
Bohmert-Tatarev, Karen; McAvoy, Susan; Peoples, Oliver P.; Snell, Kristi D.
2015-08-04
Transgenic plants that produce high levels of polyhydroxybutyrate and methods of producing them are provided. In a preferred embodiment the transgenic plants are produced using plastid transformation technologies and utilize genes which are codon optimized. Stably transformed plants able to produce greater than 10% dwt PHS in tissues are also provided.
Analysis of synonymous codon usage patterns in the genus Rhizobium.
Wang, Xinxin; Wu, Liang; Zhou, Ping; Zhu, Shengfeng; An, Wei; Chen, Yu; Zhao, Lin
2013-11-01
The codon usage patterns of rhizobia have received increasing attention. However, little information is available regarding the conserved features of the codon usage patterns in a typical rhizobial genus. The codon usage patterns of six completely sequenced strains belonging to the genus Rhizobium were analysed as model rhizobia in the present study. The relative neutrality plot showed that selection pressure played a role in codon usage in the genus Rhizobium. Spearman's rank correlation analysis combined with correspondence analysis (COA) showed that the codon adaptation index and the effective number of codons (ENC) had strong correlation with the first axis of the COA, which indicated the important role of gene expression level and the ENC in the codon usage patterns in this genus. The relative synonymous codon usage of Cys codons had the strongest correlation with the second axis of the COA. Accordingly, the usage of Cys codons was another important factor that shaped the codon usage patterns in Rhizobium genomes and was a conserved feature of the genus. Moreover, the comparison of codon usage between highly and lowly expressed genes showed that 20 unique preferred codons were shared among Rhizobium genomes, revealing another conserved feature of the genus. This is the first report of the codon usage patterns in the genus Rhizobium.
Prabha, Ratna; Singh, Dhananjaya P; Sinha, Swati; Ahmad, Khurshid; Rai, Anil
2017-04-01
With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes. Copyright © 2016 Elsevier B.V. All rights reserved.
Ríos-Fránquez, Francisco Javier; González-Bautista, Enrique; Ponce-Noyola, Teresa; Ramos-Valdivia, Ana Carmela; Poggi-Varaldo, Héctor Mario; García-Mena, Jaime; Martinez, Alfredo
2017-05-01
Bioethanol is one of the main biofuels produced from the fermentation of saccharified agricultural waste; however, this technology needs to be optimized for profitability. Because the commonly used ethanologenic yeast strains are unable to assimilate cellobiose, several efforts have been made to express cellulose hydrolytic enzymes in these yeasts to produce ethanol from lignocellulose. The C. flavigenabglA gene encoding β-glucosidase catalytic subunit was optimized for preferential codon usage in S. cerevisiae. The optimized gene, cloned into the episomal vector pRGP-1, was expressed, which led to the secretion of an active β-glucosidase in transformants of the S. cerevisiae diploid strain 2-24D. The volumetric and specific extracellular enzymatic activities using pNPG as substrate were 155 IU L -1 and 222 IU g -1 , respectively, as detected in the supernatant of the cultures of the S. cerevisiae RP2-BGL transformant strain growing in cellobiose (20 g L -1 ) as the sole carbon source for 48 h. Ethanol production was 5 g L -1 after 96 h of culture, which represented a yield of 0.41 g g -1 of substrate consumed (12 g L -1 ), equivalent to 76% of the theoretical yield. The S. cerevisiae RP2-BGL strain expressed the β-glucosidase extracellularly and produced ethanol from cellobiose, which makes this microorganism suitable for application in ethanol production processes with saccharified lignocellulose.
Wald, Naama; Alroy, Maya; Botzman, Maya; Margalit, Hanah
2012-01-01
Synonymous codons are unevenly distributed among genes, a phenomenon termed codon usage bias. Understanding the patterns of codon bias and the forces shaping them is a major step towards elucidating the adaptive advantage codon choice can confer at the level of individual genes and organisms. Here, we perform a large-scale analysis to assess codon usage bias pattern of pyrimidine-ending codons in highly expressed genes in prokaryotes. We find a bias pattern linked to the degeneracy of the encoded amino acid. Specifically, we show that codon-pairs that encode two- and three-fold degenerate amino acids are biased towards the C-ending codon while codons encoding four-fold degenerate amino acids are biased towards the U-ending codon. This codon usage pattern is widespread in prokaryotes, and its strength is correlated with translational selection both within and between organisms. We show that this bias is associated with an improved correspondence with the tRNA pool, avoidance of mis-incorporation errors during translation and moderate stability of codon–anticodon interaction, all consistent with more efficient translation. PMID:22581775
Krefft, Daria; Papkov, Aliaksei; Zylicz-Stachula, Agnieszka; Skowron, Piotr M
2017-01-01
Obtaining thermostable enzymes (thermozymes) is an important aspect of biotechnology. As thermophiles have adapted their genomes to high temperatures, their cloned genes' expression in mesophiles is problematic. This is mainly due to their high GC content, which leads to the formation of unfavorable secondary mRNA structures and codon usage in Escherichia coli (E. coli). RM.TthHB27I is a member of a family of bifunctional thermozymes, containing a restriction endonuclease (REase) and a methyltransferase (MTase) in a single polypeptide. Thermus thermophilus HB27 (T. thermophilus) produces low amounts of RM.TthHB27I with a unique DNA cleavage specificity. We have previously cloned the wild type (wt) gene into E. coli, which increased the production of RM.TthHB27I over 100-fold. However, its enzymatic activities were extremely low for an ORF expressed under a T7 promoter. We have designed and cloned a fully synthetic tthHB27IRM gene, using a modified 'codon randomization' strategy. Codons with a high GC content and of low occurrence in E. coli were eliminated. We incorporated a stem-loop circuit, devised to negatively control the expression of this highly toxic gene by partially hiding the ribosome-binding site (RBS) and START codon in mRNA secondary structures. Despite having optimized 59% of codons, the amount of produced RM.TthHB27I protein was similar for both recombinant tthHB27IRM gene variants. Moreover, the recombinant wt RM.TthHB27I is very unstable, while the RM.TthHB27I resulting from the expression of the synthetic gene exhibited enzymatic activities and stability equal to the native thermozyme isolated from T. thermophilus. Thus, we have developed an efficient purification protocol using the synthetic tthHB27IRM gene variant only. This suggests the effect of co-translational folding kinetics, possibly affected by the frequency of translational errors. The availability of active RM.TthHB27I is of practical importance in molecular biotechnology, extending the palette of available REase specificities.
USDA-ARS?s Scientific Manuscript database
MRS926 is a livestock-associated methicillin-resistant Staphylococcus aureus (MRSA) strain of sequence type (ST) 398. In order to facilitate in vitro and in vivo studies of this strain, we sought to tag it with a fluorescent marker. We cloned a codon-optimized gene for TurboGFP into a shuttle vector...
USDA-ARS?s Scientific Manuscript database
Influenza A virus (IAV) in swine constitutes a major economic burden for producers as well as a potential threat to public health. Whole inactivated virus vaccines (WIV) are the predominant countermeasure employed to control IAV in swine herds in the United States despite the superior protection, an...
Chen, Augustine; Kao, Y. F.; Brown, Chris M.
2005-01-01
The human hepatitis B virus (HBV) has a compact genome encoding four major overlapping coding regions: the core, polymerase, surface and X. The polymerase initiation codon is preceded by the partially overlapping core and four or more upstream initiation codons. There is evidence that several mechanisms are used to enable the synthesis of the polymerase protein, including leaky scanning and ribosome reinitiation. We have examined the first AUG in the pregenomic RNA, it precedes that of the core. It initiates an uncharacterized short upstream open reading frame (uORF), highly conserved in all HBV subtypes, we designated the C0 ORF. This arrangement suggested that expression of the core and polymerase may be affected by this uORF. Initiation at the C0 ORF was confirmed in reporter constructs in transfected cells. The C0 ORF had an inhibitory role in downstream expression from the core initiation site in HepG2 cells and in vitro, but also stimulated reinitiation at the polymerase start when in an optimal context. Our results indicate that the C0 ORF is a determinant in balancing the synthesis of the core and polymerase proteins. PMID:15731337
Partial attenuation of Marek's disease virus by manipulation of Di-codon bias
USDA-ARS?s Scientific Manuscript database
All species studied to date demonstrate a preference for certain codons over other synonymous codons (codon bias), a preference which is also observed for pairs of codons (di-codon bias). Previous studies using poliovirus and influenza virus as models have demonstrated the ability to cause attenuat...
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards "GC" Rich Codons.
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-04-27
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen "core" dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression.
Gubbens, Jacob; Kim, Soo Jung; Yang, Zhongying; Johnson, Arthur E.; Skach, William R.
2010-01-01
Amber suppressor tRNAs are widely used to incorporate nonnatural amino acids into proteins to serve as probes of structure, environment, and function. The utility of this approach would be greatly enhanced if multiple probes could be simultaneously incorporated at different locations in the same protein without other modifications. Toward this end, we have developed amber, opal, and ochre suppressor tRNAs derived from Escherichia coli, and yeast tRNACys that incorporate a chemically modified cysteine residue with high selectivity at the cognate UAG, UGA, and UAA stop codons in an in vitro translation system. These synthetic tRNAs were aminoacylated in vitro, and the labile aminoacyl bond was stabilized by covalently attaching a fluorescent dye to the cysteine sulfhydryl group. Readthrough efficiency (amber > opal > ochre) was substantially improved by eRF1/eRF3 inhibition with an RNA aptamer, thus overcoming an intrinsic hierarchy in stop codon selection that limits UGA and UAA termination suppression in higher eukaryotic translation systems. This approach now allows concurrent incorporation of two different modified amino acids at amber and opal codons with a combined apparent readthrough efficiency of up to 25% when compared with the parent protein lacking a stop codon. As such, it significantly expands the possibilities for incorporating nonnative amino acids for protein structure/function studies. PMID:20581130
The p40 Subunit of Interleukin (IL)-12 Promotes Stabilization and Export of the p35 Subunit
Jalah, Rashmi; Rosati, Margherita; Ganneru, Brunda; Pilkington, Guy R.; Valentin, Antonio; Kulkarni, Viraj; Bergamaschi, Cristina; Chowdhury, Bhabadeb; Zhang, Gen-Mu; Beach, Rachel Kelly; Alicea, Candido; Broderick, Kate E.; Sardesai, Niranjan Y.; Pavlakis, George N.; Felber, Barbara K.
2013-01-01
IL-12 is a 70-kDa heterodimeric cytokine composed of the p35 and p40 subunits. To maximize cytokine production from plasmid DNA, molecular steps controlling IL-12p70 biosynthesis at the posttranscriptional and posttranslational levels were investigated. We show that the combination of RNA/codon-optimized gene sequences and fine-tuning of the relative expression levels of the two subunits within a cell resulted in increased production of the IL-12p70 heterodimer. We found that the p40 subunit plays a critical role in enhancing the stability, intracellular trafficking, and export of the p35 subunit. This posttranslational regulation mediated by the p40 subunit is conserved in mammals. Based on these findings, dual gene expression vectors were generated, producing an optimal ratio of the two subunits, resulting in a ∼1 log increase in human, rhesus, and murine IL-12p70 production compared with vectors expressing the wild type sequences. Such optimized DNA plasmids also produced significantly higher levels of systemic bioactive IL-12 upon in vivo DNA delivery in mice compared with plasmids expressing the wild type sequences. A single therapeutic injection of an optimized murine IL-12 DNA plasmid showed significantly more potent control of tumor development in the B16 melanoma cancer model in mice. Therefore, the improved IL-12p70 DNA vectors have promising potential for in vivo use as molecular vaccine adjuvants and in cancer immunotherapy. PMID:23297419
Yamamoto, Itaru; Nosho, Katsuhiko; Kanno, Shinichi; Igarashi, Hisayoshi; Kurihara, Hiroyoshi; Ishigami, Keisuke; Ishiguro, Kazuya; Mitsuhashi, Kei; Maruyama, Reo; Koide, Hideyuki; Okuda, Hiroyuki; Hasegawa, Tadashi; Sukawa, Yasutaka; Okita, Kenji; Takemasa, Ichiro; Yamamoto, Hiroyuki; Shinomura, Yasuhisa; Nakase, Hiroshi
2017-03-14
The polycomb group protein enhancer of zeste homolog 2 (EZH2) is a methyltransferase that suppresses microRNA-31 (miR-31) in various human malignancies including colorectal cancer. We recently suggested that miR-31 regulates the signaling pathway downstream of epidermal growth factor receptor (EGFR) in colorectal cancer. Therefore, we conducted this study for assessing the relationship between EZH2 expression and clinical outcomes in patients with colorectal cancer treated with anti-EGFR therapeutics. We immunohistochemically evaluated EZH2 expression and assessed miR-31 and gene mutations [KRAS (codon 61/146), NRAS (codon 12/13/61), and BRAF (codon 600)] in 109 patients with colorectal cancer harboring KRAS (codon 12/13) wild-type. We also evaluated the progression-free survival (PFS) and overall survival (OS). In the result, low EZH2 expression was significantly associated with shorter PFS (log-rank test: P = 0.023) and OS (P = 0.036) in patients with colorectal cancer. In the low-miR-31-expression group and the KRAS (codon 61/146), NRAS, and BRAF wild-type groups, a significantly shorter PFS (P = 0.022, P = 0.039, P = 0.021, and P = 0.036, respectively) was observed in the EZH2 low-expression groups than in the high-expression groups. In the multivariate analysis, low EZH2 expression was associated with a shorter PFS (P = 0.046), independent of the mutational status and miR-31. In conclusion, EZH2 expression was associated with survival in patients with colorectal cancer who were treated with anti-EGFR therapeutics. Moreover, low EZH2 expression was independently associated with shorter PFS in patients with cancer, suggesting that EZH2 expression is a useful additional prognostic biomarker for anti-EGFR therapy.
Nakamura, Masayuki; Sugiura, Masahiro
2007-01-01
Codon usage in chloroplasts is different from that in prokaryotic and eukaryotic nuclear genomes. However, no experimental approach has been made to analyse the translation efficiency of individual codons in chloroplasts. We devised an in vitro assay for translation efficiencies using synthetic mRNAs, and measured the translation efficiencies of five synonymous codon groups in tobacco chloroplasts. Among four alanine codons (GCN, where N is U, C, A or G), GCU was the most efficient for translation, whereas the chloroplast genome lacks tRNA genes corresponding to GCU. Phenylalanine and tyrosine are each encoded by two codons (UUU/C and UAU/C, respectively). Phenylalanine UUC and tyrosine UAC were translated more than twice as efficiently than UUU and UAU, respectively, contrary to their codon usage, whereas translation efficiencies of synonymous codons for alanine, aspartic acid and asparagine were parallel to their codon usage. These observations indicate that translation efficiencies of individual codons are not always correlated with codon usage in vitro in chloroplasts. This raises an important issue for foreign gene expression in chloroplasts.
Functional Versatility of AGY Serine Codons in Immunoglobulin Variable Region Genes
Detanico, Thiago; Phillips, Matthew; Wysocki, Lawrence J.
2016-01-01
In systemic autoimmunity, autoantibodies directed against nuclear antigens (Ags) often arise by somatic hypermutation (SHM) that converts AGT and AGC (AGY) Ser codons into Arg codons. This can occur by three different single-base changes. Curiously, AGY Ser codons are far more abundant in complementarity-determining regions (CDRs) of IgV-region genes than expected for random codon use or from species-specific codon frequency data. CDR AGY codons are also more abundant than TCN Ser codons. We show that these trends hold even in cartilaginous fishes. Because AGC is a preferred target for SHM by activation-induced cytidine deaminase, we asked whether the AGY abundance was solely due to a selection pressure to conserve high mutability in CDRs regardless of codon context but found that this was not the case. Instead, AGY triplets were selectively enriched in the Ser codon reading frame. Motivated by reports implicating a functional role for poly/autoreactive specificities in antiviral antibodies, we also analyzed mutations at AGY in antibodies directed against a number of different viruses and found that mutations producing Arg codons in antiviral antibodies were indeed frequent. Unexpectedly, however, we also found that AGY codons mutated often to encode nearly all of the amino acids that are reported to provide the most frequent contacts with Ag. In many cases, mutations producing codons for these alternative amino acids in antiviral antibodies were more frequent than those producing Arg codons. Mutations producing each of these key amino acids required only single-base changes in AGY. AGY is the only codon group in which two-thirds of random mutations generate codons for these key residues. Finally, by directly analyzing X-ray structures of immune complexes from the RCSB protein database, we found that Ag-contact residues generated via SHM occurred more often at AGY than at any other codon group. Thus, preservation of AGY codons in antibody genes appears to have been driven by their exceptional functional versatility, despite potential autoreactive consequences. PMID:27920779
Chloroplast DNA codon use: evidence for selection at the psb A locus based on tRNA availability.
Morton, B R
1993-09-01
Codon use in the three sequenced chloroplast genomes (Marchantia, Oryza, and Nicotiana) is examined. The chloroplast has a bias in that codons NNA and NNT are favored over synonymous NNC and NNG codons. This appears to be a consequence of an overall high A + T content of the genome. This pattern of codon use is not followed by the psb A gene of all three genomes and other psb A sequences examined. In this gene, the codon use favors NNC over NNT for twofold degenerate amino acids. In each case the only tRNA coded by the genome is complementary to the NNC codon. This codon use is similar to the codon use by chloroplast genes examined from Chlamydomonas reinhardtii. Since psb A is the major translation product of the chloroplast, this suggests that selection is acting on the codon use of this gene to adapt codons to tRNA availability, as previously suggested for unicellular organisms.
Optimization of mNeonGreen for Homo sapiens increases its fluorescent intensity in mammalian cells.
Tanida-Miyake, Emiko; Koike, Masato; Uchiyama, Yasuo; Tanida, Isei
2018-01-01
Green fluorescent protein (GFP) is tremendously useful for investigating many cellular and intracellular events. The monomeric GFP mNeonGreen is about 3- to 5-times brighter than GFP and monomeric enhanced GFP and shows high photostability. The maturation half-time of mNeonGreen is about 3-fold faster than that of monomeric enhanced GFP. However, the cDNA sequence encoding mNeonGreen contains some codons that are rarely used in Homo sapiens. For better expression of mNeonGreen in human cells, we synthesized a human-optimized cDNA encoding mNeonGreen and generated an expression plasmid for humanized mNeonGreen under the control of the cytomegalovirus promoter. The resultant plasmid was introduced into HEK293 cells. The fluorescent intensity of humanized mNeonGreen was about 1.4-fold higher than that of the original mNeonGreen. The humanized mNeonGreen with a mitochondria-targeting signal showed mitochondrial distribution of mNeonGreen. We further generated an expression vector of humanized mNeonGreen with 3xFLAG tags at its carboxyl terminus as these tags are useful for immunological analyses. The 3xFLAG-tagged mNeonGreen was recognized well with an anti-FLAG-M2 antibody. These plasmids for the expression of humanized mNeonGreen and mNeonGreen-3xFLAG are useful tools for biological studies in mammalian cells using mNeonGreen.
Rezgui, Vanessa Anissa Nathalie; Tyagi, Kshitiz; Ranjan, Namit; Konevega, Andrey L; Mittelstaet, Joerg; Rodnina, Marina V; Peter, Matthias; Pedrioli, Patrick G A
2013-07-23
tRNA modifications are crucial to ensure translation efficiency and fidelity. In eukaryotes, the URM1 and ELP pathways increase cellular resistance to various stress conditions, such as nutrient starvation and oxidative agents, by promoting thiolation and methoxycarbonylmethylation, respectively, of the wobble uridine of cytoplasmic (tK(UUU)), (tQ(UUG)), and (tE(UUC)). Although in vitro experiments have implicated these tRNA modifications in modulating wobbling capacity and translation efficiency, their exact in vivo biological roles remain largely unexplored. Using a combination of quantitative proteomics and codon-specific translation reporters, we find that translation of a specific gene subset enriched for AAA, CAA, and GAA codons is impaired in the absence of URM1- and ELP-dependent tRNA modifications. Moreover, in vitro experiments using native tRNAs demonstrate that both modifications enhance binding of tK(UUU) to the ribosomal A-site. Taken together, our data suggest that tRNA thiolation and methoxycarbonylmethylation regulate translation of genes with specific codon content.
Klingbeil, Katharina; Lange, Elke; Teifke, Jens P; Mettenleiter, Thomas C; Fuchs, Walter
2014-04-01
Pigs can be severely harmed by influenza, and represent important reservoir hosts, in which new human pathogens such as the recent pandemic swine-origin H1N1 influenza A virus can arise by mutation and reassortment of genome segments. To obtain novel, safe influenza vaccines for pigs, and to investigate the antigen-specific immune response, we modified an established live-virus vaccine against Aujeszky's disease of swine, pseudorabies virus (PrV) strain Bartha (PrV-Ba), to serve as vector for the expression of haemagglutinin (HA) of swine-origin H1N1 virus. To facilitate transgene insertion, the genome of PrV-Ba was cloned as a bacterial artificial chromosome. HA expression occurred under control of the human or murine cytomegalovirus immediate early promoters (P-HCMV, P-MCMV), but could be substantially enhanced by synthetic introns and adaptation of the codon usage to that of PrV. However, despite abundant expression, the heterologous glycoprotein was not detectably incorporated into mature PrV particles. Replication of HA-expressing PrV in cell culture was only slightly affected compared to that of the parental virus strain. A single immunization of pigs with the PrV vector expressing the codon-optimized HA gene under control of P-MCMV induced high levels of HA-specific antibodies. The vaccinated animals were protected from clinical signs after challenge with a related swine-origin H1N1 influenza A virus, and challenge virus shedding was significantly reduced.
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Transcriptome Analysis of Core Dinoflagellates Reveals a Universal Bias towards “GC” Rich Codons
Williams, Ernest; Place, Allen; Bachvaroff, Tsvetan
2017-01-01
Although dinoflagellates are a potential source of pharmaceuticals and natural products, the mechanisms for regulating and producing these compounds are largely unknown because of extensive post-transcriptional control of gene expression. One well-documented mechanism for controlling gene expression during translation is codon bias, whereby specific codons slow or even terminate protein synthesis. Approximately 10,000 annotatable genes from fifteen “core” dinoflagellate transcriptomes along a range of overall guanine and cytosine (GC) content were used for codonW analysis to determine the relative synonymous codon usage (RSCU) and the GC content at each codon position. GC bias in the analyzed dataset and at the third codon position varied from 51% and 54% to 66% and 88%, respectively. Codons poor in GC were observed to be universally absent, but bias was most pronounced for codons ending in uracil followed by adenine (UA). GC bias at the third codon position was able to explain low abundance codons as well as the low effective number of codons. Thus, we propose that a bias towards codons rich in GC bases is a universal feature of core dinoflagellates, possibly relating to their unique chromosome structure, and not likely a major mechanism for controlling gene expression. PMID:28448468
XPD polymorphisms: effects on DNA repair proficiency.
Lunn, R M; Helzlsouer, K J; Parshad, R; Umbach, D M; Harris, E L; Sanford, K K; Bell, D A
2000-04-01
XPD codes for a DNA helicase involved in transcription and nucleotide excision repair. Rare XPD mutations diminish nucleotide excision repair resulting in hypersensitivity to UV light and increased risk of skin cancer. Several polymorphisms in this gene have been identified but their impact on DNA repair is not known. We compared XPD genotypes at codons 312 and 751 with DNA repair proficiency in 31 women. XPD genotypes were measured by PCR-RFLP. DNA repair proficiency was assessed using a cytogenetic assay that detects X-ray induced chromatid aberrations (breaks and gaps). Chromatid aberrations were scored per 100 metaphase cells following incubation at 37 degrees C (1.5 h after irradiation) to allow for repair of DNA damage. Individuals with the Lys/Lys codon 751 XPD genotype had a higher number of chromatid aberrations (132/100 metaphase cells) than those having a 751Gln allele (34/100 metaphase cells). Individuals having greater than 60 chromatid breaks plus gaps were categorized as having sub-optimal repair. Possessing a Lys/Lys751 genotype increased the risk of sub-optimal DNA repair (odds ratio = 7.2, 95% confidence interval = 1.01-87.7). The Asp312Asn XPD polymorphism did not appear to affect DNA repair proficiency. These results suggest that the Lys751 (common) allele may alter the XPD protein product resulting in sub-optimal repair of X-ray-induced DNA damage.
NASA Astrophysics Data System (ADS)
Malla, Spundana; Kadimisetty, Karteek; Fu, You-Jun; Choudhary, Dharamainder; Schenkman, John B.; Rusling, James F.
2017-01-01
Methylation of cytosine (C) at C-phosphate-guanine (CpG) sites enhances reactivity of DNA towards electrophiles. Mutations at CpG sites on the p53 tumor suppressor gene that can result from these adductions are in turn correlated with specific cancers. Here we describe the first restriction-enzyme-assisted LC-MS/MS sequencing study of the influence of methyl cytosines (MeC) on kinetics of p53 gene adduction by model metabolite benzo[a]pyrene-7,8-dihydrodiol-9,10-epoxide (BPDE), using methodology applicable to correlate gene damage sites for drug and pollutant metabolites with mutation sites. This method allows direct kinetic measurements by LC-MS/MS sequencing for oligonucleotides longer than 20 base pairs (bp). We used MeC and non-MeC (C) versions of a 32 bp exon 7 fragment of the p53 gene. Methylation of 19 cytosines increased the rate constant 3-fold for adduction on G at the major reactive CpG in codon 248 vs. the non-MeC fragment. Rate constants for non-CpG codons 244 and 243 were not influenced significantly by MeC. Conformational and hydrophobicity changes in the MeC-p53 exon 7 fragment revealed by CD spectra and molecular modeling increase the BPDE binding constant to G in codon 248 consistent with a pathway in which preceding reactant binding greatly facilitates the rate of covalent SN2 coupling.
A mechanism for exon skipping caused by nonsense or missense mutations in BRCA1 and other genes.
Liu, H X; Cartegni, L; Zhang, M Q; Krainer, A R
2001-01-01
Point mutations can generate defective and sometimes harmful proteins. The nonsense-mediated mRNA decay (NMD) pathway minimizes the potential damage caused by nonsense mutations. In-frame nonsense codons located at a minimum distance upstream of the last exon-exon junction are recognized as premature termination codons (PTCs), targeting the mRNA for degradation. Some nonsense mutations cause skipping of one or more exons, presumably during pre-mRNA splicing in the nucleus; this phenomenon is termed nonsense-mediated altered splicing (NAS), and its underlying mechanism is unclear. By analyzing NAS in BRCA1, we show here that inappropriate exon skipping can be reproduced in vitro, and results from disruption of a splicing enhancer in the coding sequence. Enhancers can be disrupted by single nonsense, missense and translationally silent point mutations, without recognition of an open reading frame as such. These results argue against a nuclear reading-frame scanning mechanism for NAS. Coding-region single-nucleotide polymorphisms (cSNPs) within exonic splicing enhancers or silencers may affect the patterns or efficiency of mRNA splicing, which may in turn cause phenotypic variability and variable penetrance of mutations elsewhere in a gene.
Characterization of the porcine epidemic diarrhea virus codon usage bias.
Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong
2014-12-01
Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Addepalli, Balasubrahmanym; Lesner, Nicholas P.; Limbach, Patrick A.
2015-01-01
A codon-optimized recombinant ribonuclease, MC1 is characterized for its uridine-specific cleavage ability to map nucleoside modifications in RNA. The published MC1 amino acid sequence, as noted in a previous study, was used as a template to construct a synthetic gene with a natural codon bias favoring expression in Escherichia coli. Following optimization of various expression conditions, the active recombinant ribonuclease was successfully purified as a C-terminal His-tag fusion protein from E. coli [Rosetta 2(DE3)] cells. The isolated protein was tested for its ribonuclease activity against oligoribonucleotides and commercially available E. coli tRNATyr I. Analysis of MC1 digestion products by ion-pairing reverse phase liquid-chromatography coupled with mass spectrometry (IP-RP-LC-MS) revealed enzymatic cleavage of RNA at the 5′-termini of uridine and pseudouridine, but cleavage was absent if the uridine was chemically modified or preceded by a nucleoside with a bulky modification. Furthermore, the utility of this enzyme to generate complementary digestion products to other common endonucleases, such as RNase T1, which enables the unambiguous mapping of modified residues in RNA is demonstrated. PMID:26221047
Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert
2014-11-11
The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations.
Hohn, Oliver; Hanke, Kirsten; Lausch, Veronika; Zimmermann, Anja; Mostafa, Saeed; Bannert, Norbert
2014-01-01
The HERV-K(HML-2) family contains the most recently integrated and best preserved endogenized proviral sequences in the human genome. All known elements have nevertheless been subjected to mutations or deletions that render expressed particles non-infectious. Moreover, these post-insertional mutations hamper the analysis of the general biological properties of this ancient virus family. The expression of consensus sequences and sequences of elements with reverted post-insertional mutations has therefore been very instrumental in overcoming this limitation. We investigated the particle morphology of a recently reconstituted HERV-K113 element termed oriHERV-K113 using thin-section electron microscopy (EM) and could demonstrate that strong overexpression by substitution of the 5'LTR for a CMV promoter and partial codon optimization altered the virus assembly type and morphology. This included a conversion from the regular C-type to an A-type morphology with a mass of cytoplasmic immature cores tethered to the cell membrane and the membranes of vesicles. Overexpression permitted the release and maturation of virions but reduced the envelope content. A weaker boost of virus expression by Staufen-1 was not sufficient to induce these morphological alterations. PMID:25393897
Genome-wide analysis of codon usage bias in Ebolavirus.
Cristina, Juan; Moreno, Pilar; Moratorio, Gonzalo; Musto, Héctor
2015-01-22
Ebola virus (EBOV) is a member of the family Filoviridae and its genome consists of a 19-kb, single-stranded, negative sense RNA. EBOV is subdivided into five distinct species with different pathogenicities, being Zaire ebolavirus (ZEBOV) the most lethal species. The interplay of codon usage among viruses and their hosts is expected to affect overall viral survival, fitness, evasion from host's immune system and evolution. In the present study, we performed comprehensive analyses of codon usage and composition of ZEBOV. Effective number of codons (ENC) indicates that the overall codon usage among ZEBOV strains is slightly biased. Different codon preferences in ZEBOV genes in relation to codon usage of human genes were found. Highly preferred codons are all A-ending triplets, which strongly suggests that mutational bias is a main force shaping codon usage in ZEBOV. Dinucleotide composition also plays a role in the overall pattern of ZEBOV codon usage. ZEBOV does not seem to use the most abundant tRNAs present in the human cells for most of their preferred codons. Copyright © 2014 Elsevier B.V. All rights reserved.
Colagrossi, Luna; Hermans, Lucas E; Salpini, Romina; Di Carlo, Domenico; Pas, Suzan D; Alvarez, Marta; Ben-Ari, Ziv; Boland, Greet; Bruzzone, Bianca; Coppola, Nicola; Seguin-Devaux, Carole; Dyda, Tomasz; Garcia, Federico; Kaiser, Rolf; Köse, Sukran; Krarup, Henrik; Lazarevic, Ivana; Lunar, Maja M; Maylin, Sarah; Micheli, Valeria; Mor, Orna; Paraschiv, Simona; Paraskevis, Dimitros; Poljak, Mario; Puchhammer-Stöckl, Elisabeth; Simon, François; Stanojevic, Maja; Stene-Johansen, Kathrine; Tihic, Nijaz; Trimoulet, Pascale; Verheyen, Jens; Vince, Adriana; Lepej, Snjezana Zidovec; Weis, Nina; Yalcinkaya, Tülay; Boucher, Charles A B; Wensing, Annemarie M J; Perno, Carlo F; Svicher, Valentina
2018-06-01
HBsAg immune-escape mutations can favor HBV-transmission also in vaccinated individuals, promote immunosuppression-driven HBV-reactivation, and increase fitness of drug-resistant strains. Stop-codons can enhance HBV oncogenic-properties. Furthermore, as a consequence of the overlapping structure of HBV genome, some immune-escape mutations or stop-codons in HBsAg can derive from drug-resistance mutations in RT. This study is aimed at gaining insight in prevalence and characteristics of immune-associated escape mutations, and stop-codons in HBsAg in chronically HBV-infected patients experiencing nucleos(t)ide analogues (NA) in Europe. This study analyzed 828 chronically HBV-infected European patients exposed to ≥ 1 NA, with detectable HBV-DNA and with an available HBsAg-sequence. The immune-associated escape mutations and the NA-induced immune-escape mutations sI195M, sI196S, and sE164D (resulting from drug-resistance mutation rtM204 V, rtM204I, and rtV173L) were retrieved from literature and examined. Mutations were defined as an aminoacid substitution with respect to a genotype A or D reference sequence. At least one immune-associated escape mutation was detected in 22.1% of patients with rising temporal-trend. By multivariable-analysis, genotype-D correlated with higher selection of ≥ 1 immune-associated escape mutation (OR[95%CI]:2.20[1.32-3.67], P = 0.002). In genotype-D, the presence of ≥ 1 immune-associated escape mutations was significantly higher in drug-exposed patients with drug-resistant strains than with wild-type virus (29.5% vs 20.3% P = 0.012). Result confirmed by analysing drug-naïve patients (29.5% vs 21.2%, P = 0.032). Strong correlation was observed between sP120T and rtM204I/V (P < 0.001), and their co-presence determined an increased HBV-DNA. At least one NA-induced immune-escape mutation occurred in 28.6% of patients, and their selection correlated with genotype-A (OR[95%CI]:2.03[1.32-3.10],P = 0.001). Finally, stop-codons are present in 8.4% of patients also at HBsAg-positions 172 and 182, described to enhance viral oncogenic-properties. Immune-escape mutations and stop-codons develop in a large fraction of NA-exposed patients from Europe. This may represent a potential threat for horizontal and vertical HBV transmission also to vaccinated persons, and fuel drug-resistance emergence.
Minigene-like inhibition of protein synthesis mediated by hungry codons near the start codon
Jacinto-Loeza, Eva; Vivanco-Domínguez, Serafín; Guarneros, Gabriel; Hernández-Sánchez, Javier
2008-01-01
Rare AGA or AGG codons close to the initiation codon inhibit protein synthesis by a tRNA-sequestering mechanism as toxic minigenes do. To further understand this mechanism, a parallel analysis of protein synthesis and peptidyl-tRNA accumulation was performed using both a set of lacZ constructs where AGAAGA codons were moved codon by codon from +2, +3 up to +7, +8 positions and a series of 3–8 codon minigenes containing AGAAGA codons before the stop codon. β-Galactosidase synthesis from the AGAAGA lacZ constructs (in a Pth defective in vitro system without exogenous tRNA) diminished as the AGAAGA codons were closer to AUG codon. Likewise, β-galactosidase expression from the reporter +7 AGA lacZ gene (plus tRNA, 0.25 μg/μl) waned as the AGAAGAUAA minigene shortened. Pth counteracted both the length-dependent minigene effect on the expression of β-galactosidase from the +7 AGA lacZ reporter gene and the positional effect from the AGAAGA lacZ constructs. The +2, +3 AGAAGA lacZ construct and the shortest +2, +3 AGAAGAUAA minigene accumulated the highest percentage of peptidyl-tRNAArg4. These observations lead us to propose that hungry codons at early positions, albeit with less strength, inhibit protein synthesis by a minigene-like mechanism involving accumulation of peptidyl-tRNA. PMID:18583364
Synonymous codon usage of genes in polymerase complex of Newcastle disease virus.
Kumar, Chandra Shekhar; Kumar, Sachin
2017-06-01
Newcastle disease virus (NDV) is pathogenic to both avian and non-avian species but extensively finds poultry as its primary host and causes heavy economic losses in the poultry industry. In this study, a total of 186 polymerase complex comprising of nucleoprotein (N), phosphoprotein (P), and large polymerase (L) genes of NDV was analyzed for synonymous codon usage. The relative synonymous codon usage and effective number of codons (ENC) values were used to estimate codon usage variation in each gene. Correspondence analysis (COA) was used to study the major trend in codon usage variation. Analyzing the ENC plot values against GC3s (at synonymous third codon position) we concluded that mutational pressure was the main factor determining codon usage bias than translational selection in NDV N, P, and L genes. Moreover, correlation analysis indicated, that aromaticity of N, P, and L genes also influenced the codon usage variation. The varied distribution of pathotypes for N, P, and L gene clearly suggests that change in codon usage for NDV is pathotype specific. The codon usage preference similarity in N, P, and L gene might be detrimental for polymerase complex functioning. The study represents a comprehensive analysis to date of N, P, and L genes codon usage pattern of NDV and provides a basic understanding of the mechanisms for codon usage bias. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lal, Devi; Verma, Mansi; Behura, Susanta K; Lal, Rup
2016-10-01
Actinobacteria are Gram-positive bacteria commonly found in soil, freshwater and marine ecosystems. In this investigation, bias in codon usages of ninety actinobacterial genomes was analyzed by estimating different indices of codon bias such as Nc (effective number of codons), SCUO (synonymous codon usage order), RSCU (relative synonymous codon usage), as well as sequence patterns of codon contexts. The results revealed several characteristic features of codon usage in Actinobacteria, as follows: 1) C- or G-ending codons are used frequently in comparison with A- and U ending codons; 2) there is a direct relationship of GC content with use of specific amino acids such as alanine, proline and glycine; 3) there is an inverse relationship between GC content and Nc estimates, 4) there is low SCUO value (<0.5) for most genes; and 5) GCC-GCC, GCC-GGC, GCC-GAG and CUC-GAC are the frequent context sequences among codons. This study highlights the fact that: 1) in Actinobacteria, extreme GC content and codon bias are driven by mutation rather than natural selection; (2) traits like aerobicity are associated with effective natural selection and therefore low GC content and low codon bias, demonstrating the role of both mutational bias and translational selection in shaping the habitat and phenotype of actinobacterial species. Copyright © 2016 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
A detailed analysis of codon usage patterns and influencing factors in Zika virus.
Singh, Niraj K; Tyagi, Anuj
2017-07-01
Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
CodonLogo: a sequence logo-based viewer for codon patterns.
Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V
2012-07-15
Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.
Chhapekar, Sushil; Raghavendrarao, Sanagala; Pavan, Gadamchetty; Ramakrishna, Chopperla; Singh, Vivek Kumar; Phanindra, Mullapudi Lakshmi Venkata; Dhandapani, Gurusamy; Sreevathsa, Rohini; Ananda Kumar, Polumetla
2015-05-01
Highly tolerant herbicide-resistant transgenic rice was developed by expressing codon-modified synthetic CP4--EPSPS. The transformants could tolerate up to 1% commercial glyphosate and has the potential to be used for DSR (direct-seeded rice). Weed infestation is one of the major biotic stress factors that is responsible for yield loss in direct-seeded rice (DSR). Herbicide-resistant rice has potential to improve the efficiency of weed management under DSR. Hence, the popular indica rice cultivar IR64, was genetically modified using Agrobacterium-mediated transformation with a codon-optimized CP4-EPSPS (5-enolpyruvylshikimate-3-phosphate synthase) gene, with N-terminal chloroplast targeting peptide from Petunia hybrida. Integration of the transgenes in the selected rice plants was confirmed by Southern hybridization and expression by Northern and herbicide tolerance assays. Transgenic plants showed EPSPS enzyme activity even at high concentrations of glyphosate, compared to untransformed control plants. T0, T1 and T2 lines were tested by herbicide bioassay and it was confirmed that the transgenic rice could tolerate up to 1% of commercial Roundup, which is five times more in dose used to kill weeds under field condition. All together, the transgenic rice plants developed in the present study could be used efficiently to overcome weed menace.
Aydemir, Cumhur; Onay, Huseyin; Oguz, Serife Suna; Ozdemir, Taha Resid; Erdeve, Omer; Ozkinay, Ferda; Dilmen, Ugur
2011-09-01
Preterm neonates are susceptible to infection due to a combination of sub-optimal immunity and increased exposure to invasive organisms. Invasive fungal infections are associated with significant morbidity and mortality among preterm infants cared for in the neonatal intensive care unit (NICU). Mannose-binding lectin (MBL) is a component of the innate immune system, which may be especially important in the neonatal setting. The objective of this study was to investigate the presence of any association between MBL gene polymorphism and nosocomial invasive fungal infection in preterm neonates. Codon 54 (B allele) polymorphism in exon 1 of the MBL gene was investigated in 31 patients diagnosed as nosocomial invasive fungal infection and 30 control preterm neonates. AB genotype was determined in 26% and 30% of patient and control groups, respectively, and the difference was not statistically significant. AA genotype was determined in 74% of the patient group and in 67% of the control group, and the difference was not statistically significant. B allele frequency was not different significantly in the patient group (13%) compared to the control group (18%). In our study, no relationship was found between MBL codon 54 gene polymorphism and the risk of nosocomial invasive fungal infection in preterm neonates in NICU.
Rationalizing context-dependent performance of dynamic RNA regulatory devices.
Kent, Ross; Halliwell, Samantha; Young, Kate; Swainston, Neil; Dixon, Neil
2018-06-21
The ability of RNA to sense, regulate and store information is an attractive attribute for a variety of functional applications including the development of regulatory control devices for synthetic biology. RNA folding and function is known to be highly context sensitive, which limits the modularity and reuse of RNA regulatory devices to control different heterologous sequences and genes. We explored the cause and effect of sequence context sensitivity for translational ON riboswitches located in the 5' UTR, by constructing and screening a library of N-terminal synonymous codon variants. By altering the N-terminal codon usage we were able to obtain RNA devices with a broad range of functional performance properties (ON, OFF, fold-change). Linear regression and calculated metrics were used to rationalize the major determining features leading to optimal riboswitch performance, and to identify multiple interactions between the explanatory metrics. Finally, partial least squared (PLS) analysis was employed in order to understand the metrics and their respective effect on performance. This PLS model was shown to provide good explanation of our library. This study provides a novel multi-variant analysis framework by which to rationalize the codon context performance of allosteric RNA-devices. The framework will also serve as a platform for future riboswitch context engineering endeavors.
Daniell, Henry; Chan, Hui-Ting; Pasoreck, Elise K.
2017-01-01
Plastid-made biopharmaceuticals treat major metabolic or genetic disorders, including Alzheimer’s, diabetes, hypertension, hemophilia, and retinopathy. Booster vaccines made in chloroplasts prevent global infectious diseases, such as tuberculosis, malaria, cholera, and polio, and biological threats, such as anthrax and plague. Recent advances in this field include commercial-scale production of human therapeutic proteins in FDA-approved cGMP facilities, development of tags to deliver protein drugs to targeted human cells or tissues, methods to deliver precise doses, and long-term stability of protein drugs at ambient temperature, maintaining their efficacy. Codon optimization utilizing valuable information from sequenced chloroplast genomes enhanced expression of eukaryotic human or viral genes in chloroplasts and offered unique insights into translation in chloroplasts. Support from major biopharmaceutical companies, development of hydroponic production systems, and evaluation by regulatory agencies, including the CDC, FDA, and USDA, augur well for advancing this novel concept to the clinic and revolutionizing affordable healthcare. PMID:27893966
Cheng, Zhuan; Jiang, Jiaqi; Wu, Hui; Li, Zhimin; Ye, Qin
2016-01-01
In this study, production of 3-HP via malonyl-CoA was investigated by using metabolically engineered Escherichia coli carrying heterogeneous acetyl-CoA carboxylase (Acc) from Corynebacterium glutamicum and codon-optimized malonyl-CoA reductase (MCR) from Chloroflexus aurantiacus. Three engineered E. coli strains with different host-vector systems were constructed and investigated. The results indicated that the combination of E. coli BL21(DE3) and pET28a was the most efficient host-vector system for 3-HP production, and the highest concentration of 3-HP attained in shake flask cultivation reached 1.80g/L by the strain BE-MDA with induction at 0.25mM IPTG and 25°C, and supplementation of NaHCO3 and biotin. In fed-batch fermentation performed in a 5-L reactor, the concentration of 3-HP achieved 10.08g/L in 36h. Copyright © 2015 Elsevier Ltd. All rights reserved.
Codon 219 polymorphism of PRNP in healthy caucasians and Creutzfeldt-Jakob disease patients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petraroli, R.; Pocchiari, M.
1996-04-01
A number of point and insert mutations of the PrP gene (PRNP) have been linked to familial Creutzfeldt-Jakob disease (CJD) and Gerstmann-Straussler-Scheinker disease (GSS). Moreover, the methionine/valine homozygosity at the polymorphic codon 129 of PRNP may cause a predisposition to sporadic and iatrogenic CJD or may control the age at onset of familial cases carrying either the 144-bp insertion or codon 178, codon 198, and codon 210 pathogenic mutations in PRNP. In addition, the association of methionine or valine at codon 129 and the point mutation at codon 178 on the same allele seem to play an important role inmore » determining either fatal familial insomnia or CJD. However, it is noteworthy that a relationship between codon 129 polymorphism and accelerated pathogenesis (early age at onset or shorter duration of the disease) has not been seen in familial CJD patients with codon 200 mutation or in GSS patients with codon 102 mutation, arguing that other, as yet unidentified, gene products or environmental factors, or both, may influence the clinical expression of these diseases. 17 refs.« less
Molecular Genetic Analysis and Evolution of Segment 7 in Rice Black-Streaked Dwarf Virus in China
Chen, Yanping; Wu, Jirong; Meng, Qingchang; Han, Xiaohua; Hao, Zhuanfang; Li, Mingshun; Yong, Hongjun; Zhang, Degui; Zhang, Shihuang; Li, Xinhai
2015-01-01
Rice black-streaked dwarf virus (RBSDV) causes maize rough dwarf disease or rice black-streaked dwarf disease and can lead to severe yield losses in maize and rice. To analyse RBSDV evolution, codon usage bias and genetic structure were investigated in 111 maize and rice RBSDV isolates from eight geographic locations in 2013 and 2014. The linear dsRNA S7 is A+U rich, with overall codon usage biased toward codons ending with A (A3s, S7-1: 32.64%, S7-2: 29.95%) or U (U3s, S7-1: 44.18%, S7-2: 46.06%). Effective number of codons (Nc) values of 45.63 in S7-1 (the first open reading frame of S7) and 39.96 in S7-2 (the second open reading frame of S7) indicate low degrees of RBSDV-S7 codon usage bias, likely driven by mutational bias regardless of year, host, or geographical origin. Twelve optimal codons were detected in S7. The nucleotide diversity (π) of S7 sequences in 2013 isolates (0.0307) was significantly higher than in 2014 isolates (0.0244, P = 0.0226). The nucleotide diversity (π) of S7 sequences in isolates from Jinan (0.0391) was higher than that from the other seven locations (P < 0.01). Only one S7 recombinant was detected in Baoding. RBSDV isolates could be phylogenetically classified into two groups according to S7 sequences, and further classified into two subgroups. S7-1 and S7-2 were under negative and purifying selection, with respective Ka/Ks ratios of 0.0179 and 0.0537. These RBSDV populations were expanding (P < 0.01) as indicated by negative values for Tajima's D, Fu and Li's D, and Fu and Li's F. Genetic differentiation was detected in six RBSDV subpopulations (P < 0.05). Absolute Fst (0.0790) and Nm (65.12) between 2013 and 2014, absolute Fst (0.1720) and Nm (38.49) between maize and rice, and absolute Fst values of 0.0085-0.3069 and Nm values of 0.56-29.61 among these eight geographic locations revealed frequent gene flow between subpopulations. Gene flow between 2013 and 2014 was the most frequent. PMID:26121638
Butanaev, A M
1994-01-01
The hygromycin phosphotransferase gene (hpt) from E. coli under the control of the SV40 early promoter was used as a dominant selectable marker for transformation of Chlamydomonas reinhardtii. Cells were transformed by electroporation (pulse length, 2 ms, field strength, 1 kV/cm). The culture growth phase was a crucial parameter for transformation (optimal density approximately 10(6) cells/ml). It was possible to obtain approximately 10(3) Hyg-resistant colonies under these conditions. Foreign DNA integrated into the Chlamydomonas genome was maintained for at least 8 months but the Hyg-resistant phenotype of the transformed clones was unstable. The frequency of codon usage in the hpt gene was compared with the one in Chlamydomonas nuclear genes. It is supposed that highly biased codon usage in Chlamydomonas does not preclude expression. Advantages of this selection system for studying Chlamydomonas transformation by heterologous genes are discussed.
Overkamp, Wout; Beilharz, Katrin; Detert Oude Weme, Ruud; Solopova, Ana; Karsens, Harma; Kovács, Ákos T.; Kok, Jan
2013-01-01
Green fluorescent protein (GFP) offers efficient ways of visualizing promoter activity and protein localization in vivo, and many different variants are currently available to study bacterial cell biology. Which of these variants is best suited for a certain bacterial strain, goal, or experimental condition is not clear. Here, we have designed and constructed two “superfolder” GFPs with codon adaptation specifically for Bacillus subtilis and Streptococcus pneumoniae and have benchmarked them against five other previously available variants of GFP in B. subtilis, S. pneumoniae, and Lactococcus lactis, using promoter-gfp fusions. Surprisingly, the best-performing GFP under our experimental conditions in B. subtilis was the one codon optimized for S. pneumoniae and vice versa. The data and tools described in this study will be useful for cell biology studies in low-GC-rich Gram-positive bacteria. PMID:23956387
USDA-ARS?s Scientific Manuscript database
Meleagrid herpesvirus type 1 (MeHV-1) is an ideal vector for the expression of antigens from pathogenic avian organisms in order to generate vaccines. Chicken parvovirus (ChPV) is a widespread infectious virus that causes serious disease in chickens. It is one of the etiological agents largely suspe...
Dammeyer, Thorben; Steinwand, Miriam; Krüger, Sarah-C; Dübel, Stefan; Hust, Michael; Timmis, Kenneth N
2011-02-21
Recombinant antibody fragments have a wide range of applications in research, diagnostics and therapy. For many of these, small fragments like single chain fragment variables (scFv) function well and can be produced inexpensively in bacterial expression systems. Although Escherichia coli K-12 production systems are convenient, yields of different fragments, even those produced from codon-optimized expression systems, vary significantly. Where yields are inadequate, alternative production systems are needed. Pseudomonas putida strain KT2440 is a versatile biosafety strain known for good expression of heterologous genes, so we have explored its utility as a cell factory for production of scFvs. We have generated new broad host range scFv expression constructs and assessed their production in the Pseudomonas putida KT2440 host. Two scFvs bind either to human C-reactive protein or to mucin1, proteins of significant medical diagnostic and therapeutic interest, whereas a third is a model anti-lysozyme scFv. The KT2440 antibody expression systems produce scFvs targeted to the periplasmic space that were processed precisely and were easily recovered and purified by single-step or tandem affinity chromatography. The influence of promoter system, codon optimization for P. putida, and medium on scFv yield was examined. Yields of up to 3.5 mg/l of pure, soluble, active scFv fragments were obtained from shake flask cultures of constructs based on the original codon usage and expressed from the Ptac expression system, yields that were 2.5-4 times higher than those from equivalent cultures of an E. coli K-12 expression host. Pseudomonas putida KT2440 is a good cell factory for the production of scFvs, and the broad host range constructs we have produced allow yield assessment in a number of different expression hosts when yields in one initially selected are insufficient. High cell density cultivation and further optimization and refinement of the KT2440 cell factory will achieve additional increases in the yields of scFvs.
Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli
Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.
2016-01-01
The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680
Panicker, Indu S.; Browning, Glenn F.; Markham, Philip F.
2015-01-01
While the genomes of many Mycoplasma species have been sequenced, there are no collated data on translational start codon usage, and the effects of alternate start codons on gene expression have not been studied. Analysis of the annotated genomes found that ATG was the most prevalent translational start codon among Mycoplasma spp. However in Mycoplasma gallisepticum a GTG start codon is commonly used in the vlhA multigene family, which encodes a highly abundant, phase variable lipoprotein adhesin. Therefore, the effect of this alternate start codon on expression of a reporter PhoA lipoprotein was examined in M. gallisepticum. Mutation of the start codon from ATG to GTG resulted in a 2.5 fold reduction in the level of transcription of the phoA reporter, but the level of PhoA activity in the transformants containing phoA with a GTG start codon was only 63% of that of the transformants with a phoA with an ATG start codon, suggesting that GTG was a more efficient translational initiation codon. The effect of swapping the translational start codon in phoA reporter gene expression was less in M. gallisepticum than has been seen previously in Escherichia coli or Bacillus subtilis, suggesting the process of translational initiation in mycoplasmas may have some significant differences from those used in other bacteria. This is the first study of translational start codon usage in mycoplasmas and the impact of the use of an alternate start codon on expression in these bacteria. PMID:26010086
Immunomodulator-based enhancement of anti smallpox immune responses.
Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L; Ríos-Olivares, Eddy; Otero, Miguel
2015-01-01
The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform.
Immunomodulator-Based Enhancement of Anti Smallpox Immune Responses
Martínez, Osmarie; Miranda, Eric; Ramírez, Maite; Santos, Saritza; Rivera, Carlos; Vázquez, Luis; Sánchez, Tomás; Tremblay, Raymond L.; Ríos-Olivares, Eddy; Otero, Miguel
2015-01-01
Background The current live vaccinia virus vaccine used in the prevention of smallpox is contraindicated for millions of immune-compromised individuals. Although vaccination with the current smallpox vaccine produces protective immunity, it might result in mild to serious health complications for some vaccinees. Thus, there is a critical need for the production of a safe virus-free vaccine against smallpox that is available to everyone. For that reason, we investigated the impact of imiquimod and resiquimod (Toll-like receptors agonists), and the codon-usage optimization of the vaccinia virus A27L gene in the enhancement of the immune response, with intent of producing a safe, virus-free DNA vaccine coding for the A27 vaccinia virus protein. Methods We analyzed the cellular-immune response by measuring the IFN-γ production of splenocytes by ELISPOT, the humoral-immune responses measuring total IgG and IgG2a/IgG1 ratios by ELISA, and the TH1 and TH2 cytokine profiles by ELISA, in mice immunized with our vaccine formulation. Results The proposed vaccine formulation enhanced the A27L vaccine-mediated production of IFN-γ on mouse spleens, and increased the humoral immunity with a TH1-biased response. Also, our vaccine induced a TH1 cytokine milieu, which is important against viral infections. Conclusion These results support the efforts to find a new mechanism to enhance an immune response against smallpox, through the implementation of a safe, virus-free DNA vaccination platform. PMID:25875833
Mealey, Robert H.; Leib, Steven R.; Littke, Matt H.; Wagner, Bettina; Horohov, David W.; McGuire, Travis C.
2009-01-01
Effective DNA-based vaccines against lentiviruses will likely induce CTL against conserved viral proteins. Equine infectious anemia virus (EIAV) infects horses worldwide, and serves as a useful model for lentiviral immune control. Although attenuated live EIAV vaccines have induced protective immune responses, DNA-based vaccines have not. In particular, DNA-based vaccines have had limited success in inducing CTL responses against intracellular pathogens in the horse. We hypothesized that priming with a codon-optimized plasmid encoding EIAV Gag p15/p26 with co-administration of a plasmid encoding an equine IL-2/IgG fusion protein as a molecular adjuvant, followed by boosting with a vaccinia vector expressing Gag p15/p26, would induce protective Gag-specific CTL responses. Although the regimen induced Gag-specific CTL in four of seven vaccinated horses, CTL were not detected until after the vaccinia boost, and protective effects were not observed in EIAV challenged vaccinates. Unexpectedly, vaccinates had significantly higher viral loads and more severe clinical disease, associated with the presence of vaccine-induced CTL. It was concluded that 1.) further optimization of the timing and route of DNA immunization was needed for efficient CTL priming in vivo, 2.) co-administration of the IL-2/IgG plasmid did not enhance CTL priming by the Gag p15/p26 plasmid, 3.) vaccinia vectors are useful for lentivirus-specific CTL induction in the horse, 4.) Gag-specific CTL alone are either insufficient or a more robust Gag-specific CTL response is needed to limit EIAV viremia and clinical disease, and 5.) CTL-inducing vaccines lacking envelope immunogens can result in lentiviral disease enhancement. Although the mechanisms for enhancement associated with this vaccine regimen remain to be elucidated, these results have important implications for development of lentivirus T cell vaccines. PMID:19368787
Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L
1988-01-01
Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Proteome Adaptation to High Temperatures in the Ectothermic Hydrothermal Vent Pompeii Worm
Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile
2012-01-01
Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular ‘adaptive’ strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively ‘high’ temperatures and thus a novelty in thermophilic metazoans. PMID:22348046
Proteome adaptation to high temperatures in the ectothermic hydrothermal vent Pompeii worm.
Jollivet, Didier; Mary, Jean; Gagnière, Nicolas; Tanguy, Arnaud; Fontanillas, Eric; Boutet, Isabelle; Hourdez, Stéphane; Segurens, Béatrice; Weissenbach, Jean; Poch, Olivier; Lecompte, Odile
2012-01-01
Taking advantage of the massive genome sequencing effort made on thermophilic prokaryotes, thermal adaptation has been extensively studied by analysing amino acid replacements and codon usage in these unicellular organisms. In most cases, adaptation to thermophily is associated with greater residue hydrophobicity and more charged residues. Both of these characteristics are positively correlated with the optimal growth temperature of prokaryotes. In contrast, little information has been collected on the molecular 'adaptive' strategy of thermophilic eukaryotes. The Pompeii worm A. pompejana, whose transcriptome has recently been sequenced, is currently considered as the most thermotolerant eukaryote on Earth, withstanding the greatest thermal and chemical ranges known. We investigated the amino-acid composition bias of ribosomal proteins in the Pompeii worm when compared to other lophotrochozoans and checked for putative adaptive changes during the course of evolution using codon-based Maximum likelihood analyses. We then provided a comparative analysis of codon usage and amino-acid replacements from a greater set of orthologous genes between the Pompeii worm and Paralvinella grasslei, one of its closest relatives living in a much cooler habitat. Analyses reveal that both species display the same high GC-biased codon usage and amino-acid patterns favoring both positively-charged residues and protein hydrophobicity. These patterns may be indicative of an ancestral adaptation to the deep sea and/or thermophily. In addition, the Pompeii worm displays a set of amino-acid change patterns that may explain its greater thermotolerance, with a significant increase in Tyr, Lys and Ala against Val, Met and Gly. Present results indicate that, together with a high content in charged residues, greater proportion of smaller aliphatic residues, and especially alanine, may be a different path for metazoans to face relatively 'high' temperatures and thus a novelty in thermophilic metazoans.
Metabolic engineering of oleaginous yeast Yarrowia lipolytica for limonene overproduction.
Cao, Xuan; Lv, Yu-Bei; Chen, Jun; Imanaka, Tadayuki; Wei, Liu-Jing; Hua, Qiang
2016-01-01
Limonene, a monocyclic monoterpene, is known for its using as an important precursor of many flavoring, pharmaceutical, and biodiesel products. Currently, d-limonene has been produced via fractionation from essential oils or as a byproduct of orange juice production, however, considering the increasing need for limonene and a certain amount of pesticides may exist in the limonene obtained from the citrus industry, some other methods should be explored to produce limonene. To construct the limonene synthetic pathway in Yarrowia lipolytica , two genes encoding neryl diphosphate synthase 1 (NDPS1) and limonene synthase (LS) were codon-optimized and heterologously expressed in Y. lipolytica . Furthermore, to maximize limonene production, several genes involved in the MVA pathway were overexpressed, either in different copies of the same gene or in combination. Finally with the optimized pyruvic acid and dodecane concentration in flask culture, a maximum limonene titer and content of 23.56 mg/L and 1.36 mg/g DCW were achieved in the final engineered strain Po1f-LN-051, showing approximately 226-fold increase compared with the initial yield 0.006 mg/g DCW. This is the first report on limonene biosynthesis in oleaginous yeast Y. lipolytica by heterologous expression of codon-optimized tLS and tNDPS1 genes. To our knowledge, the limonene production 23.56 mg/L, is the highest limonene production level reported in yeast. In short, we demonstrate that Y. lipolytica provides a compelling platform for the overproduction of limonene derivatives, and even other monoterpenes.
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage
Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent
2016-01-01
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
Barik, Sailen
2017-12-01
A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.
Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y
2013-02-27
We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
On fuzzy semantic similarity measure for DNA coding.
Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin
2016-02-01
A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Foley, Kendra C; Spear, Timothy T; Murray, David C; Nagato, Kaoru; Garrett-Mayer, Elizabeth; Nishimura, Michael I
2017-06-16
T cell receptor (TCR)-gene-modified T cells for adoptive cell transfer can mediate objective clinical responses in melanoma and other malignancies. When introducing a second TCR, mispairing between the endogenous and introduced α and β TCR chains limits expression of the introduced TCR, which can result in impaired efficacy or off-target reactivity and autoimmunity. One approach to promote proper TCR chain pairing involves modifications of the introduced TCR genes: introducing a disulfide bridge, substituting murine for human constant regions, codon optimization, TCR chain leucine zipper fusions, and a single-chain TCR. We have introduced these modifications into our hepatitis C virus (HCV) reactive TCR and utilize a marker gene, CD34t, which allows us to directly compare transduction efficiency with TCR expression and T cell function. Our results reveal that of the TCRs tested, T cells expressing the murine Cβ2 TCR or leucine zipper TCR have the highest levels of expression and the highest percentage of lytic and interferon-γ (IFN-γ)-producing T cells. Our studies give us a better understanding of how TCR modifications impact TCR expression and T cell function that may allow for optimization of TCR-modified T cells for adoptive cell transfer to treat patients with malignancies.
Behura, Susanta K; Severson, David W
2013-02-01
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance. © 2012 The Authors. Biological Reviews © 2012 Cambridge Philosophical Society.
Trotta, Edoardo
2016-05-17
The three stop codons UAA, UAG, and UGA signal the termination of mRNA translation. As a result of a mechanism that is not adequately understood, they are normally used with unequal frequencies. In this work, we showed that selective forces and mutational biases drive stop codon usage in the human genome. We found that, in respect to sense codons, stop codon usage was affected by stronger selective forces but was less influenced by neutral mutational biases. UGA is the most frequent termination codon in human genome. However, UAA was the preferred stop codon in genes with high breadth of expression, high level of expression, AT-rich coding sequences, housekeeping functions, and in gene ontology categories with the largest deviation from expected stop codon usage. Selective forces associated with the breadth and the level of expression favoured AT-rich sequences in the mRNA region including the stop site and its proximal 3'-UTR, but acted with scarce effects on sense codons, generating two regions, upstream and downstream of the stop codon, with strongly different base composition. By favouring low levels of GC-content, selection promoted labile local secondary structures at the stop site and its proximal 3'-UTR. The compositional and structural context favoured by selection was surprisingly emphasized in the class of ribosomal proteins and was consistent with sequence elements that increase the efficiency of translational termination. Stop codons were also heterogeneously distributed among chromosomes by a mechanism that was strongly correlated with the GC-content of coding sequences. In human genome, the nucleotide composition and the thermodynamic stability of stop codon site and its proximal 3'-UTR are correlated with the GC-content of coding sequences and with the breadth and the level of gene expression. In highly expressed genes stop codon usage is compositionally and structurally consistent with highly efficient translation termination signals.
2012-01-01
Background Influenza A virus (IAV) is a member of the family Orthomyxoviridae and contains eight segments of a single-stranded RNA genome with negative polarity. The first influenza pandemic of this century was declared in April of 2009, with the emergence of a novel H1N1 IAV strain (H1N1pdm) in Mexico and USA. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. A comprehensive study to investigate the effect of selection pressure imposed by the human host on the codon usage of an emerging, pandemic IAV strain and the trends in viral codon usage involved over the pandemic time period is much needed. Results We performed a comprehensive codon usage analysis of 310 IAV strains from the pandemic of 2009. Highly biased codon usage for Ala, Arg, Pro, Thr and Ser were found. Codon usage is strongly influenced by underlying biases in base composition. When correspondence analysis (COA) on relative synonymous codon usage (RSCU) is applied, the distribution of IAV ORFs in the plane defined by the first two major dimensional factors showed that different strains are located at different places, suggesting that IAV codon usage also reflects an evolutionary process. Conclusions A general association between codon usage bias, base composition and poor adaptation of the virus to the respective host tRNA pool, suggests that mutational pressure is the main force shaping H1N1 pdm IAV codon usage. A dynamic process is observed in the variation of codon usage of the strains enrolled in these studies. These results suggest a balance of mutational bias and natural selection, which allow the virus to explore and re-adapt its codon usage to different environments. Recoding of IAV taking into account codon bias, base composition and adaptation to host tRNA may provide important clues to develop new and appropriate vaccines. PMID:23134595
Evaluating Sense Codon Reassignment with a Simple Fluorescence Screen.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2015-12-22
Understanding the interactions that drive the fidelity of the genetic code and the limits to which modifications can be made without breaking the translational system has practical implications for understanding the molecular mechanisms of evolution as well as expanding the set of encodable amino acids, particularly those with chemistries not provided by Nature. Because 61 sense codons encode 20 amino acids, reassigning the meaning of sense codons provides an avenue for biosynthetic modification of proteins, furthering both fundamental and applied biochemical research. We developed a simple screen that exploits the absolute requirement for fluorescence of an active site tyrosine in green fluorescent protein (GFP) to probe the pliability of the degeneracy of the genetic code. Our screen monitors the restoration of the fluorophore of GFP by incorporation of a tyrosine in response to a sense codon typically assigned another meaning in the genetic code. We evaluated sense codon reassignment at four of the 21 sense codons read through wobble interactions in Escherichia coli using the Methanocaldococcus jannaschii orthogonal tRNA/aminoacyl tRNA synthetase pair originally developed and commonly used for amber stop codon suppression. By changing only the anticodon of the orthogonal tRNA, we achieved sense codon reassignment efficiencies between 1% (Phe UUU) and 6% (Lys AAG). Each of the orthogonal tRNAs preferentially decoded the codon traditionally read via a wobble interaction in E. coli with the exception of the orthogonal tRNA with an AUG anticodon, which incorporated tyrosine in response to both the His CAU and His CAC codons with approximately equal frequencies. We applied our screen in a high-throughput manner to evaluate a 10(9)-member combined tRNA/aminoacyl tRNA synthetase library to identify improved sense codon reassigning variants for the Lys AAG codon. A single rapid screen with the ability to broadly evaluate reassignable codons will facilitate identification and improvement of the combinations of sense codons and orthogonal pairs that display efficient reassignment.
Vendeix, Franck A P; Murphy, Frank V; Cantara, William A; Leszczyńska, Grażyna; Gustilo, Estella M; Sproat, Brian; Malkiewicz, Andrzej; Agris, Paul F
2012-03-02
Human tRNA(Lys3)(UUU) (htRNA(Lys3)(UUU)) decodes the lysine codons AAA and AAG during translation and also plays a crucial role as the primer for HIV-1 (human immunodeficiency virus type 1) reverse transcription. The posttranscriptional modifications 5-methoxycarbonylmethyl-2-thiouridine (mcm(5)s(2)U(34)), 2-methylthio-N(6)-threonylcarbamoyladenosine (ms(2)t(6)A(37)), and pseudouridine (Ψ(39)) in the tRNA's anticodon domain are critical for ribosomal binding and HIV-1 reverse transcription. To understand the importance of modified nucleoside contributions, we determined the structure and function of this tRNA's anticodon stem and loop (ASL) domain with these modifications at positions 34, 37, and 39, respectively (hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39)). Ribosome binding assays in vitro revealed that the hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39) bound AAA and AAG codons, whereas binding of the unmodified ASL(Lys3)(UUU) was barely detectable. The UV hyperchromicity, the circular dichroism, and the structural analyses indicated that Ψ(39) enhanced the thermodynamic stability of the ASL through base stacking while ms(2)t(6)A(37) restrained the anticodon to adopt an open loop conformation that is required for ribosomal binding. The NMR-restrained molecular-dynamics-derived solution structure revealed that the modifications provided an open, ordered loop for codon binding. The crystal structures of the hASL(Lys3)(UUU)-mcm(5)s(2)U(34);ms(2)t(6)A(37);Ψ(39) bound to the 30S ribosomal subunit with each codon in the A site showed that the modified nucleotides mcm(5)s(2)U(34) and ms(2)t(6)A(37) participate in the stability of the anticodon-codon interaction. Importantly, the mcm(5)s(2)U(34)·G(3) wobble base pair is in the Watson-Crick geometry, requiring unusual hydrogen bonding to G in which mcm(5)s(2)U(34) must shift from the keto to the enol form. The results unambiguously demonstrate that modifications pre-structure the anticodon as a key prerequisite for efficient and accurate recognition of cognate and wobble codons. Copyright © 2011 Elsevier Ltd. All rights reserved.
2014-01-01
Background KRAS mutations in codons 12 and 13 are established predictive biomarkers for anti-EGFR therapy in colorectal cancer. Previous studies suggest that KRAS codon 61 and 146 mutations may also predict resistance to anti-EGFR therapy in colorectal cancer. However, clinicopathological, molecular, and prognostic features of colorectal carcinoma with KRAS codon 61 or 146 mutation remain unclear. Methods We utilized a molecular pathological epidemiology database of 1267 colon and rectal cancers in the Nurse’s Health Study and the Health Professionals Follow-up Study. We examined KRAS mutations in codons 12, 13, 61 and 146 (assessed by pyrosequencing), in relation to clinicopathological features, and tumor molecular markers, including BRAF and PIK3CA mutations, CpG island methylator phenotype (CIMP), LINE-1 methylation, and microsatellite instability (MSI). Survival analyses were performed in 1067 BRAF-wild-type cancers to avoid confounding by BRAF mutation. Cox proportional hazards models were used to compute mortality hazard ratio, adjusting for potential confounders, including disease stage, PIK3CA mutation, CIMP, LINE-1 hypomethylation, and MSI. Results KRAS codon 61 mutations were detected in 19 cases (1.5%), and codon 146 mutations in 40 cases (3.2%). Overall KRAS mutation prevalence in colorectal cancers was 40% (=505/1267). Of interest, compared to KRAS-wild-type, overall, KRAS-mutated cancers more frequently exhibited cecal location (24% vs. 12% in KRAS-wild-type; P < 0.0001), CIMP-low (49% vs. 32% in KRAS-wild-type; P < 0.0001), and PIK3CA mutations (24% vs. 11% in KRAS-wild-type; P < 0.0001). These trends were evident irrespective of mutated codon, though statistical power was limited for codon 61 mutants. Neither KRAS codon 61 nor codon 146 mutation was significantly associated with clinical outcome or prognosis in univariate or multivariate analysis [colorectal cancer-specific mortality hazard ratio (HR) = 0.81, 95% confidence interval (CI) = 0.29-2.26 for codon 61 mutation; colorectal cancer-specific mortality HR = 0.86, 95% CI = 0.42-1.78 for codon 146 mutation]. Conclusions Tumors with KRAS mutations in codons 61 and 146 account for an appreciable proportion (approximately 5%) of colorectal cancers, and their clinicopathological and molecular features appear generally similar to KRAS codon 12 or 13 mutated cancers. To further assess clinical utility of KRAS codon 61 and 146 testing, large-scale trials are warranted. PMID:24885062
Therapy for Duchenne muscular dystrophy: renewed optimism from genetic approaches.
Fairclough, Rebecca J; Wood, Matthew J; Davies, Kay E
2013-06-01
Duchenne muscular dystrophy (DMD) is a devastating progressive disease for which there is currently no effective treatment except palliative therapy. There are several promising genetic approaches, including viral delivery of the missing dystrophin gene, read-through of translation stop codons, exon skipping to restore the reading frame and increased expression of the compensatory utrophin gene. The lessons learned from these approaches will be applicable to many other disorders.
Tsukamoto, Takashi; Inoue, Keiichi; Kandori, Hideki; Sudo, Yuki
2013-01-01
So far retinylidene proteins (∼rhodopsin) have not been discovered in thermophilic organisms. In this study we investigated and characterized a microbial rhodopsin derived from the extreme thermophilic bacterium Thermus thermophilus, which lives in a hot spring at around 75 °C. The gene for the retinylidene protein, named thermophilic rhodopsin (TR), was chemically synthesized with codon optimization. The codon optimized TR protein was functionally expressed in the cell membranes of Escherichia coli cells and showed active proton transport upon photoillumination. Spectroscopic measurements revealed that the purified TR bound only all-trans-retinal as a chromophore and showed an absorption maximum at 530 nm. In addition, TR exhibited both photocycle kinetics and pH-dependent absorption changes, which are characteristic of rhodopsins. Of note, time-dependent thermal denaturation experiments revealed that TR maintained its absorption even at 75 °C, and the denaturation rate constant of TR was much lower than those of other proton pumping rhodopsins such as archaerhodopsin-3 (200 ×), Haloquadratum walsbyi bacteriorhodopsin (by 10-times), and Gloeobacter rhodopsin (100 ×). Thus, these results suggest that microbial rhodopsins are also distributed among thermophilic organisms and have high stability. TR should allow the investigation of the molecular mechanisms of ion transport and protein folding. PMID:23740255
Tsukamoto, Takashi; Inoue, Keiichi; Kandori, Hideki; Sudo, Yuki
2013-07-26
So far retinylidene proteins (∼rhodopsin) have not been discovered in thermophilic organisms. In this study we investigated and characterized a microbial rhodopsin derived from the extreme thermophilic bacterium Thermus thermophilus, which lives in a hot spring at around 75 °C. The gene for the retinylidene protein, named thermophilic rhodopsin (TR), was chemically synthesized with codon optimization. The codon optimized TR protein was functionally expressed in the cell membranes of Escherichia coli cells and showed active proton transport upon photoillumination. Spectroscopic measurements revealed that the purified TR bound only all-trans-retinal as a chromophore and showed an absorption maximum at 530 nm. In addition, TR exhibited both photocycle kinetics and pH-dependent absorption changes, which are characteristic of rhodopsins. Of note, time-dependent thermal denaturation experiments revealed that TR maintained its absorption even at 75 °C, and the denaturation rate constant of TR was much lower than those of other proton pumping rhodopsins such as archaerhodopsin-3 (200 ×), Haloquadratum walsbyi bacteriorhodopsin (by 10-times), and Gloeobacter rhodopsin (100 ×). Thus, these results suggest that microbial rhodopsins are also distributed among thermophilic organisms and have high stability. TR should allow the investigation of the molecular mechanisms of ion transport and protein folding.
Codon adaptation and synonymous substitution rate in diatom plastid genes.
Morton, Brian R; Sorhannus, Ulf; Fox, Martin
2002-07-01
Diatom plastid genes are examined with respect to codon adaptation and rates of silent substitution (Ks). It is shown that diatom genes follow the same pattern of codon usage as other plastid genes studied previously. Highly expressed diatom genes display codon adaptation, or a bias toward specific major codons, and these major codons are the same as those in red algae, green algae, and land plants. It is also found that there is a strong correlation between Ks and variation in codon adaptation across diatom genes, providing the first evidence for such a relationship in the algae. It is argued that this finding supports the notion that the correlation arises from selective constraints, not from variation in mutation rate among genes. Finally, the diatom genes are examined with respect to variation in Ks among different synonymous groups. Diatom genes with strong codon adaptation do not show the same variation in synonymous substitution rate among codon groups as the flowering plant psbA gene which, previous studies have shown, has strong codon adaptation but unusually high rates of silent change in certain synonymous groups. The lack of a similar finding in diatoms supports the suggestion that the feature is unique to the flowering plant psbA due to recent relaxations in selective pressure in that lineage.
2014-01-01
Background mRNA translation involves simultaneous movement of multiple ribosomes on the mRNA and is also subject to regulatory mechanisms at different stages. Translation can be described by various codon-based models, including ODE, TASEP, and Petri net models. Although such models have been extensively used, the overlap and differences between these models and the implications of the assumptions of each model has not been systematically elucidated. The selection of the most appropriate modelling framework, and the most appropriate way to develop coarse-grained/fine-grained models in different contexts is not clear. Results We systematically analyze and compare how different modelling methodologies can be used to describe translation. We define various statistically equivalent codon-based simulation algorithms and analyze the importance of the update rule in determining the steady state, an aspect often neglected. Then a novel probabilistic Boolean network (PBN) model is proposed for modelling translation, which enjoys an exact numerical solution. This solution matches those of numerical simulation from other methods and acts as a complementary tool to analytical approximations and simulations. The advantages and limitations of various codon-based models are compared, and illustrated by examples with real biological complexities such as slow codons, premature termination and feedback regulation. Our studies reveal that while different models gives broadly similiar trends in many cases, important differences also arise and can be clearly seen, in the dependence of the translation rate on different parameters. Furthermore, the update rule affects the steady state solution. Conclusions The codon-based models are based on different levels of abstraction. Our analysis suggests that a multiple model approach to understanding translation allows one to ascertain which aspects of the conclusions are robust with respect to the choice of modelling methodology, and when (and why) important differences may arise. This approach also allows for an optimal use of analysis tools, which is especially important when additional complexities or regulatory mechanisms are included. This approach can provide a robust platform for dissecting translation, and results in an improved predictive framework for applications in systems and synthetic biology. PMID:24576337
Analysis of Synonymous Codon Usage Bias of Zika Virus and Its Adaption to the Hosts
Wang, Hongju; Liu, Siqing; Zhang, Bo
2016-01-01
Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts. PMID:27893824
Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression
ERIC Educational Resources Information Center
Szeberenyi, Jozsef
2009-01-01
Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…
Castro-Chavez, Fernando
2011-01-01
My previous theoretical research shows that the rotating circular genetic code is a viable tool to make easier to distinguish the rules of variation applied to the amino acid exchange; it presents a precise and positional bio-mathematical balance of codons, according to the amino acids they codify. Here, I demonstrate that when using the conventional or classic circular genetic code, a clearer pattern for the human codon usage per amino acid and per genome emerges. The most used human codons per amino acid were the ones ending with the three hydrogen bond nucleotides: C for 12 amino acids and G for the remaining 8, plus one codon for arginine ending in A that was used approximately with the same frequency than the one ending in G for this same amino acid (plus *). The most used codons in man fall almost all the time at the rightmost position, clockwise, ending either in C or in G within the circular genetic code. The human codon usage per genome is compared to other organisms such as fruit flies (Drosophila melanogaster), squid (Loligo pealei), and many others. The biosemiotic codon usage of each genomic population or ‘Theme’ is equated to a ‘molecular language’. The C/U choice or difference, and the G/A difference in the third nucleotide of the most used codons per amino acid are illustrated by comparing the most used codons per genome in humans and squids. The human distribution in the third position of most used codons is a 12-8-2, C-G-A, nucleotide ending signature, while the squid distribution in the third position of most used codons was an odd, or uneven, distribution in the third position of its most used codons: 13-6-3, U-A-G, as its nucleotide ending signature. These findings may help to design computational tools to compare human genomes, to determine the exchangeability between compatible codons and amino acids, and for the early detection of incompatible changes leading to hereditary diseases. PMID:22997484
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Differences in codon bias cannot explain differences in translational power among microbes.
Dethlefsen, Les; Schmidt, Thomas M
2005-01-06
Translational power is the cellular rate of protein synthesis normalized to the biomass invested in translational machinery. Published data suggest a previously unrecognized pattern: translational power is higher among rapidly growing microbes, and lower among slowly growing microbes. One factor known to affect translational power is biased use of synonymous codons. The correlation within an organism between expression level and degree of codon bias among genes of Escherichia coli and other bacteria capable of rapid growth is commonly attributed to selection for high translational power. Conversely, the absence of such a correlation in some slowly growing microbes has been interpreted as the absence of selection for translational power. Because codon bias caused by translational selection varies between rapidly growing and slowly growing microbes, we investigated whether observed differences in translational power among microbes could be explained entirely by differences in the degree of codon bias. Although the data are not available to estimate the effect of codon bias in other species, we developed an empirically-based mathematical model to compare the translation rate of E. coli to the translation rate of a hypothetical strain which differs from E. coli only by lacking codon bias. Our reanalysis of data from the scientific literature suggests that translational power can differ by a factor of 5 or more between E. coli and slowly growing microbial species. Using empirical codon-specific in vivo translation rates for 29 codons, and several scenarios for extrapolating from these data to estimates over all codons, we find that codon bias cannot account for more than a doubling of the translation rate in E. coli, even with unrealistic simplifying assumptions that exaggerate the effect of codon bias. With more realistic assumptions, our best estimate is that codon bias accelerates translation in E. coli by no more than 60% in comparison to microbes with very little codon bias. While codon bias confers a substantial benefit of faster translation and hence greater translational power, the magnitude of this effect is insufficient to explain observed differences in translational power among bacterial and archaeal species, particularly the differences between slowly growing and rapidly growing species. Hence, large differences in translational power suggest that the translational apparatus itself differs among microbes in ways that influence translational performance.
Leskiw, B K; Lawlor, E J; Fernandez-Abalos, J M; Chater, K F
1991-01-01
In Streptomyces coelicolor A3(2) and the related species Streptomyces lividans 66, aerial mycelium formation and antibiotic production are blocked by mutations in bldA, which specifies a tRNA(Leu)-like gene product which would recognize the UUA codon. Here we show that phenotypic expression of three disparate genes (carB, lacZ, and ampC) containing TTA codons depends strongly on bldA. Site-directed mutagenesis of carB, changing its two TTA codons to CTC (leucine) codons, resulted in bldA-independent expression; hence the bldA product is the principal tRNA for the UUA codon. Two other genes (hyg and aad) containing TTA codons show a medium-dependent reduction in phenotypic expression (hygromycin resistance and spectinomycin resistance, respectively) in bldA mutants. For hyg, evidence is presented that the UUA codon is probably being translated by a tRNA with an imperfectly matched anticodon, giving very low levels of gene product but relatively high resistance to hygromycin. It is proposed that TTA codons may be generally absent from genes expressed during vegetative growth and from the structural genes for differentiation and antibiotic production but present in some regulatory and resistance genes associated with the latter processes. The codon may therefore play a role in developmental regulation. Images PMID:1826053
Zhao, Yongchao; Zheng, Hao; Xu, Anying; Yan, Donghua; Jiang, Zijian; Qi, Qi; Sun, Jingchen
2016-08-24
Analysis of codon usage bias is an extremely versatile method using in furthering understanding of the genetic and evolutionary paths of species. Codon usage bias of envelope glycoprotein genes in nuclear polyhedrosis virus (NPV) has remained largely unexplored at present. Hence, the codon usage bias of NPV envelope glycoprotein was analyzed here to reveal the genetic and evolutionary relationships between different viral species in baculovirus genus. A total of 9236 codons from 18 different species of NPV of the baculovirus genera were used to perform this analysis. Glycoprotein of NPV exhibits weaker codon usage bias. Neutrality plot analysis and correlation analysis of effective number of codons (ENC) values indicate that natural selection is the main factor influencing codon usage bias, and that the impact of mutation pressure is relatively smaller. Another cluster analysis shows that the kinship or evolutionary relationships of these viral species can be divided into two broad categories despite all of these 18 species are from the same baculovirus genus. There are many elements that can affect codon bias, such as the composition of amino acids, mutation pressure, natural selection, gene expression level, and etc. In the meantime, cluster analysis also illustrates that codon usage bias of virus envelope glycoprotein can serve as an effective means of evolutionary classification in baculovirus genus.
Efficient initiation of mammalian mRNA translation at a CUG codon.
Dasso, M C; Jackson, R J
1989-01-01
Nucleotide substitutions were made at the initiation codon of an influenza virus NS cDNA clone in a vector carrying the bacteriophage T7 promoter. When capped mRNA transcripts of these constructs were translated in the rabbit reticulocyte lysate, a change in the initiation codon from...AUAAUGG...to...AUACUGG...reduced the in vitro translational efficiency by only 50-60%, and resulted in only a small increase in the yield of short products presumed to be initiated at downstream sites. Synthesis of the full-length product was initiated exclusively at the mutated codon, with negligible use either of in-frame upstream CUG or GUG codons, or of an in-frame downstream GUG codon. We conclude that CUG has the potential to function as an efficient initiation codon in mammalian systems, at least in certain contexts. Images PMID:2780285
Codon usage analysis of photolyase encoding genes of cyanobacteria inhabiting diverse habitats.
Rajneesh; Pathak, Jainendra; Kannaujiya, Vinod K; Singh, Shailendra P; Sinha, Rajeshwar P
2017-07-01
Nucleotide and amino acid compositions were studied to determine the genomic and structural relationship of photolyase gene in freshwater, marine and hot spring cyanobacteria. Among three habitats, photolyase encoding genes from hot spring cyanobacteria were found to have highest GC content. The genomic GC content was found to influence the codon usage and amino acid variability in photolyases. The third position of codon was found to have more effect on amino acid variability in photolyases than the first and second positions of codon. The variation of amino acids Ala, Asp, Glu, Gly, His, Leu, Pro, Gln, Arg and Val in photolyases of three different habitats was found to be controlled by first position of codon (G1C1). However, second position (G2C2) of codon regulates variation of Ala, Cys, Gly, Pro, Arg, Ser, Thr and Tyr contents in photolyases. Third position (G3C3) of codon controls incorporation of amino acids such as Ala, Phe, Gly, Leu, Gln, Pro, Arg, Ser, Thr and Tyr in photolyases from three habitats. Photolyase encoding genes of hot spring cyanobacteria have 85% codons with G or C at third position, whereas marine and freshwater cyanobacteria showed 82 and 60% codons, respectively, with G or C at third position. Principal component analysis (PCA) showed that GC content has a profound effect in separating the genes along the first major axis according to their RSCU (relative synonymous codon usage) values, and neutrality analysis indicated that mutational pressure has resulted in codon bias in photolyase genes of cyanobacteria.
Optimizing complex phenotypes through model-guided multiplex genome engineering
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.; ...
2017-05-25
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Optimizing complex phenotypes through model-guided multiplex genome engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuznetsov, Gleb; Goodman, Daniel B.; Filsinger, Gabriel T.
Here, we present a method for identifying genomic modifications that optimize a complex phenotype through multiplex genome engineering and predictive modeling. We apply our method to identify six single nucleotide mutations that recover 59% of the fitness defect exhibited by the 63-codon E. coli strain C321.ΔA. By introducing targeted combinations of changes in multiplex we generate rich genotypic and phenotypic diversity and characterize clones using whole-genome sequencing and doubling time measurements. Regularized multivariate linear regression accurately quantifies individual allelic effects and overcomes bias from hitchhiking mutations and context-dependence of genome editing efficiency that would confound other strategies.
Liu, Yanbin; Koh, Chong Mei John; Ngoh, Si Te; Ji, Lianghui
2015-10-26
Rhodosporidium and Rhodotorula are two genera of oleaginous red yeast with great potential for industrial biotechnology. To date, there is no effective method for inducible expression of proteins and RNAs in these hosts. We have developed a luciferase gene reporter assay based on a new codon-optimized LUC2 reporter gene (RtLUC2), which is flanked with CAR2 homology arms and can be integrated into the CAR2 locus in the nuclear genome at >90 % efficiency. We characterized the upstream DNA sequence of a D-amino acid oxidase gene (DAO1) from R. toruloides ATCC 10657 by nested deletions. By comparing the upstream DNA sequences of several putative DAO1 homologs of Basidiomycetous fungi, we identified a conserved DNA motif with a consensus sequence of AGGXXGXAGX11GAXGAXGG within a 0.2 kb region from the mRNA translation initiation site. Deletion of this motif led to strong mRNA transcription under non-inducing conditions. Interestingly, DAO1 promoter activity was enhanced about fivefold when the 108 bp intron 1 was included in the reporter construct. We identified a conserved CT-rich motif in the intron with a consensus sequence of TYTCCCYCTCCYCCCCACWYCCGA, deletion or point mutations of which drastically reduced promoter strength under both inducing and non-inducing conditions. Additionally, we created a selection marker-free DAO1-null mutant (∆dao1e) which displayed greatly improved inducible gene expression, particularly when both glucose and nitrogen were present in high levels. To avoid adding unwanted peptide to proteins to be expressed, we converted the original translation initiation codon to ATC and re-created a translation initiation codon at the start of exon 2. This promoter, named P DAO1-in1m1 , showed very similar luciferase activity to the wild-type promoter upon induction with D-alanine. The inducible system was tunable by adjusting the levels of inducers, carbon source and nitrogen source. The intron 1-containing DAO1 promoters coupled with a DAO1 null mutant makes an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium and Rhodotorula genera. The system will be a valuable tool for metabolic engineering and enzyme expression in these yeast hosts.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vendeix, Franck A.P.; Murphy, IV, Frank V.; Cantara, William A.
Human tRNA Lys3 UUU (htRNA Lys3 UUU) decodes the lysine codons AAA and AAG during translation and also plays a crucial role as the primer for HIV-1 (human immunodeficiency virus type 1) reverse transcription. The posttranscriptional modifications 5-methoxycarbonylmethyl-2-thiouridine (mcm 5s 2U 34), 2-methylthio-N 6-threonylcarbamoyladenosine (ms 2t 6A 37), and pseudouridine (Ψ 39) in the tRNA's anticodon domain are critical for ribosomal binding and HIV-1 reverse transcription. To understand the importance of modified nucleoside contributions, we determined the structure and function of this tRNA's anticodon stem and loop (ASL) domain with these modifications at positions 34, 37, and 39, respectively (hASLmore » Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39). Ribosome binding assays in vitro revealed that the hASL Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39 bound AAA and AAG codons, whereas binding of the unmodified ASL Lys3 UUU was barely detectable. The UV hyperchromicity, the circular dichroism, and the structural analyses indicated that Ψ 39 enhanced the thermodynamic stability of the ASL through base stacking while ms 2t 6A 37 restrained the anticodon to adopt an open loop conformation that is required for ribosomal binding. The NMR-restrained molecular-dynamics-derived solution structure revealed that the modifications provided an open, ordered loop for codon binding. The crystal structures of the hASL Lys3 UUU-mcm 5s 2U 34;ms 2t 6A 37;Ψ 39 bound to the 30S ribosomal subunit with each codon in the A site showed that the modified nucleotides mcm 5s 2U 34 and ms 2t 6A 37 participate in the stability of the anticodon–codon interaction. Importantly, the mcm 5s 2U 34·G 3 wobble base pair is in the Watson–Crick geometry, requiring unusual hydrogen bonding to G in which mcm 5s 2U 34 must shift from the keto to the enol form. The results unambiguously demonstrate that modifications pre-structure the anticodon as a key prerequisite for efficient and accurate recognition of cognate and wobble codons.« less
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position.
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y; Tor, Yitzhak; Cooperman, Barry S
2017-08-29
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon University of California base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5'- and 3'-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix.
A Novel Method for Determining the Level of Viable Disseminated Prostate Cancer Cells
2012-10-01
Metridia luciferase, for use in a real-time viability assay for mammalian cells. The coding region of the marine copepod gene has been codon optimized for...need for multiple replicates of plates in time course studies. Recently a naturally secreted luciferase was identified and cloned from the marine ...well solid white flat bottom polystyrene microplates (Corning, Cat#3917, Lowell, MA). After 24 hours, conditioned media was harvested and remaining
A common periodic table of codons and amino acids.
Biro, J C; Benyó, B; Sansom, C; Szlávecz, A; Fördös, G; Micsik, T; Benyó, Z
2003-06-27
A periodic table of codons has been designed where the codons are in regular locations. The table has four fields (16 places in each) one with each of the four nucleotides (A, U, G, C) in the central codon position. Thus, AAA (lysine), UUU (phenylalanine), GGG (glycine), and CCC (proline) were placed into the corners of the fields as the main codons (and amino acids) of the fields. They were connected to each other by six axes. The resulting nucleic acid periodic table showed perfect axial symmetry for codons. The corresponding amino acid table also displaced periodicity regarding the biochemical properties (charge and hydropathy) of the 20 amino acids and the position of the stop signals. The table emphasizes the importance of the central nucleotide in the codons and predicts that purines control the charge while pyrimidines determine the polarity of the amino acids. This prediction was experimentally tested.
Codon usage and amino acid usage influence genes expression level.
Paul, Prosenjit; Malakar, Arup Kumar; Chakraborty, Supriyo
2018-02-01
Highly expressed genes in any species differ in the usage frequency of synonymous codons. The relative recurrence of an event of the favored codon pair (amino acid pairs) varies between gene and genomes due to varying gene expression and different base composition. Here we propose a new measure for predicting the gene expression level, i.e., codon plus amino bias index (CABI). Our approach is based on the relative bias of the favored codon pair inclination among the genes, illustrated by analyzing the CABI score of the Medicago truncatula genes. CABI showed strong correlation with all other widely used measures (CAI, RCBS, SCUO) for gene expression analysis. Surprisingly, CABI outperforms all other measures by showing better correlation with the wet-lab data. This emphasizes the importance of the neighboring codons of the favored codon in a synonymous group while estimating the expression level of a gene.
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Di-codon Usage for Gene Classification
NASA Astrophysics Data System (ADS)
Nguyen, Minh N.; Ma, Jianmin; Fogel, Gary B.; Rajapakse, Jagath C.
Classification of genes into biologically related groups facilitates inference of their functions. Codon usage bias has been described previously as a potential feature for gene classification. In this paper, we demonstrate that di-codon usage can further improve classification of genes. By using both codon and di-codon features, we achieve near perfect accuracies for the classification of HLA molecules into major classes and sub-classes. The method is illustrated on 1,841 HLA sequences which are classified into two major classes, HLA-I and HLA-II. Major classes are further classified into sub-groups. A binary SVM using di-codon usage patterns achieved 99.95% accuracy in the classification of HLA genes into major HLA classes; and multi-class SVM achieved accuracy rates of 99.82% and 99.03% for sub-class classification of HLA-I and HLA-II genes, respectively. Furthermore, by combining codon and di-codon usages, the prediction accuracies reached 100%, 99.82%, and 99.84% for HLA major class classification, and for sub-class classification of HLA-I and HLA-II genes, respectively.
Yamada, Yuko; Matsugi, Jitsuhiro; Ishikura, Hisayuki
2003-04-15
The tRNA1Ser (anticodon VGA, V=uridin-5-oxyacetic acid) is essential for translation of the UCA codon in Escherichia coli. Here, we studied the translational abilities of serine tRNA derivatives, which have different bases from wild type at the first positions of their anticodons, using synthetic mRNAs containing the UCN (N=A, G, C, or U) codon. The tRNA1Ser(G34) having the anticodon GGA was able to read not only UCC and UCU codons but also UCA and UCG codons. This means that the formation of G-A or G-G pair allowed at the wobble position and these base pairs are noncanonical. The translational efficiency of the tRNA1Ser(G34) for UCA or UCG codon depends on the 2'-O-methylation of the C32 (Cm). The 2'-O-methylation of C32 may give rise to the space necessary for G-A or G-G base pair formation between the first position of anticodon and the third position of codon.
Benyo, B; Biro, J C; Benyo, Z
2004-01-01
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-01-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. PMID:27197221
Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus
2017-06-01
Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Efficient Reassignment of a Frequent Serine Codon in Wild-Type Escherichia coli.
Ho, Joanne M; Reynolds, Noah M; Rivera, Keith; Connolly, Morgan; Guo, Li-Tao; Ling, Jiqiang; Pappin, Darryl J; Church, George M; Söll, Dieter
2016-02-19
Expansion of the genetic code through engineering the translation machinery has greatly increased the chemical repertoire of the proteome. This has been accomplished mainly by read-through of UAG or UGA stop codons by the noncanonical aminoacyl-tRNA of choice. While stop codon read-through involves competition with the translation release factors, sense codon reassignment entails competition with a large pool of endogenous tRNAs. We used an engineered pyrrolysyl-tRNA synthetase to incorporate 3-iodo-l-phenylalanine (3-I-Phe) at a number of different serine and leucine codons in wild-type Escherichia coli. Quantitative LC-MS/MS measurements of amino acid incorporation yields carried out in a selected reaction monitoring experiment revealed that the 3-I-Phe abundance at the Ser208AGU codon in superfolder GFP was 65 ± 17%. This method also allowed quantification of other amino acids (serine, 33 ± 17%; phenylalanine, 1 ± 1%; threonine, 1 ± 1%) that compete with 3-I-Phe at both the aminoacylation and decoding steps of translation for incorporation at the same codon position. Reassignments of different serine (AGU, AGC, UCG) and leucine (CUG) codons with the matching tRNA(Pyl) anticodon variants were met with varying success, and our findings provide a guideline for the choice of sense codons to be reassigned. Our results indicate that the 3-iodo-l-phenylalanyl-tRNA synthetase (IFRS)/tRNA(Pyl) pair can efficiently outcompete the cellular machinery to reassign select sense codons in wild-type E. coli.
Renaud, Stéphane; Guerrera, Francesco; Seitlinger, Joseph; Costardi, Lorena; Schaeffer, Mickaël; Romain, Benoit; Mossetti, Claudio; Claire-Voegeli, Anne; Filosso, Pier Luigi; Legrain, Michèle; Ruffini, Enrico; Falcoz, Pierre-Emmanuel; Oliaro, Alberto; Massard, Gilbert
2017-01-01
Introduction The utilization of molecular markers as routinely used biomarkers is steadily increasing. We aimed to evaluate the potential different prognostic values of KRAS exon 2 codons 12 and 13 after lung metastasectomy in colorectal cancer (CRC). Results KRAS codon 12 mutations were observed in 116 patients (77%), whereas codon 13 mutations were observed in 34 patients (23%). KRAS codon 13 mutations were associated with both longer time to pulmonary recurrence (TTPR) (median TTPR: 78 months (95% CI: 50.61–82.56) vs 56 months (95% CI: 68.71–127.51), P = 0.008) and improved overall survival (OS) (median OS: 82 months vs 54 months (95% CI: 48.93–59.07), P = 0.009). Multivariate analysis confirmed that codon 13 mutations were associated with better outcomes (TTPR: HR: 0.40 (95% CI: 0.17–0.93), P = 0.033); OS: HR: 0.39 (95% CI: 0.14–1.07), P = 0.07). Otherwise, no significant difference in OS (P = 0.78) or TTPR (P = 0.72) based on the type of amino-acid substitutions was observed among KRAS codon 12 mutations. Materials and Methods We retrospectively reviewed data from 525 patients who underwent a lung metastasectomy for CRC in two departments of thoracic surgery from 1998 to 2015 and focused on 150 patients that had KRAS exon 2 codon 12/13 mutations. Conclusions KRAS exon 2 codon 13 mutations, compared to codon 12 mutations, seem to be associated with better outcomes following lung metastasectomy in CRC. Prospective multicenter studies are necessary to fully understand the prognostic value of KRAS mutations in the lung metastases of CRC. PMID:27911859
Behura, Susanta K.; Severson, David W.
2014-01-01
The mosquito Aedes aegypti is the primary vector of dengue virus (DENV) infection in most of the subtropical and tropical countries. Besides DENV, yellow fever virus (YFV) is also transmitted by A. aegypti. Susceptibility of A. aegypti to West Nile virus (WNV) has also been confirmed. Although studies have indicated correlation of codon bias between flaviviridae and their animal/insect hosts, it is not clear if codon sequences have any relation to susceptibility of A. aegypti to DENV, YFV and WNV. In the current study, usages of codon context sequences (codon pairs for neighboring amino acids) of the vector (A. aegypti) genome as well as the flaviviral genomes are investigated. We used bioinformatics methods to quantify codon context bias in a genome-wide manner of A. aegypti as well as DENV, WNV and YFV sequences. Mutual information statistics was applied to perform bicluster analysis of codon context bias between vector and flaviviral sequences. Functional relevance of the bicluster pattern was inferred from published microarray data. Our study shows that codon context bias of DENV, WNV and YFV sequences varies in a bicluster manner with that of specific sets of genes of A. aegypti. Many of these mosquito genes are known to be differentially expressed in response to flaviviral infection suggesting that codon context sequences of A. aegypti and the flaviviruses may play a role in the susceptible interaction between flaviviruses and this mosquito. The bias inusages of codon context sequences likely has a functional association with susceptibility of A. aegypti to flaviviral infection. The results from this study will allow us to conduct hypothesis driven tests to examine the role of codon contexts bias in evolution of vector-virus interactions at the molecular level. PMID:24838953
Wang, Weixia; Guo, Qinglan; Xu, Xiaogang; Sheng, Zi-ke; Ye, Xinyu; Wang, Minggui
2014-11-01
Efflux is the most common mechanism of tetracycline resistance. Class A tetracycline efflux pumps, which often have high prevalence in Enterobacteriaceae, are encoded by tet(A) and tet(A)-1 genes. These genes have two potential start codons, GTG and ATG, located upstream of the genes. The purpose of this study was to determine the start codon(s) of the class A tetracycline resistance (tet) determinants tet(A) and tet(A)-1, and the tetracycline resistance level they mediated. Conjugation, transformation and cloning experiments were performed and the genetic environment of tet(A)-1 was analysed. The start codons in class A tet determinants were investigated by site-directed mutagenesis of ATG and GTG, the putative translation initiation codons. High-level tetracycline resistance was transferred from the clinical strain of Klebsiella pneumoniae 10-148 containing tet(A)-1 plasmid pHS27 to Escherichia coli J53 by conjugation. The transformants harbouring recombinant plasmids that carried tet(A) or tet(A)-1 exhibited tetracycline MICs of 256-512 µg ml(-1), with or without tetR(A). Once the ATG was mutated to a non-start codon, the tetracycline MICs were not changed, while the tetracycline MICs decreased from 512 to 64 µg ml(-1) following GTG mutation, and to ≤4 µg ml(-1) following mutation of both GTG and ATG. It was presumed that class A tet determinants had two start codons, which are the primary start codon GTG and secondary start codon ATG. Accordingly, two putative promoters were predicted. In conclusion, class A tet determinants can confer high-level tetracycline resistance and have two start codons. © 2014 The Authors.
Strauss, E G; Levinson, R; Rice, C M; Dalrymple, J; Strauss, J H
1988-05-01
We have sequenced the nsP3 and nsP4 region of two alphaviruses, Ross River virus and O'Nyong-nyong virus, in order to examine these viruses for the presence or absence of an opal termination codon present between nsP3 and nsP4 in many alphaviruses. We found that Ross River virus possesses an in-phase opal termination codon between nsP3 and nsP4, whereas in O'Nyong-nyong virus this termination codon is replaced by an arginine codon. Previous studies have shown that two other alphaviruses, Sindbis virus and Middelburg virus, possess an opal termination codon separating nsP3 and nsP4 [E.G. Strauss, C.M. Rice, and J.H. Strauss (1983), Proc. Natl. Acad. Sci. USA 80, 5271-5275], whereas Semliki Forest virus possesses an arginine codon in lieu of the opal codon [K. Takkinen (1986), Nucleic Acids Res. 14, 5667-5682]. Thus, of the five alphaviruses examined to date, three possess the opal codon and two do not. Production of nsP4 requires readthrough of the opal codon in those alphaviruses that possess this termination codon and the function of the termination codon may be to regulate the amount of nsP4 produced. It is an open question then as to whether alphaviruses with no termination codon use other mechanisms to regulate the activity of this gene. The nsP4s of these five alphaviruses are highly conserved, sharing 71-76% amino acid sequence similarity, and all five contain the Gly-Asp-Asp motif found in many RNA virus replicases. The nsP3s are somewhat less conserved, sharing 52-73% amino acid sequence similarity throughout most of the protein, but each possesses a nonconserved C-terminal domain of 134 to 246 amino acids of unknown function.
Mutation-Specific RAS Oncogenicity Explains N-RAS Codon 61 Selection in Melanoma
Burd, Christin E.; Liu, Wenjin; Huynh, Minh V.; Waqas, Meriam A.; Gillahan, James E.; Clark, Kelly S.; Fu, Kailing; Martin, Brit L.; Jeck, William R.; Souroullas, George P.; Darr, David B.; Zedek, Daniel C.; Miley, Michael J.; Baguley, Bruce C.; Campbell, Sharon L.
2014-01-01
N-RAS mutation at codon 12, 13 or 61 is associated with transformation; yet, in melanoma, such alterations are nearly exclusive to codon 61. Here, we compared the melanoma susceptibility of an N-RasQ61R knock-in allele to similarly designed K-RasG12D and N-RasG12D alleles. With concomitant p16INK4a inactivation, K-RasG12D or N-RasQ61R expression efficiently promoted melanoma in vivo, whereas N-RasG12D did not. Additionally, N-RasQ61R mutation potently cooperated with Lkb1/Stk11 loss to induce highly metastatic disease. Functional comparisons of N-RasQ61R and N-RasG12D revealed little difference in the ability of these proteins to engage PI3K or RAF. Instead, N-RasQ61R showed enhanced nucleotide binding, decreased intrinsic GTPase activity and increased stability when compared to N-RasG12D. This work identifies a faithful model of human N-RAS mutant melanoma, and suggests that the increased melanomagenecity of N-RasQ61R over N-RasG12D is due to heightened abundance of the active, GTP-bound form rather than differences in the engagement of downstream effector pathways. PMID:25252692
Du, Qian; Yang, Xiangdong; Zhang, Jinhua; Zhong, Xiaofang; Kim, Kyung Seok; Yang, Jing; Xing, Guojie; Li, Xiaoyu; Jiang, Zhaoyuan; Li, Qiyun; Dong, Yingshan; Pan, Hongyu
2018-06-01
Phytophthora root and stem rot (PRR) caused by Phytophthora sojae is one of the most devastating diseases reducing soybean (Glycine max) production all over the world. Harpin proteins in many plant pathogenic bacteria were confirmed to enhance disease and insect resistance in crop plants. Here, a harpin protein-encoding gene hrpZpsta from the P. syringae pv. tabaci strain Psta218 was codon-optimized (renamed hrpZm) and introduced into soybean cultivars Williams 82 and Shennong 9 by Agrobacterium-mediated transformation. Three independent transgenic lines over-expressing hrpZm were obtained and exhibited stable and enhanced tolerance to P. sojae infection in T 2 -T 4 generations compared to the non-transformed (NT) and empty vector (EV)-transformed plants. Quantitative real-time PCR (qRT-PCR) analysis revealed that the expression of salicylic acid-dependent genes PR1, PR12, and PAL, jasmonic acid-dependent gene PPO, and hypersensitive response (HR)-related genes GmNPR1 and RAR was significantly up-regulated after P. sojae inoculation. Moreover, the activities of defense-related enzymes such as phenylalanine ammonia lyase (PAL), polyphenoloxidase (PPO), peroxidase, and superoxide dismutase also increased significantly in the transgenic lines compared to the NT and EV-transformed plants after inoculation. Our results suggest that over-expression of the hrpZm gene significantly enhances PRR tolerance in soybean by eliciting resistance responses mediated by multiple defense signaling pathways, thus providing an alternative approach for development of soybean varieties with improved tolerance against the soil-borne pathogen PRR.
Stringent Nucleotide Recognition by the Ribosome at the Middle Codon Position
Liu, Wei; Shin, Dongwon; Ng, Martin; Sanbonmatsu, Karissa Y.; Tor, Yitzhak; Cooperman, Barry S.
2017-01-01
Accurate translation of the genetic code depends on mRNA:tRNA codon:anticodon base pairing. Here we exploit an emissive, isosteric adenosine surrogate that allows direct measurement of the kinetics of codon:anticodon base formation during protein synthesis. Our results suggest that codon:anticodon base pairing is subject to tighter constraints at the middle position than at the 5′- and 3′-positions, and further suggest a sequential mechanism of formation of the three base pairs in the codon:anticodon helix. PMID:28850078
Khrustalev, Vladislav Victorovich
2009-01-01
Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).
Mühlhausen, Stefanie; Findeisen, Peggy; Plessmann, Uwe; Urlaub, Henning; Kollmar, Martin
2016-07-01
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including "codon capture," "genome streamlining," and "ambiguous intermediate" theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNA(Ala) containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects. © 2016 Mühlhausen et al.; Published by Cold Spring Harbor Laboratory Press.
A Major Controversy in Codon-Anticodon Adaptation Resolved by a New Codon Usage Index
Xia, Xuhua
2015-01-01
Two alternative hypotheses attribute different benefits to codon-anticodon adaptation. The first assumes that protein production is rate limited by both initiation and elongation and that codon-anticodon adaptation would result in higher elongation efficiency and more efficient and accurate protein production, especially for highly expressed genes. The second claims that protein production is rate limited only by initiation efficiency but that improved codon adaptation and, consequently, increased elongation efficiency have the benefit of increasing ribosomal availability for global translation. To test these hypotheses, a recent study engineered a synthetic library of 154 genes, all encoding the same protein but differing in degrees of codon adaptation, to quantify the effect of differential codon adaptation on protein production in Escherichia coli. The surprising conclusion that “codon bias did not correlate with gene expression” and that “translation initiation, not elongation, is rate-limiting for gene expression” contradicts the conclusion reached by many other empirical studies. In this paper, I resolve the contradiction by reanalyzing the data from the 154 sequences. I demonstrate that translation elongation accounts for about 17% of total variation in protein production and that the previous conclusion is due to the use of a codon adaptation index (CAI) that does not account for the mutation bias in characterizing codon adaptation. The effect of translation elongation becomes undetectable only when translation initiation is unrealistically slow. A new index of translation elongation ITE is formulated to facilitate studies on the efficiency and evolution of the translation machinery. PMID:25480780
Song, Jiangning; Wang, Minglei; Burrage, Kevin
2006-07-21
High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.
Characterization of codon usage pattern and influencing factors in Japanese encephalitis virus.
Singh, Niraj K; Tyagi, Anuj; Kaur, Rajinder; Verma, Ramneek; Gupta, Praveen K
2016-08-02
Recently, several outbreaks of Japanese encephalitis (JE), caused by Japanese encephalitis virus (JEV), have been reported and it has become cause of concern across the world. In this study, detailed analysis of JEV codon usage pattern was performed. The relative synonymous codon usage (RSCU) values along with mean effective number of codons (ENC) value of 55.30 indicated the presence of low codon usages bias in JEV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations of A3s, U3s, G3s, C3s, GC3s, ENC values, with overall nucleotide contents (A%, U%, G%, C%, and GC%). The correlation analysis of A3s, U3s, G3s, C3s, GC3s, with axis values of correspondence analysis (CoA) further confirmed the role of mutational pressure. However, the correlation analysis of Gravy values and Aroma values with A3s, U3s, G3s, C3s, and GC3s, indicated the presence of natural selection on codon usage bias in addition to mutational pressure. The natural selection was further confirmed by codon adaptation index (CAI) analysis. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent. Copyright © 2016 Elsevier B.V. All rights reserved.
Birikh, K R; Lebedenko, E N; Boni, I V; Berlin, Y A
1995-10-27
Synthetic intronless genes, coding for human interleukin 1 alpha (IL 1 alpha) and interleukin 1 receptor antagonist (IL1ra), have been expressed efficiently in a specially designed prokaryotic vector, pGMCE (a pGEM1 derivative), where the target gene forms the second part of a two-cistron system. The first part of the system is a translation enhancer-containing mini-cistron, whose termination codon overlaps the start codon of the target gene. In the case of the IL1 alpha gene, the high expression level is largely due to the direct efficient translation initiation at the second cistron, whereas with the IL1ra gene in the same system, the proximal translation initiation region (TIR) provides a high level of coupled expression of the target gene. Thus, pGMCE is a potentially versatile vector for direct prokaryotic expression.
Ribosomes slide on lysine-encoding homopolymeric A stretches
Koutmou, Kristin S; Schuller, Anthony P; Brunelle, Julie L; Radhakrishnan, Aditya; Djuranovic, Sergej; Green, Rachel
2015-01-01
Protein output from synonymous codons is thought to be equivalent if appropriate tRNAs are sufficiently abundant. Here we show that mRNAs encoding iterated lysine codons, AAA or AAG, differentially impact protein synthesis: insertion of iterated AAA codons into an ORF diminishes protein expression more than insertion of synonymous AAG codons. Kinetic studies in E. coli reveal that differential protein production results from pausing on consecutive AAA-lysines followed by ribosome sliding on homopolymeric A sequence. Translation in a cell-free expression system demonstrates that diminished output from AAA-codon-containing reporters results from premature translation termination on out of frame stop codons following ribosome sliding. In eukaryotes, these premature termination events target the mRNAs for Nonsense-Mediated-Decay (NMD). The finding that ribosomes slide on homopolymeric A sequences explains bioinformatic analyses indicating that consecutive AAA codons are under-represented in gene-coding sequences. Ribosome ‘sliding’ represents an unexpected type of ribosome movement possible during translation. DOI: http://dx.doi.org/10.7554/eLife.05534.001 PMID:25695637
Cladel, Nancy M.; Budgeon, Lynn R.; Hu, Jiafen; Balogh, Karla K.; Christensen, Neil D.
2013-01-01
Papillomaviruses use rare codons with respect to the host. The reasons for this are incompletely understood but among the hypotheses is the concept that rare codons result in low protein production and this allows the virus to escape immune surveillance. We changed rare codons in the oncogenes E6 and E7 of the cottontail rabbit papillomavirus to make them more mammalian-like and tested the mutant genomes in our in vivo animal model. While the amino acid sequences of the proteins remained unchanged, the oncogenic potential of some of the altered genomes increased dramatically. In addition, increased immunogenicity, as measured by spontaneous regression, was observed as the numbers of codon changes increased. This work suggests that codon usage may modify protein production in ways that influence disease outcome and that evaluation of synonymous codons should be included in the analysis of genetic variants of infectious agents and their association with disease. PMID:23433866
Akbari, Fariba; Eskandani, Morteza; Khosroushahi, Ahmad Yari
2014-11-01
Microalgae have been used in food, cosmetic, and biofuel industries as a natural source of lipids, vitamins, pigments and antioxidants for a long time. Green microalgae, as potent photobioreactors, can be considered as an economical expression system to produce recombinant therapeutical proteins at large-scale due to low cost of production and scaling-up capitalization owning to the inexpensive medium requirement, fast growth rate, and the ease of manipulation. These microalgae possess all benefit eukaryotic expression systems including the ability of post-translational modifications required for proper folding and stability of active proteins. Among the many items regarded as recombinant protein production, this review compares the different expression systems with green microalgae like Dunaliella by viewing the nuclear/chloroplast transformation challenges/benefits, related selection markers/reporter genes, and crucial factors/strategies affecting the increase of foreign protein expression in microalgae transformants. Some important factors were discussed regarding the increase of protein yielding in microalgae transformants including: transformation-associated genotypic modifications, endogenous regulatory factors, promoters, codon optimization, enhancer elements, and milking of recombinant protein.
Zucchelli, Silvia; Patrucco, Laura; Persichetti, Francesca; Gustincich, Stefano; Cotella, Diego
2016-01-01
Mammalian cells are an indispensable tool for the production of recombinant proteins in contexts where function depends on post-translational modifications. Among them, Chinese Hamster Ovary (CHO) cells are the primary factories for the production of therapeutic proteins, including monoclonal antibodies (MAbs). To improve expression and stability, several methodologies have been adopted, including methods based on media formulation, selective pressure and cell- or vector engineering. This review presents current approaches aimed at improving mammalian cell factories that are based on the enhancement of translation. Among well-established techniques (codon optimization and improvement of mRNA secondary structure), we describe SINEUPs, a family of antisense long non-coding RNAs that are able to increase translation of partially overlapping protein-coding mRNAs. By exploiting their modular structure, SINEUP molecules can be designed to target virtually any mRNA of interest, and thus to increase the production of secreted proteins. Thus, synthetic SINEUPs represent a new versatile tool to improve the production of secreted proteins in biomanufacturing processes.
Expression of Recombinant Phosphoproteins for Signal Transduction Studies.
Barber, Karl W; Rinehart, Jesse
2017-01-01
Complex signaling cascades are difficult to study in vitro without phosphorylated proteins. Here, we describe a technique for the routine production of recombinant phosphoproteins by directly incorporating phosphoserine as a nonstandard amino acid. This protocol utilizes an optimized phosphoserine orthogonal translation system and an engineered strain of E. coli containing no genomic amber codons. This approach has been used to generate a variety of phosphorylated proteins to understand the role of protein phosphorylation in cell signaling.
2016-02-11
the White- head Genome Technology Core for sequencing . This work was supported by the UCSF Program for Breakthrough Biomedical Research (funded in...landscape of the yeast genome defined by RNA sequencing . Science 320, 1344–1349. Nedialkova, D.D., and Leidel, S.A. (2015). Optimization of Codon Translation... the CC BY license (http://creativecommons.org/licenses/by/4.0/). SUMMARY Ribosome-footprint profiling provides genome -wide snapshots of translation
Management of hyperparathyroidism (PHP) in MEN2 syndromes in Europe.
Alevizaki, Maria
2013-03-14
Hyperparathyroidism occurs in 20-30% of MEN2A syndrome patients. It is usually associated with mild disease and is frequently asymptomatic, especially in younger age. There is genotype/phenotype association and PHP is usually associated with codon 634 mutations; however association with more "rare" mutations has also been reported. The pathology of the parathyroid glands includes hyperplasia, adenoma or a combination of the two. The optimal surgical management of this entity has not been defined yet.
Ranjan, Bibhuti; Satyanarayana, T
2016-02-01
The codon-optimized phytase gene of the thermophilic mold Sporotrichum thermophile (St-Phy) was expressed in Pichia pastoris. The recombinant P. pastoris harboring the phytase gene (rSt-Phy) yielded a high titer of extracellular phytase (480 ± 23 U/mL) on induction with methanol. The recombinant phytase production was ~40-fold higher than that of the native fungal strain. The purified recombinant phytase (rSt-Phy) has the molecular mass of 70 kDa on SDS-PAGE, with K m and V max (calcium phytate), k cat and k cat/K m values of 0.147 mM and 183 nmol/mg s, 1.3 × 10(3)/s and 8.84 × 10(6)/M s, respectively. Mg(2+) and Ba(2+) display a slight stimulatory effect, while other cations tested exert inhibitory action on phytase. The enzyme is inhibited by chaotropic agents (guanidinium hydrochloride, potassium iodide, and urea), Woodward's reagent K and 2,3-bunatedione, but resistant to both pepsin and trypsin. The rSt-Phy is useful in the dephytinization of broiler feeds efficiently in simulated gut conditions of chick leading to the liberation of soluble inorganic phosphate with concomitant mitigation in antinutrient effects of phytates. The addition of vanadate makes it a potential candidate for generating haloperoxidase, which has several applications.
Porting the synthetic D-glucaric acid pathway from Escherichia coli to Saccharomyces cerevisiae.
Gupta, Amita; Hicks, Michael A; Manchester, Shawn P; Prather, Kristala L J
2016-09-01
D-Glucaric acid can be produced as a value-added chemical from biomass through a de novo pathway in Escherichia coli. However, previous studies have identified pH-mediated toxicity at product concentrations of 5 g/L and have also found the eukaryotic myo-inositol oxygenase (MIOX) enzyme to be rate-limiting. We ported this pathway to Saccaromyces cerevisiae, which is naturally acid-tolerant and evaluate a codon-optimized MIOX homologue. We constructed two engineered yeast strains that were distinguished solely by their MIOX gene - either the previous version from Mus musculus or a homologue from Arabidopsis thaliana codon-optimized for expression in S. cerevisiae - in order to identify the rate-limiting steps for D-glucaric acid production both from a fermentative and non-fermentative carbon source. myo-Inositol availability was found to be rate-limiting from glucose in both strains and demonstrated to be dependent on growth rate, whereas the previously used M. musculus MIOX activity was found to be rate-limiting from glycerol. Maximum titers were 0.56 g/L from glucose in batch mode, 0.98 g/L from glucose in fed-batch mode, and 1.6 g/L from glucose supplemented with myo-inositol. Future work focusing on the MIOX enzyme, the interplay between growth and production modes, and promoting aerobic respiration should further improve this pathway. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Shen, Xin-Ming; Okuno, Tatsuya; Milone, Margherita; Otsuka, Kenji; Takahashi, Koji; Komaki, Hirofumi; Giles, Elizabeth; Ohno, Kinji; Engel, Andrew G
2016-10-01
We identify two novel mutations in acetylcholine receptor (AChR) causing a slow-channel congenital myasthenia syndrome (CMS) in three unrelated patients (Pts). Pt 1 harbors a heterozygous βV266A mutation (p.Val289Ala) in the second transmembrane domain (M2) of the AChR β subunit (CHRNB1). Pts 2 and 3 carry the same mutation at an equivalent site in the ε subunit (CHRNE), εV265A (p.Val285Ala). The mutant residues are conserved across all AChR subunits of all species and are components of a valine ring in the channel pore, which is positioned four residues above the leucine ring. Both βV266A and εV265A reduce the amino acid size and lengthen the channel opening bursts by fourfold by enhancing gating efficiency by approximately 30-fold. Substitution of alanine for valine at the corresponding position in the δ and α subunit prolongs the burst duration four- and eightfold, respectively. Replacing valine at ε codon 265 either by a still smaller glycine or by a larger leucine also lengthens the burst duration. Our analysis reveals that each valine in the valine ring contributes to channel kinetics equally, and the valine ring has been optimized in the course of evolution to govern channel gating. © 2016 WILEY PERIODICALS, INC.
Shen, Xin-Ming; Okuno, Tatsuya; Milone, Margherita; Otsuka, Kenji; Takahashi, Koji; Komaki, Hirofumi; Giles, Elizabeth; Ohno, Kinji; Engel, Andrew G.
2016-01-01
We identify two novel mutations in acetylcholine receptor (AChR) causing a slow-channel congenital myasthenia syndrome (CMS) in three unrelated patients (Pts). Pt 1 harbors a heterozygous βV266A mutation (p.Val289Ala) in the second transmembrane domain (M2) of the AChR β subunit (CHRNB1). Pts 2 and 3 carry the same mutation at an equivalent site in the ε subunit (CHRNE), εV265A (p.Val285Ala). The mutant residues are conserved across all AChR subunits of all species and are components of a valine ring in the channel pore which is positioned four residues above the leucine ring. Both βV266A and εV265A reduce the amino acid size and lengthen the channel opening bursts by 4.0-fold by enhancing gating efficiency by approximately 30-fold. Substitution of alanine for valine at the corresponding position in the δ and α subunit prolongs the burst duration 4- and 8-fold, respectively. Replacing valine at ε codon 265 either by a still smaller glycine or by a larger leucine also lengthens the burst duration. Our analysis reveals that each valine in the valine ring contributes to channel kinetics equally, and the valine ring has been optimized in the course of evolution to govern channel gating. PMID:27375219
Kakuta, Yoichi; Kawai, Yosuke; Okamoto, Daisuke; Takagawa, Tetsuya; Ikeya, Kentaro; Sakuraba, Hirotake; Nishida, Atsushi; Nakagawa, Shoko; Miura, Miki; Toyonaga, Takahiko; Onodera, Kei; Shinozaki, Masaru; Ishiguro, Yoh; Mizuno, Shinta; Takahara, Masahiro; Yanai, Shunichi; Hokari, Ryota; Nakagawa, Tomoo; Araki, Hiroshi; Motoya, Satoshi; Naito, Takeo; Moroi, Rintaro; Shiga, Hisashi; Endo, Katsuya; Kobayashi, Taku; Naganuma, Makoto; Hiraoka, Sakiko; Matsumoto, Takayuki; Nakamura, Shiro; Nakase, Hiroshi; Hisamatsu, Tadakazu; Sasaki, Makoto; Hanai, Hiroyuki; Andoh, Akira; Nagasaki, Masao; Kinouchi, Yoshitaka; Shimosegawa, Tooru; Masamune, Atsushi; Suzuki, Yasuo
2018-06-19
Despite NUDT15 variants showing significant association with thiopurine-induced adverse events (AEs) in Asians, it remains unclear which variants of NUDT15 or whether additional genetic variants should be tested to predict AEs. To clarify the best pharmacogenetic test to be used clinically, we performed association studies of NUDT15 variants and haplotypes with AEs, genome-wide association study (GWAS) to discover additional variants, and ROC analysis to select the model to predict severe AEs. Overall, 2630 patients with inflammatory bowel disease (IBD) were enrolled and genotyped for NUDT15 codon 139; 1291 patients were treated with thiopurines. diplotypes were analyzed in 970 patients, and GWASs of AEs were performed with 1221 patients using population-optimized genotyping array and imputation. We confirmed the association of NUDT15 p.Arg139Cys with leukopenia and alopecia (p = 2.20E-63, 1.32E-69, OR = 6.59, 12.1, respectively), and found a novel association with digestive symptoms (p = 6.39E-04, OR = 1.89). Time to leukopenia was significantly shorter, and when leukopenia was diagnosed, thiopurine doses were significantly lower in Arg/Cys and Cys/Cys than in Arg/Arg. In GWASs, no additional variants were found to be associated with thiopurine-induced AEs. Despite strong correlation of leukopenia frequency with estimated enzyme activities based on the diplotypes (r 2 = 0.926, p = 0.0087), there were no significant differences in the AUCs of diplotypes from those of codon 139 to predict severe AEs (AUC = 0.916, 0.921, for acute severe leukopenia, AUC = 0.990, 0.991, for severe alopecia, respectively). Genotyping of NUDT15 codon 139 was sufficient to predict acute severe leukopenia and alopecia in Japanese patients with IBD.
Boonyawat, Boonchai; Monsereenusorn, Chalinee; Traivaree, Chanchai
2014-01-01
Background Beta-thalassemia is one of the most common genetic disorders in Thailand. Clinical phenotype ranges from silent carrier to clinically manifested conditions including severe beta-thalassemia major and mild beta-thalassemia intermedia. Objective This study aimed to characterize the spectrum of beta-globin gene mutations in pediatric patients who were followed-up in Phramongkutklao Hospital. Patients and methods Eighty unrelated beta-thalassemia patients were enrolled in this study including 57 with beta-thalassemia/hemoglobin E, eight with homozygous beta-thalassemia, and 15 with heterozygous beta-thalassemia. Mutation analysis was performed by multiplex amplification refractory mutation system (M-ARMS), direct DNA sequencing of beta-globin gene, and gap polymerase chain reaction for 3.4 kb deletion detection, respectively. Results A total of 13 different beta-thalassemia mutations were identified among 88 alleles. The most common mutation was codon 41/42 (-TCTT) (37.5%), followed by codon 17 (A>T) (26.1%), IVS-I-5 (G>C) (8%), IVS-II-654 (C>T) (6.8%), IVS-I-1 (G>T) (4.5%), and codon 71/72 (+A) (2.3%), and all these six common mutations (85.2%) were detected by M-ARMS. Six uncommon mutations (10.2%) were identified by DNA sequencing including 4.5% for codon 35 (C>A) and 1.1% initiation codon mutation (ATG>AGG), codon 15 (G>A), codon 19 (A>G), codon 27/28 (+C), and codon 123/124/125 (-ACCCCACC), respectively. The 3.4 kb deletion was detected at 4.5%. The most common genotype of beta-thalassemia major patients was codon 41/42 (-TCTT)/codon 26 (G>A) or betaE accounting for 40%. Conclusion All of the beta-thalassemia alleles have been characterized by a combination of techniques including M-ARMS, DNA sequencing, and gap polymerase chain reaction for 3.4 kb deletion detection. Thirteen mutations account for 100% of the beta-thalassemia genes among the pediatric patients in our study. PMID:25525381
Sonawane, Kailas D; Kamble, Asmita S; Fandilolu, Prayagraj M
2017-12-27
Deficiency of 5-taurinomethyl-2-thiouridine, τm 5 s 2 U at the 34th 'wobble' position in tRNA Lys causes MERRF (Myoclonic Epilepsy with Ragged Red Fibers), a neuromuscular disease. This modified nucleoside of mt tRNA Lys , recognizes AAA/AAG codons during protein biosynthesis process. Its preference to identify cognate codons has not been studied at the atomic level. Hence, multiple MD simulations of various molecular models of anticodon stem loop (ASL) of mt tRNA Lys in presence and absence of τm 5 s 2 U 34 and N 6 -threonylcarbamoyl adenosine (t 6 A 37 ) along with AAA and AAG codons have been accomplished. Additional four MD simulations of multiple ASL mt tRNA Lys models in the context of ribosomal A-site residues have also been performed to investigate the role of A-site in recognition of AAA/AAG codons. MD simulation results show that, ASL models in presence of τm 5 s 2 U 34 and t 6 A 37 with codons AAA/AAG are more stable than the ASL lacking these modified bases. MD trajectories suggest that τm 5 s 2 U recognizes the codons initially by 'wobble' hydrogen bonding interactions, and then tRNA Lys might leave the explicit codon by a novel 'single' hydrogen bonding interaction in order to run the protein biosynthesis process smoothly. We propose this model as the 'Foot-Step Model' for codon recognition, in which the single hydrogen bond plays a crucial role. MD simulation results suggest that, tRNA Lys with τm 5 s 2 U and t 6 A recognizes AAA codon more preferably than AAG. Thus, these results reveal the consequences of τm 5 s 2 U and t 6 A in recognition of AAA/AAG codons in mitochondrial disease, MERRF.
Wohlin, Åsa
2015-03-21
The distribution of codons in the nearly universal genetic code is a long discussed issue. At the atomic level, the numeral series 2x(2) (x=5-0) lies behind electron shells and orbitals. Numeral series appear in formulas for spectral lines of hydrogen. The question here was if some similar scheme could be found in the genetic code. A table of 24 codons was constructed (synonyms counted as one) for 20 amino acids, four of which have two different codons. An atomic mass analysis was performed, built on common isotopes. It was found that a numeral series 5 to 0 with exponent 2/3 times 10(2) revealed detailed congruency with codon-grouped amino acid side-chains, simultaneously with the division on atom kinds, further with main 3rd base groups, backbone chains and with codon-grouped amino acids in relation to their origin from glycolysis or the citrate cycle. Hence, it is proposed that this series in a dynamic way may have guided the selection of amino acids into codon domains. Series with simpler exponents also showed noteworthy correlations with the atomic mass distribution on main codon domains; especially the 2x(2)-series times a factor 16 appeared as a conceivable underlying level, both for the atomic mass and charge distribution. Furthermore, it was found that atomic mass transformations between numeral systems, possibly interpretable as dimension degree steps, connected the atomic mass of codon bases with codon-grouped amino acids and with the exponent 2/3-series in several astonishing ways. Thus, it is suggested that they may be part of a deeper reference system. Copyright © 2015 The Author. Published by Elsevier Ltd.. All rights reserved.
Montealegre, Maria Camila; La Rosa, Sabina Leanti; Roh, Jung Hyeob; Harvey, Barrett R.
2015-01-01
ABSTRACT The endocarditis and biofilm-associated pili (Ebp) are important in Enterococcus faecalis pathogenesis, and the pilus tip, EbpA, has been shown to play a major role in pilus biogenesis, biofilm formation, and experimental infections. Based on in silico analyses, we previously predicted that ATT is the EbpA translational start codon, not the ATG codon, 120 bp downstream of ATT, which is annotated as the translational start. ATT is rarely used to initiate protein synthesis, leading to our hypothesis that this codon participates in translational regulation of Ebp production. To investigate this possibility, site-directed mutagenesis was used to introduce consecutive stop codons in place of two lysines at positions 5 and 6 from the ATT, to replace the ATT codon in situ with ATG, and then to revert this ATG to ATT; translational fusions of ebpA to lacZ were also constructed to investigate the effect of these start codons on translation. Our results showed that the annotated ATG does not start translation of EbpA, implicating ATT as the start codon; moreover, the presence of ATT, compared to the engineered ATG, resulted in significantly decreased EbpA surface display, attenuated biofilm, and reduced adherence to fibrinogen. Corroborating these findings, the translational fusion with the native ATT as the initiation codon showed significantly decreased expression of β-galactosidase compared to the construct with ATG in place of ATT. Thus, these results demonstrate that the rare initiation codon of EbpA negatively regulates EbpA surface display and negatively affects Ebp-associated functions, including biofilm and adherence to fibrinogen. PMID:26015496
Romero, H; Zavala, A; Musto, H
2000-01-25
It is widely accepted that the compositional pressure is the only factor shaping codon usage in unicellular species displaying extremely biased genomic compositions. This seems to be the case in the prokaryotes Mycoplasma capricolum, Rickettsia prowasekii and Borrelia burgdorferi (GC-poor), and in Micrococcus luteus (GC-rich). However, in the GC-poor unicellular eukaryotes Dictyostelium discoideum and Plasmodium falciparum, there is evidence that selection, acting at the level of translation, influences codon choices. This is a twofold intriguing finding, since (1) the genomic GC levels of the above mentioned eukaryotes are lower than the GC% of any studied bacteria, and (2) bacteria usually have larger effective population sizes than eukaryotes, and hence natural selection is expected to overcome more efficiently the randomizing effects of genetic drift among prokaryotes than among eukaryotes. In order to gain a new insight about this problem, we analysed the patterns of codon preferences of the nuclear genes of Entamoeba histolytica, a unicellular eukaryote characterised by an extremely AT-rich genome (GC = 25%). The overall codon usage is strongly biased towards A and T in the third codon positions, and among the presumed highly expressed sequences, there is an increased relative usage of a subset of codons, many of which are C-ending. Since an increase in C in third codon positions is 'against' the compositional bias, we conclude that codon usage in E. histolytica, as happens in D. discoideum and P. falciparum, is the result of an equilibrium between compositional pressure and selection. These findings raise the question of why strongly compositionally biased eukaryotic cells may be more sensitive to the (presumed) slight differences among synonymous codons than compositionally biased bacteria.
Théberge, M; Lacaze, P; Shareck, F; Morosoli, R; Kluepfel, D
1992-01-01
The endoglucanase isolated from culture filtrates of Streptomyces lividans IAF74 was shown to have an Mr of 46,000 and a pI of 3.3. The specific enzyme activity of 539 IU/mg, determined by the reducing assay method on carboxymethyl cellulose, is among the highest reported in the literature. The cellulase showed typical endo-type activity when reacting on oligocellodextrins. Optimal enzyme activity was obtained at 50 degrees C and pH 5.5. The kinetic constants for this endoglucanase, determined with carboxymethyl cellulose as the substrate, were a Vmax of 24.9 IU/mg of enzyme and a Km of 4.2 mg/ml. Activity was found against neither methylumbelliferyl- nor p-nitrophenyl-cellobiopyranoside nor with xylan. The DNA sequence contains one possible reading frame validated by the N terminus of the mature purified protein. However, neither ATG nor GTG starting codons were identified near the ribosome-binding site. A putative TTG codon was found as a good candidate for the start codon. Comparison of the primary amino acid sequence of the endoglucanase of S. lividans revealed that the N terminus contains a bacterial cellulose-binding domain. The catalytic domain at the C terminus showed similarity to endoglucanases from a Bacillus sp. Thus, the endoglucanase CelA belongs to family A of cellulases as described before (N. R. Gilkes, B. Henrissat, D. G. Kilburn, R. C. Miller, Jr., and R. A. J. Warren, Microbiol. Rev. 55:303-315, 1991. Images PMID:1575483
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus
Kumar, Chandra Shekhar; Kumar, Sachin
2014-01-01
Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071
An integrated, structure- and energy-based view of the genetic code.
Grosjean, Henri; Westhof, Eric
2016-09-30
The principles of mRNA decoding are conserved among all extant life forms. We present an integrative view of all the interaction networks between mRNA, tRNA and rRNA: the intrinsic stability of codon-anticodon duplex, the conformation of the anticodon hairpin, the presence of modified nucleotides, the occurrence of non-Watson-Crick pairs in the codon-anticodon helix and the interactions with bases of rRNA at the A-site decoding site. We derive a more information-rich, alternative representation of the genetic code, that is circular with an unsymmetrical distribution of codons leading to a clear segregation between GC-rich 4-codon boxes and AU-rich 2:2-codon and 3:1-codon boxes. All tRNA sequence variations can be visualized, within an internal structural and energy framework, for each organism, and each anticodon of the sense codons. The multiplicity and complexity of nucleotide modifications at positions 34 and 37 of the anticodon loop segregate meaningfully, and correlate well with the necessity to stabilize AU-rich codon-anticodon pairs and to avoid miscoding in split codon boxes. The evolution and expansion of the genetic code is viewed as being originally based on GC content with progressive introduction of A/U together with tRNA modifications. The representation we present should help the engineering of the genetic code to include non-natural amino acids. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Charles, Hubert; Calevro, Federica; Vinuelas, José; Fayard, Jean-Michel; Rahbe, Yvan
2006-01-01
Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon–anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera. PMID:16963497
Three stages during the evolution of the genetic code. [Abstract only
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1994-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity and a small codon number those amino acids emerging later in a translation process are derived. Both criteria indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage one use purines rich codons, thus purines have been retained in their third codon position. All the amino acids introduced in the second stage, in contrast, use pyrimidines in this codon position. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non enzymatic replication and interactions of DNA hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids which gradually decreased during their evolution. Amino acids independently available form prebiotic synthesis were thus correlated to purine rich codons. Conclusions on prebiotic replication are discussed also in the light of recent codon usage data.
Jafary, Fariba; Salehi, Mansoor; Sedghi, Maryam; Nouri, Nayereh; Jafary, Farzaneh; Sadeghi, Farzaneh; Motamedi, Shima; Talebi, Maede
2012-01-01
The mismatch repair system (MMR) is a post-replicative DNA repair mechanism whose defects can lead to cancer. The MSH3 protein is an essential component of the system. We postulated that MSH3 gene polymorphisms might therefore be associated with prostate cancer (PC). We studied MSH3 codon 222 and MSH3 codon 1036 polymorphisms in a group of Iranian sporadic PC patients. A total of 60 controls and 18 patients were assessed using the polymerase chain reaction and single strand conformational polymorphism. For comparing the genotype frequencies of patients and controls the chi-square test was applied. The obtained result indicated that there was significantly association between G/A genotype of MSH3 codon 222 and G/G genotype of MSH3 codon 1036 with an increased PC risk (P=0.012 and P=0.02 respectively). Our results demonstrated that MSH3 codon 222 and MSH3 codon 1036 polymorphisms may be risk factors for sporadic prostate cancer in the Iranian population.
Modular Engineering of l-Tyrosine Production in Escherichia coli
Juminaga, Darmawi; Baidoo, Edward E. K.; Redding-Johanson, Alyssa M.; Batth, Tanveer S.; Burd, Helcio; Mukhopadhyay, Aindrila; Petzold, Christopher J.
2012-01-01
Efficient biosynthesis of l-tyrosine from glucose is necessary to make biological production economically viable. To this end, we designed and constructed a modular biosynthetic pathway for l-tyrosine production in E. coli MG1655 by encoding the enzymes for converting erythrose-4-phosphate (E4P) and phosphoenolpyruvate (PEP) to l-tyrosine on two plasmids. Rational engineering to improve l-tyrosine production and to identify pathway bottlenecks was directed by targeted proteomics and metabolite profiling. The bottlenecks in the pathway were relieved by modifications in plasmid copy numbers, promoter strength, gene codon usage, and the placement of genes in operons. One major bottleneck was due to the bifunctional activities of quinate/shikimate dehydrogenase (YdiB), which caused accumulation of the intermediates dehydroquinate (DHQ) and dehydroshikimate (DHS) and the side product quinate; this bottleneck was relieved by replacing YdiB with its paralog AroE, resulting in the production of over 700 mg/liter of shikimate. Another bottleneck in shikimate production, due to low expression of the dehydroquinate synthase (AroB), was alleviated by optimizing the first 15 codons of the gene. Shikimate conversion to l-tyrosine was improved by replacing the shikimate kinase AroK with its isozyme, AroL, which effectively consumed all intermediates formed in the first half of the pathway. Guided by the protein and metabolite measurements, the best producer, consisting of two medium-copy-number, dual-operon plasmids, was optimized to produce >2 g/liter l-tyrosine at 80% of the theoretical yield. This work demonstrates the utility of targeted proteomics and metabolite profiling in pathway construction and optimization, which should be applicable to other metabolic pathways. PMID:22020510
Kille, Sabrina; Acevedo-Rocha, Carlos G; Parra, Loreto P; Zhang, Zhi-Gang; Opperman, Diederik J; Reetz, Manfred T; Acevedo, Juan Pablo
2013-02-15
Saturation mutagenesis probes define sections of the vast protein sequence space. However, even if randomization is limited this way, the combinatorial numbers problem is severe. Because diversity is created at the codon level, codon redundancy is a crucial factor determining the necessary effort for library screening. Additionally, due to the probabilistic nature of the sampling process, oversampling is required to ensure library completeness as well as a high probability to encounter all unique variants. Our trick employs a special mixture of three primers, creating a degeneracy of 22 unique codons coding for the 20 canonical amino acids. Therefore, codon redundancy and subsequent screening effort is significantly reduced, and a balanced distribution of codon per amino acid is achieved, as demonstrated exemplarily for a library of cyclohexanone monooxygenase. We show that this strategy is suitable for any saturation mutagenesis methodology to generate less-redundant libraries.
High throughput protein production screening
Beernink, Peter T [Walnut Creek, CA; Coleman, Matthew A [Oakland, CA; Segelke, Brent W [San Ramon, CA
2009-09-08
Methods, compositions, and kits for the cell-free production and analysis of proteins are provided. The invention allows for the production of proteins from prokaryotic sequences or eukaryotic sequences, including human cDNAs using PCR and IVT methods and detecting the proteins through fluorescence or immunoblot techniques. This invention can be used to identify optimized PCR and WT conditions, codon usages and mutations. The methods are readily automated and can be used for high throughput analysis of protein expression levels, interactions, and functional states.
Liu, Yan-chen; Huang, Ai-long; Hu, Yuan; Hu, Jie-li; Lai, Guo-qi; Zhang, Wen-lu
2011-12-01
To establish a detection method for HBV drug-resistant mutations related to lamivudine, adefovir and entecavir by optimization and assessment of reverse hybridization system. 26 degenerated probes covering 10 drug-resistant hotspots of 3 drugs were synthesized and immobilized on the same positively charged nylon membrane. PCR products labeled with digoxigenin were hybridized with corresponding probes. To improve the sensitivity and specificity, 4 reaction steps of reverse hybridization were optimized including the number of labeled digoxigenin, the energy intensity of UV cross-linking, hybridization and stringency wash conditions. To prove the feasibility, the specificity, sensitivity and accuracy of this system were assessed respectively. Sensitive and specific results are obtained by the optimization of the following 4 reaction steps: the primers labeled with 3 digoxigenin, energy intensity of UV cross-linking for 1500 x 0.1 mJ/cm², hybridization at 42 degrees C and stringency wash with 0.5 x SSC and 0.1% SDS solution at 44 degrees C for 30 min. In the assessment of system, the majority of probes have high specificity. The quantity of PCR product with a concentration of 10 ng/μl or above can be detected by this method. The concordant rate between reverse hybridization and direct sequencing is 93.9% in the clinical sample test. Though the specificity of several probes needs to be improved further, it is a simple, rapid and sensitive method which can detect HBV resistant mutations related to lamivudine, adefovir and entecavir simultaneously. Due to the short distance between 180 and 181, likewise 202 and 204, the sequence of the same probe covers two codon positions, and hybridization will be interfered by each other. To avoid such interference, the possible solution is that probes are designed by arranging and combining various forms of two near codons.
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.
Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo
2018-01-01
The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi
2017-12-02
The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Plinke, Claudia; Walter, Kerstin; Aly, Sahar; Ehlers, Stefan; Niemann, Stefan
2011-06-01
Ethambutol (EMB) is a major component of the first-line therapy of tuberculosis. Mutations in codon 306 of embB (embB306) were suggested as a major resistance mechanism in clinical isolates. To directly analyze the impact of individual embB306 mutations on EMB resistance, we used allelic exchange experiments to generate embB306 mutants of M. tuberculosis H37Rv. The level of EMB resistance conferred by particular mutations was measured in vitro and in vivo after EMB therapy by daily gavage in a mouse model of aerogenic tuberculosis. The wild-type embB306 ATG codon was replaced by embB306 ATC, ATA, or GTG, respectively. All of the obtained embB306 mutants exhibited a 2- to 4-fold increase in EMB MIC compared to the wild-type H37Rv. In vivo, the one selected embB306 GTG mutant required a higher dose of ethambutol to restrict its growth in the lung compared to wild-type H37Rv. These experiments demonstrate that embB306 point mutations enhance the EMB MIC in vitro to a moderate, but significant extent, and reduce the efficacy of EMB treatment in the animal model. We propose that conventional EMB susceptibility testing, in combination with embB306 genotyping, may guide dose adjustment to avoid clinical treatment failure in these low-level resistant strains.
Börner, Kathleen; Niopek, Dominik; Cotugno, Gabriella; Kaldenbach, Michaela; Pankert, Teresa; Willemsen, Joschka; Zhang, Xian; Schürmann, Nina; Mockenhaupt, Stefan; Serva, Andrius; Hiet, Marie-Sophie; Wiedtke, Ellen; Castoldi, Mirco; Starkuviene, Vytaute; Erfle, Holger; Gilbert, Daniel F.; Bartenschlager, Ralf; Boutros, Michael; Binder, Marco; Streetz, Konrad; Kräusslich, Hans-Georg; Grimm, Dirk
2013-01-01
As the only mammalian Argonaute protein capable of directly cleaving mRNAs in a small RNA-guided manner, Argonaute-2 (Ago2) is a keyplayer in RNA interference (RNAi) silencing via small interfering (si) or short hairpin (sh) RNAs. It is also a rate-limiting factor whose saturation by si/shRNAs limits RNAi efficiency and causes numerous adverse side effects. Here, we report a set of versatile tools and widely applicable strategies for transient or stable Ago2 co-expression, which overcome these concerns. Specifically, we engineered plasmids and viral vectors to co-encode a codon-optimized human Ago2 cDNA along with custom shRNAs. Furthermore, we stably integrated this Ago2 cDNA into a panel of standard human cell lines via plasmid transfection or lentiviral transduction. Using various endo- or exogenous targets, we demonstrate the potential of all three strategies to boost mRNA silencing efficiencies in cell culture by up to 10-fold, and to facilitate combinatorial knockdowns. Importantly, these robust improvements were reflected by augmented RNAi phenotypes and accompanied by reduced off-targeting effects. We moreover show that Ago2/shRNA-co-encoding vectors can enhance and prolong transgene silencing in livers of adult mice, while concurrently alleviating hepatotoxicity. Our customizable reagents and avenues should broadly improve future in vitro and in vivo RNAi experiments in mammalian systems. PMID:24049077
Baca, A M; Hol, W G
2000-02-01
Parasite genes often use codons which are rarely used in the highly expressed genes of Escherichia coli, possibly resulting in translational stalling and lower yields of recombinant protein. We have constructed the "RIG" plasmid to overcome the potential codon-bias problem seen in Plasmodium genes. RIG contains the genes that encode three tRNAs (Arg, Ile, Gly), which recognise rare codons found in parasite genes. When co-transformed into E. coli along with expression plasmids containing parasite genes, RIG can greatly increase levels of overexpressed protein. Codon frequency analysis suggests that RIG may be applied to a variety of protozoan and helminth genes.
Jiang, Fan; Huang, Lv-Yin; Chen, Gui-Lan; Zhou, Jian-Ying; Xie, Xing-Mei; Li, Dong-Zhi
2017-01-01
We describe a new β-thalassemic mutation in a Chinese subject. This allele develops by insertion of one nucleotide (+T) between codons 138 and 139 in the third exon of the β-globin gene. The mutation causes a frameshift that leads to a termination codon at codon 139. In the heterozygote, this allele has the phenotype of classical β-thalassemia (β-thal) minor.
Xu, Yi; Ju, Ho-Jong; DeBlasio, Stacy; Carino, Elizabeth J; Johnson, Richard; MacCoss, Michael J; Heck, Michelle; Miller, W Allen; Gray, Stewart M
2018-06-01
Translational readthrough of the stop codon of the capsid protein (CP) open reading frame (ORF) is used by members of the Luteoviridae to produce their minor capsid protein as a readthrough protein (RTP). The elements regulating RTP expression are not well understood, but they involve long-distance interactions between RNA domains. Using high-resolution mass spectrometry, glutamine and tyrosine were identified as the primary amino acids inserted at the stop codon of Potato leafroll virus (PLRV) CP ORF. We characterized the contributions of a cytidine-rich domain immediately downstream and a branched stem-loop structure 600 to 700 nucleotides downstream of the CP stop codon. Mutations predicted to disrupt and restore the base of the distal stem-loop structure prevented and restored stop codon readthrough. Motifs in the downstream readthrough element (DRTE) are predicted to base pair to a site within 27 nucleotides (nt) of the CP ORF stop codon. Consistent with a requirement for this base pairing, the DRTE of Cereal yellow dwarf virus was not compatible with the stop codon-proximal element of PLRV in facilitating readthrough. Moreover, deletion of the complementary tract of bases from the stop codon-proximal region or the DRTE of PLRV prevented readthrough. In contrast, the distance and sequence composition between the two domains was flexible. Mutants deficient in RTP translation moved long distances in plants, but fewer infection foci developed in systemically infected leaves. Selective 2'-hydroxyl acylation and primer extension (SHAPE) probing to determine the secondary structure of the mutant DRTEs revealed that the functional mutants were more likely to have bases accessible for long-distance base pairing than the nonfunctional mutants. This study reveals a heretofore unknown combination of RNA structure and sequence that reduces stop codon efficiency, allowing translation of a key viral protein. IMPORTANCE Programmed stop codon readthrough is used by many animal and plant viruses to produce key viral proteins. Moreover, such "leaky" stop codons are used in host mRNAs or can arise from mutations that cause genetic disease. Thus, it is important to understand the mechanism(s) of stop codon readthrough. Here, we shed light on the mechanism of readthrough of the stop codon of the coat protein ORFs of viruses in the Luteoviridae by identifying the amino acids inserted at the stop codon and RNA structures that facilitate this "leakiness" of the stop codon. Members of the Luteoviridae encode a C-terminal extension to the capsid protein known as the readthrough protein (RTP). We characterized two RNA domains in Potato leafroll virus (PLRV), located 600 to 700 nucleotides apart, that are essential for efficient RTP translation. We further determined that the PLRV readthrough process involves both local structures and long-range RNA-RNA interactions. Genetic manipulation of the RNA structure altered the ability of PLRV to translate RTP and systemically infect the plant. This demonstrates that plant virus RNA contains multiple layers of information beyond the primary sequence and extends our understanding of stop codon readthrough. Strategic targets that can be exploited to disrupt the virus life cycle and reduce its ability to move within and between plant hosts were revealed. Copyright © 2018 American Society for Microbiology.
Selva Kumar, C; Nair, Rahul R; Sivaramakrishnan, K G; Ganesh, D; Janarthanan, S; Arunachalam, M; Sivaruban, T
2012-12-01
Forces that influence the evolution of synonymous codon usage bias are analyzed in six species of three basal orders of aquatic insects. The rationale behind choosing six species of aquatic insects (three from Ephemeroptera, one from Plecoptera, and two from Odonata) for the present analysis is based on phylogenetic position at the basal clades of the Order Insecta facilitating the understanding of the evolution of codon bias and of factors shaping codon usage patterns in primitive clades of insect lineages and their subtle differences in some of their ecological and environmental requirements in terms of habitat-microhabitat requirements, altitudinal preferences, temperature tolerance ranges, and consequent responses to climate change impacts. The present analysis focuses on open reading frames of the 13 protein-coding genes in the mitochondrial genome of six carefully chosen insect species to get a comprehensive picture of the evolutionary intricacies of codon bias. In all the six species, A and T contents are observed to be significantly higher than G and C, and are used roughly equally. Since transcription hypothesis on codon usage demands A richness and T poorness, it is quite likely that mutation pressure may be the key factor associated with synonymous codon usage (SCU) variations in these species because the mutation hypothesis predicts AT richness and GC poorness in the mitochondrial DNA. Thus, AT-biased mutation pressure seems to be an important factor in framing the SCU variation in all the selected species of aquatic insects, which in turn explains the predominance of A and T ending codons in these species. This study does not find any association between microhabitats and codon usage variations in the mitochondria of selected aquatic insects. However, this study has identified major forces, such as compositional constraints and mutation pressure, which shape patterns of codon usage in mitochondrial genes in the primitive clades of insect lineages.
Three stages in the evolution of the genetic code
NASA Technical Reports Server (NTRS)
Baumann, U.; Oro, J.
1993-01-01
A diversification of the genetic code based on the number of codons available for the proteinous amino acids is established. Three groups of amino acids during evolution of the code are distinguished. On the basis of their chemical complexity those amino acids emerging later in a translation process are derived. Codon number and chemical complexity indicate that His, Phe, Tyr, Cys and either Lys or Asn were introduced in the second stage, whereas the number of codons alone gives evidence that Trp and Met were introduced in the third stage. The amino acids of stage 1 use purine-rich codons, while all the amino acids introduced in the second stage, in contrast, use pyrimidines in the third position of their codons. A low abundance of pyrimidines during early translation is derived. This assumption is supported by experiments on non-enzymatic replication and interactions of hairpin loops with a complementary strand. A back extrapolation concludes a high purine content of the first nucleic acids, which gradually decreased during their evolution. Amino acids independently available from prebiotic synthesis were thus correlated to purine-rich codons. Implications on the prebiotic replication are discussed also in the light of recent codon usage data.
Emergent Rules for Codon Choice Elucidated by Editing Rare Arginine Codons in Escherichia coli
2016-09-20
alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we imple- mented a CRISPR ... Crispr -assisted MAGE). First, we designed oligos that changed not only the target AGR codon to NNN but also made several synonymous changes at least 50...nt downstream that would disrupt a 20-bp CRISPR target lo- cus. MAGE was used to replace each AGR with NNN in parallel, and CRISPR /cas9 was used to
Mandal, Debabrata; Köhrer, Caroline; Su, Dan; Babu, I. Ramesh; Chan, Clement T.Y.; Liu, Yuchen; Söll, Dieter; Blum, Paul; Kuwahara, Masayasu; Dedon, Peter C.; RajBhandary, Uttam L.
2014-01-01
Most archaea and bacteria use a modified C in the anticodon wobble position of isoleucine tRNA to base pair with A but not with G of the mRNA. This allows the tRNA to read the isoleucine codon AUA without also reading the methionine codon AUG. To understand why a modified C, and not U or modified U, is used to base pair with A, we mutated the C34 in the anticodon of Haloarcula marismortui isoleucine tRNA (tRNA2Ile) to U, expressed the mutant tRNA in Haloferax volcanii, and purified and analyzed the tRNA. Ribosome binding experiments show that although the wild-type tRNA2Ile binds exclusively to the isoleucine codon AUA, the mutant tRNA binds not only to AUA but also to AUU, another isoleucine codon, and to AUG, a methionine codon. The G34 to U mutant in the anticodon of another H. marismortui isoleucine tRNA species showed similar codon binding properties. Binding of the mutant tRNA to AUG could lead to misreading of the AUG codon and insertion of isoleucine in place of methionine. This result would explain why most archaea and bacteria do not normally use U or a modified U in the anticodon wobble position of isoleucine tRNA for reading the codon AUA. Biochemical and mass spectrometric analyses of the mutant tRNAs have led to the discovery of a new modified nucleoside, 5-cyanomethyl U in the anticodon wobble position of the mutant tRNAs. 5-Cyanomethyl U is present in total tRNAs from euryarchaea but not in crenarchaea, eubacteria, or eukaryotes. PMID:24344322
Yatawara, Lalani; Wickramasinghe, Susiji; Rajapakse, R P V J; Agatsuma, Takeshi
2010-09-01
In the present study, we determined the complete mitochondrial (mt) genome sequence (13,839bp) of parasitic nematode Setaria digitata and its structure and organization compared with Onchocerca volvulus, Dirofilaria immitis and Brugia malayi. The mt genome of S. digitata is slightly larger than the mt genomes of other filarial nematodes. S. digitata mt genome contains 36 genes (12 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs) that are typically found in metazoans. This genome contains a high A+T (75.1%) content and low G+C content (24.9%). The mt gene order for S. digitata is the same as those for O. volvulus, D. immitis and B. malayi but it is distinctly different from other nematodes compared. The start codons inferred in the mt genome of S. digitata are TTT, ATT, TTG, ATG, GTT and ATA. Interestingly, the initiation codon TTT is unique to S. digitata mt genome and four protein-coding genes use this codon as a translation initiation codon. Five protein-coding genes use TAG as a stop codon whereas three genes use TAA and four genes use T as a termination codon. Out of 64 possible codons, only 57 are used for mitochondrial protein-coding genes of S. digitata. T-rich codons such as TTT (18.9%), GTT (7.9%), TTG (7.8%), TAT (7%), ATT (5.7%), TCT (4.8%) and TTA (4.1%) are used more frequently. This pattern of codon usage reflects the strong bias for T in the mt genome of S. digitata. In conclusion, the present investigation provides new molecular data for future studies of the comparative mitochondrial genomics and systematic of parasitic nematodes of socio-economic importance. 2010 Elsevier B.V. All rights reserved.
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill; Hook, Vivian Y H
2004-08-16
This study demonstrates utilization of the novel GTG initiation codon for translation of a human mRNA transcript that encodes the serpin endopin 2B, a protease inhibitor. Molecular cloning revealed the nucleotide sequence of the human endopin 2B cDNA. Its deduced primary sequence shows high homology to bovine endopin 2A that possesses cross-class protease inhibition of elastase and papain. Notably, the human endopin 2B cDNA sequence revealed GTG as the predicted translation initiation codon; the predicted translation product of 46 kDa endopin 2B was produced by in vitro translation of 35S-endopin 2B with mammalian (rabbit) protein translation components. Importantly, bioinformatic studies demonstrated the presence of the entire human endopin 2B cDNA sequence with GTG as initiation codon within the human genome on chromosome 14. Further evidence for GTG as a functional initiation codon was illustrated by GTG-mediated in vitro translation of the heterologous protein EGFP, and by GTG-mediated expression of EGFP in mammalian PC12 cells. Mutagenesis of GTG to GTC resulted in the absence of EGFP expression in PC12 cells, indicating the function of GTG as an initiation codon. In addition, it was apparent that the GTG initiation codon produces lower levels of translated protein compared to ATG as initiation codon. Significantly, GTG-mediated translation of endopin 2B demonstrates a functional human gene product not previously predicted from initial analyses of the human genome. Further analyses based on GTG as an alternative initiation codon may predict new candidate genes of the human genome.
O’Donoghue, Patrick; Prat, Laure; Heinemann, Ilka U.; Ling, Jiqiang; Odoi, Keturah; Liu, Wenshe R.; Söll, Dieter
2012-01-01
Over 300 amino acids are found in proteins in nature, yet typically only 20 are genetically encoded. Reassigning stop codons and use of quadruplet codons emerged as the main avenues for genetically encoding non-canonical amino acids (NCAAs). Canonical aminoacyl-tRNAs with near-cognate anticodons also read these codons to some extent. This background suppression leads to ‘statistical protein’ that contains some natural amino acid(s) at a site intended for NCAA. We characterize near-cognate suppression of amber, opal and a quadruplet codon in common Escherichia coli laboratory strains and find that the PylRS/tRNAPyl orthogonal pair cannot completely outcompete contamination by natural amino acids. PMID:23036644
Molnár, István; Hill, D. Steven; Zirkle, Ross; Hammer, Philip E.; Gross, Frank; Buckel, Thomas G.; Jungmann, Volker; Pachlatko, Johannes Paul; Ligon, James M.
2005-01-01
The cytochrome P450 monooxygenase Ema1 from Streptomyces tubercidicus R-922 and its homologs from closely related Streptomyces strains are able to catalyze the regioselective oxidation of avermectin into 4"-oxo-avermectin, a key intermediate in the manufacture of the agriculturally important insecticide emamectin benzoate (V. Jungmann, I. Molnár, P. E. Hammer, D. S. Hill, R. Zirkle, T. G. Buckel, D. Buckel, J. M. Ligon, and J. P. Pachlatko, Appl. Environ. Microbiol. 71:6968-6976, 2005). The gene for Ema1 has been expressed in Streptomyces lividans, Streptomyces avermitilis, and solvent-tolerant Pseudomonas putida strains using different promoters and vectors to provide biocatalytically competent cells. Replacing the extremely rare TTA codon with the more frequent CTG codon to encode Leu4 in Ema1 increased the biocatalytic activities of S. lividans strains producing this enzyme. Ferredoxins and ferredoxin reductases were also cloned from Streptomyces coelicolor and biocatalytic Streptomyces strains and tested in ema1 coexpression systems to optimize the electron transport towards Ema1. PMID:16269733
Zhou, Yanrong; Lin, Yanli; Wu, Xiaojie; Xiong, Fuyin; Lv, Yuemeng; Zheng, Tao; Huang, Peitang; Chen, Hongxing
2012-02-01
Transgene expression for the mammary gland bioreactor aimed at producing recombinant proteins requires optimized expression vector construction. Previously we presented a hybrid gene locus strategy, which was originally tested with human lactoferrin (hLF) as target transgene, and an extremely high-level expression of rhLF ever been achieved as to 29.8 g/l in mice milk. Here to demonstrate the broad application of this strategy, another 38.4 kb mWAP-htPA hybrid gene locus was constructed, in which the 3-kb genomic coding sequence in the 24-kb mouse whey acidic protein (mWAP) gene locus was substituted by the 17.4-kb genomic coding sequence of human tissue plasminogen activator (htPA), exactly from the start codon to the end codon. Corresponding five transgenic mice lines were generated and the highest expression level of rhtPA in the milk attained as to 3.3 g/l. Our strategy will provide a universal way for the large-scale production of pharmaceutical proteins in the mammary gland of transgenic animals.
SGDB: a database of synthetic genes re-designed for optimizing protein over-expression.
Wu, Gang; Zheng, Yuanpu; Qureshi, Imran; Zin, Htar Thant; Beck, Tyler; Bulka, Blazej; Freeland, Stephen J
2007-01-01
Here we present the Synthetic Gene Database (SGDB): a relational database that houses sequences and associated experimental information on synthetic (artificially engineered) genes from all peer-reviewed studies published to date. At present, the database comprises information from more than 200 published experiments. This resource not only provides reference material to guide experimentalists in designing new genes that improve protein expression, but also offers a dataset for analysis by bioinformaticians who seek to test ideas regarding the underlying factors that influence gene expression. The SGDB was built under MySQL database management system. We also offer an XML schema for standardized data description of synthetic genes. Users can access the database at http://www.evolvingcode.net/codon/sgdb/index.php, or batch downloads all information through XML files. Moreover, users may visually compare the coding sequences of a synthetic gene and its natural counterpart with an integrated web tool at http://www.evolvingcode.net/codon/sgdb/aligner.php, and discuss questions, findings and related information on an associated e-forum at http://www.evolvingcode.net/forum/viewforum.php?f=27.
Bioluminescence Monitoring of Neuronal Activity in Freely Moving Zebrafish Larvae
Knafo, Steven; Prendergast, Andrew; Thouvenin, Olivier; Figueiredo, Sophie Nunes; Wyart, Claire
2017-01-01
The proof of concept for bioluminescence monitoring of neural activity in zebrafish with the genetically encoded calcium indicator GFP-aequorin has been previously described (Naumann et al., 2010) but challenges remain. First, bioluminescence signals originating from a single muscle fiber can constitute a major pitfall. Second, bioluminescence signals emanating from neurons only are very small. To improve signals while verifying specificity, we provide an optimized 4 steps protocol achieving: 1) selective expression of a zebrafish codon-optimized GFP-aequorin, 2) efficient soaking of larvae in GFP-aequorin substrate coelenterazine, 3) bioluminescence monitoring of neural activity from motor neurons in free-tailed moving animals performing acoustic escapes and 4) verification of the absence of muscle expression using immunohistochemistry. PMID:29130058
Importance of codon usage for the temporal regulation of viral gene expression
Shin, Young C.; Bischof, Georg F.; Lauer, William A.; Desrosiers, Ronald C.
2015-01-01
The glycoproteins of herpesviruses and of HIV/SIV are made late in the replication cycle and are derived from transcripts that use an unusual codon usage that is quite different from that of the host cell. Here we show that the actions of natural transinducers from these two different families of persistent viruses (Rev of SIV and ORF57 of the rhesus monkey rhadinovirus) are dependent on the nature of the skewed codon usage. In fact, the transinducibility of expression of these glycoproteins by Rev and by ORF57 can be flipped simply by changing the nature of the codon usage. Even expression of a luciferase reporter could be made Rev dependent or ORF57 dependent by distinctive changes to its codon usage. Our findings point to a new general principle in which different families of persisting viruses use a poor codon usage that is skewed in a distinctive way to temporally regulate late expression of structural gene products. PMID:26504241
Subramanian, Abhishek; Sarkar, Ram Rup
2015-10-01
Understanding the variations in gene organization and its effect on the phenotype across different Leishmania species, and to study differential clinical manifestations of parasite within the host, we performed large scale analysis of codon usage patterns between Leishmania and other known Trypanosomatid species. We present the causes and consequences of codon usage bias in Leishmania genomes with respect to mutational pressure, translational selection and amino acid composition bias. We establish GC bias at wobble position that governs codon usage bias across Leishmania species, rather than amino acid composition bias. We found that, within Leishmania, homogenous codon context coding for less frequent amino acid pairs and codons avoiding formation of folding structures in mRNA are essentially chosen. We predicted putative differences in global expression between genes belonging to specific pathways across Leishmania. This explains the role of evolution in shaping the otherwise conserved genome to demonstrate species-specific function-level differences for efficient survival. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Holmquist, R.; Pearl, D.
1980-01-01
Theoretical equations are derived for molecular divergence with respect to gene and protein structure in the presence of genetic events with unequal probabilities: amino acid and base compositions, the frequencies of nucleotide replacements, the usage of degenerate codons, the distribution of fixed base replacements within codons and the distribution of fixed base replacements among codons. Results are presented in the form of tables relating the probabilities of given numbers of codon base changes with respect to the original codon for the alpha hemoglobin, beta hemoglobin, myoglobin, cytochrome c and parvalbumin group gene families. Application of the calculations to the rabbit alpha and beta hemoglobin mRNAs and proteins indicates that the genes are separated by about 425 fixed based replacements distributed over 114 codon sites, which is a factor of two greater than previous estimates. The theoretical results also suggest that many more base replacements are required to effect a given gene or protein structural change than previously believed.
Beilharz, Katrin; van Raaphorst, Renske; Kjos, Morten; Veening, Jan-Willem
2015-10-01
During the last decades, a wide range of fluorescent proteins (FPs) have been developed and improved. This has had a great impact on the possibilities in biological imaging and the investigation of cellular processes at the single-cell level. Recently, we have benchmarked a set of green fluorescent proteins (GFPs) and generated a codon-optimized superfolder GFP for efficient use in the important human pathogen Streptococcus pneumoniae and other low-GC Gram-positive bacteria. In the present work, we constructed and compared four red fluorescent proteins (RFPs) in S. pneumoniae. Two orange-red variants, mOrange2 and TagRFP, and two far-red FPs, mKate2 and mCherry, were codon optimized and examined by fluorescence microscopy and plate reader assays. Notably, protein fusions of the RFPs to FtsZ were constructed by direct transformation of linear Gibson assembly (isothermal assembly) products, a method that speeds up the strain construction process significantly. Our data show that mCherry is the fastest-maturing RFP in S. pneumoniae and is best suited for studying gene expression, while mKate2 and TagRFP are more stable and are the preferred choices for protein localization studies. The RFPs described here will be useful for cell biology studies that require multicolor labeling in S. pneumoniae and related organisms. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Zhao, Yao; Kang, Lin; Gao, Shan; Zhou, Yang; Su, Libo; Xin, Wenwen; Su, Yuxin; Wang, Jinglin
2011-06-01
The alpha and epsilon toxins are 2 of the 4 major lethal toxins of the pathogen Clostridium perfringens. In this study, the expression of the epsilon toxin (etx) gene of C. perfringens was optimized by replacing rare codons with high-frequency codons, and the optimized gene was synthesized using overlapping PCR. Then, the etx gene or the alpha-toxin gene (cpa) was individually inserted into the pTIG-Trx expression vector with a hexahistidine tag and a thioredoxin (Trx) to facilitate their purification and induce the expression of soluble proteins. The recombinant alpha toxin (rCPA) and epsilon toxin (rETX) were highly expressed as soluble forms in the recipient Escherichia coli BL21 strain, respectively. The rCPA and rETX were purified using Ni(2+)-chelating chromatography and size-exclusion chromatography. And the entire purification process recovered about 40% of each target protein from the starting materials. The purified target toxins formed single band at about 42kDa (rCPA) or 31kDa (rETX) in sodium dodecyl sulfate-polyacrylamide gel electrophoresis, and their functional activity was confirmed by bioactivity assays. We have shown that the production of large amounts of soluble and functional proteins by using the pTIG-Trx vector in E. coli is a good alternative for the production of native alpha and epsilon toxins and could also be useful for the production of other toxic proteins with soluble forms. Copyright © 2011 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yokawa, Satoru; School of Pharmacy, Aichi Gakuin University, Nagoya 464-8650; Suzuki, Takahiro
We have firstly visualized glucagon secretion using a method of video-rate bioluminescence imaging. The fusion protein of proglucagon and Gaussia luciferase (PGCG-GLase) was used as a reporter to detect glucagon secretion and was efficiently expressed in mouse pancreatic α cells (αTC1.6) using a preferred human codon-optimized gene. In the culture medium of the cells expressing PGCG-GLase, luminescence activity determined with a luminometer was increased with low glucose stimulation and KCl-induced depolarization, as observed for glucagon secretion. From immunochemical analyses, PGCG-GLase stably expressed in clonal αTC1.6 cells was correctly processed and released by secretory granules. Luminescence signals of the secreted PGCG-GLase frommore » the stable cells were visualized by video-rate bioluminescence microscopy. The video images showed an increase in glucagon secretion from clustered cells in response to stimulation by KCl. The secretory events were observed frequently at the intercellular contact regions. Thus, the localization and frequency of glucagon secretion might be regulated by cell-cell adhesion. - Highlights: • The fused protein of proglucagon to Gaussia luciferase was used as a reporter. • The fusion protein was highly expressed using a preferred human-codon optimized gene. • Glucagon secretion stimulated by depolarization was determined by luminescence. • Glucagon secretion in α cells was visualized by bioluminescence imaging. • Glucagon secretion sites were localized in the intercellular contact regions.« less
Loughran, Gary; Jungreis, Irwin; Tzani, Ioanna; Power, Michael; Dmitriev, Ruslan I.; Ivanov, Ivaylo P.; Kellis, Manolis; Atkins, John F.
2018-01-01
Although stop codon readthrough is used extensively by viruses to expand their gene expression, verified instances of mammalian readthrough have only recently been uncovered by systems biology and comparative genomics approaches. Previously, our analysis of conserved protein coding signatures that extend beyond annotated stop codons predicted stop codon readthrough of several mammalian genes, all of which have been validated experimentally. Four mRNAs display highly efficient stop codon readthrough, and these mRNAs have a UGA stop codon immediately followed by CUAG (UGA_CUAG) that is conserved throughout vertebrates. Extending on the identification of this readthrough motif, we here investigated stop codon readthrough, using tissue culture reporter assays, for all previously untested human genes containing UGA_CUAG. The readthrough efficiency of the annotated stop codon for the sequence encoding vitamin D receptor (VDR) was 6.7%. It was the highest of those tested but all showed notable levels of readthrough. The VDR is a member of the nuclear receptor superfamily of ligand-inducible transcription factors, and it binds its major ligand, calcitriol, via its C-terminal ligand-binding domain. Readthrough of the annotated VDR mRNA results in a 67 amino acid–long C-terminal extension that generates a VDR proteoform named VDRx. VDRx may form homodimers and heterodimers with VDR but, compared with VDR, VDRx displayed a reduced transcriptional response to calcitriol even in the presence of its partner retinoid X receptor. PMID:29386352
DOE Office of Scientific and Technical Information (OSTI.GOV)
Colledge, Danielle; Soppe, Sally; Yuen, Lilly
Premature stop codons in the hepatitis B virus (HBV) surface protein can be associated with nucleos(t)ide analogue resistance due to overlap of the HBV surface and polymerase genes. The aim of this study was to determine the effect of the replication of three common surface stop codon variants on the hepatocyte. Cell lines were transfected with infectious HBV clones encoding surface stop codons rtM204I/sW196*, rtA181T/sW172*, rtV191I/sW182*, and a panel of substitutions in the surface proteins. HBsAg was measured by Western blotting. Proliferation and apoptosis were measured using flow cytometry. All three surface stop codon variants were defective in HBsAg secretion.more » Cells transfected with these variants were less proliferative and had higher levels of apoptosis than those transfected with variants that did not encode surface stop codons. The most cytopathic variant was rtM204I/sW196*. Replication of HBV encoding surface stop codons was toxic to the cell and promoted apoptosis, exacerbating disease progression. - Highlights: •Under normal circumstances, HBV replication is not cytopathic. •Premature stop codons in the HBV surface protein can be selected and enriched during nucleos(t)ide analogue therapy. •Replication of these variants can be cytopathic to the cell and promote apoptosis. •Inadequate antiviral therapy may actually promote disease progression.« less
GC-Content of Synonymous Codons Profoundly Influences Amino Acid Usage
Li, Jing; Zhou, Jun; Wu, Ying; Yang, Sihai; Tian, Dacheng
2015-01-01
Amino acids typically are encoded by multiple synonymous codons that are not used with the same frequency. Codon usage bias has drawn considerable attention, and several explanations have been offered, including variation in GC-content between species. Focusing on a simple parameter—combined GC proportion of all the synonymous codons for a particular amino acid, termed GCsyn—we try to deepen our understanding of the relationship between GC-content and amino acid/codon usage in more details. We analyzed 65 widely distributed representative species and found a close association between GCsyn, GC-content, and amino acids usage. The overall usages of the four amino acids with the greatest GCsyn and the five amino acids with the lowest GCsyn both vary with the regional GC-content, whereas the usage of the remaining 11 amino acids with intermediate GCsyn is less variable. More interesting, we discovered that codon usage frequencies are nearly constant in regions with similar GC-content. We further quantified the effects of regional GC-content variation (low to high) on amino acid usage and found that GC-content determines the usage variation of amino acids, especially those with extremely high GCsyn, which accounts for 76.7% of the changed GC-content for those regions. Our results suggest that GCsyn correlates with GC-content and has impact on codon/amino acid usage. These findings suggest a novel approach to understanding the role of codon and amino acid usage in shaping genomic architecture and evolutionary patterns of organisms. PMID:26248983
CCC CGA is a weak translational recoding site in Escherichia coli.
Shu, Ping; Dai, Huacheng; Mandecki, Wlodek; Goldman, Emanuel
2004-12-08
Previously published experiments had indicated unexpected expression of a control vector in which a beta-galactosidase reporter was in the +1 reading frame relative to the translation start. This control vector contained the codon pair CCC CGA in the zero reading frame, raising the possibility that ribosomes rephased on this sequence, with peptidyl-tRNA(Pro) pairing with CCC in the +1 frame. This putative rephasing might also be exacerbated by the rare CGA Arg codon in the second position due to increased vacancy of the ribosomal A-site. To test this hypothesis, a series of site-directed mutants was constructed, including mutations in both the first and second codons of this codon pair. The results show that interrupting the continuous run of C residues with synonymous codon changes essentially abolishes the frameshift. Further, changing the rare Arg codon to a common Arg codon also reduces the frequency of the frameshift. These results provide strong support for the hypothesis that CCC CGA in the zero frame is indeed a weak translational frameshift site in Escherichia coli, with a 1-2% efficiency. Because the vector sequence also contains another CCC triplet in the +1 reading frame starting within the next codon after the CGA, our data also support possible contribution to expression of a +7 nucleotide ribosome hop into the same +1 reading frame. We also confirm here a previous report that CCC UGA is a translational frameshift site, in these experiments, with about 5% efficiency.
The ribosome as an optimal decoder: a lesson in molecular recognition.
Savir, Yonatan; Tlusty, Tsvi
2013-04-11
The ribosome is a complex molecular machine that, in order to synthesize proteins, has to decode mRNAs by pairing their codons with matching tRNAs. Decoding is a major determinant of fitness and requires accurate and fast selection of correct tRNAs among many similar competitors. However, it is unclear whether the modern ribosome, and in particular its large conformational changes during decoding, are the outcome of adaptation to its task as a decoder or the result of other constraints. Here, we derive the energy landscape that provides optimal discrimination between competing substrates and thereby optimal tRNA decoding. We show that the measured landscape of the prokaryotic ribosome is sculpted in this way. This model suggests that conformational changes of the ribosome and tRNA during decoding are means to obtain an optimal decoder. Our analysis puts forward a generic mechanism that may be utilized broadly by molecular recognition systems. Copyright © 2013 Elsevier Inc. All rights reserved.
GeMS: an advanced software package for designing synthetic genes.
Jayaraj, Sebastian; Reid, Ralph; Santi, Daniel V
2005-01-01
A user-friendly, advanced software package for gene design is described. The software comprises an integrated suite of programs-also provided as stand-alone tools-that automatically performs the following tasks in gene design: restriction site prediction, codon optimization for any expression host, restriction site inclusion and exclusion, separation of long sequences into synthesizable fragments, T(m) and stem-loop determinations, optimal oligonucleotide component design and design verification/error-checking. The output is a complete design report and a list of optimized oligonucleotides to be prepared for subsequent gene synthesis. The user interface accommodates both inexperienced and experienced users. For inexperienced users, explanatory notes are provided such that detailed instructions are not necessary; for experienced users, a streamlined interface is provided without such notes. The software has been extensively tested in the design and successful synthesis of over 400 kb of genes, many of which exceeded 5 kb in length.
Ribosome A and P sites revealed by length analysis of ribosome profiling data
Martens, Andrew T.; Taylor, James; Hilser, Vincent J.
2015-01-01
The high-throughput sequencing of nuclease-protected mRNA fragments bound to ribosomes, a technique known as ribosome profiling, quantifies the relative frequencies with which different regions of transcripts are translated. This technique has revealed novel translation initiation sites with unprecedented scope and has furthered investigations into the connections between codon biases and translation rates. Yet the location of the codon being decoded in ribosome footprints is still unknown, and has been complicated by the recent observation of footprints with non-canonical lengths. Here we show how taking into account the variations in ribosome footprint lengths can reveal the ribosome aminoacyl (A) and peptidyl (P) site locations. These location assignments are in agreement with the proposed mechanisms for various ribosome pauses and further enhance the resolution of the profiling data. We also show that GC-rich motifs at the 5′ ends of footprints are found in yeast, calling into question the anti-Shine-Dalgarno effect's role in ribosome pausing. PMID:25805170
USDA-ARS?s Scientific Manuscript database
In order to characterize the evolutionary adaptations of avian paramyxovirus 1 (APMV-1) genomes, we have compared codon usage and codon adaptation indexes among groups of Newcastle disease viruses that differ in biological, ecological, and genetic characteristics. We have used available GenBank com...
Quach, Tommy; Brooks, Daniel M; Miranda, Hector C
2016-01-01
The complete mitochondrial genome of the Palawan peacock-pheasant Polyplectron napoleonis is 16,710 bp and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a control-region. All protein-coding genes use the standard ATG start codon, except for cox1 which has GTG start codon. Seven out of 13 PCGs have TAA stop codons, two have AGG (cox1 and nd6), and three PCGs (nd2, cox2 and nd4) have incomplete stop codon of just T- - nucleotide.
Das, Shibsankar; Roymondal, Uttam; Sahoo, Satyabrata
2009-08-15
Based on the hypothesis that highly expressed genes are often characterized by strong compositional bias in terms of codon usage, there are a number of measures currently in use that quantify codon usage bias in genes, and hence provide numerical indices to predict the expression levels of genes. With the recent advent of expression measure from the score of the relative codon usage bias (RCBS), we have explicitly tested the performance of this numerical measure to predict the gene expression level and illustrate this with an analysis of Yeast genomes. In contradiction with previous other studies, we observe a weak correlations between GC content and RCBS, but a selective pressure on the codon preferences in highly expressed genes. The assertion that the expression of a given gene depends on the score of relative codon usage bias (RCBS) is supported by the data. We further observe a strong correlation between RCBS and protein length indicating natural selection in favour of shorter genes to be expressed at higher level. We also attempt a statistical analysis to assess the strength of relative codon bias in genes as a guide to their likely expression level, suggesting a decrease of the informational entropy in the highly expressed genes.
Abad, Francisco; de la Morena-Barrio, María Eugenia; Fernández-Breis, Jesualdo Tomás; Corral, Javier
2018-06-01
Translation is a key biological process controlled in eukaryotes by the initiation AUG codon. Variations affecting this codon may have pathological consequences by disturbing the correct initiation of translation. Unfortunately, there is no systematic study describing these variations in the human genome. Moreover, we aimed to develop new tools for in silico prediction of the pathogenicity of gene variations affecting AUG codons, because to date, these gene defects have been wrongly classified as missense. Whole-exome analysis revealed the mean of 12 gene variations per person affecting initiation codons, mostly with high (> 0:01) minor allele frequency (MAF). Moreover, analysis of Ensembl data (December 2017) revealed 11,261 genetic variations affecting the initiation AUG codon of 7,205 genes. Most of these variations (99.5%) have low or unknown MAF, probably reflecting deleterious consequences. Only 62 variations had high MAF. Genetic variations with high MAF had closer alternative AUG downstream codons than did those with low MAF. Besides, the high-MAF group better maintained both the signal peptide and reading frame. These differentiating elements could help to determine the pathogenicity of this kind of variation. Data and scripts in Perl and R are freely available at https://github.com/fanavarro/hemodonacion. jfernand@um.es. Supplementary data are available at Bioinformatics online.
Influence of codon usage bias on FGLamide-allatostatin mRNA secondary structure.
Martínez-Pérez, Francisco; Bendena, William G; Chang, Belinda S W; Tobe, Stephen S
2011-03-01
The FGLamide allatostatins (ASTs) are invertebrate neuropeptides which inhibit juvenile hormone biosynthesis in Dictyoptera and related orders. They also show myomodulatory activity. FGLamide AST nucleotide frequencies and codon bias were investigated with respect to possible effects on mRNA secondary structure. 367 putative FGLamide ASTs and their potential endoproteolytic cleavage sites were identified from 40 species of crustaceans, chelicerates and insects. Among these, 55% comprised only 11 amino acids. An FGLamide AST consensus was identified to be (X)(1→16)Y(S/A/N/G)FGLGKR, with a strong bias for the codons UUU encoding for Phe and AAA for Lys, which can form strong Watson-Crick pairing in all peptides analyzed. The physical distance between these codons favor a loop structure from Ser/Ala-Phe to Lys-Arg. Other loop and hairpin loops were also inferred from the codon frequencies in the N-terminal motif, and the first amino acids from the C-terminal motif, or the dibasic potential endoproteolytic cleavage site. Our results indicate that nucleotide frequencies and codon usage bias in FGLamide ASTs tend to favor mRNA folds in the codon sequence in the C-terminal active peptide core and at the dibasic potential endoproteolytic cleavage site. Copyright © 2010 Elsevier Inc. All rights reserved.
Bohmert-Tatarev, Karen; McAvoy, Susan; Daughtry, Sean; Peoples, Oliver P; Snell, Kristi D
2011-04-01
An optimized genetic construct for plastid transformation of tobacco (Nicotiana tabacum) for the production of the renewable, biodegradable plastic polyhydroxybutyrate (PHB) was designed using an operon extension strategy. Bacterial genes encoding the PHB pathway enzymes were selected for use in this construct based on their similarity to the codon usage and GC content of the tobacco plastome. Regulatory elements with limited homology to the host plastome yet known to yield high levels of plastidial recombinant protein production were used to enhance the expression of the transgenes. A partial transcriptional unit, containing genes of the PHB pathway and a selectable marker gene encoding spectinomycin resistance, was flanked at the 5' end by the host plant's psbA coding sequence and at the 3' end by the host plant's 3' psbA untranslated region. This design allowed insertion of the transgenes into the plastome as an extension of the psbA operon, rendering the addition of a promoter to drive the expression of the transgenes unnecessary. Transformation of the optimized construct into tobacco and subsequent spectinomycin selection of transgenic plants yielded T0 plants that were capable of producing up to 18.8% dry weight PHB in samples of leaf tissue. These plants were fertile and produced viable seed. T1 plants producing up to 17.3% dry weight PHB in samples of leaf tissue and 8.8% dry weight PHB in the total biomass of the plant were also isolated.
Wang, Ying; Li, Yue-Zhong
2014-06-07
Overexpression of foreign genes in Escherichia coli cells is an efficient means to obtain recombinant proteins. The technique is, however, often hampered by misfolding, degradation, aggregation and formation in inclusion bodies of products. In this study, we reported that in vivo solubility of overexpressed arginine deiminases (ADI) improved by changing the cultivation conditions. ADI is enzymes that convert L-arginine to L-citrulline. After codon optimization, we synthesized the ADI gene of Pseudomonas putida and constructed it for overexpression in E. coli cells. The rADI products were mainly in inclusion body forms. We performed a series of optimization to enhance solubility of the protein. Co-expression with the GroES-GroEL chaperone team increased approximately 5-fold of the rADI activity. In addition the combination of L-arginine and D-glucose in the Luria-Bertani (LB) growth medium further increased the total activity to about 15 times. Separate L-arginine and D-glucose or the addition of other saccharides or amino acids had no such effects. The solubilization effects of the combination of L-arginine and D-glucose were further confirmed in the overexpression of another ADI from Listeria welshimeri. The enzymatic and conversion characteristics of the rADI products were further determined. Combined addition of L-arginine and D-glucose in the LB medium significantly improved in vivo solubility of rADI proteins. The present study suggested a new strategy to increase the solubilization of overexpressed recombinant proteins in E. coli cells.
NASA Astrophysics Data System (ADS)
Sharma, Ajeet K.; Ahmed, Nabeel; O'Brien, Edward P.
2018-02-01
Ribosome profiling experiments have found greater than 100-fold variation in ribosome density along mRNA transcripts, indicating that individual codon elongation rates can vary to a similar degree. This wide range of elongation times, coupled with differences in codon usage between transcripts, suggests that the average codon translation-rate per gene can vary widely. Yet, ribosome run-off experiments have found that the average codon translation rate for different groups of transcripts in mouse stem cells is constant at 5.6 AA/s. How these seemingly contradictory results can be reconciled is the focus of this study. Here, we combine knowledge of the molecular factors shown to influence translation speed with genomic information from Escherichia coli, Saccharomyces cerevisiae and Homo sapiens to simulate the synthesis of cytosolic proteins in these organisms. The model recapitulates a near constant average translation rate, which we demonstrate arises because the molecular determinants of translation speed are distributed nearly randomly amongst most of the transcripts. Consequently, codon translation rates are also randomly distributed and fast-translating segments of a transcript are likely to be offset by equally probable slow-translating segments, resulting in similar average elongation rates for most transcripts. We also show that the codon usage bias does not significantly affect the near random distribution of codon translation rates because only about 10 % of the total transcripts in an organism have high codon usage bias while the rest have little to no bias. Analysis of Ribo-Seq data and an in vivo fluorescent assay supports these conclusions.
Kamble, Asmita S; Fandilolu, Prayagraj M; Sambhare, Susmit B; Sonawane, Kailas D
2017-01-01
Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the 'wobble' 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by 'wobble' as well as a novel 'single' hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons.
Kamble, Asmita S.; Fandilolu, Prayagraj M.; Sambhare, Susmit B.; Sonawane, Kailas D.
2017-01-01
Lack of naturally occurring modified nucleoside 5-taurinomethyluridine (τm5U) at the ‘wobble’ 34th position in tRNALeu causes mitochondrial myopathy, encephalopathy, lactic acidosis and stroke-like episodes (MELAS). The τm5U34 specifically recognizes UUG and UUA codons. Structural consequences of τm5U34 to read cognate codons have not been studied so far in detail at the atomic level. Hence, 50ns multiple molecular dynamics (MD) simulations of various anticodon stem loop (ASL) models of tRNALeu in presence and absence of τm5U34 along with UUG and UUA codons were performed to explore the dynamic behaviour of τm5U34 during codon recognition process. The MD simulation results revealed that τm5U34 recognizes G/A ending codons by ‘wobble’ as well as a novel ‘single’ hydrogen bonding interactions. RMSD and RMSF values indicate the comparative stability of the ASL models containing τm5U34 modification over the other models, lacking τm5U34. Another MD simulation study of 55S mammalian mitochondrial rRNA with tRNALeu showed crucial interactions between the A-site residues, A918, A919, G256 and codon-anticodon bases. Thus, these results could improve our understanding about the decoding efficiency of human mt tRNALeu with τm5U34 to recognize UUG and UUA codons. PMID:28453549
Dass, J Febin Prabhu; Sudandiradoss, C
2012-07-15
5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Recent evidence for evolution of the genetic code
NASA Technical Reports Server (NTRS)
Osawa, S.; Jukes, T. H.; Watanabe, K.; Muto, A.
1992-01-01
The genetic code, formerly thought to be frozen, is now known to be in a state of evolution. This was first shown in 1979 by Barrell et al. (G. Barrell, A. T. Bankier, and J. Drouin, Nature [London] 282:189-194, 1979), who found that the universal codons AUA (isoleucine) and UGA (stop) coded for methionine and tryptophan, respectively, in human mitochondria. Subsequent studies have shown that UGA codes for tryptophan in Mycoplasma spp. and in all nonplant mitochondria that have been examined. Universal stop codons UAA and UAG code for glutamine in ciliated protozoa (except Euplotes octacarinatus) and in a green alga, Acetabularia. E. octacarinatus uses UAA for stop and UGA for cysteine. Candida species, which are yeasts, use CUG (leucine) for serine. Other departures from the universal code, all in nonplant mitochondria, are CUN (leucine) for threonine (in yeasts), AAA (lysine) for asparagine (in platyhelminths and echinoderms), UAA (stop) for tyrosine (in planaria), and AGR (arginine) for serine (in several animal orders) and for stop (in vertebrates). We propose that the changes are typically preceded by loss of a codon from all coding sequences in an organism or organelle, often as a result of directional mutation pressure, accompanied by loss of the tRNA that translates the codon. The codon reappears later by conversion of another codon and emergence of a tRNA that translates the reappeared codon with a different assignment. Changes in release factors also contribute to these revised assignments. We also discuss the use of UGA (stop) as a selenocysteine codon and the early history of the code.
Hart, Andrew; Cortés, María Paz; Latorre, Mauricio; Martinez, Servet
2018-01-01
The analysis of codon usage bias has been widely used to characterize different communities of microorganisms. In this context, the aim of this work was to study the codon usage bias in a natural consortium of five acidophilic bacteria used for biomining. The codon usage bias of the consortium was contrasted with genes from an alternative collection of acidophilic reference strains and metagenome samples. Results indicate that acidophilic bacteria preferentially have low codon usage bias, consistent with both their capacity to live in a wide range of habitats and their slow growth rate, a characteristic probably acquired independently from their phylogenetic relationships. In addition, the analysis showed significant differences in the unique sets of genes from the autotrophic species of the consortium in relation to other acidophilic organisms, principally in genes which code for proteins involved in metal and oxidative stress resistance. The lower values of codon usage bias obtained in this unique set of genes suggest higher transcriptional adaptation to living in extreme conditions, which was probably acquired as a measure for resisting the elevated metal conditions present in the mine.
Schuster, W; Brennicke, A
1991-01-01
An intact gene for the ribosomal protein S19 (rps19) is absent from Oenothera mitochondria. The conserved rps19 reading frame found in the mitochondrial genome is interrupted by a termination codon. This rps19 pseudogene is cotranscribed with the downstream rps3 gene and is edited on both sides of the translational stop. Editing, however, changes the amino acid sequence at positions that were well conserved before editing. Other strange editings create translational stops in open reading frames coding for functional proteins. In coxI and rps3 mRNAs CGA codons are edited to UGA stop codons only five and three codons, respectively, downstream to the initiation codon. These aberrant editings in essential open reading frames and in the rps19 pseudogene appear to have been shifted to these positions from other editing sites. These observations suggest a requirement for a continuous evolutionary constraint on the editing specificities in plant mitochondria. Images PMID:1762921
Luo, M; Mao, X; Plummer, F A
2005-02-01
We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.
Sroubek, Jakub; Krishnan, Yamini; McDonald, Thomas V.
2013-01-01
Human ether-á-gogo-related gene (HERG) encodes a potassium channel that is highly susceptible to deleterious mutations resulting in susceptibility to fatal cardiac arrhythmias. Most mutations adversely affect HERG channel assembly and trafficking. Why the channel is so vulnerable to missense mutations is not well understood. Since nothing is known of how mRNA structural elements factor in channel processing, we synthesized a codon-modified HERG cDNA (HERG-CM) where the codons were synonymously changed to reduce GC content, secondary structure, and rare codon usage. HERG-CM produced typical IKr-like currents; however, channel synthesis and processing were markedly different. Translation efficiency was reduced for HERG-CM, as determined by heterologous expression, in vitro translation, and polysomal profiling. Trafficking efficiency to the cell surface was greatly enhanced, as assayed by immunofluorescence, subcellular fractionation, and surface labeling. Chimeras of HERG-NT/CM indicated that trafficking efficiency was largely dependent on 5′ sequences, while translation efficiency involved multiple areas. These results suggest that HERG translation and trafficking rates are independently governed by noncoding information in various regions of the mRNA molecule. Noncoding information embedded within the mRNA may play a role in the pathogenesis of hereditary arrhythmia syndromes and could provide an avenue for targeted therapeutics.—Sroubek, J., Krishnan, Y., McDonald, T V. Sequence- and structure-specific elements of HERG mRNA determine channel synthesis and trafficking efficiency. PMID:23608144
High-level expression of a synthetic gene encoding a sweet protein, monellin, in Escherichia coli.
Chen, Zhongjun; Cai, Heng; Lu, Fuping; Du, Lianxiang
2005-11-01
The expression of a synthetic gene encoding monellin, a sweet protein, in E. coli under the control of T7 promoter from phage is described. The single-chain monellin gene was designed based on the biased codons of E. coli so as to optimize its expression. Monellin was produced and accounted for 45% of total soluble proteins. It was purified to yield 43 mg protein per g dry cell wt. The purity of the recombinant protein was confirmed by SDS-PAGE.
Lack of correlation between p53 codon 72 polymorphism and anal cancer risk
Contu, Simone S; Agnes, Grasiela; Damin, Andrea P; Contu, Paulo C; Rosito, Mário A; Alexandre, Claudio O; Damin, Daniel C
2009-01-01
AIM: To investigate the potential role of p53 codon 72 polymorphism as a risk factor for development of anal cancer. METHODS: Thirty-two patients with invasive anal carcinoma and 103 healthy blood donors were included in the study. p53 codon 72 polymorphism was analyzed in blood samples through polymerase chain reaction-restriction fragment length polymorphism and DNA sequencing. RESULTS: The relative frequency of each allele was 0.60 for Arg and 0.40 for Pro in patients with anal cancer, and 0.61 for Arg and 0.39 for Pro in normal controls. No significant differences in distribution of the codon 72 genotypes between patients and controls were found. CONCLUSION: These results do not support a role for the p53 codon 72 polymorphism in anal carcinogenesis. PMID:19777616
Strong Purifying Selection at Synonymous Sites in D. melanogaster
Lawrie, David S.; Messer, Philipp W.; Hershberg, Ruth; Petrov, Dmitri A.
2013-01-01
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated. PMID:23737754
The genomics of oral cancer and wound healing.
Aswini, Y B
2009-01-01
Oral cancer is the most common malignancy in India, where it is epidemiologically linked to the chewing of betel quid and other carcinogens. But various point mutations were detectable in the p53 and p15 genes. Hence, this review was conducted with the aim to find out genetic risks as well as markers for oral cancers and wound healing. Tobacco-related cancers are associated with polymorphisms of the CYP1A1 and GSTM1 genes in terms of genotype frequencies and cigarette smoking dose. Expression of E6/E7 were also found in tumors, most of which were derived from the oropharynx. Presence of homozygous arginine at codon 72 renders p53 about seven times more susceptible to E6-mediated proteolytic degradation. Erythropoietin, vascular permeability factor (VPF, also known as vascular endothelial growth factor or VEGF), and PDGF has been implicated as one of the principal mitogens involved in cutaneous wound healing. Activation of NF-kB is associated with enhanced cell survival. Human papilloma virus status is a significantly favorable prognostic factor in tonsilar cancer and may be used as a marker in order to optimize the treatment of patients with this type of cancer.
Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M
2015-08-03
Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Nougairede, Antoine; De Fabritus, Lauriane; Aubry, Fabien; Gould, Ernest A; Holmes, Edward C; de Lamballerie, Xavier
2013-02-01
Large-scale codon re-encoding represents a powerful method of attenuating viruses to generate safe and cost-effective vaccines. In contrast to specific approaches of codon re-encoding which modify genome-scale properties, we evaluated the effects of random codon re-encoding on the re-emerging human pathogen Chikungunya virus (CHIKV), and assessed the stability of the resultant viruses during serial in cellulo passage. Using different combinations of three 1.4 kb randomly re-encoded regions located throughout the CHIKV genome six codon re-encoded viruses were obtained. Introducing a large number of slightly deleterious synonymous mutations reduced the replicative fitness of CHIKV in both primate and arthropod cells, demonstrating the impact of synonymous mutations on fitness. Decrease of replicative fitness correlated with the extent of re-encoding, an observation that may assist in the modulation of viral attenuation. The wild-type and two re-encoded viruses were passaged 50 times either in primate or insect cells, or in each cell line alternately. These viruses were analyzed using detailed fitness assays, complete genome sequences and the analysis of intra-population genetic diversity. The response to codon re-encoding and adaptation to culture conditions occurred simultaneously, resulting in significant replicative fitness increases for both re-encoded and wild type viruses. Importantly, however, the most re-encoded virus failed to recover its replicative fitness. Evolution of these viruses in response to codon re-encoding was largely characterized by the emergence of both synonymous and non-synonymous mutations, sometimes located in genomic regions other than those involving re-encoding, and multiple convergent and compensatory mutations. However, there was a striking absence of codon reversion (<0.4%). Finally, multiple mutations were rapidly fixed in primate cells, whereas mosquito cells acted as a brake on evolution. In conclusion, random codon re-encoding provides important information on the evolution and genetic stability of CHIKV viruses and could be exploited to develop a safe, live attenuated CHIKV vaccine.
Rujito, Lantip; Basalamah, Muhammad; Mulatsih, Sri; Sofro, Abdul Salam M
2015-01-01
Thalassemia is the most prevalent genetic blood disorder worldwide, and particularly prevalent in Indonesia. The purpose of this study was to determine the spectrum of β-thalassemia (β-thal) mutations found in the southern region of Central Java, Indonesia. The subjects of the study included 209 β-thal Javanese patients from Banyumas Residency, a southwest region of Central Java Province. DNA analysis was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), amplification refractory mutation system (ARMS), and the direct sequencing method. The results showed that 14 alleles were found in the following order: IVS-I-5 (G > C) (HBB: c.92 + 5G > C) 43.5%, codon 26 (Hb E; HBB: c.79G > A) 28.2%, IVS-I-1 (G > A) (HBB: c.92 + 1G > A) 5.0%, codon 15 (TGG > TAG) (HBB: c.47G > A) 3.8%, IVS-I-1 (G > T) (HBB: c.92 + 1G > T) 3.1%, codon 35 (-C) (HBB: c.110delC) 2.4%. The rest, including codons 41/42 (-TTCT) (HBB: c.126_129delCTTT), codons 8/9 (+G) (HBB: c.27_28insG), codon 19 (AAC > AGC) (HBB: c.59A > G), codon 17 (AAG > TAG) (HBB: c.52A > T), IVS-I-2 (T > C) (HBB: c.92 + 2T > C), codons 123/124/125 (-ACCCCACC) (HBB: c.370_378delACCCCACCA), codon 40 (-G) (HBB: c.123delG) and Cap +1 (A > C) (HBB: c.-50A > C), accounted for up to 1.0% each. The most prevalent alleles would be recommended to be used as part of β-thal screening for the Javanese, one of the major ethnic groups in the country.
Bharill, Shashank; Chen, Chunlai; Stevens, Benjamin; Kaur, Jaskiran; Smilansky, Zeev; Mandecki, Wlodek; Gryczynski, Ignacy; Gryczynski, Zygmunt; Cooperman, Barry S.; Goldman, Yale E.
2011-01-01
Metal enhanced fluorescence (MEF) increased total photon emission of Cy3- and Cy5-labeled ribosomal initiation complexes near 50 nm silver particles 4- and 5.5-fold respectively. Fluorescence intensity fluctuations above shot noise, at 0.1 – 5 Hz, were greater on silver particles. Overall signal to noise ratio was similar or slightly improved near the particles. Proximity to silver particles did not compromise ribosome function, as measured by codon-dependent binding of fluorescent tRNA, dynamics of fluorescence resonance energy transfer between adjacent tRNAs in the ribosome, and tRNA translocation induced by elongation factor G. PMID:21158483
Bharill, Shashank; Chen, Chunlai; Stevens, Benjamin; Kaur, Jaskiran; Smilansky, Zeev; Mandecki, Wlodek; Gryczynski, Ignacy; Gryczynski, Zygmunt; Cooperman, Barry S; Goldman, Yale E
2011-01-25
Metal-enhanced fluorescence (MEF) increased total photon emission of Cy3- and Cy5-labeled ribosomal initiation complexes near 50 nm silver particles 4- and 5.5-fold, respectively. Fluorescence intensity fluctuations above shot noise, at 0.1-5 Hz, were greater on silver particles. Overall signal-to-noise ratio was similar or slightly improved near the particles. Proximity to silver particles did not compromise ribosome function, as measured by codon-dependent binding of fluorescent tRNA, dynamics of fluorescence resonance energy transfer between adjacent tRNAs in the ribosome, and tRNA translocation induced by elongation factor G.
Aris-Brosou, Stéphane; Bielawski, Joseph P
2006-08-15
A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.
Franzo, Giovanni; Tucciarone, Claudia Maria; Cecchinato, Mattia; Drigo, Michele
2017-09-01
Based on virus dependence from host cell machinery, their codon usage is expected to show a strong relation with the host one. Even if this association has been stated, especially for bacteria viruses, the linkage is considered to be less consistent for more complex organisms and a codon bias adaptation after host jump has never been proven. Canine parvovirus type 2 (CPV-2) was selected as a model because it represents a well characterized case of host jump, originating from Feline panleukopenia virus (FPV). The current study demonstrates that the adaptation to specific tissue and host codon bias affected CPV-2 evolution. Remarkably, FPV and CPV-2 showed a higher closeness toward the codon bias of the tissues they display the higher tropism for. Moreover, after the host jump, a clear and significant trend was evidenced toward a reduction in the distance between CPV-2 and the dog codon bias over time. This evidence was not confirmed for FPV, suggesting that an equilibrium has been reached during the prolonged virus-host co-evolution. Additionally, the presence of an intermediate pattern displayed by some strains infecting wild species suggests that these could have facilitated the host switch also by acting on codon bias. Copyright © 2017 Elsevier Inc. All rights reserved.
Hara, A; Ueda, M; Misawa, S; Matsui, T; Furuhashi, K; Tanaka, A
2000-03-01
Development of a transformation system in the n-alkane-assimilating diploid yeast Candida tropicalis requires an antibiotic resistance gene in order to establish a selectable marker. The resistance gene for hygromycin B has often been used as a selectable marker in yeast transformation. However, C. tropicalis harboring the hygromycin resistance gene (HYG) was as sensitive to hygromycin B as the wild-type strain. Nine CTG codons were found in the ORF of the HYG gene. This codon has been reported to be translated as serine rather than leucine in Candida species. Analysis of the tRNA gene in C. tropicalis with the anticodon CAG [tRNA(CAG) gene], which is complementary to the codon CTG, showed that the sequence was highly similar to that of the C. maltosa tRNA(CAG) gene. In C. maltosa, the codon CTG is read as serine and not leucine. These results suggested that the HYG gene was not functional due to the nonuniversal usage of the CTG codon. Each of the nine CTG codons in the ORF of the HYG gene was changed to a CTC codon, which is read as leucine, by site-directed mutagenesis. When a plasmid containing the mutated HYG gene (HYG#) was constructed and introduced into C. tropicalis, hygromycin-resistant transformants were successfully obtained. This mutated hygromycin resistance gene may be useful for direct selection of C. tropicalis transformants.
Properties and determinants of codon decoding time distributions
2014-01-01
Background Codon decoding time is a fundamental property of mRNA translation believed to affect the abundance, function, and properties of proteins. Recently, a novel experimental technology--ribosome profiling--was developed to measure the density, and thus the speed, of ribosomes at codon resolution. Specifically, this method is based on next-generation sequencing, which theoretically can provide footprint counts that correspond to the probability of observing a ribosome in this position for each nucleotide in each transcript. Results In this study, we report for the first time various novel properties of the distribution of codon footprint counts in five organisms, based on large-scale analysis of ribosomal profiling data. We show that codons have distinctive footprint count distributions. These tend to be preserved along the inner part of the ORF, but differ at the 5' and 3' ends of the ORF, suggesting that the translation-elongation stage actually includes three biophysical sub-steps. In addition, we study various basic properties of the codon footprint count distributions and show that some of them correlate with the abundance of the tRNA molecule types recognizing them. Conclusions Our approach emphasizes the advantages of analyzing ribosome profiling and similar types of data via a comparative genomic codon-distribution-centric view. Thus, our methods can be used in future studies related to translation and even transcription elongation. PMID:25572668
Xu, Yanbing; Zheng, Zhaojuan; Xu, Qianqian; Yong, Qiang; Ouyang, Jia
2016-03-30
Inulooligosaccharides (IOS) represent an important class of oligosaccharides at industrial scale. An efficient conversion of inulin to IOS through endoinulinase from Aspergillus niger is presented. A 1482 bp codon optimized gene fragment encoding endoinulinase from A. niger DSM 2466 was cloned into pPIC9K vector and was transformed into Pichia pastoris KM71. Maximum activity of the recombinant endoinulinase, 858 U/mL, was obtained at 120 h of the high cell density fermentation process. The optimal conditions for inulin hydrolysis using the recombinant endoinulinase were investigated. IOS were harvested with a high concentration of 365.1 g/L and high yield up to 91.3%. IOS with different degrees of polymerization (DP, mainly DP 3-6) were distributed in the final reaction products.
Somatic mutations in cancer: Stochastic versus predictable.
Gold, Barry
2017-02-01
The origins of human cancers remain unclear except for a limited number of potent environmental mutagens, such as tobacco and UV light, and in rare cases, familial germ line mutations that affect tumor suppressor genes or oncogenes. A significant component of cancer etiology has been deemed stochastic and correlated with the number of stem cells in a tissue, the number of times the stem cells divide and a low incidence of random DNA polymerase errors that occur during each cell division. While somatic mutations occur during each round of DNA replication, mutations in cancer driver genes are not stochastic. Out of a total of 2843 codons, 1031 can be changed to stop codons by a single base substitution in the tumor suppressor APC gene, which is mutated in 76% of colorectal cancers (CRC). However, the nonsense mutations, which comprise 65% of all the APC driver mutations in CRC, are not random: 43% occur at Arg CGA codons, although they represent <3% of the codons. In TP53, CGA codons comprise <3% of the total 393 codons but they account for 72% and 39% of the mutations in CRC and ovarian cancer OVC, respectively. This mutation pattern is consistent with the kinetically slow, but not stochastic, hydrolytic deamination of 5-methylcytosine residues at specific methylated CpG sites to afford T·G mismatches that lead to C→T transitions and stop codons at CGA. Analysis of nonsense mutations in CRC, OVC and a number of other cancers indicates the need to expand the predictable risk factors for cancer to include, in addition to random polymerase errors, the methylation status of gene body CGA codons in tumor suppressor genes. Copyright © 2017. Published by Elsevier B.V.
Liu, Kaiyu; Li, Yi; Jousset, Françoise-Xavière; Zadori, Zoltan; Szelei, Jozsef; Yu, Qian; Pham, Hanh Thi; Lépine, François; Bergoin, Max; Tijssen, Peter
2011-01-01
The Acheta domesticus densovirus (AdDNV), isolated from crickets, has been endemic in Europe for at least 35 years. Severe epizootics have also been observed in American commercial rearings since 2009 and 2010. The AdDNV genome was cloned and sequenced for this study. The transcription map showed that splicing occurred in both the nonstructural (NS) and capsid protein (VP) multicistronic RNAs. The splicing pattern of NS mRNA predicted 3 nonstructural proteins (NS1 [576 codons], NS2 [286 codons], and NS3 [213 codons]). The VP gene cassette contained two VP open reading frames (ORFs), of 597 (ORF-A) and 268 (ORF-B) codons. The VP2 sequence was shown by N-terminal Edman degradation and mass spectrometry to correspond with ORF-A. Mass spectrometry, sequencing, and Western blotting of baculovirus-expressed VPs versus native structural proteins demonstrated that the VP1 structural protein was generated by joining ORF-A and -B via splicing (splice II), eliminating the N terminus of VP2. This splice resulted in a nested set of VP1 (816 codons), VP3 (467 codons), and VP4 (429 codons) structural proteins. In contrast, the two splices within ORF-B (Ia and Ib) removed the donor site of intron II and resulted in VP2, VP3, and VP4 expression. ORF-B may also code for several nonstructural proteins, of 268, 233, and 158 codons. The small ORF-B contains the coding sequence for a phospholipase A2 motif found in VP1, which was shown previously to be critical for cellular uptake of the virus. These splicing features are unique among parvoviruses and define a new genus of ambisense densoviruses. PMID:21775445
Overcoming codon-usage bias in heterologous protein expression in Streptococcus gordonii.
Lee, Song F; Li, Yi-Jing; Halperin, Scott A
2009-11-01
One of the limitations facing the development of Streptococcus gordonii into a successful vaccine vector is the inability of this bacterium to express high levels of heterologous proteins. In the present study, we have identified 12 codons deemed as rare codons in S. gordonii and seven other streptococcal species. tRNA genes encoding 10 of the 12 rare codons were cloned into a plasmid. The plasmid was transformed into strains of S. gordonii expressing the fusion protein SpaP/S1, the anti-complement receptor 1 (CR1) single-chain variable fragment (scFv) antibody, or the Toxoplasma gondii cyclophilin C18 protein. These three heterologous proteins contained high percentages of amino acids encoded by rare codons. The results showed that the production of SpaP/S1, anti-CR1 scFv and C18 increased by 2.7-, 120- and 10-fold, respectively, over the control strains. In contrast, the production of the streptococcal SpaP protein without the pertussis toxin S1 fragment was not affected by tRNA gene supplementation, indicating that the increased production of SpaP/S1 protein was due to the ability to overcome the limitation caused by rare codons required for the S1 fragment. The increase in anti-CR1 scFv production was also observed in Streptococcus mutans following tRNA gene supplementation. Collectively, the findings in the present study demonstrate for the first time, to the best of our knowledge, that codon-usage bias exists in Streptococcus spp. and the limitation of heterologous protein expression caused by codon-usage bias can be overcome by tRNA supplementation.
Iben, James R.; Maraia, Richard J.
2012-01-01
tRNA genes are interspersed throughout eukaryotic DNA, contributing to genome architecture and evolution in addition to translation of the transcriptome. Codon use correlates with tRNA gene copy number in noncomplex organisms including yeasts. Synonymous codons impact translation with various outcomes, dependent on relative tRNA abundances. Availability of whole-genome sequences allowed us to examine tRNA gene copy number variation (tgCNV) and codon use in four Schizosaccharomyces species and Saccharomyces cerevisiae. tRNA gene numbers vary from 171 to 322 in the four Schizosaccharomyces despite very high similarity in other features of their genomes. In addition, we performed whole-genome sequencing of several related laboratory strains of Schizosaccharomyces pombe and found tgCNV at a cluster of tRNA genes. We examined for the first time effects of wobble rules on correlation of tRNA gene number and codon use and showed improvement for S. cerevisiae and three of the Schizosaccharomyces species. In contrast, correlation in Schizosaccharomyces japonicus is poor due to markedly divergent tRNA gene content, and much worsened by the wobble rules. In japonicus, some tRNA iso-acceptor genes are absent and others are greatly reduced relative to the other yeasts, while genes for synonymous wobble iso-acceptors are amplified, indicating wobble use not apparent in any other eukaryote. We identified a subset of japonicus-specific wobbles that improves correlation of codon use and tRNA gene content in japonicus. We conclude that tgCNV is high among Schizo species and occurs in related laboratory strains of S. pombe (and expectedly other species), and tRNAome-codon analyses can provide insight into species-specific wobble decoding. PMID:22586155
Physical Model for the Evolution of the Genetic Code
NASA Astrophysics Data System (ADS)
Yamashita, Tatsuro; Narikiyo, Osamu
2011-12-01
Using the shape space of codons and tRNAs we give a physical description of the genetic code evolution on the basis of the codon capture and ambiguous intermediate scenarios in a consistent manner. In the lowest dimensional version of our description, a physical quantity, codon level is introduced. In terms of the codon levels two scenarios are typically classified into two different routes of the evolutional process. In the case of the ambiguous intermediate scenario we perform an evolutional simulation implemented cost selection of amino acids and confirm a rapid transition of the code change. Such rapidness reduces uncomfortableness of the non-unique translation of the code at intermediate state that is the weakness of the scenario. In the case of the codon capture scenario the survival against mutations under the mutational pressure minimizing GC content in genomes is simulated and it is demonstrated that cells which experience only neutral mutations survive.
Reassigning stop codons via translation termination: How a few eukaryotes broke the dogma.
Alkalaeva, Elena; Mikhailova, Tatiana
2017-03-01
The genetic code determines how amino acids are encoded within mRNA. It is universal among the vast majority of organisms, although several exceptions are known. Variant genetic codes are found in ciliates, mitochondria, and numerous other organisms. All revealed genetic codes (standard and variant) have at least one codon encoding a translation stop signal. However, recently two new genetic codes with a reassignment of all three stop codons were revealed in studies examining the protozoa transcriptomes. Here, we discuss this finding and the recent studies of variant genetic codes in eukaryotes. We consider the possible molecular mechanisms allowing the use of certain codons as sense and stop signals simultaneously. The results obtained by studying these amazing organisms represent a new and exciting insight into the mechanism of stop codon decoding in eukaryotes. Also see the video abstract here. © 2017 WILEY Periodicals, Inc.
Manuel, Edwin R.; Blache, Céline A.; Paquette, Rebecca; Kaltcheva, Teodora I.; Ishizaki, Hidenobu; Ellenhorn, Joshua D.I.; Hensel, Michael; Metelitsa, Leonid; Diamond, Don J.
2011-01-01
Cancer vaccine therapies have only achieved limited success when focusing on effector immunity with the goal of eliciting robust tumor-specific T cell responses. More recently, there is an emerging understanding that effective immunity can only be achieved by coordinate disruption of tumor-derived immune suppression. Towards that goal, we have developed a potent Salmonella-based vaccine expressing codon-optimized survivin (CO-SVN) referred to as 3342Max. When used alone as a therapeutic vaccine, 3342Max can attenuate growth of aggressive murine melanomas overexpressing SVN. However, under more immunosuppressive conditions, such as those associated with larger tumor volumes, we found that the vaccine was ineffective. Vaccine efficacy could be rescued if tumor-bearing mice were treated initially with Salmonella encoding a shRNA targeting the tolerogenic molecule STAT3 (YS1646-shSTAT3). In vaccinated mice, silencing STAT3 increased the proliferation and granzyme B levels of intratumoral CD4+ and CD8+ T cells. The combined strategy also increased apoptosis in tumors of treated mice, enhancing tumor-specific killing of tumor targets. Interestingly, mice treated with YS1646-shSTAT3 or 3342Max alone were similarly unsuccessful in rejecting established tumors, while the combined regimen was highly potent. Our findings establish that a combined strategy of silencing immunosuppressive molecules followed by vaccination can act synergistically to attenuate tumor growth, and they offer a novel translational direction to improve tumor immunotherapy. PMID:21527558
PCR-RFLP to Detect Codon 248 Mutation in Exon 7 of "p53" Tumor Suppressor Gene
ERIC Educational Resources Information Center
Ouyang, Liming; Ge, Chongtao; Wu, Haizhen; Li, Suxia; Zhang, Huizhan
2009-01-01
Individual genome DNA was extracted fast from oral swab and followed up with PCR specific for codon 248 of "p53" tumor suppressor gene. "Msp"I restriction mapping showed the G-C mutation in codon 248, which closely relates to cancer susceptibility. Students learn the concepts, detection techniques, and research significance of point mutations or…
Codon influence on protein expression in E. coli correlates with mRNA levels
Boël, Grégory; Wong, Kam-Ho; Su, Min; Luff, Jon; Valecha, Mayank; Everett, John K.; Acton, Thomas B.; Xiao, Rong; Montelione, Gaetano T.; Aalberts, Daniel P.; Hunt, John F.
2016-01-01
Degeneracy in the genetic code, which enables a single protein to be encoded by a multitude of synonymous gene sequences, has an important role in regulating protein expression, but substantial uncertainty exists concerning the details of this phenomenon. Here we analyze the sequence features influencing protein expression levels in 6,348 experiments using bacteriophage T7 polymerase to synthesize messenger RNA in Escherichia coli. Logistic regression yields a new codon-influence metric that correlates only weakly with genomic codon-usage frequency, but strongly with global physiological protein concentrations and also mRNA concentrations and lifetimes in vivo. Overall, the codon content influences protein expression more strongly than mRNA-folding parameters, although the latter dominate in the initial ~16 codons. Genes redesigned based on our analyses are transcribed with unaltered efficiency but translated with higher efficiency in vitro. The less efficiently translated native sequences show greatly reduced mRNA levels in vivo. Our results suggest that codon content modulates a kinetic competition between protein elongation and mRNA degradation that is a central feature of the physiology and also possibly the regulation of translation in E. coli. PMID:26760206
On the possible origin and evolution of the genetic code
NASA Technical Reports Server (NTRS)
Jukes, T. H.
1974-01-01
The genetic code is examined for indications of possible preceding codes that existed during early evolution. Eight of the 20 amino acids are coded by 'quartets' of codons with fourfold degeneracy, and 16 such quartets can exist, so that an earlier code could have provided for 15 or 16 amino acids, rather than 20. If twofold degeneracy is postulated for the first position of the codon, there could have been ten amino acids in the code. It is speculated that these may have been phenylalanine, valine, proline, alanine, histidine, glutamine, glutanic acid, aspartic acid, cysteine and glycine. There is a notable deficiency of arginine in proteins, despite the fact that it has six codons. Simultaneously, there is more lysine in proteins than would be expected from its two codons, if the four bases in mRNA are equiprobable and are arranged randomly. It is speculated that arginine is an 'intruder' into the genetic code, and that it may have displayed another amino acid such as ornithine, or may even have displayed lysine from some of its previous codon assignments. As a result, natural selection has favored lysine against the fact that it has only two codons.
Demonstration of GTG as an alternative initiation codon for the serpin endopin 2B-2.
Hwang, Shin-Rong; Garza, Christina Z; Wegrzyn, Jill L; Hook, Vivian Y H
2005-02-18
This study demonstrates GTG as a novel, alternative initiation codon for translation of bovine endopin 2B-2, a serpin protease inhibitor. Molecular cDNA cloning revealed the endopin 2B-1 and endopin 2B-2 isoforms that are predicted to inhibit papain and elastase. Notably, GTG was demonstrated as the initiation codon for endopin 2B-2, whereas endopin 2B-1 possesses ATG as its initiation codon. GTG mediated in vitro translation of 46kDa endopin 2B-2. GTG also mediated translation of EGFP by in vitro translation and by expression in mammalian cells. Notably, mutagenesis of GTG to GTC resulted in the absence of EGFP expression in cells. GTG produced a lower level of protein expression compared to ATG. The use of GTG as an initiation codon to direct translation of endopin 2B, as well as the heterologous protein EGFP, demonstrates the role of GTG in the regulation of mRNA translation in mammalian cells. Significantly, further analyses of mammalian genomes based on GTG as an alternative initiation codon may predict new candidate gene products expressed by mammalian and human genomes.
Nonneutral GC3 and retroelement codon mimicry in Phytophthora.
Jiang, Rays H Y; Govers, Francine
2006-10-01
Phytophthora is a genus entirely comprised of destructive plant pathogens. It belongs to the Stramenopila, a unique branch of eukaryotes, phylogenetically distinct from plants, animals, or fungi. Phytophthora genes show a strong preference for usage of codons ending with G or C (high GC3). The presence of high GC3 in genes can be utilized to differentiate coding regions from noncoding regions in the genome. We found that both selective pressure and mutation bias drive codon bias in Phytophthora. Indicative for selection pressure is the higher GC3 value of highly expressed genes in different Phytophthora species. Lineage specific GC increase of noncoding regions is reminiscent of whole-genome mutation bias, whereas the elevated Phytophthora GC3 is primarily a result of translation efficiency-driven selection. Heterogeneous retrotransposons exist in Phytophthora genomes and many of them vary in their GC content. Interestingly, the most widespread groups of retroelements in Phytophthora show high GC3 and a codon bias that is similar to host genes. Apparently, selection pressure has been exerted on the retroelement's codon usage, and such mimicry of host codon bias might be beneficial for the propagation of retrotransposons.
[Identifying and sequence analysis of HLA-B*2736].
Li, Zhen; Zou, Hong-Yan; Shao, Chao-Peng; Tang, Si; Wang, Da-Ming; Cheng, Liang-Hong
2007-11-01
An unknown HLA-B allele which was similar to HLA-B*270401 was detected by FLOW-SSOPCR-SSP and heterozygous sequence-based typing (SBT) in Chinese Han individual. Its anomalous patterns suggested the possible presence of new allele. Amplifying exon 2-5(include intron 2-4) of the HLA-B*27 allele separately by using allele-specific primers and sequencing in both directions. Identifying the difference between the novel B*27 allele and B*270401. The sequence of novel B*27 from exon 2 to partial exon 5 is 1 815 bp. There are 10 nt changes from B*270401 in exon 3-4, at nt634where A-->C(codon130 AGC-->CGC, 130 S-->R); nt670 where A-->T (codon142 ACC-->TCC, 142 T-->S); nt683 where G-->T (codon146 TGG-->TTG, 146 W-->L); nt698 where A-->T (codon151 GAG-->GTG, 151 E-->V); nt774 where G-->C (codon176 GAG-->GAC, 176 E-->D); nt776 where C-->A (codon177 ACG-->AAG, 177 T-->K); nt781 where C-->G (codon179 CAG-->GAG, 179Q-->E); nt789 where G-->T (codon181 GCG-->GCT) resulting no coding change; nt1438 where C-->T (codon206 GGC-->GGT) resulting no coding change; nt1449 where G-->C (codon210 GGG-->GCG, 210G-->A). In IMGT/HLA database, only three alleles (B*270502/2706/2732) have sequences of introns. The same sequence in intron 2 showed homology between the novel HLA-B*27 allele and B*2706, but their homology could not be supported in intron 3-4. Comparing the sequence of the novel B*27 allele in intron 3 and 4 with B*27 group, it showed there are three mutations at nt106 C-->G, nt179 G-->A, nt536 G-->A and one deletion at nt168 in intron 3 and one mutations at nt82 T-->C in intron 4, but the sequence of the novel B*27 allele in intron 3 and 4 was all the same to B*070201. The sequence was submitted to Gen-Bank and the accession number was DQ915176. The allele has been confirmed as an extension of B*2736 by the WHO Nomenclature committee in November 2006.
Zika Virus Attenuation by Codon Pair Deoptimization Induces Sterilizing Immunity in Mouse Models.
Li, Penghui; Ke, Xianliang; Wang, Ting; Tan, Zhongyuan; Luo, Dan; Miao, Yuanjiu; Sun, Jianhong; Zhang, Yuan; Liu, Yan; Hu, Qinxue; Xu, Fuqiang; Wang, Hanzhong; Zheng, Zhenhua
2018-06-20
Zika virus (ZIKV) infection during the large epidemics in the Americas is related to congenital abnormities or fetal demise. To date, there is no vaccine, antiviral drug, or other modality available to prevent or treat Zika virus infection. Here we designed novel live attenuated ZIKV vaccine candidates using a codon pair deoptimization strategy. Three codon pair-deoptimized ZIKVs (Min E, Min NS1, and Min E+NS1) were de novo synthesized, and recovered by reverse genetics, containing large amounts of underrepresented codon pairs in E gene and/or NS1 gene. Amino acid sequence was 100% unchanged. The codon pair-deoptimized variants had decreased replication fitness in Vero cells (Min NS1 ≫ Min E > Min E+NS1), replicated more efficiently in insect cells than in mammalian cells, and demonstrated diminished virulence in a mouse model. In particular, Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and a single immunization achieved complete protection against lethal challenge and vertical ZIKV transmission during pregnancy. More importantly, due to the numerous synonymous substitutions in the codon pair-deoptimized strains, reversion to wild-type virulence through gradual nucleotide sequence mutations is unlikely. Our results collectively demonstrate that ZIKV can be effectively attenuated by codon pair deoptimization, highlighting the potential of Min E+NS1 as a safe vaccine candidate to prevent ZIKV infections. IMPORTANCE Due to unprecedented epidemics of Zika virus (ZIKV) across the Americas and the unexpected clinical symptoms including Guillain-Barré syndrome, microcephaly and other birth defects in human, there is an urgent need for ZIKV vaccine development. Here, we provided the first attenuated versions of ZIKV with two important genes (E and/or NS1) that were subjected to codon pair deoptimization. Compared to parental ZIKV, the codon pair-deoptimized ZIKVs were mammalian-attenuated, and preferred insect to mammalian Cells. Min E+NS1, the most restrictive variant, induced sterilizing immunity with a robust neutralizing antibody titer, and achieved complete protection against lethal challenge and vertical virus transmission during pregnancy. More importantly, the massive synonymous mutational approach made it impossible to revert to wild-type virulence. Our results have proven the feasibility of codon pair deoptimization as a strategy to develop live-attenuated vaccine candidates against flavivirues like ZIKV, Japanese encephalitis virus and West Nile virus. Copyright © 2018 American Society for Microbiology.
Dimitrieva, Slavica; Anisimova, Maria
2014-01-01
In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data
Gritsenko, Alexey A.; Hulsman, Marc; Reinders, Marcel J. T.; de Ridder, Dick
2015-01-01
Translation of RNA to protein is a core process for any living organism. While for some steps of this process the effect on protein production is understood, a holistic understanding of translation still remains elusive. In silico modelling is a promising approach for elucidating the process of protein synthesis. Although a number of computational models of the process have been proposed, their application is limited by the assumptions they make. Ribosome profiling (RP), a relatively new sequencing-based technique capable of recording snapshots of the locations of actively translating ribosomes, is a promising source of information for deriving unbiased data-driven translation models. However, quantitative analysis of RP data is challenging due to high measurement variance and the inability to discriminate between the number of ribosomes measured on a gene and their speed of translation. We propose a solution in the form of a novel multi-scale interpretation of RP data that allows for deriving models with translation dynamics extracted from the snapshots. We demonstrate the usefulness of this approach by simultaneously determining for the first time per-codon translation elongation and per-gene translation initiation rates of Saccharomyces cerevisiae from RP data for two versions of the Totally Asymmetric Exclusion Process (TASEP) model of translation. We do this in an unbiased fashion, by fitting the models using only RP data with a novel optimization scheme based on Monte Carlo simulation to keep the problem tractable. The fitted models match the data significantly better than existing models and their predictions show better agreement with several independent protein abundance datasets than existing models. Results additionally indicate that the tRNA pool adaptation hypothesis is incomplete, with evidence suggesting that tRNA post-transcriptional modifications and codon context may play a role in determining codon elongation rates. PMID:26275099
Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data.
Gritsenko, Alexey A; Hulsman, Marc; Reinders, Marcel J T; de Ridder, Dick
2015-08-01
Translation of RNA to protein is a core process for any living organism. While for some steps of this process the effect on protein production is understood, a holistic understanding of translation still remains elusive. In silico modelling is a promising approach for elucidating the process of protein synthesis. Although a number of computational models of the process have been proposed, their application is limited by the assumptions they make. Ribosome profiling (RP), a relatively new sequencing-based technique capable of recording snapshots of the locations of actively translating ribosomes, is a promising source of information for deriving unbiased data-driven translation models. However, quantitative analysis of RP data is challenging due to high measurement variance and the inability to discriminate between the number of ribosomes measured on a gene and their speed of translation. We propose a solution in the form of a novel multi-scale interpretation of RP data that allows for deriving models with translation dynamics extracted from the snapshots. We demonstrate the usefulness of this approach by simultaneously determining for the first time per-codon translation elongation and per-gene translation initiation rates of Saccharomyces cerevisiae from RP data for two versions of the Totally Asymmetric Exclusion Process (TASEP) model of translation. We do this in an unbiased fashion, by fitting the models using only RP data with a novel optimization scheme based on Monte Carlo simulation to keep the problem tractable. The fitted models match the data significantly better than existing models and their predictions show better agreement with several independent protein abundance datasets than existing models. Results additionally indicate that the tRNA pool adaptation hypothesis is incomplete, with evidence suggesting that tRNA post-transcriptional modifications and codon context may play a role in determining codon elongation rates.
Mora, Liliana; Heurgué-Hamard, Valérie; de Zamaroczy, Miklos; Kervestin, Stephanie; Buckingham, Richard H
2007-12-07
Bacterial release factors RF1 and RF2 are methylated on the Gln residue of a universally conserved tripeptide motif GGQ, which interacts with the peptidyl transferase center of the large ribosomal subunit, triggering hydrolysis of the ester bond in peptidyl-tRNA and releasing the newly synthesized polypeptide from the ribosome. In vitro experiments have shown that the activity of RF2 is stimulated by Gln methylation. The viability of Escherichia coli K12 strains depends on the integrity of the release factor methyltransferase PrmC, because K12 strains are partially deficient in RF2 activity due to the presence of a Thr residue at position 246 instead of Ala. Here, we study in vivo RF1 and RF2 activity at termination codons in competition with programmed frameshifting and the effect of the Ala-246 --> Thr mutation. PrmC inactivation reduces the specific termination activity of RF1 and RF2(Ala-246) by approximately 3- to 4-fold. The mutation Ala-246 --> Thr in RF2 reduces the termination activity in cells approximately 5-fold. After correction for the decrease in level of RF2 due to the autocontrol of RF2 synthesis, the mutation Ala-246 --> Thr reduced RF2 termination activity by approximately 10-fold at UGA codons and UAA codons. PrmC inactivation had no effect on cell growth in rich media but reduced growth considerably on poor carbon sources. This suggests that the expression of some genes needed for optimal growth under such conditions can become growth limiting as a result of inefficient translation termination.
Adams, Philip P.; Flores Avile, Carlos; Jewett, Mollie W.
2017-01-01
Knowledge of the transcriptional responses of vector-borne pathogens at the vector-pathogen interface is critical for understanding disease transmission. Borrelia (Borreliella) burgdorferi, the causative agent of Lyme disease in the United States, is transmitted by the bite of infected Ixodes sp. ticks. It is known that B. burgdorferi has altered patterns of gene expression during tick acquisition, persistence and transmission. Recently, we and others have discovered in vitro expression of RNAs found internal, overlapping, and antisense to annotated open reading frames in the B. burgdorferi genome. However, there is a lack of molecular genetic tools for B. burgdorferi for quantitative, strand-specific, comparative analysis of these transcripts in distinct environments such as the arthropod vector. To address this need, we have developed a dual luciferase reporter system to quantify B. burgdorferi promoter activities in a strand-specific manner. We demonstrate that constitutive expression of a B. burgdorferi codon-optimized Renilla reniformis luciferase gene (rlucBb) allows normalization of the activity of a promoter of interest when fused to the B. burgdorferi codon-optimized Photinus pyralis luciferase gene (flucBb) on the same plasmid. Using the well characterized, differentially regulated, promoters for flagellin (flaBp), outer surface protein A (ospAp) and outer surface protein C (ospCp), we document the efficacy of the dual luciferase system for quantitation of promoter activities during in vitro growth and in infected ticks. Cumulatively, the dual luciferase method outlined herein is the first dual reporter system for B. burgdorferi, providing a novel and highly versatile approach for strand-specific molecular genetic analyses. PMID:28620587
Martínez, Osmarie; Bravo Cruz, Ariana; Santos, Saritza; Ramírez, Maite; Miranda, Eric; Shisler, Joanna; Otero, Miguel
2017-10-20
Smallpox is a disease caused by Variola virus (VARV). Although eradicated by WHO in 1980, the threat of using VARV on a bioterror attack has increased. The current smallpox vaccine ACAM2000, which consists of live vaccinia virus (VACV), causes complications in individuals with a compromised immune system or with previously reported skin diseases. Thus, a safer and efficacious vaccine needs to be developed. Previously, we reported that our virus-free DNA vaccine formulation, a pVAX1 plasmid encoding codon-optimized VACV A27L gene (pA27LOPT) with and without Imiquimod adjuvant, stimulates A27L-specific production of IFN-γ and increases humoral immunity 7days post-vaccination. Here, we investigated the immune response of our novel vaccine by measuring the frequency of splenocytes producing IFN-γ by ELISPOT, the TH1 and TH2 cytokine profiles, and humoral immune responses two weeks post-vaccination, when animals were challenged with VACV. In all assays, the A27-based DNA vaccine conferred protective immune responses. Specifically, two weeks after vaccination, mice were challenged intranasally with vaccinia virus, and viral titers in mouse lungs and ovaries were significantly lower in groups immunized with pA27LOPT and pA27LOPT+Imiquimod. These results demonstrate that our vaccine formulation decreases viral replication and dissemination in a virus-free DNA vaccine platform, and provides an alternative towards a safer an efficacious vaccine. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Adams, Philip P; Flores Avile, Carlos; Jewett, Mollie W
2017-01-01
Knowledge of the transcriptional responses of vector-borne pathogens at the vector-pathogen interface is critical for understanding disease transmission. Borrelia ( Borreliella ) burgdorferi , the causative agent of Lyme disease in the United States, is transmitted by the bite of infected Ixodes sp . ticks. It is known that B. burgdorferi has altered patterns of gene expression during tick acquisition, persistence and transmission. Recently, we and others have discovered in vitro expression of RNAs found internal, overlapping, and antisense to annotated open reading frames in the B. burgdorferi genome. However, there is a lack of molecular genetic tools for B. burgdorferi for quantitative, strand-specific, comparative analysis of these transcripts in distinct environments such as the arthropod vector. To address this need, we have developed a dual luciferase reporter system to quantify B. burgdorferi promoter activities in a strand-specific manner. We demonstrate that constitutive expression of a B. burgdorferi codon-optimized Renilla reniformis luciferase gene ( rluc Bb ) allows normalization of the activity of a promoter of interest when fused to the B. burgdorferi codon-optimized Photinus pyralis luciferase gene ( fluc Bb ) on the same plasmid. Using the well characterized, differentially regulated, promoters for flagellin ( flaBp ), outer surface protein A ( ospAp ) and outer surface protein C ( ospCp ), we document the efficacy of the dual luciferase system for quantitation of promoter activities during in vitro growth and in infected ticks. Cumulatively, the dual luciferase method outlined herein is the first dual reporter system for B. burgdorferi , providing a novel and highly versatile approach for strand-specific molecular genetic analyses.
Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.
Bae, Young-An
2017-04-01
Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
Ozen, Filiz; Ozdemir, Semra; Zemheri, Ebru; Hacimuto, Gizem; Silan, Fatma; Ozdemir, Ozturk
2013-02-01
The aim of the current study was to investigate the prevalence and predictive significance of the KRAS and BRAF mutations in Turkish patients with colorectal cancer (CRC). Totally, 53 fresh tumoral tissue specimens were investigated in patients with CRC. All specimens were obtained during routine surgery of patients who were histopathologically diagnosed and genotyped for common KRAS and BRAF point mutations. After DNA extraction, the target mutations were analyzed using the AutoGenomics INFINITI(®) assay, and some samples were confirmed by quantitative real-time polymerase chain reaction fluorescence melting curve analyses. KRAS mutations were found in 26 (49.05%) CRC samples. Twenty-seven samples (50.95%) had wild-type profiles for KRAS codon 12, 13, and 61 in the current cohort. In 17 (65.38%) samples, codon 12; in 7 (26.93%) samples, codon 13; and in 2 (7.69%) samples, codon 61 were found to be mutated, particularly in grade 2 of tumoral tissues. No point mutation was detected in BRAF codon Val600Glu for the studied CRC patients. Our study, based on a representative collection of human CRC tumors, indicates that KRAS gene mutations were detected in 49.05% of the samples, and the most frequent mutation was in the G12D codon. Results also showed that codons 12 and 13 of KRAS are relatively frequently without BRAF mutation in a CRC cohort from the Turkish population.
Ribosome stalling and peptidyl-tRNA drop-off during translational delay at AGA codons
Cruz-Vera, Luis Rogelio; Magos-Castro, Marco Antonio; Zamora-Romo, Efraín; Guarneros, Gabriel
2004-01-01
Minigenes encoding the peptide Met–Arg–Arg have been used to study the mechanism of toxicity of AGA codons proximal to the start codon or prior to the termination codon in bacteria. The codon sequences of the ‘mini-ORFs’ employed were initiator, combinations of AGA and CGA, and terminator. Both, AGA and CGA are low-usage Arg codons in ORFs of Escherichia coli but, whilst AGA is translated by the scarce tRNAArg4, CGA is recognized by the abundant tRNAArg2. Overexpression of minigenes harbouring AGA in the third position, next to a termination codon, was deleterious to the cell and led to the accumulation of peptidyl-tRNAArg4 and of the peptidyl-tRNA cognate to the preceding CGA or AGA Arg triplet. The minigenes carrying CGA in the third position were not toxic. Minigene-mediated toxicity and peptidyl-tRNA accumulation were suppressed by overproduction of tRNAArg4 but not by overproduction of peptidyl-tRNA hydrolase, an enzyme that is only active on substrates that have been released from the ribosome. Consistent with these findings, peptidyl-tRNAArg4 was identified to be mainly associated with ribosomes in a stand-by complex. These and previous results support the hypothesis that the primary mechanism of inhibition of protein synthesis by AGA triplets in pth+ cells involves sequestration of tRNAs as peptidyl-tRNA on the stalled ribosome. PMID:15317870
Johnston, Christopher; Douarre, Pierre E; Soulimane, Tewfik; Pletzer, Daniel; Weingart, Helge; MacSharry, John; Coffey, Aidan; Sleator, Roy D; O'Mahony, Jim
2013-06-01
Subunit and DNA-based vaccines against Mycobacterium avium ssp. paratuberculosis (MAP) attempt to overcome inherent issues associated with whole-cell formulations. However, these vaccines can be hampered by poor expression of recombinant antigens from a number of disparate hosts. The high G+C content of MAP invariably leads to a codon bias throughout gene expression. To investigate if the codon bias affects recombinant MAP antigen expression, the open reading frame of a MAP-specific antigen MptD (MAP3733c) was codon optimised for expression against a Lactobacillus salivarius host. Of the total 209 codons which constitute MAP3733c, 172 were modified resulting in a reduced G+C content from 61% for the native gene to 32.7% for the modified form. Both genes were placed under the transcriptional control of the PnisA promoter; allowing controlled heterologous expression in L. salivarius. Expression was monitored using fluorescence microscopy and microplate fluorometry via GFP tags translationally fused to the C-termini of the two MptD genes. A > 37-fold increase in expression was observed for the codon-optimised MAP3733synth variant over the native gene. Due to the low cost and improved expression achieved, codon optimisation significantly improves the potential of L. salivarius as an oral vaccine stratagem against Johne's disease. © 2013 Federation of European Microbiological Societies. Published by John Wiley & Sons Ltd. All rights reserved.
Characterization and optimization of ArtinM lectin expression in Escherichia coli.
Pranchevicius, Maria-Cristina S; Oliveira, Leandro L; Rosa, José C; Avanci, Nilton C; Quiapim, Andréa C; Roque-Barreira, Maria-Cristina; Goldman, Maria-Helena S
2012-08-02
ArtinM is a d-mannose-specific lectin from Artocarpus integrifolia seeds that induces neutrophil migration and activation, degranulation of mast cells, acceleration of wound healing, induction of interleukin-12 production by macrophages and dendritic cells, and protective T helper 1 immune response against Leishmania major, Leishmania amazonensis and Paracoccidioides brasiliensis infections. Considering the important biological properties of ArtinM and its therapeutic applicability, this study was designed to produce high-level expression of active recombinant ArtinM (rArtinM) in Escherichia coli system. The ArtinM coding region was inserted in pET29a(+) vector and expressed in E. coli BL21(DE3)-Codon Plus-RP. The conditions for overexpression of soluble ArtinM were optimized testing different parameters: temperatures (20, 25, 30 or 37°C) and shaking speeds (130, 200 or 220 rpm) during induction, concentrations of the induction agent IPTG (0.01-4 mM) and periods of induction (1-19 h). BL21-CodonPlus(DE3)-RP cells induced under the optimized conditions (incubation at 20°C, at a shaking speed of 130 rpm, induction with 0.4 mM IPTG for 19 h) resulted in the accumulation of large amounts of soluble rArtinM. The culture provided 22.4 mg/L of rArtinM, which activity was determined by its one-step purification through affinity chromatography on immobilized d-mannose and glycoarray analysis. Gel filtration showed that rArtinM is monomeric, contrasting with the tetrameric form of the plant native protein (jArtinM). The analysis of intact rArtinM by mass spectrometry revealed a 16,099.5 Da molecular mass, and the peptide mass fingerprint and esi-cid-ms/ms of amino acid sequences of peptides from a tryptic digest covered 41% of the total ArtinM amino acid sequence. In addition, circular dichroism and fluorescence spectroscopy of rArtinM indicated that its global fold comprises β-sheet structure. Overall, the optimized process to express rArtinM in E. coli provided high amounts of soluble, correctly folded and active recombinant protein, compatible with large scale production of the lectin.
2014-01-01
Background Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene. Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. Results The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences. Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures. Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes. In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. Conclusions We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane. PMID:24708613
Chou, Ting-Chun; Moyle, Richard L
2014-04-08
Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene.Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences.Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures.Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes.In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane.
Enhanced protection against Ebola virus mediated by an improved adenovirus-based vaccine.
Richardson, Jason S; Yao, Michel K; Tran, Kaylie N; Croyle, Maria A; Strong, James E; Feldmann, Heinz; Kobinger, Gary P
2009-01-01
The Ebola virus is transmitted by direct contact with bodily fluids of infected individuals, eliciting death rates as high as 90% among infected humans. Currently, replication defective adenovirus-based Ebola vaccine is being studied in a phase I clinical trial. Another Ebola vaccine, based on an attenuated vesicular stomatitis virus has shown efficacy in post-exposure treatment of nonhuman primates to Ebola infection. In this report, we modified the common recombinant adenovirus serotype 5-based Ebola vaccine expressing the wild-type ZEBOV glycoprotein sequence from a CMV promoter (Ad-CMVZGP). The immune response elicited by this improved expression cassette vector (Ad-CAGoptZGP) and its ability to afford protection against lethal ZEBOV challenge in mice was compared to the standard Ad-CMVZGP vector. Ad-CMVZGP was previously shown to protect mice, guinea pigs and nonhuman primates from an otherwise lethal challenge of Zaire ebolavirus. The antigenic expression cassette of this vector was improved through codon optimization, inclusion of a consensus Kozak sequence and reconfiguration of a CAG promoter (Ad-CAGoptZGP). Expression of GP from Ad-CAGoptZGP was substantially higher than from Ad-CMVZGP. Ad-CAGoptZGP significantly improved T and B cell responses at doses 10 to 100-fold lower than that needed with Ad-CMVZGP. Additionally, Ad-CAGoptZGP afforded full protections in mice against lethal challenge at a dose 100 times lower than the dose required for Ad-CMVZGP. Finally, Ad-CAGoptZGP induced full protection to mice when given 30 minutes post-challenge. We describe an improved adenovirus-based Ebola vaccine capable of affording post-exposure protection against lethal challenge in mice. The molecular modifications of the new improved vaccine also translated in the induction of significantly enhanced immune responses and complete protection at a dose 100 times lower than with the previous generation adenovirus-based Ebola vaccine. Understanding and improving the molecular components of adenovirus-based vaccines can produce potent, optimized product, useful for vaccination and post-exposure therapy.
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology.
Jones, Jennifer E; Long, Kristin M; Whitmore, Alan C; Sanders, Wes; Thurlow, Lance R; Brown, Julia A; Morrison, Clayton R; Vincent, Heather; Peck, Kayla M; Browning, Christian; Moorman, Nathaniel; Lim, Jean K; Heise, Mark T
2017-11-14
Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4 + T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. IMPORTANCE Chikungunya virus (CHIKV) is a mosquito-borne alphavirus that causes significant outbreaks of viral arthralgia. Studies with CHIKV and other alphaviruses demonstrated that the opal termination codon within nsP3 is highly conserved. However, some strains of CHIKV and other alphaviruses contain mutations in the opal termination codon. These mutations alter the virulence of related alphaviruses in mammalian and mosquito hosts. Here, we report that a clinical isolate of a CHIKV strain from the recent outbreak in the Caribbean islands contains a mixture of viruses encoding either the opal termination codon or an arginine mutation. Mutating the opal stop codon to an arginine residue attenuates CHIKV-induced disease in a mouse model. Compared to infection with the opal-containing parental virus, infection with the arginine mutant causes limited swelling and inflammation, as well as dampened recruitment of immune mediators of pathology, including CD4 + T cells and NK cells. We propose that the opal termination codon plays an essential role in the induction of severe CHIKV disease. Copyright © 2017 Jones et al.
Effect of Estrogen on Mutagenesis in Human Mammary Epithelial Cells
2005-06-01
instability remains undefined in most human cancers, it appears to arise from subtle, intragenic mutations of the genes , whose products play a key role in...cells and is less labor-intensive. A G-G or T-G mismatch was introduced into ATG start codon of the enhanced green fluorescent protein (EGFP) gene ...Repair of the G-G or T-G mismatch to G-C or T-A, respectively in the heteroduplex plasmid generates a functional EGFP gene expression. The heteroduplex
Transformation of NIH3T3 Cells with Synthetic c‐Ha‐ras Genes
Kamiya, Hiroyuki; Miura, Kazunobu; Ohtomo, Noriko; Koda, Toshiaki; Kakinuma, Mitsuaki; Nishimura, Susumu
1989-01-01
Synthetic human c‐Ha‐ras genes in which amino acid codons were altered to those which are frequently used in highly expressed Escherichia coli genes were ligated to the 3′‐end of Rous sarcoma virus long terminal repeat. When NIH3T3 cells were transfected with the plasmids having those genes with valine at codon 12, leucine at codon 61 or arginine at codon 61, transformants were efficiently produced. These results indicated that the synthetic c‐Ha‐ras genes are expressed in a mammalian system even though their codon usage is altered to correspond with that of E. colt. This expression vector system should he useful for studies on the structure‐function relationships of c‐Ha‐ras, since the synthetic gene can be easily modified to have multiple base alterations, and can also be used simultaneously for the production of large amounts of p21 in E. coli for biochemical and biophysical studies. PMID:2542206
RNA Editing in Plant Mitochondria
NASA Astrophysics Data System (ADS)
Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel
1989-12-01
Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.
An analysis of the metabolic theory of the origin of the genetic code
NASA Technical Reports Server (NTRS)
Amirnovin, R.; Bada, J. L. (Principal Investigator)
1997-01-01
A computer program was used to test Wong's coevolution theory of the genetic code. The codon correlations between the codons of biosynthetically related amino acids in the universal genetic code and in randomly generated genetic codes were compared. It was determined that many codon correlations are also present within random genetic codes and that among the random codes there are always several which have many more correlations than that found in the universal code. Although the number of correlations depends on the choice of biosynthetically related amino acids, the probability of choosing a random genetic code with the same or greater number of codon correlations as the universal genetic code was found to vary from 0.1% to 34% (with respect to a fairly complete listing of related amino acids). Thus, Wong's theory that the genetic code arose by coevolution with the biosynthetic pathways of amino acids, based on codon correlations between biosynthetically related amino acids, is statistical in nature.
Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).
Jeong, Dageum; Lee, Youn-Ho
2016-01-01
The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.
Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV. PMID:28880881
Chen, Ye; Li, Xinxin; Chi, Xiaojuan; Wang, Song; Ma, Yanmei; Chen, Jilong
2017-01-01
The classical swine fever virus (CSFV), circulating worldwide, is a highly contagious virus. Since the emergence of CSFV, it has caused great economic loss in swine industry. The envelope glycoprotein E2 gene of the CSFV is an immunoprotective antigen that induces the immune system to produce neutralizing antibodies. Therefore, it is essential to study the codon usage of the E2 gene of the CSFV. In this study, 140 coding sequences of the E2 gene were analyzed. The value of effective number of codons (ENC) showed low codon usage bias in the E2 gene. Our study showed that codon usage could be described mainly by mutation pressure ENC plot analysis combined with principal component analysis (PCA) and translational selection-correlation analysis between the general average hydropathicity (Gravy) and aromaticity (Aroma), and nucleotides at the third position of codons (A3s, T3s, G3s, C3s and GC3s). Furthermore, the neutrality analysis, which explained the relationship between GC12s and GC3s, revealed that natural selection had a key role compared with mutational bias during the evolution of the E2 gene. These results lay a foundation for further research on the molecular evolution of CSFV.
rpoB gene mutations among Mycobacterium tuberculosis isolates from extrapulmonary sites.
Khosravi, Azar Dokht; Meghdadi, Hossein; Ghadiri, Ata A; Alami, Ameneh; Sina, Amir Hossein; Mirsaeidi, Mehdi
2018-03-01
The aim of this study was to analyze mutations occurring in the rpoB gene of Mycobacterium tuberculosis (MTB) isolates from clinical samples of extrapulmonary tuberculosis (EPTB). Seventy formalin-fixed, paraffin-embedded samples and fresh tissue samples from confirmed EPTB cases were analyzed. Nested PCR based on the rpoB gene was performed on the extracted DNAs, combined with cloning and subsequent sequencing. Sixty-seven (95.7%) samples were positive for nester PCR. Sequence analysis of the 81 bp region of the rpoB gene demonstrated mutations in 41 (61.2%) of 67 sequenced samples. Several point mutations including deletion mutations at codons 510, 512, 513 and 515, with 45% and 51% of the mutations in codons 512 and 513 respectively were seen, along with 26% replacement mutations at codons 509, 513, 514, 518, 520, 524 and 531. The most common alteration was Gln → His, at codon 513, presented in 30 (75.6%) isolates. This study demonstrated sequence alterations in codon 513 of the 81 bp region of the rpoB gene as the most common mutation occurred in 75.6% of molecularly confirmed rifampin-resistant strains. In addition, simultaneous mutation at codons 512 and 513 was demonstrated in 34.3% of the isolates. © 2018 APMIS. Published by John Wiley & Sons Ltd.
Differential Reprogramming of Isogenic Colorectal Cancer Cells by Distinct Activating KRAS Mutations
2015-01-01
Oncogenic mutations of Ras at codons 12, 13, or 61, that render the protein constitutively active, are found in ∼16% of all cancer cases. Among the three major Ras isoforms, KRAS is the most frequently mutated isoform in cancer. Each Ras isoform and tumor type displays a distinct pattern of codon-specific mutations. In colon cancer, KRAS is typically mutated at codon 12, but a significant fraction of patients have mutations at codon 13. Clinical data suggest different outcomes and responsiveness to treatment between these two groups. To investigate the differential effects upon cell status associated with KRAS mutations we performed a quantitative analysis of the proteome and phosphoproteome of isogenic SW48 colon cancer cell lines in which one allele of the endogenous gene has been edited to harbor specific KRAS mutations (G12V, G12D, or G13D). Each mutation generates a distinct signature, with the most variability seen between G13D and the codon 12 KRAS mutants. One notable example of specific up-regulation in KRAS codon 12 mutant SW48 cells is provided by the short form of the colon cancer stem cell marker doublecortin-like Kinase 1 (DCLK1) that can be reversed by suppression of KRAS. PMID:25599653
Modification of orthogonal tRNAs: unexpected consequences for sense codon reassignment.
Biddle, Wil; Schmitt, Margaret A; Fisk, John D
2016-12-01
Breaking the degeneracy of the genetic code via sense codon reassignment has emerged as a way to incorporate multiple copies of multiple non-canonical amino acids into a protein of interest. Here, we report the modification of a normally orthogonal tRNA by a host enzyme and show that this adventitious modification has a direct impact on the activity of the orthogonal tRNA in translation. We observed nearly equal decoding of both histidine codons, CAU and CAC, by an engineered orthogonal M. jannaschii tRNA with an AUG anticodon: tRNA Opt We suspected a modification of the tRNA Opt AUG anticodon was responsible for the anomalous lack of codon discrimination and demonstrate that adenosine 34 of tRNA Opt AUG is converted to inosine. We identified tRNA Opt AUG anticodon loop variants that increase reassignment of the histidine CAU codon, decrease incorporation in response to the histidine CAC codon, and improve cell health and growth profiles. Recognizing tRNA modification as both a potential pitfall and avenue of directed alteration will be important as the field of genetic code engineering continues to infiltrate the genetic codes of diverse organisms. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Li, Z; Kelley, C; Collins, F; Rouse, D; Morris, S
1998-04-01
The molecular mechanisms associated with the pathogenesis of tuberculosis are not well understood. The present study evaluated the role of catalase-peroxidase as a potential virulence factor for Mycobacterium tuberculosis. Growth and persistence of M. tuberculosis H37Rv in intravenously infected BALB/ c mice were compared with katG-deleted, isoniazid-resistant M. tuberculosis H37RVINHR. Transformation of M. tuberculosis H37Rv (TBkatG) or Mycobacterium intracellulare (MACkatG) genes into M. tuberculosis H37RvINHR restored its catalase-peroxidase activities and the ability of the recombinants to persist in spleens of mice and guinea pigs. Transformation with the TBkatG gene with the codon 463 R-->L mutation also restored catalase-peroxidase activity and enhanced persistence. However, transformants with the codon 275 T-->P mutant expressed low levels of enzymatic activity and failed to persist in guinea pig spleen, although they did survive in mouse tissues. These results indicate that KatG contributes to the ability of M. tuberculosis to grow and survive within the infected host tissues.
2012-01-01
Background To evaluate the value of KRAS codon 13 mutations in patients with advanced colorectal cancer (advanced CRC) treated with oxaliplatin and fluoropyrimidines. Methods Tumor specimens from 201 patients with advanced CRC from a randomized, phase III trial comparing oxaliplatin/5-FU vs. oxaliplatin/capecitabine were retrospectively analyzed for KRAS mutations. Mutation data were correlated to response data (Overall response rate, ORR), progression-free survival (PFS) and overall survival (OS). Results 201 patients were analysed for KRAS mutation (61.2% males; mean age 64.2 ± 8.6 years). KRAS mutations were identified in 36.3% of tumors (28.8% in codon 12, 7.4% in codon 13). The ORR in codon 13 patients compared to codon 12 and wild type patients was significantly lower (p = 0.008). There was a tendency for a better overall survival in KRAS wild type patients compared to mutants (p = 0.085). PFS in all patients was not different in the three KRAS genetic groups (p = 0.72). However, we found a marked difference in PFS between patients with codon 12 and 13 mutant tumors treated with infusional 5-FU versus capecitabine based regimens. Conclusions Our data suggest that the type of KRAS mutation may be of clinical relevance under oxaliplatin combination chemotherapies without the addition of monoclonal antibodies in particular when overall response rates are important. Trial registration number 2002-04-017 PMID:22876876
Mitochondrial genetic codes evolve to match amino acid requirements of proteins.
Swire, Jonathan; Judson, Olivia P; Burt, Austin
2005-01-01
Mitochondria often use genetic codes different from the standard genetic code. Now that many mitochondrial genomes have been sequenced, these variant codes provide the first opportunity to examine empirically the processes that produce new genetic codes. The key question is: Are codon reassignments the sole result of mutation and genetic drift? Or are they the result of natural selection? Here we present an analysis of 24 phylogenetically independent codon reassignments in mitochondria. Although the mutation-drift hypothesis can explain reassignments from stop to an amino acid, we found that it cannot explain reassignments from one amino acid to another. In particular--and contrary to the predictions of the mutation-drift hypothesis--the codon involved in such a reassignment was not rare in the ancestral genome. Instead, such reassignments appear to take place while the codon is in use at an appreciable frequency. Moreover, the comparison of inferred amino acid usage in the ancestral genome with the neutral expectation shows that the amino acid gaining the codon was selectively favored over the amino acid losing the codon. These results are consistent with a simple model of weak selection on the amino acid composition of proteins in which codon reassignments are selected because they compensate for multiple slightly deleterious mutations throughout the mitochondrial genome. We propose that the selection pressure is for reduced protein synthesis cost: most reassignments give amino acids that are less expensive to synthesize. Taken together, our results strongly suggest that mitochondrial genetic codes evolve to match the amino acid requirements of proteins.
Computational Tools and Algorithms for Designing Customized Synthetic Genes
Gould, Nathan; Hendy, Oliver; Papamichail, Dimitris
2014-01-01
Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations. PMID:25340050
José, Marco V.; Govezensky, Tzipe; García, José A.; Bobadilla, Juan R.
2009-01-01
Herein two genetic codes from which the primeval RNA code could have originated the standard genetic code (SGC) are derived. One of them, called extended RNA code type I, consists of all codons of the type RNY (purine-any base-pyrimidine) plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. In order to test if putative nucleotide sequences in the RNA World and in both extended RNA codes, share the same scaling and statistical properties to those encountered in current prokaryotes, we used the genomes of four Eubacteria and three Archaeas. For each prokaryote, we obtained their respective genomes obeying the RNA code or the extended RNA codes types I and II. In each case, we estimated the scaling properties of triplet sequences via a renormalization group approach, and we calculated the frequency distributions of distances for each codon. Remarkably, the scaling properties of the distance series of some codons from the RNA code and most codons from both extended RNA codes turned out to be identical or very close to the scaling properties of codons of the SGC. To test for the robustness of these results, we show, via computer simulation experiments, that random mutations of current genomes, at the rates of 10−10 per site per year during three billions of years, were not enough for destroying the observed patterns. Therefore, we conclude that most current prokaryotes may still contain relics of the primeval RNA World and that both extended RNA codes may well represent two plausible evolutionary paths between the RNA code and the current SGC. PMID:19183813
Modifications modulate anticodon loop dynamics and codon recognition of E. coli tRNA(Arg1,2).
Cantara, William A; Bilbille, Yann; Kim, Jia; Kaiser, Rob; Leszczyńska, Grażyna; Malkiewicz, Andrzej; Agris, Paul F
2012-03-02
Three of six arginine codons are read by two tRNA(Arg) isoacceptors in Escherichia coli. The anticodon stem and loop of these isoacceptors (ASL(Arg1,2)) differs only in that the position 32 cytidine of tRNA(Arg1) is posttranscriptionally modified to 2-thiocytidine (s(2)C(32)). The tRNA(Arg1,2) are also modified at positions 34 (inosine, I(34)) and 37 (2-methyladenosine, m(2)A(37)). To investigate the roles of modifications in the structure and function, we analyzed six ASL(Arg1,2) constructs differing in their array of modifications by spectroscopy and codon binding assays. Thermal denaturation and circular dichroism spectroscopy indicated that modifications contribute thermodynamic and base stacking properties, resulting in more order but less stability. NMR-derived structures of the ASL(Arg1,2) showed that the solution structures of the ASLs were nearly identical. Surprisingly, none possessed the U-turn conformation required for effective codon binding on the ribosome. Yet, all ASL(Arg1,2) constructs efficiently bound the cognate CGU codon. Three ASLs with I(34) were able to decode CGC, whereas only the singly modified ASL(Arg1,2)(ICG) with I(34) was able to decode CGA. The dissociation constants for all codon bindings were physiologically relevant (0.4-1.4 μM). However, with the introduction of s(2)C(32) or m(2)A(37) to ASL(Arg1,2)(ICG), the maximum amount of ASL bound to CGU and CGC was significantly reduced. These results suggest that, by allowing loop flexibility, the modifications modulate the conformation of the ASL(Arg1,2), which takes one structure free in solution and two others when bound to the cognate arginyl-tRNA synthetase or to codons on the ribosome where modifications reduce or restrict binding to specific codons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Bera, Bidhan Ch; Virmani, Nitin; Kumar, Naveen; Anand, Taruna; Pavulraj, S; Rash, Adam; Elton, Debra; Rash, Nicola; Bhatia, Sandeep; Sood, Richa; Singh, Raj Kumar; Tripathi, Bhupendra Nath
2017-08-23
Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective. The group - specific consensus amino acid substitutions were identified in all polymerase genes of EIVs that led to divergence of EIVs into various clades. The consistent amino acid changes were also detected in the Florida clade 2 EIVs circulating in Europe and Asia since 2007. To study the codon usage patterns, a total of 281,324 codons of polymerase genes of EIV H3N8 isolates from 1963 to 2015 were systemically analyzed. The polymerase genes of EIVs exhibit a weak codon usage bias. The ENc-GC3s and Neutrality plots indicated that natural selection is the major influencing factor of codon usage bias, and that the impact of mutation pressure is comparatively minor. The methods for estimating host imposed translation pressure suggested that the polymerase acidic (PA) gene seems to be under less translational pressure compared to polymerase basic 1 (PB1) and polymerase basic 2 (PB2) genes. The multivariate statistical analysis of polymerase genes divided EIVs into four evolutionary diverged clusters - Pre-divergent, Eurasian, Florida sub-lineage 1 and 2. Various lineage specific amino acid substitutions observed in all polymerase genes of EIVs and especially, clade 2 EIVs underwent major variations which led to the emergence of a phylogenetically distinct group of EIVs originating from Richmond/1/07. The codon usage bias was low in all the polymerase genes of EIVs that was influenced by the multiple factors such as the nucleotide compositions, mutation pressure, aromaticity and hydropathicity. However, natural selection was the major influencing factor in defining the codon usage patterns and evolution of polymerase genes of EIVs.
Margonis, Georgios A; Kim, Yuhree; Sasaki, Kazunari; Samaha, Mario; Amini, Neda; Pawlik, Timothy M
2016-09-01
Investigations regarding the impact of tumor biology after surgical management of colorectal liver metastasis have focused largely on overall survival. We investigated the impact of codon-specific KRAS mutations on the rates and patterns of recurrence in patients after surgery for colorectal liver metastasis (CRLM). All patients who underwent curative-intent surgery for CRLM between 2002 and 2015 at Johns Hopkins who had available data on KRAS mutation status were identified. Clinico-pathologic data, recurrence patterns, and recurrence-free survival (RFS) were assessed using univariable and multivariable analyses. A total of 512 patients underwent resection only (83.2%) or resection plus radiofrequency ablation (16.8%). Although 5-year overall survival was 64.6%, 284 (55.5%) patients recurred with a median RFS time of 18.1 months. The liver was the initial recurrence site for 181 patients, whereas extrahepatic recurrence was observed in 162 patients. Among patients with an extrahepatic recurrence, 102 (63%) had a lung recurrence. Although overall KRAS mutation was not associated with overall RFS (P = 0.186), it was independently associated with a worse extrahepatic (P = 0.004) and lung RFS (P = 0.007). Among patients with known KRAS codon-specific mutations, patients with codon 13 KRAS mutation had a worse 5-year extrahepatic RFS (P = 0.01), whereas codon 12 mutations were not associated with extrahepatic (P = 0.11) or lung-specific recurrence rate (P = 0.24). On multivariable analysis, only codon 13 mutation independently predicted worse overall extrahepatic RFS (P = 0.004) and lung-specific RFS (P = 0.023). Among patients undergoing resection of CRLM, overall KRAS mutation was not associated with RFS. KRAS codon 13 mutations, but not codon 12 mutations, were associated with a higher risk for overall extrahepatic recurrence and lung-specific recurrence. Cancer 2016. © 2016 American Cancer Society. Cancer 2016;122:2698-2707. © 2016 American Cancer Society. © 2016 American Cancer Society.
2013-01-01
Background Retrospective analyses in the West suggest that mutations in KRAS codons 61 and 146, BRAF, NRAS, and PIK3CA are negative predictive factors for cetuximab treatment in colorectal cancer patients. We developed a novel multiplex kit detecting 36 mutations in KRAS codons 61 and 146, BRAF, NRAS, and PIK3CA using Luminex (xMAP) assay in a single reaction. Methods Tumor samples and clinical data from Asian colorectal cancer patients treated with cetuximab were collected. We investigated KRAS, BRAF, NRAS, and PIK3CA mutations using both the multiplex kit and direct sequencing methods, and evaluated the concordance between the 2 methods. Objective response, progression-free survival (PFS), and overall survival (OS) were also evaluated according to mutational status. Results In total, 82 of 83 samples (78 surgically resected specimens and 5 biopsy specimens) were analyzed using both methods. All multiplex assays were performed using 50 ng of template DNA. The concordance rate between the methods was 100%. Overall, 49 (59.8%) patients had all wild-type tumors, 21 (25.6%) had tumors harboring KRAS codon 12 or 13 mutations, and 12 (14.6%) had tumors harboring KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA mutations. The response rates in these patient groups were 38.8%, 4.8%, and 0%, respectively. Median PFS in these groups was 6.1 months (95% confidence interval (CI): 3.1–9.2), 2.7 months (1.2–4.2), and 1.6 months (1.5–1.7); median OS was 13.8 months (9.2–18.4), 8.2 months (5.7–10.7), and 6.3 months (1.3–11.3), respectively. Statistically significant differences in both PFS and OS were found between patients with all wild-type tumors and those with KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA mutations (PFS: 95% CI, 0.11–0.44; P < 0.0001; OS: 95% CI, 0.15–0.61; P < 0.0001). Conclusions Our newly developed multiplex kit is practical and feasible for investigation of a range of sample types. Moreover, mutations in KRAS codon 61, KRAS codon 146, BRAF, NRAS, or PIK3CA detected in Asian patients were not predictive of clinical benefits from cetuximab treatment, similar to the result obtained in European studies. PMID:24006859
Bohmert-Tatarev, Karen; McAvoy, Susan; Daughtry, Sean; Peoples, Oliver P.; Snell, Kristi D.
2011-01-01
An optimized genetic construct for plastid transformation of tobacco (Nicotiana tabacum) for the production of the renewable, biodegradable plastic polyhydroxybutyrate (PHB) was designed using an operon extension strategy. Bacterial genes encoding the PHB pathway enzymes were selected for use in this construct based on their similarity to the codon usage and GC content of the tobacco plastome. Regulatory elements with limited homology to the host plastome yet known to yield high levels of plastidial recombinant protein production were used to enhance the expression of the transgenes. A partial transcriptional unit, containing genes of the PHB pathway and a selectable marker gene encoding spectinomycin resistance, was flanked at the 5′ end by the host plant’s psbA coding sequence and at the 3′ end by the host plant’s 3′ psbA untranslated region. This design allowed insertion of the transgenes into the plastome as an extension of the psbA operon, rendering the addition of a promoter to drive the expression of the transgenes unnecessary. Transformation of the optimized construct into tobacco and subsequent spectinomycin selection of transgenic plants yielded T0 plants that were capable of producing up to 18.8% dry weight PHB in samples of leaf tissue. These plants were fertile and produced viable seed. T1 plants producing up to 17.3% dry weight PHB in samples of leaf tissue and 8.8% dry weight PHB in the total biomass of the plant were also isolated. PMID:21325565
Analysis of amino acid and codon usage in Paramecium bursaria.
Dohra, Hideo; Fujishima, Masahiro; Suzuki, Haruo
2015-10-07
The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Jeong, Hyun-Jeong; Lee, Joong-Bok; Park, Seung-Yong; Song, Chang-Seon; Kim, Bo-Sook; Rho, Jung-Rae; Yoo, Mi-Hyun; Jeong, Byung-Hoon; Kim, Yong-Sun
2007-01-01
Polymorphisms of the prion protein gene (PRNP) have been detected in several cervid species. In order to confirm the genetic variations, this study examined the DNA sequences of the PRNP obtained from 33 captive sika deer (Cervus nippon laiouanus) in Korea. A total of three single-nucleotide polymorphisms (SNPs) at codons 100, 136 and 226 in the PRNP of the sika deer were identified. The polymorphic site located at codon 100 has not been reported. The SNPs detected at codons 100 and 226 induced amino acid substitutions. The SNP at codon 136 was a silent mutation that does not induce any amino acid change. The genotype and allele frequencies were determined for each of the SNPs. PMID:17679779
Long, Xi-Dai; Ma, Yun; Zhou, Yuan-Feng; Ma, Ai-Min; Fu, Guo-Hui
2010-10-01
Genetic polymorphisms in DNA repair genes may influence individual variations in DNA repair capacity, and this may be associated with the risk and outcome of hepatocellular carcinoma (HCC) related to aflatoxin B1 (AFB1) exposure. In this study, we focused on the polymorphism of xeroderma pigmentosum complementation group C (XPC) codon 939 (rs#2228001), which is involved in nucleotide excision repair. We conducted a case-control study including 1156 HCC cases and 1402 controls without any evidence of hepatic disease to evaluate the associations between this polymorphism and HCC risk and prognosis in the Guangxi population. AFB1 DNA adduct levels, XPC genotypes, and XPC protein levels were tested with a comparative enzyme-linked immunosorbent assay, TaqMan polymerase chain reaction for XPC genotypes, and immunohistochemistry, respectively. Higher AFB1 exposure was observed among HCC patients versus the control group [odds ratio (OR) = 9.88 for AFB1 exposure years and OR = 6.58 for AFB1 exposure levels]. The XPC codon 939 Gln alleles significantly increased HCC risk [OR = 1.25 (95% confidence interval = 1.03-1.52) for heterozygotes of the XPC codon 939 Lys and Gln alleles (XPC-LG) and OR = 1.81 (95% confidence interval = 1.36-2.40) for homozygotes of the XPC codon 939 Gln alleles (XPC-GG)]. Significant interactive effects between genotypes and AFB1 exposure status were also observed in the joint-effects analysis. This polymorphism, moreover, was correlated with XPC expression levels in cancerous tissues (r = -0.369, P < 0.001) and with the overall survival of HCC patients (the median survival times were 30, 25, and 19 months for patients with homozygotes of the XPC codon 939 Lys alleles, XPC-LG, and XPC-GG, respectively), especially under high AFB1 exposure conditions. Like AFB1 exposure, the XPC codon 939 polymorphism was an independent prognostic factor influencing the survival of HCC. Additionally, this polymorphism multiplicatively interacted with the xeroderma pigmentosum complementation group D codon 751 polymorphism with respect to HCC risk (OR(interaction) = 1.71). These results suggest that the XPC codon 939 polymorphism may be associated with the risk and outcome of AFB1-related HCC in the Guangxi population and may interact with AFB1 exposure in the process of HCC induction by AFB1.
Molecular Characterization of β-Thalassemia Mutations in Central Vietnam.
Doro, Maria G; Casu, Giuseppina; Frogheri, Laura; Persico, Ivana; Triet, Le Phan Minh; Hoa, Phan Thi Thuy; Hoang, Nguyen Huy; Pirastru, Monica; Mereu, Paolo; Cucca, Francesco; Masala, Bruno
2017-03-01
The molecular basis of β-thalassemia (β-thal) mutations in North and in South Vietnam have been described during the past 15 years, whereas limited data were available concerning the central area of the country. In this study, we describe the molecular characterization and frequency of β-globin gene mutations in the Thua Thien Hue Province of Central Vietnam as the result of a first survey conducted in 22 transfusion-dependent patients, and four unrelated heterozygotes. Nine different known mutations were identified (seven of the β 0 and two of the β + type) in a total of 48 chromosomes. The most common was codon 26 (G>A) or Hb E (HBB: c.79 G>A) accounting for 29.2% of the total studied chromosomes, followed by codon 17 (A>T) (HBB: c.52 A>T) (25.0%), and codons 41/42 (-TTCT) (HBB: c.126_129delCTTT) (18.8%). Other mutations with appreciable frequencies (6.3-8.3%) were IVS-I-1 (G>T) (HBB: c.92+1 G>T), codon 26 (G>T) (HBB: c.79 G>T) and codons 71/72 (+A) (HBB: c.216_217insA). Relatively rarer (2.0%) were the promoter -28 (A>G) (HBB: c.78 A>G) mutation, the codon 95 (+A) (HBB: c.287_288insA), which is reported only in the Vietnamese, and the codons 14/15 (+G) (HBB: c.45_46insG) mutation, thus far observed only in Thailand. Results are relevant for implementing appropriate measures for β-thal prevention and control in the region as well as in the whole country.
Hand gesture recognition by analysis of codons
NASA Astrophysics Data System (ADS)
Ramachandra, Poornima; Shrikhande, Neelima
2007-09-01
The problem of recognizing gestures from images using computers can be approached by closely understanding how the human brain tackles it. A full fledged gesture recognition system will substitute mouse and keyboards completely. Humans can recognize most gestures by looking at the characteristic external shape or the silhouette of the fingers. Many previous techniques to recognize gestures dealt with motion and geometric features of hands. In this thesis gestures are recognized by the Codon-list pattern extracted from the object contour. All edges of an image are described in terms of sequence of Codons. The Codons are defined in terms of the relationship between maxima, minima and zeros of curvature encountered as one traverses the boundary of the object. We have concentrated on a catalog of 24 gesture images from the American Sign Language alphabet (Letter J and Z are ignored as they are represented using motion) [2]. The query image given as an input to the system is analyzed and tested against the Codon-lists, which are shape descriptors for external parts of a hand gesture. We have used the Weighted Frequency Indexing Transform (WFIT) approach which is used in DNA sequence matching for matching the Codon-lists. The matching algorithm consists of two steps: 1) the query sequences are converted to short sequences and are assigned weights and, 2) all the sequences of query gestures are pruned into match and mismatch subsequences by the frequency indexing tree based on the weights of the subsequences. The Codon sequences with the most weight are used to determine the most precise match. Once a match is found, the identified gesture and corresponding interpretation are shown as output.
Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit
2016-08-01
Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Increasing the fidelity of noncanonical amino acid incorporation in cell-free protein synthesis.
Gan, Qinglei; Fan, Chenguang
2017-11-01
Cell-free protein synthesis provides a robust platform for co-translational incorporation of noncanonical amino acid (ncAA) into proteins to facilitate biological studies and biotechnological applications. Recently, eliminating the activity of release factor 1 has been shown to increase ncAA incorporation in response to amber codons. However, this approach could promote mis-incorporation of canonical amino acids by near cognate suppression. We performed a facile protocol to remove near cognate tRNA isoacceptors of the amber codon from total tRNAs, and used the phosphoserine (Sep) incorporation system as validation. By manipulating codon usage of target genes and tRNA species introduced into the cell-free protein synthesis system, we increased the fidelity of Sep incorporation at a specific position. By removing three near cognate tRNA isoacceptors of the amber stop codon [tRNA Lys , tRNA Tyr , and tRNA Gln (CUG)] from the total tRNA, the near cognate suppression decreased by 5-fold without impairing normal protein synthesis in the cell-free protein synthesis system. Mass spectrometry analyses indicated that the fidelity of ncAA incorporation was improved. Removal of near cognate tRNA isoacceptors of the amber codon could increase ncAA incorporation fidelity towards the amber stop codon in release factor deficiency systems. We provide a general strategy to improve fidelity of ncAA incorporation towards stop, quadruplet and sense codons in cell-free protein synthesis systems. This article is part of a Special Issue entitled "Biochemistry of Synthetic Biology - Recent Developments" Guest Editor: Dr. Ilka Heinemann and Dr. Patrick O'Donoghue. Copyright © 2016 Elsevier B.V. All rights reserved.
Genotyping of beta thalassemia trait by high-resolution DNA melting analysis.
Saetung, Rattika; Ongchai, Siriwan; Charoenkwan, Pimlak; Sanguansermsri, Torpong
2013-11-01
Beta thalassemia is a common hereditary hemalogogical disease in Thailand, with a prevalence of 5-8%. In this study, we evaluated the high resolution DNA melting (HRM) assay to identify beta thalassemia mutation in samples from 143 carriers of the beta thalassemia traits in at risk couples. The DNA was isolated from venous blood samples and tested for mutation under a series of 5 PCR-HRM (A, B, C, D and E primers) protocols. The A primers were for detection of beta thalassemia mutations in the HBB promoter region, the B primers for mutations in exon I, the C primers for exon II, the D primers for exon III and the E primers for the 3.4 kb deletion mutation. The mutations were diagnosed by comparing the complete melting curve profiles of a wild type control with those for each mutant sample. With the PCR-HRM technique, fourteen types of beta thalassemia mutations were detected. Each mutation had a unique and specific melting profile. The mutations included 36.4% (52 cases) codon 41/42-CTTT, 26.6% (38 cases) codon 17 A-T, 11.2% (16 cases) IVS1-1 G-T, 8.4% (12 cases) codon 71/72 +A, 8.4% (12 cases) of the 3.4 kb deletion and 3.5% (5 cases) -28 A-G. The remainder included one instance each of -87 C-A, -31 A-C, codon 27/28 +C, codon 30 G-A, IVS1-5 G-C, codon 35 C-A, codon 41-C and IVSII -654 C-T. Of the total cases, 85.8% of the mutations could be detected by primers B and C. The PCR-HRM method provides a rapid, simple and highly feasible strategy for mutation screening of beta thalassemia traits.
Levin-Karp, Ayelet; Barenholz, Uri; Bareia, Tasneem; Dayagi, Michal; Zelcbuch, Lior; Antonovsky, Niv; Noor, Elad; Milo, Ron
2013-06-21
Translational coupling is the interdependence of translation efficiency of neighboring genes encoded within an operon. The degree of coupling may be quantified by measuring how the translation rate of a gene is modulated by the translation rate of its upstream gene. Translational coupling was observed in prokaryotic operons several decades ago, but the quantitative range of modulation translational coupling leads to and the factors governing this modulation were only partially characterized. In this study, we systematically quantify and characterize translational coupling in E. coli synthetic operons using a library of plasmids carrying fluorescent reporter genes that are controlled by a set of different ribosome binding site (RBS) sequences. The downstream gene expression level is found to be enhanced by the upstream gene expression via translational coupling with the enhancement level varying from almost no coupling to over 10-fold depending on the upstream gene's sequence. Additionally, we find that the level of translational coupling in our system is similar between the second and third locations in the operon. The coupling depends on the distance between the stop codon of the upstream gene and the start codon of the downstream gene. This study is the first to systematically and quantitatively characterize translational coupling in a synthetic E. coli operon. Our analysis will be useful in accurate manipulation of gene expression in synthetic biology and serves as a step toward understanding the mechanisms involved in translational expression modulation.
Modulation of c-fms proto-oncogene in an ovarian carcinoma cell line by a hammerhead ribozyme.
Yokoyama, Y.; Morishita, S.; Takahashi, Y.; Hashimoto, M.; Tamaya, T.
1997-01-01
Co-expression of macrophage colony-stimulating factor (M-CSF) and its receptor (c-fms) is often found in ovarian epithelial carcinoma, suggesting the existence of autocrine regulation of cell growth by M-CSF. To block this autocrine loop, we have developed hammerhead ribozymes against c-fms mRNA. As target sites of the ribozyme, we chose the GUC sequence in codon 18 and codon 27 of c-fms mRNA. Two kinds of ribozymes were able to cleave an artificial c-fms RNA substrate in a cell-free system, although the ribozyme against codon 18 was much more efficient than that against codon 27. We next constructed an expression vector carrying a ribozyme sequence that targeted the GUC sequence in codon 18 of c-fms mRNA. It was introduced into TYK-nu cells that expressed M-CSF and its receptor. Its transfectant showed a reduced growth potential. The expression levels of c-fms protein and mRNA in the transfectant were clearly decreased with the expression of ribozyme RNA compared with that of an untransfected control or a transfectant with the vector without the ribozyme sequence. These results suggest that the ribozyme against GUC in codon 18 of c-fms mRNA is a promising tool for blocking the autocrine loop of M-CSF in ovarian epithelial carcinoma. Images Figure 2 Figure 3 Figure 5 Figure 6 PMID:9376277
Effects of tRNA modification on translational accuracy depend on intrinsic codon-anticodon strength.
Manickam, Nandini; Joshi, Kartikeya; Bhatt, Monika J; Farabaugh, Philip J
2016-02-29
Cellular health and growth requires protein synthesis to be both efficient to ensure sufficient production, and accurate to avoid producing defective or unstable proteins. The background of misreading error frequency by individual tRNAs is as low as 2 × 10(-6) per codon but is codon-specific with some error frequencies above 10(-3) per codon. Here we test the effect on error frequency of blocking post-transcriptional modifications of the anticodon loops of four tRNAs in Escherichia coli. We find two types of responses to removing modification. Blocking modification of tRNA(UUC)(Glu) and tRNA(QUC)(Asp) increases errors, suggesting that the modifications act at least in part to maintain accuracy. Blocking even identical modifications of tRNA(UUU)(Lys) and tRNA(QUA)(Tyr) has the opposite effect of decreasing errors. One explanation could be that the modifications play opposite roles in modulating misreading by the two classes of tRNAs. Given available evidence that modifications help preorder the anticodon to allow it to recognize the codons, however, the simpler explanation is that unmodified 'weak' tRNAs decode too inefficiently to compete against cognate tRNAs that normally decode target codons, which would reduce the frequency of misreading. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
The effect of tRNA levels on decoding times of mRNA codons.
Dana, Alexandra; Tuller, Tamir
2014-08-01
The possible effect of transfer ribonucleic acid (tRNA) concentrations on codons decoding time is a fundamental biomedical research question; however, due to a large number of variables affecting this process and the non-direct relation between them, a conclusive answer to this question has eluded so far researchers in the field. In this study, we perform a novel analysis of the ribosome profiling data of four organisms which enables ranking the decoding times of different codons while filtering translational phenomena such as experimental biases, extreme ribosomal pauses and ribosome traffic jams. Based on this filtering, we show for the first time that there is a significant correlation between tRNA concentrations and the codons estimated decoding time both in prokaryotes and in eukaryotes in natural conditions (-0.38 to -0.66, all P values <0.006); in addition, we show that when considering tRNA concentrations, codons decoding times are not correlated with aminoacyl-tRNA levels. The reported results support the conjecture that translation efficiency is directly influenced by the tRNA levels in the cell. Thus, they should help to understand the evolution of synonymous aspects of coding sequences via the adaptation of their codons to the tRNA pool. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Seligmann, Hervé
2018-05-01
Genetic codes mainly evolve by reassigning punctuation codons, starts and stops. Previous analyses assuming that undefined amino acids translate stops showed greater divergence between nuclear and mitochondrial genetic codes. Here, three independent methods converge on which amino acids translated stops at split between nuclear and mitochondrial genetic codes: (a) alignment-free genetic code comparisons inserting different amino acids at stops; (b) alignment-based blast analyses of hypothetical peptides translated from non-coding mitochondrial sequences, inserting different amino acids at stops; (c) biases in amino acid insertions at stops in proteomic data. Hence short-term protein evolution models reconstruct long-term genetic code evolution. Mitochondria reassign stops to amino acids otherwise inserted at stops by codon-anticodon mismatches (near-cognate tRNAs). Hence dual function (translation termination and translation by codon-anticodon mismatch) precedes mitochondrial reassignments of stops to amino acids. Stop ambiguity increases coded information, compensates endocellular mitogenome reduction. Mitochondrial codon reassignments might prevent viral infections. Copyright © 2018 Elsevier B.V. All rights reserved.
Mohammadzadeh, Sara; Roohvand, Farzin; Memarnejadian, Arash; Jafari, Anis; Ajdary, Soheila; Salmanian, Ali-Hatef; Ehsani, Parastoo
2016-01-01
Plants transformed by virus-based vectors have emerged as promising tools to rapidly express large amounts and inexpensive antigens in transient condition. We studied the possibility of transient-expression of an HBsAg-fused polytopic construct (HCVpc) [containing H-2d and HLA-A2-restricted CD8+CTL-epitopic peptides of C (Core; aa 132-142), E6 (Envelope2; aa 614-622), N (NS3; aa 1406-1415), and E4 (Envelope2; aa 405-414) in tandem of CE6NE4] in tobacco (Nicotiana tabacum) leaves for the development of a plant-based HCV vaccine. A codon-optimized gene encoding the Kozak sequence, hexahistidine (6×His)-tag peptide, and HCVpc in tandem was designed, chemically synthesized, fused to HBsAg gene, and inserted into Potato virus X (PVX-GW) vector under the control of duplicated PVX coat protein promoter (CPP). The resulted recombinant plasmids (after confirmation by restriction and sequencing analyses) were transferred into Agrobacterium tumefaciens strain GV3101 and vacuum infiltrated into tobacco leaves. The effect of gene-silencing suppressor, p19 protein from tomato bushy stunt virus, on the expression yield of HCVpc-HBsAg was also evaluated by co-infiltration of a p19 expression vector. Codon-optimized gene increased adaptation index (CAI) value (from 0.61 to 0.92) in tobacco. The expression of the HCVpc-HBsAg was confirmed by western blot and HBsAg-based detection ELISA on total extractable proteins of tobacco leaves. The expression level of the fusion protein was significantly higher in p19 co-agroinfiltrated plants. The results indicated the possibility of expression of HCVpc-HBsAg constructs with proper protein conformations in tobacco for final application as a plant-derived HCV vaccine.
Babcock, Gregory J.; Esshaki, Diana J.; Thomas, William D.; Ambrosino, Donna M.
2004-01-01
A novel coronavirus, severe acute respiratory syndrome coronavirus (SARS-CoV), has recently been identified as the causative agent of severe acute respiratory syndrome (SARS). SARS-CoV appears similar to other coronaviruses in both virion structure and genome organization. It is known for other coronaviruses that the spike (S) glycoprotein is required for both viral attachment to permissive cells and for fusion of the viral envelope with the host cell membrane. Here we describe the construction and expression of a soluble codon-optimized SARS-CoV S glycoprotein comprising the first 1,190 amino acids of the native S glycoprotein (S1190). The codon-optimized and native S glycoproteins exhibit similar molecular weight as determined by Western blot analysis, indicating that synthetic S glycoprotein is modified correctly in a mammalian expression system. S1190 binds to the surface of Vero E6 cells, a cell permissive to infection, as demonstrated by fluorescence-activated cell sorter analysis, suggesting that S1190 maintains the biologic activity present in native S glycoprotein. This interaction is blocked with serum obtained from recovering SARS patients, indicating that the binding is specific. In an effort to map the ligand-binding domain of the SARS-CoV S glycoprotein, carboxy- and amino-terminal truncations of the S1190 glycoprotein were constructed. Amino acids 270 to 510 were the minimal receptor-binding region of the SARS-CoV S glycoprotein as determined by flow cytometry. We speculate that amino acids 1 to 510 of the SARS-CoV S glycoprotein represent a unique domain containing the receptor-binding site (amino acids 270 to 510), analogous to the S1 subunit of other coronavirus S glycoproteins. PMID:15078936
Efficient CRISPR/Cas9-based genome editing in carrot cells.
Klimek-Chodacka, Magdalena; Oleszkiewicz, Tomasz; Lowder, Levi G; Qi, Yiping; Baranski, Rafal
2018-04-01
The first report presenting successful and efficient carrot genome editing using CRISPR/Cas9 system. Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated (Cas9) is a powerful genome editing tool that has been widely adopted in model organisms recently, but has not been used in carrot-a model species for in vitro culture studies and an important health-promoting crop grown worldwide. In this study, for the first time, we report application of the CRISPR/Cas9 system for efficient targeted mutagenesis of the carrot genome. Multiplexing CRISPR/Cas9 vectors expressing two single-guide RNA (gRNAs) targeting the carrot flavanone-3-hydroxylase (F3H) gene were tested for blockage of the anthocyanin biosynthesis in a model purple-colored callus using Agrobacterium-mediated genetic transformation. This approach allowed fast and visual comparison of three codon-optimized Cas9 genes and revealed that the most efficient one in generating F3H mutants was the Arabidopsis codon-optimized AteCas9 gene with up to 90% efficiency. Knockout of F3H gene resulted in the discoloration of calli, validating the functional role of this gene in the anthocyanin biosynthesis in carrot as well as providing a visual marker for screening successfully edited events. Most resulting mutations were small Indels, but long chromosome fragment deletions of 116-119 nt were also generated with simultaneous cleavage mediated by two gRNAs. The results demonstrate successful site-directed mutagenesis in carrot with CRISPR/Cas9 and the usefulness of a model callus culture to validate genome editing systems. Given that the carrot genome has been sequenced recently, our timely study sheds light on the promising application of genome editing tools for boosting basic and translational research in this important vegetable crop.
Rahpeyma, Mehdi; Samarbaf-Zadeh, Alireza; Makvandi, Manoochehr; Ghadiri, Ata A; Dowall, Stuart D; Fotouhi, Fatemeh
2017-07-01
Crimean-Congo hemorrhagic fever virus (CCHFV) is a major cause of tick-borne viral hemorrhagic disease in the world. Despite of its importance as a deadly pathogen, there is currently no licensed vaccine against CCHF disease. The attachment glycoprotein of CCHFV (Gn) is a potentially important target for protective antiviral immune responses. To characterize the expression of recombinant CCHFV Gn in an insect-cell-based system, we developed a gene expression system expressing the full-length coding sequence under a polyhedron promoter in Sf9 cells using recombinant baculovirus. Recombinant Gn was purified by affinity chromatography, and the immunoreactivity of the protein was evaluated using sera from patients with confirmed CCHF infection. Codon-optimized Gn was successfully expressed, and the product had the expected molecular weight for CCHFV Gn glycoprotein of 37 kDa. In time course studies, the optimum expression of Gn occurred between 36 and 48 hours postinfection. The immunoreactivity of the recombinant protein in Western blot assay against human sera was positive and was similar to the results obtained with the anti-V5 tag antibody. Additionally, mice were subjected to subcutaneous injection with recombinant Gn, and the cellular and humoral immune response was monitored. The results showed that recombinant Gn protein was highly immunogenic and could elicit high titers of antigen-specific antibodies. Induction of the inflammatory cytokine interferon-gamma and the regulatory cytokine IL-10 was also detected. In conclusion, a recombinant baculovirus harboring CCHFV Gn was constructed and expressed in Sf9 host cells for the first time, and it was demonstrated that this approach is a suitable expression system for producing immunogenic CCHFV Gn protein without any biosafety concerns.
Loiacono, Monica; Martino, Piera A; Albonico, Francesca; Dell'Orco, Francesca; Ferretti, Manuela; Zanzani, Sergio; Mortarino, Michele
2017-09-01
Staphylococcus pseudintermedius is an opportunistic pathogen of dogs and cats. A high-resolution melting analysis (HRMA) protocol was designed and tested on 42 clinical isolates with known fluoroquinolone (FQ) susceptibility and gyrA codon 84 and grlA codon 80 mutation status. The HRMA approach was able to discriminate between FQ-sensitive and FQ-resistant strains and confirmed previous reports that the main mutation site associated with FQ resistance in S. pseudintermedius is located at position 251 (Ser84Leu) of gyrA. Routine, HRMA-based FQ susceptibility profiles may be a valuable tool to guide therapy. The FQ resistance-predictive power of the assay should be tested in a significantly larger number of isolates.
Vanlalruati, Catherine; Mandal, Surajit De; Gurusubramanian, Guruswami; Senthil Kumar, Nachimuthu
2016-07-01
The complete mitochondrial genome of Junonia iphita was determined to be 15,433 bp in length, including 37 typical mitochondrial genes and an AT-rich region. All the protein coding genes (PCGs) are initiated by typical ATN codons, except cox1 gene that is by CGA codon. Eight genes use complete termination codon (TAA), whereas the cox1, cox2 and nad5 genes end with single T; nad4 and nad1 ends with stop codon TA. All the tRNA show secondary cloverleaf structures except trnS1 (AGN). The A + T rich region is 546 bp in length containing ATAGA motif followed by a 18 bp poly-T stretch, two microsatellite-like (TA)9 elements and 8 bp poly-A stretch immediately upstream of trnM gene.
Ivanov, Ivaylo P.; Loughran, Gary; Atkins, John F.
2008-01-01
In a minority of eukaryotic mRNAs, a small functional upstream ORF (uORF), often performing a regulatory role, precedes the translation start site for the main product(s). Here, conserved uORFs in numerous ornithine decarboxylase homologs are identified from yeast to mammals. Most have noncanonical evolutionarily conserved start codons, the main one being AUU, which has not been known as an initiator for eukaryotic chromosomal genes. The AUG-less uORF present in mouse antizyme inhibitor, one of the ornithine decarboxylase homologs in mammals, mediates polyamine-induced repression of the downstream main ORF. This repression is part of an autoregulatory circuit, and one of its sensors is the AUU codon, which suggests that translation initiation codon identity is likely used for regulation in eukaryotes. PMID:18626014
Production of Phloroglucinol, a Platform Chemical, in Arabidopsis using a Bacterial Gene.
Abdel-Ghany, Salah E; Day, Irene; Heuberger, Adam L; Broeckling, Corey D; Reddy, Anireddy S N
2016-12-07
Phloroglucinol (1,3,5-trihydroxybenzene; PG) and its derivatives are phenolic compounds that are used for various industrial applications. Current methods to synthesize PG are not sustainable due to the requirement for carbon-based precursors and co-production of toxic byproducts. Here, we describe a more sustainable production of PG using plants expressing a native bacterial or a codon-optimized synthetic PhlD targeted to either the cytosol or chloroplasts. Transgenic lines were analyzed for the production of PG using gas and liquid chromatography coupled to mass spectroscopy. Phloroglucinol was produced in all transgenic lines and the line with the highest PhlD transcript level showed the most accumulation of PG. Over 80% of the produced PG was glycosylated to phlorin. Arabidopsis leaves have the machinery to glycosylate PG to form phlorin, which can be hydrolyzed enzymatically to produce PG. Furthermore, the metabolic profile of plants with PhlD in either the cytosol or chloroplasts was altered. Our results provide evidence that plants can be engineered to produce PG using a bacterial gene. Phytoproduction of PG using a bacterial gene paves the way for further genetic manipulations to enhance the level of PG with implications for the commercial production of this important platform chemical in plants.
Stable Isotopes, Quantum Computing and Consciousness
NASA Astrophysics Data System (ADS)
Berezin, Alexander A.
2000-10-01
Recent proposals of quantum computing/computers (QC) based on nuclear spins suggest that consciousness (CON) activity may be related (assisted) to subset of C13 atoms incorporated randomly, or quasirandomly, in neural structures. Consider two DNA chains. Even if they are completely identical chemically (same sequence of codons), patterns of 12C and 13C isotopes in them are different (possible origin of personal individuality). Perhaps it is subsystem of nuclear spins of 13C "sublattice" which forms dynamical system capable of QC and on which CON is "spanned". Some issues related to this hypothesis are: (1) existence of CON-driven positional correlations among C13 atoms, (2) motion (hopping) of C13 via enhanced neutron tunneling, cf. quantum "anti Zeno-effect", (3) possible optimization of concentration of QC-active C13 atoms above their standard isotopic abundance, (4) characteristic time-scales for operation of C13-based QC (perrhaps, broad range of scales), (5) reflection of QC dynamics of C13 on CON, (6) possibility that C13-based QC operates "above" level of "regular" CON (perhaps, Jungian sub/super-CON), (7) isotopicity as connector to universal Library of Patterns ("Platonic World"), (8) self-stabilization of coherence in C13 (sub)system. Some of this questions are, in principle, experimentally addressable through shifting of isotopic abundances.
Herrera, Victoria L M; Steffen, Martin; Moran, Ann Marie; Tan, Glaiza A; Pasion, Khristine A; Rivera, Keith; Pappin, Darryl J; Ruiz-Opazo, Nelson
2016-06-14
In contrast to rat and mouse databases, the NCBI gene database lists the human dual-endothelin1/VEGFsp receptor (DEspR, formerly Dear) as a unitary transcribed pseudogene due to a stop [TGA]-codon at codon#14 in automated DNA and RNA sequences. However, re-analysis is needed given prior single gene studies detected a tryptophan [TGG]-codon#14 by manual Sanger sequencing, demonstrated DEspR translatability and functionality, and since the demonstration of actual non-translatability through expression studies, the standard-of-excellence for pseudogene designation, has not been performed. Re-analysis must meet UNIPROT criteria for demonstration of a protein's existence at the highest (protein) level, which a priori, would override DNA- or RNA-based deductions. To dissect the nucleotide sequence discrepancy, we performed Maxam-Gilbert sequencing and reviewed 727 RNA-seq entries. To comply with the highest level multiple UNIPROT criteria for determining DEspR's existence, we performed various experiments using multiple anti-DEspR monoclonal antibodies (mAbs) targeting distinct DEspR epitopes with one spanning the contested tryptophan [TGG]-codon#14, assessing: (a) DEspR protein expression, (b) predicted full-length protein size, (c) sequence-predicted protein-specific properties beyond codon#14: receptor glycosylation and internalization, (d) protein-partner interactions, and (e) DEspR functionality via DEspR-inhibition effects. Maxam-Gilbert sequencing and some RNA-seq entries demonstrate two guanines, hence a tryptophan [TGG]-codon#14 within a compression site spanning an error-prone compression sequence motif. Western blot analysis using anti-DEspR mAbs targeting distinct DEspR epitopes detect the identical glycosylated 17.5 kDa pull-down protein. Decrease in DEspR-protein size after PNGase-F digest demonstrates post-translational glycosylation, concordant with the consensus-glycosylation site beyond codon#14. Like other small single-transmembrane proteins, mass spectrometry analysis of anti-DEspR mAb pull-down proteins do not detect DEspR, but detect DEspR-protein interactions with proteins implicated in intracellular trafficking and cancer. FACS analyses also detect DEspR-protein in different human cancer stem-like cells (CSCs). DEspR-inhibition studies identify DEspR-roles in CSC survival and growth. Live cell imaging detects fluorescently-labeled anti-DEspR mAb targeted-receptor internalization, concordant with the single internalization-recognition sequence also located beyond codon#14. Data confirm translatability of DEspR, the full-length DEspR protein beyond codon#14, and elucidate DEspR-specific functionality. Along with detection of the tryptophan [TGG]-codon#14 within an error-prone compression site, cumulative data demonstrating DEspR protein existence fulfill multiple UNIPROT criteria, thus refuting its pseudogene designation.
Polymorphism at codon 36 of the p53 gene.
Felix, C A; Brown, D L; Mitsudomi, T; Ikagaki, N; Wong, A; Wasserman, R; Womer, R B; Biegel, J A
1994-01-01
A polymorphism at codon 36 in exon 4 of the p53 gene was identified by single strand conformation polymorphism (SSCP) analysis and direct sequencing of genomic DNA PCR products. The polymorphic allele, present in the heterozygous state in genomic DNAs of four of 100 individuals (4%), changes the codon 36 CCG to CCA, eliminates a FinI restriction site and creates a BccI site. Including this polymorphism there are four known polymorphisms in the p53 coding sequence.
Genomic adaptation of the ISA virus to Salmo salar codon usage
2013-01-01
Background The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Methods Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Results Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Conclusions Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations. PMID:23829271
Genomic adaptation of the ISA virus to Salmo salar codon usage.
Tello, Mario; Vergara, Francisco; Spencer, Eugenio
2013-07-05
The ISA virus (ISAV) is an Orthomyxovirus whose genome encodes for at least 10 proteins. Low protein identity and lack of genetic tools have hampered the study of the molecular mechanism behind its virulence. It has been shown that viral codon usage controls several processes such as translational efficiency, folding, tuning of protein expression, antigenicity and virulence. Despite this, the possible role that adaptation to host codon usage plays in virulence and viral evolution has not been studied in ISAV. Intergenomic adaptation between viral and host genomes was calculated using the codon adaptation index score with EMBOSS software and the Kazusa database. Classification of host genes according to GeneOnthology was performed using Blast2go. A non parametric test was applied to determine the presence of significant correlations among CAI, mortality and time. Using the codon adaptation index (CAI) score, we found that the encoding genes for nucleoprotein, matrix protein M1 and antagonist of Interferon I signaling (NS1) are the ISAV genes that are more adapted to host codon usage, in agreement with their requirement for production of viral particles and inactivation of antiviral responses. Comparison to host genes showed that ISAV shares CAI values with less than 0.45% of Salmo salar genes. GeneOntology classification of host genes showed that ISAV genes share CAI values with genes from less than 3% of the host biological process, far from the 14% shown by Influenza A viruses and closer to the 5% shown by Influenza B and C. As well, we identified a positive correlation (p<0.05) between CAI values of a virus and the duration of the outbreak disease in given salmon farms, as well as a weak relationship between codon adaptation values of PB1 and the mortality rates of a set of ISA viruses. Our analysis shows that ISAV is the least adapted viral Salmo salar pathogen and Orthomyxovirus family member less adapted to host codon usage, avoiding the general behavior of host genes. This is probably due to its recent emergence among farmed Salmon populations.
YrdC exhibits properties expected of a subunit for a tRNA threonylcarbamoyl transferase.
Harris, Kimberly A; Jones, Victoria; Bilbille, Yann; Swairjo, Manal A; Agris, Paul F
2011-09-01
The post-transcriptional nucleoside modifications of tRNA's anticodon domain form the loop structure and dynamics required for effective and accurate recognition of synonymous codons. The N(6)-threonylcarbamoyladenosine modification at position 37 (t(6)A(37)), 3'-adjacent to the anticodon, of many tRNA species in all organisms ensures the accurate recognition of ANN codons by increasing codon affinity, enhancing ribosome binding, and maintaining the reading frame. However, biosynthesis of this complex modification is only partially understood. The synthesis requires ATP, free threonine, a single carbon source for the carbamoyl, and an enzyme yet to be identified. Recently, the universal protein family Sua5/YciO/YrdC was associated with t(6)A(37) biosynthesis. To further investigate the role of YrdC in t(6)A(37) biosynthesis, the interaction of the Escherichia coli YrdC with a heptadecamer anticodon stem and loop of lysine tRNA (ASL(Lys)(UUU)) was examined. YrdC bound the unmodified ASL(Lys)(UUU) with high affinity compared with the t(6)A(37)-modified ASL(Lys)(UUU) (K(d) = 0.27 ± 0.20 μM and 1.36 ± 0.39 μM, respectively). YrdC also demonstrated specificity toward the unmodified versus modified anticodon pentamer UUUUA and toward threonine and ATP. The protein did not significantly alter the ASL architecture, nor was it able to base flip A(37), as determined by NMR, circular dichroism, and fluorescence of 2-aminopuine at position 37. Thus, current data support the hypothesis that YrdC, with many of the properties of a putative threonylcarbamoyl transferase, most likely functions as a component of a heteromultimeric protein complex for t(6)A(37) biosynthesis.
The future of human DNA vaccines
Li, Lei; Saade, Fadi; Petrovsky, Nikolai
2012-01-01
DNA vaccines have evolved greatly over the last 20 years since their invention, but have yet to become a competitive alternative to conventional protein or carbohydrate based human vaccines. Whilst safety concerns were an initial barrier, the Achilles heel of DNA vaccines remains their poor immunogenicity when compared to protein vaccines. A wide variety of strategies have been developed to optimize DNA vaccine immunogenicity, including codon optimization, genetic adjuvants, electroporation and sophisticated prime-boost regimens, with each of these methods having its advantages and limitations. Whilst each of these methods has contributed to incremental improvements in DNA vaccine efficacy, more is still needed if human DNA vaccines are to succeed commercially. This review foresees a final breakthrough in human DNA vaccines will come from application of the latest cutting-edge technologies, including “epigenetics” and “omics” approaches, alongside traditional techniques to improve immunogenicity such as adjuvants and electroporation, thereby overcoming the current limitations of DNA vaccines in humans PMID:22981627
van der Woude, Aniek D; Perez Gallego, Ruth; Vreugdenhil, Angie; Puthan Veetil, Vinod; Chroumpi, Tania; Hellingwerf, Klaas J
2016-04-08
Erythritol is a polyol that is used in the food and beverage industry. Due to its non-caloric and non-cariogenic properties, the popularity of this sweetener is increasing. Large scale production of erythritol is currently based on conversion of glucose by selected fungi. In this study, we describe a biotechnological process to produce erythritol from light and CO2, using engineered Synechocystis sp. PCC6803. By functionally expressing codon-optimized genes encoding the erythrose-4-phosphate phosphatase TM1254 and the erythrose reductase Gcy1p, or GLD1, this cyanobacterium can directly convert the Calvin cycle intermediate erythrose-4-phosphate into erythritol via a two-step process and release the polyol sugar in the extracellular medium. Further modifications targeted enzyme expression and pathway intermediates. After several optimization steps, the best strain, SEP024, produced up to 2.1 mM (256 mg/l) erythritol, excreted in the medium.
Expression and refolding of tobacco anionic peroxidase from E. coli inclusion bodies.
Hushpulian, D M; Savitski, P A; Rojkova, A M; Chubar, T A; Fechina, V A; Sakharov, I Yu; Lagrimini, L M; Tishkov, V I; Gazaryan, I G
2003-11-01
Coding DNA of the tobacco anionic peroxidase gene was cloned in pET40b vector. The problem of 11 arginine codons, rare in procaryotes, in the tobacco peroxidase gene was solved using E. coli BL21(DE3) Codon Plus strain. The expression level of the tobacco apo-peroxidase in the above strain was approximately 40% of the total E. coli protein. The tobacco peroxidase refolding was optimized based on the earlier developed protocol for horseradish peroxidase. The reactivation yield of recombinant tobacco enzyme was about 7% with the specific activity of 1100-1200 U/mg towards 2,2;-azino-bis(3-ethylbenzothiazoline-6-sulfonate) (ABTS). It was shown that the reaction of ABTS oxidation by hydrogen peroxide catalyzed by recombinant tobacco peroxidase proceeds via the ping-pong kinetic mechanism as for the native enzyme. In the presence of calcium ions, the recombinant peroxidase exhibits a 2.5-fold decrease in the second order rate constant for hydrogen peroxide and 1.5-fold decrease for ABTS. Thus, calcium ions have an inhibitory effect on the recombinant enzyme like that observed earlier for the native tobacco peroxidase. The data demonstrate that the oligosaccharide part of the enzyme has no effect on the kinetic properties and calcium inhibition of tobacco peroxidase.
Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron
Hiesel, Rudolf; Brennicke, Axel
1983-01-01
The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484
Model for Codon Position Bias in RNA Editing
NASA Astrophysics Data System (ADS)
Liu, Tsunglin; Bundschuh, Ralf
2005-08-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
A model for codon position bias in RNA editing
NASA Astrophysics Data System (ADS)
Bundschuh, Ralf; Liu, Tsunglin
2006-03-01
RNA editing can be crucial for the expression of genetic information via inserting, deleting, or substituting a few nucleotides at specific positions in an RNA sequence. Within coding regions in an RNA sequence, editing usually occurs with a certain bias in choosing the positions of the editing sites. In the mitochondrial genes of Physarum polycephalum, many more editing events have been observed at the third codon position than at the first and second, while in some plant mitochondria the second codon position dominates. Here we propose an evolutionary model that explains this bias as the basis of selection at the protein level. The model predicts a distribution of the three positions rather close to the experimental observation in Physarum. This suggests that the codon position bias in Physarum is mainly a consequence of selection at the protein level.
Molecular investigations of β-thalassemic children in Erbil governorate
NASA Astrophysics Data System (ADS)
Hasan, Ahmad N.; Al-Attar, Mustafa S.
2017-09-01
The present work studies the molecular investigation of 40 thalassemic carriers using polymerase chain reaction. Forty thalassemic carriers who were registered and treated at Erbil thalassemic center and twenty apparently healthy children have been included in the present study. Ages of both groups ranged between 1-18 years. Four primers used to detect four different beta thalassemia mutations they were codon 8/9, codon 8, codon 41/42 and IVS-1-5. The two most common mutations detected among thalassemia group were Cd8/9 with 8 cases (20%) and Cd-8 with 6 cases (15%) followed by codon 41/42 with 4 cases (10%) which investigated and detected for the first time in Erbil governorate through the present study and finally IVS-1-5 with 3 cases (7.5%), while no any cases detected among control group.
Ang, S B L; Hing, W C; Tung, S Y; Park, T
2014-07-01
The Codonics Safe Labeling System(™) (http://www.codonics.com/Products/SLS/flash/) is a piece of equipment that is able to barcode scan medications, read aloud the medication and the concentration and print a label of the appropriate concentration in the appropriate colour code. We decided to test this system in our facility to identify risks, benefits and usability. Our project comprised a baseline survey (25 anaesthesia cases during which 212 syringes were prepared from 223 drugs), an observational study (47 cases with 330 syringes prepared) and a user acceptability survey. The baseline compliance with all labelling requirements was 58%. In the observational study the compliance using the Codonics system was 98.6% versus 63.8% with conventional labelling. In the user acceptability survey the majority agreed the Codonics machine was easy to use, more legible and adhered with better security than the conventional preprinted label. However, most were neutral when asked about the likelihood of flexibility and customisation and were dissatisfied with the increased workload. Our findings suggest that the Codonics labelling machine is user-friendly and it improved syringe labelling compliance in our study. However, staff need to be willing to follow proper labelling workflow rather than batch label during preparation. Future syringe labelling equipment developers need to concentrate on user interface issues to reduce human factor and workflow problems. Support logistics are also an important consideration prior to implementation of any new labelling system.
Romero, Héctor; Zavala, Alejandro; Musto, Héctor
2000-01-01
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C.trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted. PMID:10773076
Romero, H; Zavala, A; Musto, H
2000-05-15
The patterns of synonymous codon choices of the completely sequenced genome of the bacterium Chlamydia trachomatis were analysed. We found that the most important source of variation among the genes results from whether the sequence is located on the leading or lagging strand of replication, resulting in an over representation of G or C, respectively. This can be explained by different mutational biases associated to the different enzymes that replicate each strand. Next we found that most highly expressed sequences are located on the leading strand of replication. From this result, replicational-transcriptional selection can be invoked. Then, when the genes located on the leading strand are studied separately, the correspondence analysis detects a principal trend which discriminates between lowly and highly expressed sequences, the latter displaying a different codon usage pattern than the former, suggesting selection for translation, which is reinforced by the fact that Ks values between orthologous sequences from C. trachomatis and Chlamydia pneumoniae are much smaller in highly expressed genes. Finally, synonymous codon choices appear to be influenced by the hydropathy of each encoded protein and by the degree of amino acid conservation. Therefore, synonymous codon usage in C.trachomatis seems to be the result of a very complex balance among different factors, which rises the problem of whether the forces driving codon usage patterns among microorganisms are rather more complex than generally accepted.
Recent advances in the production of recombinant subunit vaccines in Pichia pastoris
Wang, Man; Jiang, Shuai; Wang, Yefu
2016-01-01
ABSTRACT Recombinant protein subunit vaccines are formulated using defined protein antigens that can be produced in heterologous expression systems. The methylotrophic yeast Pichia pastoris has become an important host system for the production of recombinant subunit vaccines. Although many basic elements of P. pastoris expression system are now well developed, there is still room for further optimization of protein production. Codon bias, gene dosage, endoplasmic reticulum protein folding and culture condition are important considerations for improved production of recombinant vaccine antigens. Here we comment on current advances in the application of P. pastoris for the synthesis of recombinant subunit vaccines. PMID:27246656
Optimizing Restriction Site Placement for Synthetic Genomes
NASA Astrophysics Data System (ADS)
Montes, Pablo; Memelli, Heraldo; Ward, Charles; Kim, Joondong; Mitchell, Joseph S. B.; Skiena, Steven
Restriction enzymes are the workhorses of molecular biology. We introduce a new problem that arises in the course of our project to design virus variants to serve as potential vaccines: we wish to modify virus-length genomes to introduce large numbers of unique restriction enzyme recognition sites while preserving wild-type function by substitution of synonymous codons. We show that the resulting problem is NP-Complete, give an exponential-time algorithm, and propose effective heuristics, which we show give excellent results for five sample viral genomes. Our resulting modified genomes have several times more unique restriction sites and reduce the maximum gap between adjacent sites by three to nine-fold.
Gu, Yang; Deng, Jieying; Liu, Yanfeng; Li, Jianghua; Shin, Hyun-Dong; Du, Guocheng; Chen, Jian; Liu, Long
2017-10-01
N-acetylglucosamine (GlcNAc) is an important amino sugar extensively used in the healthcare field. In a previous study, the recombinant Bacillus subtilis strain BSGN6-P xylA -glmS-pP43NMK-GNA1 (BN0-GNA1) had been constructed for microbial production of GlcNAc by pathway design and modular optimization. Here, the production of GlcNAc is further improved by rewiring both the glucose transportation and central metabolic pathways. First, the phosphotransferase system (PTS) is blocked by deletion of three genes, yyzE (encoding the PTS system transporter subunit IIA YyzE), ypqE (encoding the PTS system transporter subunit IIA YpqE), and ptsG (encoding the PTS system glucose-specific EIICBA component), resulting in 47.6% increase in the GlcNAc titer (from 6.5 ± 0.25 to 9.6 ± 0.16 g L -1 ) in shake flasks. Then, reinforcement of the expression of the glcP and glcK genes and optimization of glucose facilitator proteins are performed to promote glucose import and phosphorylation. Next, the competitive pathways for GlcNAc synthesis, namely glycolysis, peptidoglycan synthesis pathway, pentose phosphate pathway, and tricarboxylic acid cycle, are repressed by initiation codon-optimization strategies, and the GlcNAc titer in shake flasks is improved from 10.8 ± 0.25 to 13.2 ± 0.31 g L -1 . Finally, the GlcNAc titer is further increased to 42.1 ± 1.1 g L -1 in a 3-L fed-batch bioreactor, which is 1.72-fold that of the original strain, BN0-GNA1. This study shows considerably enhanced GlcNAc production, and the metabolic engineering strategy described here will be useful for engineering other prokaryotic microorganisms for the production of GlcNAc and related molecules. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Jasik, Agnieszka; Reichert, Michal
2006-05-01
This study presents preliminary data on the polymorphism in the prion protein gene of Swiniarka sheep using temperature gradient gel electrophoresis (TGGE). Available data indicate that sensitivity to scrapie is associated with polymorphisms in three codons of prion protein gene: 136,154, and 171. The TGGE method was used to detect point mutations in these codons responsible for sensitivity or resistance to scrapie. This study revealed presence of an allele encoding valine (V) in codon 136, which is associated with high sensitivity to scrapie and occurred in the form of heterozygous allele together with alanine (AV). The highest variability was observed in codon 171, with presence of arginine (R) and glutamine (Q) in the homozygous (RR or QQ) as well as the heterozygous form (RQ). The results of examination of fifty sheep DNA samples with mutations in codons 136, 154, and 171 demonstrated that TGGE can be used as a simple and rapid method to detect mutations in the PrP gene of sheep. Several samples can be run at the same time, making TGGE ideal for the screening of large numbers of samples.
Roymondal, Uttam; Das, Shibsankar; Sahoo, Satyabrata
2009-01-01
We present an expression measure of a gene, devised to predict the level of gene expression from relative codon bias (RCB). There are a number of measures currently in use that quantify codon usage in genes. Based on the hypothesis that gene expressivity and codon composition is strongly correlated, RCB has been defined to provide an intuitively meaningful measure of an extent of the codon preference in a gene. We outline a simple approach to assess the strength of RCB (RCBS) in genes as a guide to their likely expression levels and illustrate this with an analysis of Escherichia coli (E. coli) genome. Our efforts to quantitatively predict gene expression levels in E. coli met with a high level of success. Surprisingly, we observe a strong correlation between RCBS and protein length indicating natural selection in favour of the shorter genes to be expressed at higher level. The agreement of our result with high protein abundances, microarray data and radioactive data demonstrates that the genomic expression profile available in our method can be applied in a meaningful way to the study of cell physiology and also for more detailed studies of particular genes of interest. PMID:19131380
Rous Sarcoma Virus RNA Stability Element Inhibits Deadenylation of mRNAs with Long 3′UTRs
Balagopal, Vidya; Beemon, Karen L.
2017-01-01
All retroviruses use their full-length primary transcript as the major mRNA for Group-specific antigen (Gag) capsid proteins. This results in a long 3′ untranslated region (UTR) downstream of the termination codon. In the case of Rous sarcoma virus (RSV), there is a 7 kb 3′UTR downstream of the gag terminator, containing the pol, env, and src genes. mRNAs containing long 3′UTRs, like those with premature termination codons, are frequently recognized by the cellular nonsense-mediated mRNA decay (NMD) machinery and targeted for degradation. To prevent this, RSV has evolved an RNA stability element (RSE) in the RNA immediately downstream of the gag termination codon. This 400-nt RNA sequence stabilizes premature termination codons (PTCs) in gag. It also stabilizes globin mRNAs with long 3′UTRs, when placed downstream of the termination codon. It is not clear how the RSE stabilizes the mRNA and prevents decay. We show here that the presence of RSE inhibits deadenylation severely. In addition, the RSE also impairs decapping (DCP2) and 5′-3′ exonucleolytic (XRN1) function in knockdown experiments in human cells. PMID:28763028
Identification of the initiation site of poliovirus polyprotein synthesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dorner, A.J.; Dorner, L.F.; Larsen, G.R.
1982-06-01
The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less
Selenocysteine incorporation: A trump card in the game of mRNA decay
Shetty, Sumangala P.; Copeland, Paul R.
2015-01-01
The incorporation of the 21st amino acid, selenocysteine (Sec), occurs on mRNAs that harbor in-frame stop codons because the Sec-tRNASec recognizes a UGA codon. This sets up an intriguing interplay between translation elongation, translation termination and the complex machinery that marks mRNAs that contain premature termination codons for degradation, leading to nonsense mediated mRNA decay (NMD). In this review we discuss the intricate and complex relationship between this key quality control mechanism and the process of Sec incorporation in mammals. PMID:25622574
Okombo, John; Mwai, Leah; Kiara, Steven M.; Pole, Lewa; Tetteh, Kevin K. A.; Nzila, Alexis; Marsh, Kevin
2014-01-01
The mechanisms of drug resistance development in the Plasmodium falciparum parasite to lumefantrine (LUM), commonly used in combination with artemisinin, are still unclear. We assessed the polymorphisms of Pfmspdbl2 for associations with LUM activity in a Kenyan population. MSPDBL2 codon 591S was associated with reduced susceptibility to LUM (P = 0.04). The high frequency of Pfmspdbl2 codon 591S in Kenya may be driven by the widespread use of lumefantrine in artemisinin combination therapy (Coartem). PMID:25534732
Álvaro-Benito, Miguel; de Abreu, Miguel; Portillo, Francisco; Sanz-Aparicio, Julia; Fernández-Lobato, María
2010-01-01
Schwanniomyces occidentalis β-fructofuranosidase (Ffase) releases β-fructose from the nonreducing ends of β-fructans and synthesizes 6-kestose and 1-kestose, both considered prebiotic fructooligosaccharides. Analyzing the amino acid sequence of this protein revealed that it includes a serine instead of a leucine at position 196, caused by a nonuniversal decoding of the unique mRNA leucine codon CUG. Substitution of leucine for Ser196 dramatically lowers the apparent catalytic efficiency (kcat/Km) of the enzyme (approximately 1,000-fold), but surprisingly, its transferase activity is enhanced by almost 3-fold, as is the enzymes' specificity for 6-kestose synthesis. The influence of 6 Ffase residues on enzyme activity was analyzed on both the Leu196/Ser196 backgrounds (Trp47, Asn49, Asn52, Ser111, Lys181, and Pro232). Only N52S and P232V mutations improved the transferase activity of the wild-type enzyme (about 1.6-fold). Modeling the transfructosylation products into the active site, in combination with an analysis of the kinetics and transfructosylation reactions, defined a new region responsible for the transferase specificity of the enzyme. PMID:20851958
Akhmaloka; Susilowati, Prima Endang; Subandi; Madayanti, Fida
2008-01-01
Termination translation in Saccharomyces cerevisiae is controlled by two interacting polypeptide chain release factors, eRF1 and eRF3. Two regions in human eRF1, position at 281-305 and position at 411-415, were proposed to be involved on the interaction to eRF3. In this study we have constructed and characterized yeast eRF1 mutant at position 410 (correspond to 415 human eRF1) from tyrosine to serine residue resulting eRF1(Y410S). The mutations did not affect the viability and temperature sensitivity of the cell. The stop codons suppression of the mutant was analyzed in vivo using PGK-stop codon-LACZ gene fusion and showed that the suppression of the mutant was significantly increased in all of codon terminations. The suppression on UAG codon was the highest increased among the stop codons by comparing the suppression of the wild type respectively. In vitro interaction between eRF1 (mutant and wild type) to eRF3 were carried out using eRF1-(His)6 and eRF1(Y410S)-(His)6 expressed in Escherichia coli and indigenous Saccharomyces cerevisiae eRF3. The results showed that the binding affinity of eRF1(Y410S) to eRF3 was decreased up to 20% of the wild type binding affinity. Computer modeling analysis using Swiss-Prot and Amber version 9.0 programs revealed that the overall structure of eRF1(Y410S) has no significant different with the wild type. However, substitution of tyrosine to serine triggered the structural change on the other motif of C-terminal domain of eRF1. The data suggested that increasing stop codon suppression and decreasing of the binding affinity of eRF1(Y410S) were probably due to the slight modification on the structure of the C-terminal domain. PMID:18463713
Speed Controls in Translating Secretory Proteins in Eukaryotes - an Evolutionary Perspective
Mahlab, Shelly; Linial, Michal
2014-01-01
Protein translation is the most expensive operation in dividing cells from bacteria to humans. Therefore, managing the speed and allocation of resources is subject to tight control. From bacteria to humans, clusters of relatively rare tRNA codons at the N′-terminal of mRNAs have been implicated in attenuating the process of ribosome allocation, and consequently the translation rate in a broad range of organisms. The current interpretation of “slow” tRNA codons does not distinguish between protein translations mediated by free- or endoplasmic reticulum (ER)-bound ribosomes. We demonstrate that proteins translated by free- or ER-bound ribosomes exhibit different overall properties in terms of their translation efficiency and speed in yeast, fly, plant, worm, bovine and human. We note that only secreted or membranous proteins with a Signal peptide (SP) are specified by segments of “slow” tRNA at the N′-terminal, followed by abundant codons that are considered “fast.” Such profiles apply to 3100 proteins of the human proteome that are composed of secreted and signal peptide (SP)-assisted membranous proteins. Remarkably, the bulks of the proteins (12,000), or membranous proteins lacking SP (3400), do not have such a pattern. Alternation of “fast” and “slow” codons was found also in proteins that translocate to mitochondria through transit peptides (TP). The differential clusters of tRNA adapted codons is not restricted to the N′-terminal of transcripts. Specifically, Glycosylphosphatidylinositol (GPI)-anchored proteins are unified by clusters of low adapted tRNAs codons at the C′-termini. Furthermore, selection of amino acids types and specific codons was shown as the driving force which establishes the translation demands for the secretory proteome. We postulate that “hard-coded” signals within the secretory proteome assist the steps of protein maturation and folding. Specifically, “speed control” signals for delaying the translation of a nascent protein fulfill the co- and post-translational stages such as membrane translocation, proteins processing and folding. PMID:24391480
Automated design of degenerate codon libraries.
Mena, Marco A; Daugherty, Patrick S
2005-12-01
Degenerate codon libraries are frequently used in protein engineering and evolution studies but are often limited to targeting a small number of positions to adequately limit the search space. To mitigate this, codon degeneracy can be limited using heuristics or previous knowledge of the targeted positions. To automate design of libraries given a set of amino acid sequences, an algorithm (LibDesign) was developed that generates a set of possible degenerate codon libraries, their resulting size, and their score relative to a user-defined scoring function. A gene library of a specified size can then be constructed that is representative of the given amino acid distribution or that includes specific sequences or combinations thereof. LibDesign provides a new tool for automated design of high-quality protein libraries that more effectively harness existing sequence-structure information derived from multiple sequence alignment or computational protein design data.
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lobanov, Alexei V.; Heaphy, Stephen M.; Turanov, Anton A.
2016-11-21
The ribosome can change its reading frame during translation in a process known as programmed ribosomal frameshifting. These rare events are supported by complex mRNA signals. However, we found that the ciliates Euplotes crassus and Euplotes focardii exhibit widespread frameshifting at stop codons. 47 different codons preceding stop signals resulted in either +1 or +2 frameshifts, and +1 frameshifting at AAA was the most frequent. The frameshifts showed unusual plasticity and rapid evolution, and had little influence on translation rates. The proximity of a stop codon to the 3' mRNA end, rather than its occurrence or sequence context, appeared tomore » designate termination. Thus, a ‘stop codon’ is not a sufficient signal for translation termination, and the default function of stop codons in Euplotes is frameshifting, whereas termination is specific to certain mRNA positions and probably requires additional factors.« less
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model. PMID:22368390
Schematic for efficient computation of GC, GC3, and AT3 bias spectra of genome.
Rizvi, Ahsan Z; Venu Gopal, T; Bhattacharya, C
2012-01-01
Selection of synonymous codons for an amino acid is biased in protein translation process. This biased selection causes repetition of synonymous codons in structural parts of genome that stands for high N/3 peaks in DNA spectrum. Period-3 spectral property is utilized here to produce a 3-phase network model based on polyphase filterbank concepts for derivation of codon bias spectra (CBS). Modification of parameters in this model can produce GC, GC3, and AT3 bias spectra. Complete schematic in LabVIEW platform is presented here for efficient and parallel computation of GC, GC3, and AT3 bias spectra of genomes alongwith results of CBS patterns. We have performed the correlation coefficient analysis of GC, GC3, and AT3 bias spectra with codon bias patterns of CBS for biological and statistical significance of this model.
Engqvist, Martin K M; Nielsen, Jens
2015-08-21
The Ambiguous Nucleotide Tool (ANT) is a desktop application that generates and evaluates degenerate codons. Degenerate codons are used to represent DNA positions that have multiple possible nucleotide alternatives. This is useful for protein engineering and directed evolution, where primers specified with degenerate codons are used as a basis for generating libraries of protein sequences. ANT is intuitive and can be used in a graphical user interface or by interacting with the code through a defined application programming interface. ANT comes with full support for nonstandard, user-defined, or expanded genetic codes (translation tables), which is important because synthetic biology is being applied to an ever widening range of natural and engineered organisms. The Python source code for ANT is freely distributed so that it may be used without restriction, modified, and incorporated in other software or custom data pipelines.
Zhao, Yongzhong; Epstein, Richard J
2013-01-01
Methylation-prone CpG dinucleotides are strongly conserved in the germline, yet are also predisposed to somatic mutation. Here we quantify the relationship between germline codon mutability and somatic carcinogenesis by comparing usage of the nonsense-prone CGA (→TGA) codons in gene groups that differ in apoptotic function; to this end, suppressor genes were subclassified as either apoptotic (gatekeepers) or repair (caretakers). Mutations affecting CGA codons in sporadic tumors proved to be highly asymmetric. Moreover, nonsense mutations were 3-fold more likely to affect gatekeepers than caretakers. In addition, intragenic CGA clustering nonrandomly affected functionally critical regions of gatekeepers. We conclude that human gatekeeper suppressor genes are enriched for nonsense-prone codons, and submit that this germline vulnerability to tumors could reflect in utero selection for a methylation-dependent capability to short-circuit environmental insults that otherwise trigger apoptosis and fetal loss.
The role of modifications in codon discrimination by tRNA(Lys)UUU.
Murphy, Frank V; Ramakrishnan, Venki; Malkiewicz, Andrzej; Agris, Paul F
2004-12-01
The natural modification of specific nucleosides in many tRNAs is essential during decoding of mRNA by the ribosome. For example, tRNA(Lys)(UUU) requires the modification N6-threonylcarbamoyladenosine at position 37 (t(6)A37), adjacent and 3' to the anticodon, to bind AAA in the A site of the ribosomal 30S subunit. Moreover, it can only bind both AAA and AAG lysine codons when doubly modified with t(6)A37 and either 5-methylaminomethyluridine or 2-thiouridine at the wobble position (mnm(5)U34 or s(2)U34). Here we report crystal structures of modified tRNA anticodon stem-loops bound to the 30S ribosomal subunit with lysine codons in the A site. These structures allow the rationalization of how modifications in the anticodon loop enable decoding of both lysine codons AAA and AAG.
Disruption of the Opal Stop Codon Attenuates Chikungunya Virus-Induced Arthritis and Pathology
Jones, Jennifer E.; Long, Kristin M.; Whitmore, Alan C.; Sanders, Wes; Thurlow, Lance R.; Brown, Julia A.; Morrison, Clayton R.; Vincent, Heather; Browning, Christian; Moorman, Nathaniel; Lim, Jean K.
2017-01-01
ABSTRACT Chikungunya virus (CHIKV) is a mosquito-borne alphavirus responsible for several significant outbreaks of debilitating acute and chronic arthritis and arthralgia over the past decade. These include a recent outbreak in the Caribbean islands and the Americas that caused more than 1 million cases of viral arthralgia. Despite the major impact of CHIKV on global health, viral determinants that promote CHIKV-induced disease are incompletely understood. Most CHIKV strains contain a conserved opal stop codon at the end of the viral nsP3 gene. However, CHIKV strains that encode an arginine codon in place of the opal stop codon have been described, and deep-sequencing analysis of a CHIKV isolate from the Caribbean identified both arginine and opal variants within this strain. Therefore, we hypothesized that the introduction of the arginine mutation in place of the opal termination codon may influence CHIKV virulence. We tested this by introducing the arginine mutation into a well-characterized infectious clone of a CHIKV strain from Sri Lanka and designated this virus Opal524R. This mutation did not impair viral replication kinetics in vitro or in vivo. Despite this, the Opal524R virus induced significantly less swelling, inflammation, and damage within the feet and ankles of infected mice. Further, we observed delayed induction of proinflammatory cytokines and chemokines, as well as reduced CD4+ T cell and NK cell recruitment compared to those in the parental strain. Therefore, the opal termination codon plays an important role in CHIKV pathogenesis, independently of effects on viral replication. PMID:29138302
Cassette Series Designed for Live-Cell Imaging of Proteins and High Resolution Techniques in Yeast
Young, Carissa L.; Raden, David L.; Caplan, Jeffrey; Czymmek, Kirk; Robinson, Anne S.
2012-01-01
During the past decade, it has become clear that protein function and regulation are highly dependent upon intracellular localization. Although fluorescent protein variants are ubiquitously used to monitor protein dynamics, localization, and abundance; fluorescent light microscopy techniques often lack the resolution to explore protein heterogeneity and cellular ultrastructure. Several approaches have been developed to identify, characterize, and monitor the spatial localization of proteins and complexes at the sub-organelle level; yet, many of these techniques have not been applied to yeast. Thus, we have constructed a series of cassettes containing codon-optimized epitope tags, fluorescent protein variants that cover the full spectrum of visible light, a TetCys motif used for FlAsH-based localization, and the first evaluation in yeast of a photoswitchable variant – mEos2 – to monitor discrete subpopulations of proteins via confocal microscopy. This series of modules, complete with six different selection markers, provides the optimal flexibility during live-cell imaging and multicolor labeling in vivo. Furthermore, high-resolution imaging techniques include the yeast-enhanced TetCys motif that is compatible with diaminobenzidine photooxidation used for protein localization by electron microscopy and mEos2 that is ideal for super-resolution microscopy. We have examined the utility of our cassettes by analyzing all probes fused to the C-terminus of Sec61, a polytopic membrane protein of the endoplasmic reticulum of moderate protein concentration, in order to directly compare fluorescent probes, their utility and technical applications. Our series of cassettes expand the repertoire of molecular tools available to advance targeted spatiotemporal investigations using multiple live-cell, super-resolution or electron microscopy imaging techniques. PMID:22473760
Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A.; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M.
2015-01-01
Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2′-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications. PMID:25611746
Rivera-Hoyos, Claudia M; Morales-Álvarez, Edwin David; Poveda-Cuevas, Sergio Alejandro; Reyes-Guzmán, Edwin Alfredo; Poutou-Piñales, Raúl A; Reyes-Montaño, Edgar Antonio; Pedroza-Rodríguez, Aura Marina; Rodríguez-Vázquez, Refugio; Cardozo-Bernal, Ángela M
2015-01-01
Lacasses are multicopper oxidases that can catalyze aromatic and non-aromatic compounds concomitantly with reduction of molecular oxygen to water. Fungal laccases have generated a growing interest due to their biotechnological potential applications, such as lignocellulosic material delignification, biopulping and biobleaching, wastewater treatment, and transformation of toxic organic pollutants. In this work we selected fungal genes encoding for laccase enzymes GlLCC1 in Ganoderma lucidum and POXA 1B in Pleurotus ostreatus. These genes were optimized for codon use, GC content, and regions generating secondary structures. Laccase proposed computational models, and their interaction with ABTS [2, 2'-azino-bis(3-ethylbenzothiazoline-6-sulphonic acid)] substrate was evaluated by molecular docking. Synthetic genes were cloned under the control of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase (GAP) constitutive promoter. P. pastoris X-33 was transformed with pGAPZαA-LaccGluc-Stop and pGAPZαA-LaccPost-Stop constructs. Optimization reduced GC content by 47 and 49% for LaccGluc-Stop and LaccPost-Stop genes, respectively. A codon adaptation index of 0.84 was obtained for both genes. 3D structure analysis using SuperPose revealed LaccGluc-Stop is similar to the laccase crystallographic structure 1GYC of Trametes versicolor. Interaction analysis of the 3D models validated through ABTS, demonstrated higher substrate affinity for LaccPost-Stop, in agreement with our experimental results with enzymatic activities of 451.08 ± 6.46 UL-1 compared to activities of 0.13 ± 0.028 UL-1 for LaccGluc-Stop. This study demonstrated that G. lucidum GlLCC1 and P. ostreatus POXA 1B gene optimization resulted in constitutive gene expression under GAP promoter and α-factor leader in P. pastoris. These are important findings in light of recombinant enzyme expression system utility for environmentally friendly designed expression systems, because of the wide range of substrates that laccases can transform. This contributes to a great gamut of products in diverse settings: industry, clinical and chemical use, and environmental applications.
2013-01-01
Background Segment 6 of the ISA virus codes for hemoagglutinin-esterase (HE). This segment is highly variable, with more than 26 variants identified. The major variation is observed in what is called the high polymorphism region (HPR). The role of the different HPR zones in the viral cycle or evolution remains unknown. However viruses that present the HPR0 are avirulent, while viruses with important deletions in this region have been responsible for outbreaks with high mortality rates. In this work, using bioinformatic tools, we examined the influence of different HPRs on the adaptation of HE genes to the host translational machinery and the relationship to observed virulence. Methods Translational efficiency of HE genes and their HPR were estimated analyzing codon-pair bias (CPB), adaptation to host codon use (codon adaptation index - CAI) and the adaptation to available tRNAs (tAI). These values were correlated with reported mortality for the respective ISA virus and the ΔG of RNA folding. tRNA abundance was inferred from tRNA gene numbers identified in the Salmo salar genome using tRNAScan-SE. Statistical correlation between data was performed using a non-parametric test. Results We found that HPR0 contains zones with codon pairs of low frequency and low availability of tRNA with respect to salmon codon-pair usage, suggesting that HPR modifies HE translational efficiency. Although calculating tAI was impossible because one third of tRNAs (~60.000) were tRNA-ala, translational efficiency measured by CPB shows that as HPR size increases, the CPB value of the HE gene decreases (P = 2x10-7, ρ = −0.675, n = 63) and that these values correlate positively with the mortality rates caused by the virus (ρ = 0.829, P = 2x10-7, n = 11). The mortality associated with different virus isolates or their corresponding HPR sizes were not related with the ΔG of HPR RNA folding, suggesting that the secondary structure of HPR RNA does not modify virulence. Conclusions Our results suggest that HPR size affects the efficiency of gene translation, which modulates the virulence of the virus by a mechanism similar to that observed in production of live attenuated vaccines through deoptimization of codon-pair usage. PMID:23742749
Oliver, J L; Marín, A; Martínez-Zapater, J M
1990-01-01
During plant evolution, some plastid genes have been moved to the nuclear genome. These transferred genes are now correctly expressed in the nucleus, their products being transported into the chloroplast. We compared the base compositions, the distributions of some dinucleotides and codon usages of transferred, nuclear and chloroplast genes in two dicots and two monocots plant species. Our results indicate that transferred genes have adjusted to nuclear base composition and codon usage, being now more similar to the nuclear genes than to the chloroplast ones in every species analyzed. PMID:2308837
Ochola-Oyier, Lynette Isabella; Okombo, John; Mwai, Leah; Kiara, Steven M; Pole, Lewa; Tetteh, Kevin K A; Nzila, Alexis; Marsh, Kevin
2015-03-01
The mechanisms of drug resistance development in the Plasmodium falciparum parasite to lumefantrine (LUM), commonly used in combination with artemisinin, are still unclear. We assessed the polymorphisms of Pfmspdbl2 for associations with LUM activity in a Kenyan population. MSPDBL2 codon 591S was associated with reduced susceptibility to LUM (P = 0.04). The high frequency of Pfmspdbl2 codon 591S in Kenya may be driven by the widespread use of lumefantrine in artemisinin combination therapy (Coartem). Copyright © 2015, Ochola-Oyier et al.
Shao, Yuan-jun; Hu, Xian-qiong; Peng, Guang-da; Wang, Rui-xian; Gao, Rui-na; Lin, Chao; Shen, Wei-de; Li, Rui; Li, Bing
2012-12-01
The first complete mitochondrial genome (mitogenome) of Tachinidae Exorista sorbillans (Diptera) is sequenced by PCR-based approach. The circular mitogenome is 14,960 bp long and has the representative mitochondrial gene (mt gene) organization and order of Diptera. All protein-coding sequences are initiated with ATN codon; however, the only exception is Cox I gene, which has a 4-bp ATCG putative start codon. Ten of the thirteen protein-coding genes have a complete termination codon (TAA), but the rest are seated on the H strand with incomplete codons. The mitogenome of E. sorbillans is biased toward A+T content at 78.4 %, and the strand-specific bias is in reflection of the third codon positions of mt genes, and their T/C ratios as strand indictor are higher on the H strand more than those on the L strand pointing at any strain of seven Diptera flies. The length of the A+T-rich region of E. sorbillans is 106 bp, including a tandem triple copies of a13-bp fragment. Compared to Haematobia irritans, E. sorbillans holds distant relationship with Drosophila. Phylogenetic topologies based on the amino acid sequences, supporting that E. sorbillans (Tachinidae) is clustered with strains of Calliphoridae and Oestridae, and superfamily Oestroidea are polyphyletic groups with Muscidae in a clade.
Sequence similarity is more relevant than species specificity in probabilistic backtranslation.
Ferro, Alfredo; Giugno, Rosalba; Pigola, Giuseppe; Pulvirenti, Alfredo; Di Pietro, Cinzia; Purrello, Michele; Ragusa, Marco
2007-02-21
Backtranslation is the process of decoding a sequence of amino acids into the corresponding codons. All synthetic gene design systems include a backtranslation module. The degeneracy of the genetic code makes backtranslation potentially ambiguous since most amino acids are encoded by multiple codons. The common approach to overcome this difficulty is based on imitation of codon usage within the target species. This paper describes EasyBack, a new parameter-free, fully-automated software for backtranslation using Hidden Markov Models. EasyBack is not based on imitation of codon usage within the target species, but instead uses a sequence-similarity criterion. The model is trained with a set of proteins with known cDNA coding sequences, constructed from the input protein by querying the NCBI databases with BLAST. Unlike existing software, the proposed method allows the quality of prediction to be estimated. When tested on a group of proteins that show different degrees of sequence conservation, EasyBack outperforms other published methods in terms of precision. The prediction quality of a protein backtranslation methis markedly increased by replacing the criterion of most used codon in the same species with a Hidden Markov Model trained with a set of most similar sequences from all species. Moreover, the proposed method allows the quality of prediction to be estimated probabilistically.
Zaborske, John M.; Bauer DuMont, Vanessa L.; Wallace, Edward W. J.; Pan, Tao; Aquadro, Charles F.; Drummond, D. Allan
2014-01-01
Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient. PMID:25489848
Hussmann, Jeffrey A; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H
2015-12-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics.
Hussmann, Jeffrey A.; Patchett, Stephanie; Johnson, Arlen; Sawyer, Sara; Press, William H.
2015-01-01
Ribosome profiling produces snapshots of the locations of actively translating ribosomes on messenger RNAs. These snapshots can be used to make inferences about translation dynamics. Recent ribosome profiling studies in yeast, however, have reached contradictory conclusions regarding the average translation rate of each codon. Some experiments have used cycloheximide (CHX) to stabilize ribosomes before measuring their positions, and these studies all counterintuitively report a weak negative correlation between the translation rate of a codon and the abundance of its cognate tRNA. In contrast, some experiments performed without CHX report strong positive correlations. To explain this contradiction, we identify unexpected patterns in ribosome density downstream of each type of codon in experiments that use CHX. These patterns are evidence that elongation continues to occur in the presence of CHX but with dramatically altered codon-specific elongation rates. The measured positions of ribosomes in these experiments therefore do not reflect the amounts of time ribosomes spend at each position in vivo. These results suggest that conclusions from experiments in yeast using CHX may need reexamination. In particular, we show that in all such experiments, codons decoded by less abundant tRNAs were in fact being translated more slowly before the addition of CHX disrupted these dynamics. PMID:26656907
Masuda, Isao; Matsuzaki, Motomichi; Kita, Kiyoshi
2010-10-01
Diverse mitochondrial (mt) genetic systems have evolved independently of the more uniform nuclear system and often employ modified genetic codes. The organization and genetic system of dinoflagellate mt genomes are particularly unusual and remain an evolutionary enigma. We determined the sequence of full-length cytochrome c oxidase subunit 1 (cox1) mRNA of the earliest diverging dinoflagellate Perkinsus and show that this gene resides in the mt genome. Apparently, this mRNA is not translated in a single reading frame with standard codon usage. Our examination of the nucleotide sequence and three-frame translation of the mRNA suggest that the reading frame must be shifted 10 times, at every AGG and CCC codon, to yield a consensus COX1 protein. We suggest two possible mechanisms for these translational frameshifts: a ribosomal frameshift in which stalled ribosomes skip the first bases of these codons or specialized tRNAs recognizing non-triplet codons, AGGY and CCCCU. Regardless of the mechanism, active and efficient machinery would be required to tolerate the frameshifts predicted in Perkinsus mitochondria. To our knowledge, this is the first evidence of translational frameshifts in protist mitochondria and, by far, is the most extensive case in mitochondria.
Banda, Malathi; Recio, Leslie; Parsons, Barbara L
2013-10-01
Furan is a rodent liver carcinogen, but the mode of action for furan hepatocarcinogenicity is unclear. H-ras codon 61 mutations have been detected in spontaneous liver tumors of B6C3F1 mice, and the fraction of liver tumors carrying H-ras codon 61 CAA to AAA mutation increased in furan-treated mice. Allele-specific competitive blocker PCR (ACB-PCR) has been used previously to quantify early, carcinogen-induced increases in tumor-associated mutations. The present pilot study investigated whether furan drives clonal expansion of pre-existing H-ras mutant cells in B6C3F1 mouse liver. H-ras codon 61 CAA to CTA and CAA to AAA mutations were measured in DNA isolated from liver tissue of female mice treated with 0, 1, 2, 4, or 8 mg furan/kg body weight, five days per week for three weeks, using five mice per treatment group. Spontaneous levels of mutation were low, with two of five control mice having an H-ras codon 61 CTA or AAA mutant fraction (MF) greater than 10(-5) . Several furan-treated mice had H-ras codon 61 AAA or CTA MFs greater than those measured in control mice and lower bound estimates of induced MF were calculated. However, no statistically-significant differences were observed between treatment groups. Therefore, while sustained exposure to furan is carcinogenic, at the early stage of carcinogenesis examined in this study (three weeks), there was not a significant expansion of H-ras mutant cells. Copyright © 2013 Wiley Periodicals, Inc.
L-MPZ, a Novel Isoform of Myelin P0, Is Produced by Stop Codon Readthrough*
Yamaguchi, Yoshihide; Hayashi, Akiko; Campagnoni, Celia W.; Kimura, Akio; Inuzuka, Takashi; Baba, Hiroko
2012-01-01
Myelin protein zero (P0 or MPZ) is a major myelin protein (∼30 kDa) expressed in the peripheral nervous system (PNS) in terrestrial vertebrates. Several groups have detected a P0-related 36-kDa (or 35-kDa) protein that is expressed in the PNS as an antigen for the serum IgG of patients with neuropathy. The molecular structure and function of this 36-kDa protein are, however, still unknown. We hypothesized that the 36-kDa protein may be derived from P0 mRNA by stop codon readthrough. We found a highly conserved region after the regular stop codon in predicted sequences from the 3′-UTR of P0 in higher animals. MS of the 36-kDa protein revealed that both P0 peptides and peptides deduced from the P0 3′-UTR sequence were found among the tryptic fragments. In transfected cells and in an in vitro transcription/translation system, the 36-kDa molecule was also produced from the identical mRNA that produced P0. We designated this 36-kDa molecule as large myelin protein zero (L-MPZ), a novel isoform of P0 that contains an additional domain at the C terminus. In the PNS, L-MPZ was localized in compact myelin. In transfected cells, just like P0, L-MPZ was localized at cell-cell adhesion sites in the plasma membrane. These results suggest that L-MPZ produced by the stop codon readthrough mechanism is potentially involved in myelination. Since this is the first finding of stop codon readthrough in a common mammalian protein, detailed analysis of L-MPZ expression will help to understand the mechanism of stop codon readthrough in mammals. PMID:22457349
José, Marco V; Morgado, Eberto R; Govezensky, Tzipe
2011-07-01
Herein, we rigorously develop novel 3-dimensional algebraic models called Genetic Hotels of the Standard Genetic Code (SGC). We start by considering the primeval RNA genetic code which consists of the 16 codons of type RNY (purine-any base-pyrimidine). Using simple algebraic operations, we show how the RNA code could have evolved toward the current SGC via two different intermediate evolutionary stages called Extended RNA code type I and II. By rotations or translations of the subset RNY, we arrive at the SGC via the former (type I) or via the latter (type II), respectively. Biologically, the Extended RNA code type I, consists of all codons of the type RNY plus codons obtained by considering the RNA code but in the second (NYR type) and third (YRN type) reading frames. The Extended RNA code type II, comprises all codons of the type RNY plus codons that arise from transversions of the RNA code in the first (YNY type) and third (RNR) nucleotide bases. Since the dimensions of remarkable subsets of the Genetic Hotels are not necessarily integer numbers, we also introduce the concept of algebraic fractal dimension. A general decoding function which maps each codon to its corresponding amino acid or the stop signals is also derived. The Phenotypic Hotel of amino acids is also illustrated. The proposed evolutionary paths are discussed in terms of the existing theories of the evolution of the SGC. The adoption of 3-dimensional models of the Genetic and Phenotypic Hotels will facilitate the understanding of the biological properties of the SGC.
Analysis of codon usage in beta-tubulin sequences of helminths.
von Samson-Himmelstjerna, G; Harder, A; Failing, K; Pape, M; Schnieder, T
2003-07-01
Codon usage bias has been shown to be correlated with gene expression levels in many organisms, including the nematode Caenorhabditis elegans. Here, the codon usage (cu) characteristics for a set of currently available beta-tubulin coding sequences of helminths were assessed by calculating several indices, including the effective codon number (Nc), the intrinsic codon deviation index (ICDI), the P2 value and the mutational response index (MRI). The P2 value gives a measure of translational pressure, which has been shown to be correlated to high gene expression levels in some organisms, but it has not yet been analysed in that respect in helminths. For all but two of the C. elegans beta-tubulin coding sequences investigated, the P2 value was the only index that indicated the presence of codon usage bias. Therefore, we propose that in general the helminth beta-tubulin sequences investigated here are not expressed at high levels. Furthermore, we calculated the correlation coefficients for the cu patterns of the helminth beta-tubulin sequences compared with those of highly expressed genes in organisms such as Escherichia coli and C. elegans. It was found that beta-tubulin cu patterns for all sequences of members of the Strongylida were significantly correlated to those for highly expressed C. elegans genes. This approach provides a new measure for comparing the adaptation of cu of a particular coding sequence with that of highly expressed genes in possible expression systems.Finally, using the cu patterns of the sequences studied, a phylogenetic tree was constructed. The topology of this tree was very much in concordance with that of a phylogeny based on small subunit ribosomal DNA sequence alignments.
[Distiller Yeasts Producing Antibacterial Peptides].
Klyachko, E V; Morozkina, E V; Zaitchik, B Ts; Benevolensky, S V
2015-01-01
A new method of controlling lactic acid bacteria contamination was developed with the use of recombinant Saccharomyces cerevisiae strains producing antibacterial peptides. Genes encoding the antibacterial peptides pediocin and plantaricin with codons preferable for S. cerevisiae were synthesized, and a system was constructed for their secretory expression. Recombinant S. cerevisiae strains producing antibacterial peptides effectively inhibit the growth of Lactobacillus sakei, Pediacoccus pentasaceus, Pediacoccus acidilactici, etc. The application of distiller yeasts producing antibacterial peptides enhances the ethanol yield in cases of bacterial contamination. Recombinant yeasts producing the antibacterial peptides pediocin and plantaricin can successfully substitute the available industrial yeast strains upon ethanol production.
Kumar, Vidya Pradeep; Kolte, Atul P; Dhali, Arindam; Naik, Chandrashekar; Sridhar, Manpal
2018-04-25
Utilization of energy-rich crop residues by ruminants is restricted by the presence of lignin, which is recalcitrant to digestion. Application of lignin degrading enzymes on the lignocellulosic biomass exposes the cellulose for easy digestion by ruminants. Laccases have been found to be considerably effective in improving the digestibility by way of delignification. However, laccase yields from natural hosts are not sufficient for industrial scale applications, which restricts their use. A viable option would be to express the laccase gene in compatible hosts to achieve higher production yields. A codon-optimized synthetic variant of Schizophyllum commune laccase gene was cloned into a pPIC9K vector and expressed in P. pastoris GS115 (his4) under the control of an alcohol oxidase promoter. Colonies were screened for G418 resistance and the methanol utilization phenotype was established. The transformant yielded a laccase activity of 344 U·mL -1 after 5 days of growth at 30°C (0.019 g·mL -1 wet cell weight). The laccase protein produced by the recombinant Pichia clone was detected as two bands with apparent molecular weights of 55 kDa and 70 kDa on SDS-PAGE. Activity staining on native PAGE confirmed the presence of bioactive laccase. Treatment of five common crop residues with recombinant laccase recorded a lignin loss ranging between 1.64% in sorghum stover, to 4.83% in finger millet, with an enhancement in digestibility ranging between 8.71% in maize straw to 24.61% in finger millet straw. Treatment with recombinant laccase was effective in enhancing the digestibility of lignocellulosic biomass for ruminant feeding through delignification. To date, a number of hosts have been adventured to produce laccase in large quantities, but, to our knowledge, there are no reports of the expression of laccase protein from Schizophyllum commune in Pichia pastoris, and also on the treatment of crop residues using recombinant laccase for ruminant feeding.
Ovine Reference Materials and Assays for Prion Genetic Testing
USDA-ARS?s Scientific Manuscript database
Codon variants implicated in scrapie susceptibility or disease progression include those at amino acid positions 112, 136, 141, 154, and 171. Nine single nucleotide polymorphisms (SNPs) determine which residues are encoded by the five implicated codons and accurately scoring these SNPs is essential...
Chan, Ying; Zhu, Baosheng; Jiang, Hongguo; Zhang, Jinman; Luo, Ying; Tang, Wenru
2016-01-01
To evaluate the association of the TP53 codon 72 (rs 1042522) alone or in combination with HDM2 SNP309 (rs 2279744) polymorphisms with human infertility and IVF outcome, we collected 1450 infertility women undergoing their first controlled ovarian stimulation for IVF treatment and 250 fertile controls in the case-control study. Frequencies, distribution, interaction of genes, and correlation with infertility and IVF outcome of clinical pregnancy were analyzed. We found a statistically significant association between TP53 codon 72 polymorphism and IVF outcome (52.10% vs. 47.40%, OR = 0.83, 95%CI:0.71-0.96, p = 0.01). No significant difference was shown between TP53 codon 72, HDM2 SNP309 polymorphisms, human infertility, and between the combination of two genes polymorphisms and the clinical pregnancy outcome of IVF. The data support C allele as a protective factor for IVF pregnancy outcome. Further researches should be focused on the mechanism of these associations.
Scherer, N M; Basso, D M
2008-09-16
DNATagger is a web-based tool for coloring and editing DNA, RNA and protein sequences and alignments. It is dedicated to the visualization of protein coding sequences and also protein sequence alignments to facilitate the comprehension of evolutionary processes in sequence analysis. The distinctive feature of DNATagger is the use of codons as informative units for coloring DNA and RNA sequences. The codons are colored according to their corresponding amino acids. It is the first program that colors codons in DNA sequences without being affected by "out-of-frame" gaps of alignments. It can handle single gaps and gaps inside the triplets. The program also provides the possibility to edit the alignments and change color patterns and translation tables. DNATagger is a JavaScript application, following the W3C guidelines, designed to work on standards-compliant web browsers. It therefore requires no installation and is platform independent. The web-based DNATagger is available as free and open source software at http://www.inf.ufrgs.br/~dmbasso/dnatagger/.
Methylation of class I translation termination factors: structural and functional aspects.
Graille, Marc; Figaro, Sabine; Kervestin, Stéphanie; Buckingham, Richard H; Liger, Dominique; Heurgué-Hamard, Valérie
2012-07-01
During protein synthesis, release of polypeptide from the ribosome occurs when an in frame termination codon is encountered. Contrary to sense codons, which are decoded by tRNAs, stop codons present in the A-site are recognized by proteins named class I release factors, leading to the release of newly synthesized proteins. Structures of these factors bound to termination ribosomal complexes have recently been obtained, and lead to a better understanding of stop codon recognition and its coordination with peptidyl-tRNA hydrolysis in bacteria. Release factors contain a universally conserved GGQ motif which interacts with the peptidyl-transferase centre to allow peptide release. The Gln side chain from this motif is methylated, a feature conserved from bacteria to man, suggesting an important biological role. However, methylation is catalysed by completely unrelated enzymes. The function of this motif and its post-translational modification will be discussed in the context of recent structural and functional studies. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Novel mutations responsible for α-thalassemia in Iranian families.
Bayat, Nooshin; Farashi, Samaneh; Hafezi-Nejad, Nima; Faramarzi, Negin; Ashki, Mehri; Vakili, Shadi; Imanian, Hashem; Khosravi, Mohsen; Azar-Keivan, Azita; Najmabadi, Hossein
2013-01-01
α-Thalassemia (α-thal) is usually caused by deletions on the α-globin gene cluster and the role of point mutations is less well investigated. In the present study, a total of 1048 individuals with hypochromic microcytic anemia, who did not present the most common α-thal deletions, were referred for α-globin gene DNA sequencing. The nucleotide changes were studied and a total of five new mutations was identified, of which three were located on the α2 gene [codon7 (Lys→Stop), codon 34 (Leu→Pro) and codon 83 (Leu→Arg)] and two on the α1 gene [IVS-I-116 (A>G) and codon 44 (+C)]. These novel mutations not only explain new findings by molecular analysis of the α-globin gene but also have clinical importance due to their changes in α-globin production in means of decreased hemoglobin (Hb) related values. Moreover, considerations of its role in combination with other mutations, and the possibility of causing Hb H (β4) are yet to be studied.
A method for multi-codon scanning mutagenesis of proteins based on asymmetric transposons.
Liu, Jia; Cropp, T Ashton
2012-02-01
Random mutagenesis followed by selection or screening is a commonly used strategy to improve protein function. Despite many available methods for random mutagenesis, nearly all generate mutations at the nucleotide level. An ideal mutagenesis method would allow for the generation of 'codon mutations' to change protein sequence with defined or mixed amino acids of choice. Herein we report a method that allows for mutations of one, two or three consecutive codons. Key to this method is the development of a Mu transposon variant with asymmetric terminal sequences. As a demonstration of the method, we performed multi-codon scanning on the gene encoding superfolder GFP (sfGFP). Characterization of 50 randomly chosen clones from each library showed that more than 40% of the mutants in these three libraries contained seamless, in-frame mutations with low site preference. By screening only 500 colonies from each library, we successfully identified several spectra-shift mutations, including a S205D variant that was found to bear a single excitation peak in the UV region.